> For the complete documentation index, see [llms.txt](https://wiki.clay-wangzhi.com/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://wiki.clay-wangzhi.com/shell/part3/10_manipulating_variables/10_1_manipulating_strings.md). # 10.1 字符串处理 Bash 支持的字符串操作数量达到了一个惊人的数目。但可惜的是，这些操作工具缺乏一个统一的核心。他们中的一些是[参数代换](http://tldp.org/LDP/abs/html/parameter-substitution.html#PARAMSUBREF)的子集，另外一些则是 UNIX 下 [`expr`](http://tldp.org/LDP/abs/html/moreadv.html#EXPRREF) 函数的子集。这将会导致语法前后不一致或者功能上出现重叠，更不用说那些可能导致的混乱了。 ## 字符串长度 ### `$` ### `expr length $string` 上面两个表达式等价于C语言中的 `strlen()` 函数。 ### `expr "$string" : '.*'` ```bash stringZ=abcABC123ABCabc echo ${#stringZ} # 15 echo `expr length $stringz` # 15 echo `expr "$stringZ" : '.*'` # 15 ``` 样例 10-1. 在文本的段落之间插入空行 ```bash #!/bin/bash # paragraph-space.sh # 版本 2.1，发布日期 2012年7月29日 # 在无空行的文本文件的段落之间插入空行。 # 像这样使用: $0 "$filename.$SUFFIX" # 将转换结果重定向到新的文件。 rm -f $file # 在转换后删除原文件。 echo "$filename.$SUFFIX" # 将记录输出到 stdout 中。 done exit 0 # 练习： # ----- # 这个脚本会将当前工作目录下的所有文件进行转换。 # 修改脚本，使得它仅转换 ".mac" 后缀的文件。 # *** 还可以使用另外一种方法。 *** # #!/bin/bash # 将图像批处理转换成不同的格式。 # 假设已经安装了 imagemagick。（在大部分 Linux 发行版中都有） INFMT=png # 可以是 tif, jpg, gif 等等。 OUTFMT=pdf # 可以是 tif, jpg, gif, pdf 等等。 for pic in *"$INFMT" do p2=$(ls "$pic" | sed -e s/\.$INFMT//) # echo $p2 convert "$pic" $p2.$OUTFMT done exit $? ``` 样例 10-4. 将流音频格式转换成 ogg 格式 ```bash #!/bin/bash # ra2ogg.sh: 将流音频文件 (*.ra) 转换成 ogg 格式。 # 使用 "mplayer" 媒体播放器程序： # http://www.mplayerhq.hu/homepage # 使用 "ogg" 库与 "oggenc"： # http://www.xiph.org/ # # 脚本同时需要安装一些解码器，例如 sipr.so 等等一些。 # 这些解码器可以在 compat-libstdc++ 包中找到。 OFILEPREF=${1%%ra} # 删除 "ra" 后缀。 OFILESUFF=wav # wav 文件后缀。 OUTFILE="$OFILEPREF""$OFILESUFF" E_NOARGS=85 if [ -z "$1" ] # 必须指定一个文件进行转换。 then echo "Usage: `basename $0` [filename]" exit $E_NOAGRS fi ###################################################### mplayer "$1" -ao pcm:file=$OUTFILE oggenc "$OUTFILE" # 由 oggenc 自动加上正确的文件后缀名。 ###################################################### rm "$OUTFILE" # 立即删除 *.wav 文件。 # 如果你仍需保留原文件，注释掉上面这一行即可。 exit $? # 注意： # ----- # 在网站上，点击一个 *.ram 的流媒体音频文件 #+ 通常只会下载到 *.ra 音频文件的 URL。 # 你可以使用 "wget" 或者类似的工具下载 *.ra 文件本身。 # 练习： # ----- # 这个脚本仅仅转换 *.ra 文件。 # 修改脚本增加适应性，使其可以转换 *.ram 或其他文件格式。 # # 如果你非常有热情，你可以扩展这个脚本使其 #+ 可以自动下载并且转换流媒体音频文件。 # 给定一个 URL，自动下载流媒体音频文件 (使用 "wget")， #+ 然后转换它。 ``` 下面是使用字符串截取结构对 [`getopt`](http://tldp.org/LDP/abs/html/extmisc.html#GETOPTY) 的一个简单模拟。样例 10-5. 模拟 `getopt` ```bash #!/bin/bash # getopt-simple.sh # 作者: Chris Morgan # 允许在高级脚本编程指南中使用。 getopt_simple() { echo "getopt_simple()" echo "Parameters are '$*'" until [ -z "$1" ] do echo "Processing parameter of: '$1'" if [ ${1:0:1} = '/' ] then tmp=${1:1} # 删除开头的 '/' parameter=${tmp%%=*} # 取出名称。 value=${tmp##*=} # 取出值。 echo "Parameter: '$parameter', value: '$value'" eval $parameter=$value fi shift done } # 将所有参数传递给 getopt_simple()。 getopt_simple $* echo "test is '$test'" echo "test2 is '$test2'" exit 0 # 可以查看该脚本的修改版 UseGetOpt.sh。 --- sh getopt_example.sh /test=value1 /test2=value2 Parameters are '/test=value1 /test2=value2' Processing parameter of: '/test=value1' Parameter: 'test', value: 'value1' Processing parameter of: '/test2=value2' Parameter: 'test2', value: 'value2' test is 'value1' test2 is 'value2' ``` ## 子串替换 ### `${string/substring/replacement}` 替换匹配到的第一个 `$substring` 为 `$replacement`。 ### `${string//substring/replacement}` 替换匹配到的所有 `$substring` 为 `$replacement`。 ```bash stringZ=abcABC123ABCabc echo ${stringZ/abc/xyz} # xyzABC123ABCabc # 将匹配到的第一个 'abc' 替换为 'xyz'。 echo ${stringZ//abc/xyz} # xyzABC123ABCxyz # 将匹配到的所有 'abc' 替换为 'xyz'。 echo --------------- echo "$stringZ" # abcABC123ABCabc echo --------------- # 字符串本身并不会被修改！ # 匹配以及替换的字符串可以是参数么？ match=abc repl=000 echo ${stringZ/$match/$repl} # 000ABC123ABCabc # ^ ^ ^^^ echo ${stringZ//$match/$repl} # 000ABC123ABC000 # Yes! ^ ^ ^^^ ^^^ echo # 如果没有给定 $replacement 字符串会怎样？ echo ${stringZ/abc} # ABC123ABCabc echo ${stringZ//abc} # ABC123ABC # 仅仅是将其删除而已。 ``` ### `${string/#substring/replacement}` 替换 `$string` 中最前端匹配到的 `$substring` 为 `$replacement`。 ### `${string/%substring/replacement}` 替换 `$string` 中最末端匹配到的 `$substring` 为 `$replacement`。 ```bash stringZ=abcABC123ABCabc echo ${stringZ/#abc/XYZ} # XYZABC123ABCabc # 将前端的 'abc' 替换为 'XYZ' echo ${stringZ/%abc/XYZ} # abcABC123ABCXYZ # 将末端的 'abc' 替换为 'XYZ' ``` --- # Agent Instructions This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com. ## Querying This Documentation If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question. Perform an HTTP GET request on the current page URL with the `ask` query parameter: ``` GET https://wiki.clay-wangzhi.com/shell/part3/10_manipulating_variables/10_1_manipulating_strings.md?ask= ``` The question should be specific, self-contained, and written in natural language. The response will contain a direct answer to the question and relevant excerpts and sources from the documentation. Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.