bash:wordsplitting
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
bash:wordsplitting [2021/01/09 15:15] – peter | bash:wordsplitting [2021/01/09 15:51] (current) – [Controlling Word Splitting] peter | ||
---|---|---|---|
Line 4: | Line 4: | ||
Understanding how your original command will be transformed by the shell is of paramount importance in writing robust scripts. | Understanding how your original command will be transformed by the shell is of paramount importance in writing robust scripts. | ||
+ | |||
+ | <WRAP info> | ||
+ | **NOTE: | ||
+ | |||
+ | * **Word splitting** is part of the process that determines what each of those arguments will be. | ||
+ | * After **Word splitting** and **pathname expansion**, | ||
+ | |||
+ | Word splitting is performed on the results of almost all unquoted expansions. | ||
+ | |||
+ | * The result of the expansion is broken into separate words based on the characters of the IFS variable. | ||
+ | |||
+ | </ | ||
---- | ---- | ||
Line 10: | Line 22: | ||
< | < | ||
- | The order of expansions is: brace expansion, tilde expansion, parameter, variable and arithmetic expansion and command substitution (done in a left-to-right fashion), word splitting, and pathname expansion. | + | The order of expansions is: brace expansion, tilde expansion, parameter, variable and arithmetic expansion |
+ | and command substitution (done in a left-to-right fashion), word splitting, and pathname expansion. | ||
For additional information on word splitting and argument handling in Bash, consider reading Arguments. | For additional information on word splitting and argument handling in Bash, consider reading Arguments. | ||
Line 16: | Line 29: | ||
---- | ---- | ||
- | |||
===== What is Word Splitting? ===== | ===== What is Word Splitting? ===== | ||
Line 22: | Line 34: | ||
Let's write a little helper script that will show us the arguments as passed by the shell: | Let's write a little helper script that will show us the arguments as passed by the shell: | ||
- | <code bash> | + | <file bash test.sh> |
#!/bin/sh - | #!/bin/sh - | ||
printf "%d args:" " | printf "%d args:" " | ||
printf " < | printf " < | ||
echo | echo | ||
+ | </ | ||
+ | |||
+ | ---- | ||
+ | |||
+ | ==== Make it executable ==== | ||
+ | |||
+ | <code bash> | ||
+ | chmod +x test.sh | ||
</ | </ | ||
- | If you create a file named args with the above contents, make it executable with chmod +x args, and put it in one of the directories listed in echo " | + | ---- |
+ | |||
+ | ==== Run it ==== | ||
<code bash> | <code bash> | ||
- | args hello world "how are you?" | + | ./ |
</ | </ | ||
Line 41: | Line 63: | ||
</ | </ | ||
- | The ultimate result of most shell commands is to execute some program | + | <WRAP info> |
+ | **NOTE: | ||
- | Word splitting is part of the process that determines what each of those arguments will be -- after word splitting and pathname expansion, every resulting word becomes an argument to the program that the shell executes. | + | </ |
- | Our helper program above receives the argument list as constructed by the shell, and shows it to us. | + | ---- |
- | Word splitting | + | ===== Test when IFS is not SET ===== |
+ | |||
+ | If IFS is not set, then it will be performed as if IFS contained a space, a tab, and a newline. | ||
For example: | For example: | ||
Line 53: | Line 78: | ||
<code bash> | <code bash> | ||
var=" | var=" | ||
- | args $var | + | test.sh |
+ | </ | ||
+ | |||
+ | returns: | ||
+ | |||
+ | <code bash> | ||
4 args: < | 4 args: < | ||
</ | </ | ||
---- | ---- | ||
+ | |||
+ | ===== Test when IFS is SET ===== | ||
An example using IFS: | An example using IFS: | ||
Line 63: | Line 95: | ||
<code bash> | <code bash> | ||
log=/ | log=/ | ||
- | args $log | + | test.sh |
+ | </ | ||
+ | |||
+ | returns: | ||
+ | |||
+ | <code bash> | ||
args: <> <var> <log> < | args: <> <var> <log> < | ||
unset IFS | unset IFS | ||
Line 70: | Line 107: | ||
---- | ---- | ||
- | An example | + | ===== Test with Command Substitution ===== |
<code bash> | <code bash> | ||
ls -l | ls -l | ||
+ | </ | ||
+ | |||
+ | returns: | ||
+ | |||
+ | <code bash> | ||
total 2864 | total 2864 | ||
-rw-r--r-- 1 greg greg 2919154 2001-05-23 00:48 Yello - Oh Yeah.mp3 | -rw-r--r-- 1 greg greg 2919154 2001-05-23 00:48 Yello - Oh Yeah.mp3 | ||
+ | </ | ||
- | args $(ls -l) | + | Now run against the test script: |
+ | |||
+ | <code bash> | ||
+ | test.sh | ||
+ | </ | ||
+ | |||
+ | returns: | ||
+ | |||
+ | <code bash> | ||
11 args: < | 11 args: < | ||
</ | </ | ||
Line 85: | Line 136: | ||
===== Controlling Word Splitting ===== | ===== Controlling Word Splitting ===== | ||
- | As you can see above, we usually do not want to let word splitting occur when filenames are involved. | + | As you can see above, we usually do not want to let word splitting occur when filenames are involved. |
- | Double quoting an expansion suppresses word splitting, except in the special cases of " | + | Double quoting an expansion suppresses word splitting, except in the special cases of < |
+ | |||
+ | <code bash> | ||
+ | var=" | ||
+ | </ | ||
+ | |||
+ | returns: | ||
<code bash> | <code bash> | ||
- | var=" | ||
1 args: <This is a variable> | 1 args: <This is a variable> | ||
+ | </ | ||
- | array=(testing, | + | ---- |
+ | |||
+ | <code bash> | ||
+ | array=(testing, | ||
+ | </ | ||
+ | |||
+ | returns: | ||
+ | |||
+ | <code bash> | ||
3 args: < | 3 args: < | ||
</ | </ | ||
- | " | + | <WRAP info> |
+ | **NOTE: | ||
There are very complicated rules involving whitespace characters in IFS. Quoting the man page again: | There are very complicated rules involving whitespace characters in IFS. Quoting the man page again: | ||
Line 108: | Line 174: | ||
We won't explore those rules in depth here, except to note the part about sequences of non-whitespace characters. | We won't explore those rules in depth here, except to note the part about sequences of non-whitespace characters. | ||
+ | |||
+ | </ | ||
+ | |||
+ | ---- | ||
+ | |||
+ | ===== IFS with non-whitespace characters ===== | ||
If IFS contains non-whitespace characters, then empty words can be generated: | If IFS contains non-whitespace characters, then empty words can be generated: | ||
Line 113: | Line 185: | ||
<code bash> | <code bash> | ||
getent passwd sshd | getent passwd sshd | ||
+ | </ | ||
+ | |||
+ | returns: | ||
+ | |||
+ | <code bash> | ||
sshd: | sshd: | ||
+ | </ | ||
- | IFS=:; | + | ---- |
+ | |||
+ | Set IFS | ||
+ | |||
+ | <code bash> | ||
+ | IFS=:; | ||
+ | </ | ||
+ | |||
+ | returns: | ||
+ | |||
+ | <code bash> | ||
7 args: < | 7 args: < | ||
+ | </ | ||
+ | Unset IFS | ||
+ | |||
+ | <code bash> | ||
unset IFS | unset IFS | ||
</ | </ | ||
- | There was another empty word generated in one of our previous examples, where IFS was set to /. The observant reader will have noticed, therefore, that non-whitespace IFS characters are not ignored at the beginning and end of expansions, the way whitespace IFS characters are. | + | <WRAP info> |
+ | **NOTE: | ||
- | Whitespace | + | * non-whitespace |
- | Finally, we note that pathname | + | Whitespace IFS characters get consolidated. |
+ | |||
+ | * Multiple spaces in a row, for example, have the same effect as a single space, when IFS contains a space (or is not set at all). | ||
+ | * Newlines also count as whitespace for this purpose, which has important ramifications when attempting to load an array with lines of input. | ||
+ | |||
+ | </ | ||
+ | |||
+ | Pathname | ||
<code bash> | <code bash> | ||
Line 131: | Line 231: | ||
qmaild: | qmaild: | ||
- | IFS=:; | + | IFS=:; |
737 args: < | 737 args: < | ||
Line 137: | Line 237: | ||
</ | </ | ||
- | The * word, produced by the shell' | + | The **< |
+ | |||
+ | * This can be disastrous if it happens unexpectedly. | ||
+ | * As with most of the dangerous features of the shell, it is retained because " | ||
+ | * In fact, it can be used for good, if you're very careful: | ||
<code bash> | <code bash> | ||
files=' | files=' | ||
- | args $files | + | test.sh |
2 args: <Yello - Oh Yeah.mp3> | 2 args: <Yello - Oh Yeah.mp3> | ||
</ | </ | ||
- | Pathname expansion can be disabled with set -f or set -o noglob; though this can lead to surprising and confusing code. | + | <WRAP info> |
+ | **NOTE: | ||
+ | </ | ||
---- | ---- | ||
Line 153: | Line 260: | ||
* Word splitting is not performed on expansions inside Bash keywords such as [[ ... ]] and case. | * Word splitting is not performed on expansions inside Bash keywords such as [[ ... ]] and case. | ||
* Word splitting is not performed on expansions in assignments. Thus, one does not need to quote anything in a command like these: | * Word splitting is not performed on expansions in assignments. Thus, one does not need to quote anything in a command like these: | ||
- | |||
* foo=$bar | * foo=$bar | ||
- | |||
* bar=$(a command) | * bar=$(a command) | ||
- | |||
* logfile=$logdir/ | * logfile=$logdir/ | ||
- | |||
* PATH=/ | * PATH=/ | ||
- | |||
* In either case, quoting anyway will not break anything. So if in doubt, quote! | * In either case, quoting anyway will not break anything. So if in doubt, quote! | ||
- | |||
* When using the read command (which you generally don't want to use without -r), word splitting is performed on the input. | * When using the read command (which you generally don't want to use without -r), word splitting is performed on the input. | ||
bash/wordsplitting.1610205307.txt.gz · Last modified: 2021/01/09 15:15 by peter