bash:wordsplitting
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
bash:wordsplitting [2021/01/09 15:33] – [BASH - WordSplitting] peter | bash:wordsplitting [2021/01/09 15:51] (current) – [Controlling Word Splitting] peter | ||
---|---|---|---|
Line 4: | Line 4: | ||
Understanding how your original command will be transformed by the shell is of paramount importance in writing robust scripts. | Understanding how your original command will be transformed by the shell is of paramount importance in writing robust scripts. | ||
+ | |||
+ | <WRAP info> | ||
+ | **NOTE: | ||
+ | |||
+ | * **Word splitting** is part of the process that determines what each of those arguments will be. | ||
+ | * After **Word splitting** and **pathname expansion**, | ||
+ | |||
+ | Word splitting is performed on the results of almost all unquoted expansions. | ||
+ | |||
+ | * The result of the expansion is broken into separate words based on the characters of the IFS variable. | ||
+ | |||
+ | </ | ||
---- | ---- | ||
Line 15: | Line 27: | ||
For additional information on word splitting and argument handling in Bash, consider reading Arguments. | For additional information on word splitting and argument handling in Bash, consider reading Arguments. | ||
</ | </ | ||
- | |||
- | <WRAP info> | ||
- | **NOTE: | ||
- | |||
- | * **Word splitting** is part of the process that determines what each of those arguments will be. | ||
- | * After **Word splitting** and **pathname expansion**, | ||
- | |||
- | </ | ||
---- | ---- | ||
Line 66: | Line 70: | ||
---- | ---- | ||
- | ===== IFS ===== | + | ===== Test when IFS is not SET ===== |
- | + | ||
- | Word splitting is performed on the results of almost all unquoted expansions. | + | |
- | + | ||
- | The result of the expansion is broken into separate words based on the characters of the IFS variable. | + | |
If IFS is not set, then it will be performed as if IFS contained a space, a tab, and a newline. | If IFS is not set, then it will be performed as if IFS contained a space, a tab, and a newline. | ||
Line 78: | Line 78: | ||
<code bash> | <code bash> | ||
var=" | var=" | ||
- | args $var | + | test.sh |
+ | </ | ||
+ | |||
+ | returns: | ||
+ | |||
+ | <code bash> | ||
4 args: < | 4 args: < | ||
</ | </ | ||
---- | ---- | ||
+ | |||
+ | ===== Test when IFS is SET ===== | ||
An example using IFS: | An example using IFS: | ||
Line 88: | Line 95: | ||
<code bash> | <code bash> | ||
log=/ | log=/ | ||
- | args $log | + | test.sh |
+ | </ | ||
+ | |||
+ | returns: | ||
+ | |||
+ | <code bash> | ||
args: <> <var> <log> < | args: <> <var> <log> < | ||
unset IFS | unset IFS | ||
Line 95: | Line 107: | ||
---- | ---- | ||
- | An example | + | ===== Test with Command Substitution ===== |
<code bash> | <code bash> | ||
ls -l | ls -l | ||
+ | </ | ||
+ | |||
+ | returns: | ||
+ | |||
+ | <code bash> | ||
total 2864 | total 2864 | ||
-rw-r--r-- 1 greg greg 2919154 2001-05-23 00:48 Yello - Oh Yeah.mp3 | -rw-r--r-- 1 greg greg 2919154 2001-05-23 00:48 Yello - Oh Yeah.mp3 | ||
+ | </ | ||
- | args $(ls -l) | + | Now run against the test script: |
+ | |||
+ | <code bash> | ||
+ | test.sh | ||
+ | </ | ||
+ | |||
+ | returns: | ||
+ | |||
+ | <code bash> | ||
11 args: < | 11 args: < | ||
</ | </ | ||
Line 110: | Line 136: | ||
===== Controlling Word Splitting ===== | ===== Controlling Word Splitting ===== | ||
- | As you can see above, we usually do not want to let word splitting occur when filenames are involved. | + | As you can see above, we usually do not want to let word splitting occur when filenames are involved. |
- | Double quoting an expansion suppresses word splitting, except in the special cases of " | + | Double quoting an expansion suppresses word splitting, except in the special cases of < |
+ | |||
+ | <code bash> | ||
+ | var=" | ||
+ | </ | ||
+ | |||
+ | returns: | ||
<code bash> | <code bash> | ||
- | var=" | ||
1 args: <This is a variable> | 1 args: <This is a variable> | ||
+ | </ | ||
- | array=(testing, | + | ---- |
+ | |||
+ | <code bash> | ||
+ | array=(testing, | ||
+ | </ | ||
+ | |||
+ | returns: | ||
+ | |||
+ | <code bash> | ||
3 args: < | 3 args: < | ||
</ | </ | ||
- | " | + | <WRAP info> |
+ | **NOTE: | ||
There are very complicated rules involving whitespace characters in IFS. Quoting the man page again: | There are very complicated rules involving whitespace characters in IFS. Quoting the man page again: | ||
Line 133: | Line 174: | ||
We won't explore those rules in depth here, except to note the part about sequences of non-whitespace characters. | We won't explore those rules in depth here, except to note the part about sequences of non-whitespace characters. | ||
+ | |||
+ | </ | ||
+ | |||
+ | ---- | ||
+ | |||
+ | ===== IFS with non-whitespace characters ===== | ||
If IFS contains non-whitespace characters, then empty words can be generated: | If IFS contains non-whitespace characters, then empty words can be generated: | ||
Line 138: | Line 185: | ||
<code bash> | <code bash> | ||
getent passwd sshd | getent passwd sshd | ||
+ | </ | ||
+ | |||
+ | returns: | ||
+ | |||
+ | <code bash> | ||
sshd: | sshd: | ||
+ | </ | ||
- | IFS=:; | + | ---- |
+ | |||
+ | Set IFS | ||
+ | |||
+ | <code bash> | ||
+ | IFS=:; | ||
+ | </ | ||
+ | |||
+ | returns: | ||
+ | |||
+ | <code bash> | ||
7 args: < | 7 args: < | ||
+ | </ | ||
+ | Unset IFS | ||
+ | |||
+ | <code bash> | ||
unset IFS | unset IFS | ||
</ | </ | ||
- | There was another empty word generated in one of our previous examples, where IFS was set to /. The observant reader will have noticed, therefore, that non-whitespace IFS characters are not ignored at the beginning and end of expansions, the way whitespace IFS characters are. | + | <WRAP info> |
+ | **NOTE: | ||
- | Whitespace | + | * non-whitespace |
- | Finally, we note that pathname | + | Whitespace IFS characters get consolidated. |
+ | |||
+ | * Multiple spaces in a row, for example, have the same effect as a single space, when IFS contains a space (or is not set at all). | ||
+ | * Newlines also count as whitespace for this purpose, which has important ramifications when attempting to load an array with lines of input. | ||
+ | |||
+ | </ | ||
+ | |||
+ | Pathname | ||
<code bash> | <code bash> | ||
Line 156: | Line 231: | ||
qmaild: | qmaild: | ||
- | IFS=:; | + | IFS=:; |
737 args: < | 737 args: < | ||
Line 162: | Line 237: | ||
</ | </ | ||
- | The * word, produced by the shell' | + | The **< |
+ | |||
+ | * This can be disastrous if it happens unexpectedly. | ||
+ | * As with most of the dangerous features of the shell, it is retained because " | ||
+ | * In fact, it can be used for good, if you're very careful: | ||
<code bash> | <code bash> | ||
files=' | files=' | ||
- | args $files | + | test.sh |
2 args: <Yello - Oh Yeah.mp3> | 2 args: <Yello - Oh Yeah.mp3> | ||
</ | </ | ||
- | Pathname expansion can be disabled with set -f or set -o noglob; though this can lead to surprising and confusing code. | + | <WRAP info> |
+ | **NOTE: | ||
+ | </ | ||
---- | ---- |
bash/wordsplitting.1610206397.txt.gz · Last modified: 2021/01/09 15:33 by peter