Do parentheses really put the command in a subshell?
A subshell starts out as an almost identical copy of the original shell process. Under the hood, the shell calls the fork
system call1, which creates a new process whose code and memory are copies2. When the subshell is created, there are very few differences between it and its parent. In particular, they have the same variables. Even the $$
special variable keeps the same value in subshells: it's the original shell's process ID. Similarly $PPID
is the PID of the parent of the original shell.
A few shells change a few variables in the subshell. Bash sets BASHPID
to the PID of the shell process, which changes in subshells. Bash, zsh and mksh arrange for $RANDOM
to yield different values in the parent and in the subshell. But apart from built-in special cases like these, all variables have the same value in the subshell as in the original shell, the same export status, the same read-only status, etc. All function definitions, alias definitions, shell options and other settings are inherited as well.
A subshell created by (…)
has the same file descriptors as its creator. Some other means of creating subshells modify some file descriptors before executing user code; for example, the left-hand side of a pipe runs in a subshell3 with standard output connected to the pipe. The subshell also starts out with the same current directory, the same signal mask, etc. One of the few exceptions is that subshells do not inherit custom traps: ignored signals (trap '' SIGNAL
) remain ignored in the subshell, but other traps (trap CODE
SIGNAL) are reset to the default action4.
A subshell is thus different from executing a script. A script is a separate program. This separate program might coincidentally be also a script which is executed by the same interpreter as the parent, but this coincidence doesn't give the separate program any special visibility on internal data of the parent. Non-exported variables are internal data, so when the interpreter for the child shell script is executed, it doesn't see these variables. Exported variables, i.e. environment variables, are transmitted to executed programs.
Thus:
x=1
(echo $x)
prints 1
because the subshell is a replication of the shell that spawned it.
x=1
sh -c 'echo $x'
happens to run a shell as a child process of a shell, but the x
on the second line has no more connection with the x
on the second line than in
x=1
perl -le 'print $x'
or
x=1
python -c 'print x'
1 An exception is the ksh93
shell where the forking is optimised out and most of its side effects are emulated.
2 Semantically, they're copies. From an implementation perspective, there's a lot of sharing going on.
3 For the right-hand side, it depends on the shell.
4 If you test this out, note that things like $(trap)
may report the traps of the original shell. Note also that many shells have bugs in corner cases involving traps. For example ninjalj notes that as of bash 4.3, bash -x -c 'trap "echo ERR at \$BASH_SUBSHELL \$BASHPID" ERR; set -E; false; echo one subshell; (false); echo two subshells; ( (false) )'
runs the ERR
trap from the nested subshell in the “two subshells” case, but not the ERR
trap from the intermediate subshell — set -E
option should propagate the ERR
trap to all subshells but the intermediate subshell is optimized away and so isn't there to run its ERR
trap.
Obviously, yes, as all the documentation says, a parenthesized command is run in a subshell.
The subshell inherits a copy of all the parent's variables. The difference is that any changes you make in the subshell aren't also made in the parent.
The ksh man page makes this a little clearer than the bash one:
man ksh
:
A parenthesized command is executed in a sub-shell without removing non-exported variables.
man bash
:
(
list)
list is executed in a subshell environment (see COMMAND EXECUTION ENVIRONMENT below). Variable assignments and builtin commands that affect the shell's environment do not remain in effect after the command completes.
COMMAND EXECUTION ENVIRONMENT
The shell has an execution environment, which consists of the following: [...] shell parameters that are set by variable assignment [...].
Command substitution, commands grouped with parentheses, and asynchronous commands are invoked in a subshell environment that is a duplicate of the shell environment, [...]