gitlab-org--gitlab-foss/doc/development/shell_commands.md

# Guidelines for shell commands in the GitLab codebase

## References

- [Google Ruby Security Reviewer's Guide](https://code.google.com/p/ruby-security/wiki/Guide)
- [OWASP Command Injection](https://www.owasp.org/index.php/Command_Injection)
- [Ruby on Rails Security Guide Command Line Injection](http://guides.rubyonrails.org/security.html#command-line-injection)

## Use File and FileUtils instead of shell commands

Sometimes we invoke basic Unix commands via the shell when there is also a Ruby API for doing it. Use the Ruby API if it exists. <http://www.ruby-doc.org/stdlib-2.0.0/libdoc/fileutils/rdoc/FileUtils.html#module-FileUtils-label-Module+Functions>

```ruby
# Wrong
system "mkdir -p tmp/special/directory"
# Better (separate tokens)
system *%W(mkdir -p tmp/special/directory)
# Best (do not use a shell command)
FileUtils.mkdir_p "tmp/special/directory"

# Wrong
contents = `cat #{filename}`
# Correct
contents = File.read(filename)

# Sometimes a shell command is just the best solution. The example below has no
# user input, and is hard to implement correctly in Ruby: delete all files and
# directories older than 120 minutes under /some/path, but not /some/path
# itself.
Gitlab::Popen.popen(%W(find /some/path -not -path /some/path -mmin +120 -delete))
```

This coding style could have prevented CVE-2013-4490.

## Bypass the shell by splitting commands into separate tokens

When we pass shell commands as a single string to Ruby, Ruby will let `/bin/sh` evaluate the entire string. Essentially, we are asking the shell to evaluate a one-line script. This creates a risk for shell injection attacks. It is better to split the shell command into tokens ourselves. Sometimes we use the scripting capabilities of the shell to change the working directory or set environment variables. All of this can also be achieved securely straight from Ruby

```ruby
# Wrong
system "cd /home/git/gitlab && bundle exec rake db:#{something} RAILS_ENV=production"
# Correct
system({'RAILS_ENV' => 'production'}, *%W(bundle exec rake db:#{something}), chdir: '/home/git/gitlab')

# Wrong
system "touch #{myfile}"
# Better
system "touch", myfile
# Best (do not run a shell command at all)
FileUtils.touch myfile
```

This coding style could have prevented CVE-2013-4546.

## Separate options from arguments with --

Make the difference between options and arguments clear to the argument parsers of system commands with `--`. This is supported by many but not all Unix commands.

To understand what `--` does, consider the problem below.

```
# Example
$ echo hello > -l
$ cat -l
cat: illegal option -- l
usage: cat [-benstuv] [file ...]
```

In the example above, the argument parser of `cat` assumes that `-l` is an option. The solution in the example above is to make it clear to `cat` that `-l` is really an argument, not an option. Many Unix command line tools follow the convention of separating options from arguments with `--`.

```
# Example (continued)
$ cat -- -l
hello
```

In the GitLab codebase, we avoid the option/argument ambiguity by _always_ using `--`.

```ruby
# Wrong
system(*%W(git branch -d #{branch_name}))
# Correct
system(*%W(git branch -d -- #{branch_name}))
```

This coding style could have prevented CVE-2013-4582.

## Do not use the backticks

Capturing the output of shell commands with backticks reads nicely, but you are forced to pass the command as one string to the shell. We explained above that this is unsafe. In the main GitLab codebase, the solution is to use `Gitlab::Popen.popen` instead.

```ruby
# Wrong
logs = `cd #{repo_dir} && git log`
# Correct
logs, exit_status = Gitlab::Popen.popen(%W(git log), repo_dir)

# Wrong
user = `whoami`
# Correct
user, exit_status = Gitlab::Popen.popen(%W(whoami))
```

In other repositories, such as gitlab-shell you can also use `IO.popen`.

```ruby
# Safe IO.popen example
logs = IO.popen(%W(git log), chdir: repo_dir).read
```

Note that unlike `Gitlab::Popen.popen`, `IO.popen` does not capture standard error.
Add shell commands guide 2014-02-25 10:53:01 +00:00			`# Guidelines for shell commands in the GitLab codebase`

References for the issues the guide addresses. 2014-03-24 11:04:43 +00:00			`## References`

			`- [Google Ruby Security Reviewer's Guide](https://code.google.com/p/ruby-security/wiki/Guide)`
			`- [OWASP Command Injection](https://www.owasp.org/index.php/Command_Injection)`
Can deeplink after all. 2014-03-24 13:31:35 +00:00			`- [Ruby on Rails Security Guide Command Line Injection](http://guides.rubyonrails.org/security.html#command-line-injection)`
References for the issues the guide addresses. 2014-03-24 11:04:43 +00:00
Add shell commands guide 2014-02-25 10:53:01 +00:00			`## Use File and FileUtils instead of shell commands`

Update docs to markdown style guide. 2014-04-24 22:48:22 +00:00			`Sometimes we invoke basic Unix commands via the shell when there is also a Ruby API for doing it. Use the Ruby API if it exists. <http://www.ruby-doc.org/stdlib-2.0.0/libdoc/fileutils/rdoc/FileUtils.html#module-FileUtils-label-Module+Functions>`
Add shell commands guide 2014-02-25 10:53:01 +00:00
			```ruby
			`# Wrong`
			`system "mkdir -p tmp/special/directory"`
			`# Better (separate tokens)`
			`system *%W(mkdir -p tmp/special/directory)`
			`# Best (do not use a shell command)`
			`FileUtils.mkdir_p "tmp/special/directory"`

			`# Wrong`
			contents = `cat #{filename}`
			`# Correct`
			`contents = File.read(filename)`
Add a counterexample to 'do it in Ruby' 2014-10-02 16:27:18 +00:00
			`# Sometimes a shell command is just the best solution. The example below has no`
			`# user input, and is hard to implement correctly in Ruby: delete all files and`
			`# directories older than 120 minutes under /some/path, but not /some/path`
			`# itself.`
			`Gitlab::Popen.popen(%W(find /some/path -not -path /some/path -mmin +120 -delete))`
Add shell commands guide 2014-02-25 10:53:01 +00:00			```

			`This coding style could have prevented CVE-2013-4490.`

			`## Bypass the shell by splitting commands into separate tokens`

Update docs to markdown style guide. 2014-04-24 22:48:22 +00:00			When we pass shell commands as a single string to Ruby, Ruby will let `/bin/sh` evaluate the entire string. Essentially, we are asking the shell to evaluate a one-line script. This creates a risk for shell injection attacks. It is better to split the shell command into tokens ourselves. Sometimes we use the scripting capabilities of the shell to change the working directory or set environment variables. All of this can also be achieved securely straight from Ruby
Add shell commands guide 2014-02-25 10:53:01 +00:00
			```ruby
			`# Wrong`
			`system "cd /home/git/gitlab && bundle exec rake db:#{something} RAILS_ENV=production"`
			`# Correct`
			`system({'RAILS_ENV' => 'production'}, *%W(bundle exec rake db:#{something}), chdir: '/home/git/gitlab')`

			`# Wrong`
			`system "touch #{myfile}"`
			`# Better`
			`system "touch", myfile`
			`# Best (do not run a shell command at all)`
			`FileUtils.touch myfile`
			```

			`This coding style could have prevented CVE-2013-4546.`

			`## Separate options from arguments with --`

Update docs to markdown style guide. 2014-04-24 22:48:22 +00:00			Make the difference between options and arguments clear to the argument parsers of system commands with `--`. This is supported by many but not all Unix commands.
Add shell commands guide 2014-02-25 10:53:01 +00:00
			To understand what `--` does, consider the problem below.

			```
			`# Example`
			`$ echo hello > -l`
			`$ cat -l`
			`cat: illegal option -- l`
			`usage: cat [-benstuv] [file ...]`
			```

Update docs to markdown style guide. 2014-04-24 22:48:22 +00:00			In the example above, the argument parser of `cat` assumes that `-l` is an option. The solution in the example above is to make it clear to `cat` that `-l` is really an argument, not an option. Many Unix command line tools follow the convention of separating options from arguments with `--`.
Add shell commands guide 2014-02-25 10:53:01 +00:00
			```
			`# Example (continued)`
			`$ cat -- -l`
			`hello`
			```

			In the GitLab codebase, we avoid the option/argument ambiguity by _always_ using `--`.

			```ruby
			`# Wrong`
			`system(*%W(git branch -d #{branch_name}))`
			`# Correct`
			`system(*%W(git branch -d -- #{branch_name}))`
			```

			`This coding style could have prevented CVE-2013-4582.`

			`## Do not use the backticks`

Update docs to markdown style guide. 2014-04-24 22:48:22 +00:00			Capturing the output of shell commands with backticks reads nicely, but you are forced to pass the command as one string to the shell. We explained above that this is unsafe. In the main GitLab codebase, the solution is to use `Gitlab::Popen.popen` instead.
Add shell commands guide 2014-02-25 10:53:01 +00:00
			```ruby
			`# Wrong`
			logs = `cd #{repo_dir} && git log`
			`# Correct`
			`logs, exit_status = Gitlab::Popen.popen(%W(git log), repo_dir)`

			`# Wrong`
			user = `whoami`
			`# Correct`
			`user, exit_status = Gitlab::Popen.popen(%W(whoami))`
			```

			In other repositories, such as gitlab-shell you can also use `IO.popen`.

			```ruby
			`# Safe IO.popen example`
			`logs = IO.popen(%W(git log), chdir: repo_dir).read`
			```

			Note that unlike `Gitlab::Popen.popen`, `IO.popen` does not capture standard error.