If you're someone like me who learned coding in higher-level languages, like Python, R, and Java, bash can be quite strange. Luckily for you, bash has the same structures as other languages, albeit with different syntax. Note that this tutorial assumes you're familiar with basic bash commands for file manipulation.
This tutorial will cover the following:
- Bash script basics: Creating bash scripts, Variables, Backticks, User input, Arguments
- Control structures: For loops, If/else statements, While loops
- Intermediate: Reading a file line by line, Flags and options
Creating Bash Scripts
Bash scripts are simply a list of bash commands that the computer executes line by line, much like you use commands line by line. Create a bash script like any file.
nano myscript.sh
Let's add these commands
# Inside myscript.sh
echo Hello world!
touch newfile.txt
echo Hello new file >> newfile.txt
We can run bash scripts like so:
bash myscript.sh
# Hello world!
# Creates newfile.txt with 'Hello new file' inside
It's as simple as that! But we can make our scripts more complicated!
Variables
Variables hold information, like numbers and strings (which include file names and paths). Think of them as placeholders.
You can set values to variables as in the following. Note that variable names are case-sensitive
a=10
A=5
# Note: a does not equal A
file=/path/to/file.txt
path=/another/path
newFile=test.txt
You must ensure that there are no spaces between the variable name, the equals sign, and the value
# Good: a=10
# Bad: a = 10
You use variables in scripts by adding a dollar sign ($) at the front of the variable name
echo $a
# 10
echo $file
# /path/to/file.txt
Combine variables with strings
touch "$path"/created_file.txt
# Creates a new file called created_file.txt at /another/path
Or combine variables
mv $file "$path"/"$newFile"
# Saves /path/to/file.txt at /another/path/test.txt
In general, it is good practice to place quotes around your variable names to prevent them from blending into your strings, e.g. $varName
vs "$var"Name
may have different outputs.
Backticks (`)
Commands within backticks will be run before the main command
For example, you know that wc -l file
will output the number of lines within a given file, but if you are writing a script, you might not want the output to simply be a number.
You can do the following:
echo The number of lines is `wc -l file.txt` in file.txt
# The number of lines is 1303 in file.txt
Backticks also work for declaring variables
count=`grep -c "Apples" fruitList.txt`
echo The number of apples is $count
# The number of apples is 107
User input
Getting user input is actually quite easy. Use the read
command in front of the desired variable name, and you can use the input like a variable.
echo What is your name?
read name
echo Hello "$name"!
# What is your name
# $ Bailee
# Hello Bailee!
Note that this is single line only. Pressing enter will submit the input.
Arguments
You know how commands like ls
can take additional arguments such as ls newDir
to list the files within newDir
? You can make your script to accept arguments as well.
Arguments are denoted as $1
for the first parameter, $2
for the second, and so on. You use them like variables.
Let's say you want to make a script that counts the number of items in a given directory
# counter.sh
numItems=`ls $1 | wc -l`
echo The number of items in $1 is $numItems
You can run it using:
bash counter.sh newDir
# The number of items in newDir is 125
Note that flags (like ls -l -h
) will be covered below.
For Loops
We use for loops to perform the same set of commands for a particular list of things (such as a sequence of numbers or a list of files).
Let's loop over this list of numbers. We generate this list using seq
for x in `seq 1 5`
do
echo My number is $x
done
# My number is 1
# My number is 2
# My number is 3
# My number is 4
# My number is 5
You can use a list of strings separated by spaces such as egg milk flour
. If you have a string with spaces already, you must surround the string with quotes, such as "brown egg" milk "wheat flour"
for str in `"brown egg" milk "wheat flour"`
do
echo Don't forget to add the $str
done
# Don't forget to add the brown egg
# Don't forget to add the milk
# Don't forget to add the wheat flour
Output from commands, such as ls
, can be used in loops. Here you can output the first line within each file of a given directory
dir=$1
for file in `ls $dir`
do
echo File: $file
head -n 1
done
# File: dog_names.txt
# Spot
# File: cat_names.txt
# Whiskers
# File: fish_names.txt
# Goldie
If/Else Statements
At this moment, you're scripts are linear, meaning that A -> B -> C -> D. With if/else statements, you can create more complex scripts where A -> B and depending on B, you can go to C or D. Here is a simple if-else statement.
echo What is your name?
read name
if [[ $name == "Tom" ]]; then
echo Hello, you must be Tom
else
echo I don't know you
fi
# What is your name?
# > Tom
# Hello, you must be Tom
Like declaring variables, spacing matters. You must have a space between the condition and the surrounding brackets, or you will get an error.
You can also omit the else
statement if you don't need a command for that case.
For multiple if
conditions within the same stack, use elif
(i.e. else-if) following the initial if
. The else
statement can be omitted here as well, but if it is included, it must be at the end.
if [[ $name == "Tom" ]]; then
echo Hello, you must be Tom
elif [[ $name == "Brad" ]]; then
echo Brad, please, come in
else
echo I don't know you
fi
Here are some examples of condition statements that would be useful. There are plenty more if you search for them on the internet.
if [[ $name == "Tom" ]]; then # If $name equals Tom
if [[ $name != "Tom" ]]; then # If $name does not equal Tom
if [[ -d raw_files ]]; then # If directory raw_files exists and is a directory
if [[ -f testFile.txt ]]; then # If testFile.txt exists and is a file
if [[ ! -f testFile.txt ]]; then # If testFile.txt does NOT exist OR testFile.txt is NOT a file
if [[ $x -ge 10 ]]; then # If $x is Greater or Equal than 10
if [[ $n -lt 15 ]]; then # If $n is Less Than than 15
Combine multiple conditions into one if-statement using && (AND) and || (OR).
if [[ $name == "Tom" || $name == "Brad" ]]; then # If $name equals Tom OR $name equals Brad
if [[ $x -gt 10 && $x -lt 20 ]]; then # If $x is greater than 10 AND less than 20
Finally, you can put if-else statements with for loops. Let's say you ran an analysis on a batch different data files. Each analysis generates a separate results folder and a log inside that folder. You might want to read the logs of each analysis without having to go into each folder manually. You also want to note if the log file doesn't exist, suggesting that the analysis might have an error.
for dir in `ls Results`
do
echo $dir
if [[ -f Results/$dir/log.txt ]]; then
cat Results/$dir/log.txt
else
cat ERROR: NO LOG FOUND
fi
done | less
You can note the pipe after the for-loop statement. Like head
and sort
, output from loops can be shunted to other commands. Output from other commands can also be piped into loops.
While-loops
While-loops are similar to for-loops in that they repeat a set of commands. However, for-loops are finite. On the other hand, while-loops will repeat commands until their condition is met (while <condition is true>; do something; done
). Think of them as if-statements that repeat forever until the condition is not true.
secret=''
while [[ $secret != "there is no cost without sacrifice" ]]
do
echo "What is the password?"
read secret
done
echo "Welcome!"
Here, you are asked for a password. If you put in the wrong password, you will be asked again until it is correct. Then you will receive a warm welcome. Note that this is not a good way to code passwords because a user can simply view the file for the right phrase.
Reading a file line by line
While loops can be used to read files line by line. To read a file, you must pipe the output of a command such as cat
into the while loop. The format of the while loop should now be while read varName
, shown below.
# file.txt
TO DO LIST:
1) Eat
2) Clean
3) Walk the dog
cat file.txt | while read line; do
echo $line
done
# TO DO LIST:
# 1) Eat
# 2) Clean
# 3) Walk the dog
Flags and options
Adding flags and options can make your script not only more complex but also more user friendly, given that users don't need to input arguments in an exact order.
report=0
file=''
while getopts 'rf:' flag
do
case "$flag" in
r) report=1 ;;
f) file="$OPTARG" ;;
*) echo "Unknown option. Exiting"
exit ;;
esac
if [[ $report == "1" ]]; then
echo "Deleting" $file >> report.log
fi
rm $file
done
Let's go through this in chunks. The following below are your variables. They don't necessarily need to match your flags, but they are useful for holding information if a flag is used or if a flag has an argument
report=0
file=''
This is another type of while loop. Here, flag will be the variable, and you're going to use getopts
to loop through abf:
, setting $flag
as each flag.
Note that options followed by a colon (:) take their own arguments. Here the -f
flag will take a file argument while -a
and -b
don't need arguments
while getopts 'rf:' flag
This is a case statement. Think of it as a large if-else stack, where we check if $flag
matches any of the strings denoted as <string>)
. Each case must end with double semicolons (;;). Use $OPTARG
to get the argument of a flag. Finally *)
denotes the commands for erroneous options. If there is an error, we exit. Otherwise, the rest of the script is run normally.
case "$flag" in
r) report=1 ;;
f) file="$OPTARG" ;;
*) echo "Unknown option. Exiting"
exit ;;
esac
if [[ $report == "1" ]]; then
echo "Deleting" $file >> report.log
fi
rm $file
Conclusions
This tutorial should hopefully set you up to writing more complex bash scripts. I know this isn't everything, but thanks for reading, and let me know if there are any typos, errors, or things can be added.