Difference between revisions of "Linux"

From CSE330 Wiki
Jump to navigationJump to search
 
(30 intermediate revisions by 6 users not shown)
Line 7: Line 7:
 
The open-source community is responsible for the development of many different [[wikipedia:Linux distributions|distributions]] of Linux.  Distributions, or ''distros'', are different "flavors" of the Linux operating system with different objectives.
 
The open-source community is responsible for the development of many different [[wikipedia:Linux distributions|distributions]] of Linux.  Distributions, or ''distros'', are different "flavors" of the Linux operating system with different objectives.
  
There are hundreds of distributions of Linux.  Three of the main banches are [[wikipedia:Debian|Debian]], [[wikipedia:SuSE|SuSE]] (based on [[wikipedia:Slackware|Slackware]]), and [[wikipedia:Red Hat Enterprise Linux|Red Hat Enterprise Linux (RHEL)]].
+
There are hundreds of distributions of Linux.  Three of the main branches are [[wikipedia:Debian|Debian]], [[wikipedia:SuSE|SuSE]] (based on [[wikipedia:Slackware|Slackware]]), and [[wikipedia:Red Hat Enterprise Linux|Red Hat Enterprise Linux (RHEL)]].
  
The sections below discuss two of these branches and give recommendations of Linux distributions suitable for a web server.  The CSE330 wiki provides instructions for both Debian-based and RHEL-based Linux distributions, so the choice of which to use is up to you.
+
For CSE 330, you'll be using a version of Red Hat Enterprise Linux.  
 
 
=== Debian ===
 
 
 
'''Debian''' was first introduced in 1993.  Debian has a passionate following, and its repositories contain more software packages than any other mainstream Linux distribution.  There are hundreds of derivatives of Debian.  Debian's most popular desktop derivative is '''[[wikipedia:Ubuntu|Ubuntu]]'''.
 
 
 
Base Debian has a slow upgrade schedule, and because of this, it is an extremely stable operating system, making it well-suited for servers.  '''[[wikipedia:Ubuntu Server|Ubuntu Server]]''' is also an excellent choice for a Debian-based web server.
 
  
 
=== Red Hat Enterprise Linux (RHEL) ===
 
=== Red Hat Enterprise Linux (RHEL) ===
Line 24: Line 18:
  
 
=== Special Note: Linux Kernel and Modules ===
 
=== Special Note: Linux Kernel and Modules ===
 +
{{XKCD
 +
|id=619
 +
|name=supported_features
 +
}}
  
 
What separates Linux from other Unix variants is its kernel. The kernel is the most important component of the operating system and is responsible for scheduling processes, providing access to the hardware devices, allocating memory to the programs, and so on.
 
What separates Linux from other Unix variants is its kernel. The kernel is the most important component of the operating system and is responsible for scheduling processes, providing access to the hardware devices, allocating memory to the programs, and so on.
  
 
The Linux kernel uses both monolithic and modular approaches. A monolithic kernel is a single program that contains all the code so any addition to kernel (such as code to access a driver) requires recompiling the code. A monolithic kernel is usually a little faster and could have a smaller size since only the absolutely necessary code is there. The modular kernel, on the other hand, enables dynamic loading and unloading  of kernel code, called ''modules''. Typical modules include device drivers. Thanks to this modular approach, Linux seldom requires a reboot after installing a new device.
 
The Linux kernel uses both monolithic and modular approaches. A monolithic kernel is a single program that contains all the code so any addition to kernel (such as code to access a driver) requires recompiling the code. A monolithic kernel is usually a little faster and could have a smaller size since only the absolutely necessary code is there. The modular kernel, on the other hand, enables dynamic loading and unloading  of kernel code, called ''modules''. Typical modules include device drivers. Thanks to this modular approach, Linux seldom requires a reboot after installing a new device.
 +
 +
== Bash ==
 +
 +
In this course, you will be interacting with Linux primarily on the command line.  The most widespread ''command line language'' is '''Bash'''.  This is described in more detail in [[Bash|the Bash guide]].
  
 
== Files and Permissions ==
 
== Files and Permissions ==
Line 37: Line 39:
 
The root directory of Linux contains a dozen or so subdirectories, each with a specific purpose:
 
The root directory of Linux contains a dozen or so subdirectories, each with a specific purpose:
  
* '''/bin''' contains [[#Running Programs|binaries]] used by all users
+
* '''/bin''' contains [[Bash#Running Programs|binaries]] used by all users
 
* '''/sbin''' contains system binaries typically used only by the system administrator
 
* '''/sbin''' contains system binaries typically used only by the system administrator
 
* '''/lib''' contains libraries for the binaries found in '''/bin''' and '''/sbin'''
 
* '''/lib''' contains libraries for the binaries found in '''/bin''' and '''/sbin'''
Line 126: Line 128:
 
</source>
 
</source>
  
For more information, see http://www.tuxfiles.org/linuxhelp/filepermissions.html
+
For more information, see https://www.linux.com/training-tutorials/how-manage-file-and-folder-permissions-linux/
  
 
=== The . and .. Directories ===
 
=== The . and .. Directories ===
Line 152: Line 154:
  
 
<source lang="bash">
 
<source lang="bash">
$ sudo useradd -r -m -c "My Full Name" myUserName
+
$ sudo useradd -r -m -c "<My Full Name>" <myUserName>
$ sudo passwd myUserName
+
$ sudo passwd <myUserName>
 
Enter new UNIX password:  
 
Enter new UNIX password:  
 
Retype new UNIX password:  
 
Retype new UNIX password:  
Line 160: Line 162:
  
 
'''Note:''' If Linux doesn't like your command, first check your syntax and then try it with single quotes around your full name and password instead of double quotes. If this works it is likely that you used some special characters in your password such as '!', which serves a special purpose in the shell. This is because Linux takes parameters enclosed in double quotes as one string, but still looks for special characters inside of the string and acts in accordance with their normal meaning. On the other hand, single quotes tell the shell to ignore any special characters within.
 
'''Note:''' If Linux doesn't like your command, first check your syntax and then try it with single quotes around your full name and password instead of double quotes. If this works it is likely that you used some special characters in your password such as '!', which serves a special purpose in the shell. This is because Linux takes parameters enclosed in double quotes as one string, but still looks for special characters inside of the string and acts in accordance with their normal meaning. On the other hand, single quotes tell the shell to ignore any special characters within.
 
'''IMPORTANT:''' If you are using Ubuntu Server, new users are created with /bin/sh as their default shell.  You want to use /bin/bash.  To change the shell for your user, run this command:
 
 
<source lang="bash">
 
$ sudo chsh -s /bin/bash myUserName  # changes shell for myUserName to /bin/bash
 
$
 
</source>
 
  
 
=== Adding a User to the Sudoers List ===
 
=== Adding a User to the Sudoers List ===
Line 182: Line 177:
 
When you are finished, save and close the file.  For more information on editing files on the command line, refer to [[#File Editors|the section about File Editors in this guide]].
 
When you are finished, save and close the file.  For more information on editing files on the command line, refer to [[#File Editors|the section about File Editors in this guide]].
  
In this case, '''alice''' will be able to run any command with SUDO privileges on the computer.  For more detail on SUDO configuration, see http://www.linuxhelp.net/guides/sudo/
+
<!-- AS of 02/19/2017, this link isn't available in HTTPS. -->
 +
In this case, '''alice''' will be able to run any command with SUDO privileges on the computer.   
  
 
'''Note:''' The visudo utility is especially useful because it not only locks the sudoers file against simultaneous edits, but it also checks the file for syntax errors when you quit. Therefore, it is always recommended that you use visudo to change the sudoers list rather than attempting to modify it directly.
 
'''Note:''' The visudo utility is especially useful because it not only locks the sudoers file against simultaneous edits, but it also checks the file for syntax errors when you quit. Therefore, it is always recommended that you use visudo to change the sudoers list rather than attempting to modify it directly.
 
== Bash ==
 
 
[[wikipedia:Bash|Bash]] is the default shell environment in Linux; that is, it is the interface in which you will be interacting with your Linux server.  Bash is a derivative of ''sh'', one of the first shells.  Other popular shells include ''csh'' and ''tcsh'', shells with c-like syntax for scripting, and ''zsh'' a bash-like shell which focuses on extending the capabilities of the shell environment.
 
 
=== Displaying a Value ===
 
 
To display a value at the shell prompt, use the command '''echo'''.
 
 
<source lang="bash">
 
$ echo "Hello World" # displays Hello World
 
Hello World
 
$
 
</source>
 
 
Note: In examples, code written at the prompt is conventionally denoted by a line starting with a currency symbol.  Lines without a currency symbol represent output.
 
 
==== Seeing the contents of a file ====
 
 
If you want to see the contents of a file, use the '''cat''' command.
 
 
<source lang="bash">
 
$ cat myfile.txt
 
Hello World
 
$
 
</source>
 
 
'''cat''' is one of a number of useful Linux command-line binaries, the rest of which we will see later.
 
 
=== Working Directory ===
 
 
Whenever you are interacting with the shell, you will be executing commands from a ''working directory''.  To see the current working directory, run the command '''pwd''' ('''p'''ath to '''w'''orking '''d'''irectory).  To change the working directory, run the command '''cd''' ('''c'''hange '''d'''irectory).
 
 
<source lang="bash">
 
$ pwd
 
/home/todd
 
$ cd projects
 
$ pwd
 
/home/todd/projects
 
$ cd ./  # recall that . is the current directory
 
$ pwd
 
/home/todd/projects
 
$ cd ../  # recall that .. is the next directory up in the filesystem
 
$ pwd
 
/home/todd
 
$
 
</source>
 
 
If you run commands that interact with the filesystem (e.g. ones that create or edit files), they will be saved in your current working directory.
 
 
=== Variables ===
 
 
Bash supports the use of variables.  There are system-defined variables, and you can also define your own custom variables.
 
 
==== Defining and Accessing Variables ====
 
 
<source lang="bash">
 
$ MYVARIABLE="Hello World"    # assigns the value Hello World to the variable MYVARIABLE
 
$ echo $MYVARIABLE    # notice that you need to put a currency symbol in front of the variable in order to access its value
 
Hello World
 
$ export $MYVARIABLE    # allows MYVARIABLE to be accessed in child processes (e.g., in a program you call from the shell)
 
$ export MYVARIABLE="Hello Moon"    # a shortcut for defining a variable and exporting it to subprocesses
 
$ set    # displays a list of all currently set variables
 
MYVARIABLE=Hello World
 
$
 
</source>
 
 
==== System Variables ====
 
 
Bash comes pre-loaded with certain environment variables.  Some of the variables with which you may find yourself interacting include:
 
 
* '''PATH''': search path for the commands
 
* '''PWD''': name of the current directory
 
* '''SHELL''': type of shell
 
* '''TERM''': type of the terminal
 
* '''USER''': the account name
 
* '''HOME''': the user's home directory
 
* '''PS1''': the prompt at command line
 
* '''$$''': the process id of current shell
 
* '''$RANDOM''': a random value
 
* '''$?''': the return value of the last command
 
* '''$_''': the last argument of the previous command
 
* '''$#''': where # is a number, the value of the #th argument
 
* '''IFS''': input field separator
 
 
Try echoing some of the system variables to examine your current environment.
 
 
=== Running Programs ===
 
 
To run an executable file, simply enter its filename into the shell prompt:
 
 
<source lang="bash">
 
$ /usr/bin/perl -v  # runs the binary executable located at /usr/bin/perl with the flag -v
 
This is perl 5, version 12, subversion 3 (v5.12.3)
 
$ ../mydir/myprogram  # runs the binary located one level up in the file system, then in mydir/myprogram
 
You just ran myprogram!
 
$
 
</source>
 
 
==== Programs in your PATH ====
 
 
Many commonly-used executable binaries are located in /bin, /usr/bin, and similar directories.  In order to avoid typing paths to these directories every time you want to execute a command, you define these directories in your PATH system variable:
 
 
<source lang="bash">
 
$ echo $PATH  # displays the current value of the PATH variable
 
/opt/local/bin:/opt/local/sbin:/usr/bin:/bin:/usr/sbin:/sbin:/usr/local/bin
 
$ PATH=$PATH:/my/favorite/bin  # adds a directory to your PATH variable
 
$ echo $PATH
 
/opt/local/bin:/opt/local/sbin:/usr/bin:/bin:/usr/sbin:/sbin:/usr/local/bin:/my/favorite/bin
 
$
 
</source>
 
 
Notice that the different PATH directories are separated by colons.  Now, when you execute a command, Bash will scan all of the directories in your PATH variable.  To see the path to the binary that Bash found, use the '''which''' command.
 
 
<source lang="bash">
 
$ perl -v
 
This is perl 5, version 12, subversion 3 (v5.12.3)
 
$ which perl
 
/usr/bin/perl
 
$
 
</source>
 
 
Note: it is unwise to have '''.''' in your PATH.  Instead, if you want to run an executable in the current directory, do so by calling ./myprogram:
 
 
<source lang="bash">
 
$ ./myprogram
 
You just ran myprogram!
 
$ myprogram
 
-bash: myprogram: command not found
 
$
 
</source>
 
 
==== Foreground and Background Processes ====
 
 
A program runs in the foreground (unless it detaches itself from the terminal) by default. You can run a program in the background by adding '''&''' at the end of the command (after arguments). In this case, the shell would fork a process for that program and enable the command prompt back for input. At any time, '''jobs''' command can be used to see the processes running at the background. '''fg''' command brings the specified process back to foreground. A program running in the background can be stopped by typing ''ctrl-c'' in most cases. Typing ''ctrl-z'' interrupts a program running in the foreground. If a program is interrupted, it will not continue executing until it is resumed. An interrupted program can be brought back to foreground by '''fg''', or it could be send to background by '''bg'''.
 
 
<source lang="bash">
 
$ ./myprogram
 
You just ran myprogram!
 
I'm taking a long time to run.
 
^C
 
$ jobs
 
$ ./myprogram
 
You just ran myprogram!
 
I'm taking a long time to run.
 
^Z
 
[1]+  Stopped                ./myprogram
 
$ jobs
 
[1]+  Stopped                ./myprogram
 
$ bg
 
[1]+ ./myprogram &
 
$ jobs
 
[1]+  Running                ./myprogram &
 
$ fg
 
^C
 
$ jobs
 
$ ./myprogram &
 
[1] 64741
 
$ jobs
 
[1]+  Running                ./myprogram &
 
$
 
</source>
 
 
==== Killing Processes ====
 
 
A process can be killed by using the '''kill''' command: <code>kill process-number</code>
 
 
In some cases the kill signal can be ignored, so it may be necessary to force kill the program by sending an absolute KILL signal: <code>kill -9 process-number</code>
 
 
The current processes can be listed using the '''ps''' command.
 
 
<source lang="bash">
 
$ ps  # list currently running processes in the current shell
 
  PID TTY          TIME CMD
 
19107 ttys000    0:00.75 -bash
 
1873 ttys001    0:00.05 -bash
 
57267 ttys002    0:00.20 -bash
 
50721 ttys003    0:00.55 -bash
 
$ ps -eaf  # list all currently running processes
 
  UID  PID  PPID  C STIME  TTY          TIME CMD
 
    0    1    0  0 31Dec00 ??        3:24.45 /sbin/launchd
 
    0 19106  327  0  1Aug12 ttys000    0:00.03 login -pfl sffc /bin/bash -c exec -la bash /bin/bash
 
  501 19107 19106  0  1Aug12 ttys000    0:00.75 -bash
 
    0  1872  327  0 31Jul12 ttys001    0:00.02 login -pfl sffc /bin/bash -c exec -la bash /bin/bash
 
  501  1873  1872  0 31Jul12 ttys001    0:00.05 -bash
 
    0 57266  327  0 Mon05AM ttys002    0:00.08 login -pfl sffc /bin/bash -c exec -la bash /bin/bash
 
  501 57267 57266  0 Mon05AM ttys002    0:00.20 -bash
 
    0 64747 57267  0  9:58AM ttys002    0:00.00 ps -eaf
 
    0 50720  327  0 Fri12AM ttys003    0:00.03 login -pfl sffc /bin/bash -c exec -la bash /bin/bash
 
  501 50721 50720  0 Fri12AM ttys003    0:00.55 -bash
 
$
 
</source>
 
 
==== Directing Output ====
 
 
A program's standard output can be send to a file by typing ''>filename'' at the end. Similarly, ''>>'' appends to a file. In Linux, there are three default file handlers, ''standard input'' or ''STDIN'', ''standard output'' or ''STDOUT'', and ''standard error'' or ''STDERR''. STDOUT has a file handler number 1 and STDERR has a number of 2. In bash, you can direct either of these handlers to a file.  You can also redirect one file handler to another.
 
 
<source lang="bash">
 
$ ./myprogram >filename.txt  # redirects all output to filename.txt
 
$ cat filename.txt
 
You just ran myprogram!
 
$ ./myprogram >>filename.txt  # appends the output to filename.txt
 
$ cat filename.txt
 
You just ran myprogram!
 
You just ran myprogram!
 
$ ./myprogram 1>filename.txt  # redirects the standard output to filename.txt
 
$ cat filename.txt
 
You just ran myprogram!
 
$ ./myprogram 2>filename.txt  #redirects the error output to filename.txt
 
You just ran myprogram!
 
$ ./myprogram 2>&1  # STDERR is redirected to STDOUT
 
You just ran myprogram!
 
$
 
</source>
 
 
Output of one program can be redirected to the input of another program using pipes.
 
 
<source lang="bash">
 
$ ./program1 | ./program2  # send program1's output as an input to program2
 
You just ran program2 with the input: You just ran program1!
 
$
 
</source>
 
 
Redirection is possible for STDIN too. A program can get its input by redirecting STDIN using '''<'''
 
 
<source lang="bash">
 
$ ./myprogram < inputfile.txt
 
You just ran myprogram with input from inputfile.txt!
 
$
 
</source>
 
 
Finally, '''`''' (a backtick) can be used to capture the output of a program, and use it as a string such as in setting a variable
 
 
<source lang="bash">
 
$ MYVARIABLE=`./myprogram`
 
$ echo $MYVARIABLE
 
You just ran myprogram!
 
</source>
 
 
==== SUDO ====
 
 
Some commands require root privileges to run.  In order to run a command as root without logging in as root, use '''sudo'''.
 
 
<source lang="bash">
 
$ yum install lynx
 
You need to be root to perform this command.
 
$ sudo yum install lynx
 
[sudo] password:
 
.....
 
Complete!
 
$
 
</source>
 
 
==== Automatically Running Programs ====
 
 
You will often find it useful for binaries to be executed at predefined intervals, certain days of the week, or at startup.  Linux provides you with the tools you need to make these configurations.
 
 
===== Scheduled Programs in Cron =====
 
 
''Cron'' is a system service that will run programs in a periodic manner.  For more details on how to configure cron, see [[Cron|the Cron guide]].
 
 
===== Programs at Startup =====
 
 
When a Linux system boots there are a series of scripts that are called to start up system processes, daemons, and other programs (such as SSH servers, web servers, database programs, etc).  The simplest way to add something to the boot process is to add it to ''/etc/rc.local'', which is a script that is called automatically at the very end of the boot process.  Simply write a script that does what you want and then call it from with in ''/etc/rc.local'' to ensure that your script is called at the end of the boot process.
 
 
You can also add scripts which run at different times during the boot process.  The way to do this varies by Linux distribution.  For Fedora, see  http://www.yolinux.com/TUTORIALS/LinuxTutorialInitProcess.html (specifically the section entitled ''Init Script Activation'').
 
 
=== Shell Scripting ===
 
 
Programs can be scripted using Bash.  For more information, see [[Shell Scripting]].
 
  
 
== Networking ==
 
== Networking ==
Line 468: Line 194:
 
In order to avoid setting your system's time manually at every daylight savings change, you can use a Network Time Server via the Network Time Protocol (NTP).
 
In order to avoid setting your system's time manually at every daylight savings change, you can use a Network Time Server via the Network Time Protocol (NTP).
  
The NTP Daemon comes pre-installed on EC2 AMI instances. To install it on Debian, install the '''ntp''' package from apt.
+
The NTP Daemon comes pre-installed on EC2 AMI instances.
 +
 
 +
{{RequiredInstructions|content=
  
 
=== Setting Timezone ===
 
=== Setting Timezone ===
Line 483: Line 211:
  
 
''ntp'' uses ''/etc/ntp.conf'' configuration file to find the ostnames of remote time servers.  The defaults here are probably fine.
 
''ntp'' uses ''/etc/ntp.conf'' configuration file to find the ostnames of remote time servers.  The defaults here are probably fine.
 +
 +
}}
  
 
== Installing Software ==
 
== Installing Software ==
  
The package management tool in Red Hat Enterprise Linux (and therefore also your Amazon EC2 instance) is '''rpm'''. (In Debian, it is '''dpkg'''.)  If you have an ''rpm'' package, you can install it by
+
The package management tool in Red Hat Enterprise Linux (and therefore also your Amazon EC2 instance) is '''rpm'''.   If you have an ''rpm'' package, you can install it by
  
 
<source lang="bash">
 
<source lang="bash">
Line 497: Line 227:
 
=== Repository-Based Package Managers ===
 
=== Repository-Based Package Managers ===
  
A better alternative is to use a repository-based package manager.  In RHEL, this is '''yum'''; in Debian, it is '''apt'''.
+
A better alternative is to use a repository-based package manager.  In RHEL, this is '''yum'''.
 +
 
 +
Before you install new software, you need to ensure that your local list of available packages is up-to-date. Run the following commands to perform this operation:
  
Before you install new software, you need to ensure that your local list of available packages is up-to-date.  Run one of the following commands to perform this operation:
+
<source lang="bash>
 +
yum check-update
 +
</source>
  
* In RHEL: '''yum check-update'''
+
After you have ensured that your package list is synced with the remote repository, you can start installing packages.  To install a package, the following command:
* In Debian: '''apt-get update'''
 
  
After you have ensured that your package list is synced with the remote repository, you can start installing packages.  To install a package, use one of the following commands:
+
<source lang="bash>
 +
yum install <package-name>
 +
</source>
  
* In RHEL: '''yum install package-name'''
+
If you get a permission denied error, try sudoing the command, like '''sudo apt-get install <xxxx>'''
* In Debian: '''apt-get install package-name'''
 
  
 
For example, this is how you would install ''lynx'', a command-line web browser, in your RHEL Linux distribution:
 
For example, this is how you would install ''lynx'', a command-line web browser, in your RHEL Linux distribution:
Line 548: Line 282:
  
 
The list of repositories that '''yum''' searches is located at <code>/etc/yum.conf</code>.  The list of repositories that '''apt''' searches is located at <code>/etc/apt/sources.list</code>.
 
The list of repositories that '''yum''' searches is located at <code>/etc/yum.conf</code>.  The list of repositories that '''apt''' searches is located at <code>/etc/apt/sources.list</code>.
 +
 +
{{RequiredInstructions|content=
 +
 +
=== Essential Packages ===
 +
 +
To save you some headaches later on, it is recommended that you install a few essential package bundles when you first create a new Linux instance.  These packages include things like Make and a C compiler.
 +
 +
'''RHEL:'''
 +
 +
<source lang="bash">
 +
$ sudo yum groupinstall "Development Tools"
 +
$ sudo yum install kernel-devel kernel-headers
 +
</source>
 +
 +
}}
 +
 +
=== Searching for Packages ===
 +
 +
You can search for packages by name, command, and so on.  For example, to search for all packages relating to "lynx", you could do:
 +
 +
 +
'''RHEL:'''
 +
<source lang="bash">
 +
$ yum search lynx
 +
</source>
  
 
== Command Reference ==
 
== Command Reference ==
 
 
Earlier, you saw that '''cat''' is a command that shows the contents of a file.  Below is a list of other useful commands in Linux.
 
Earlier, you saw that '''cat''' is a command that shows the contents of a file.  Below is a list of other useful commands in Linux.
  
=== Navigation and FIle Management ===
+
=== Navigation and File Management ===
  
 
* '''ls''' List file(s) in current working directory
 
* '''ls''' List file(s) in current working directory
Line 605: Line 363:
 
* '''lsof''' Open files in the system
 
* '''lsof''' Open files in the system
 
* '''netstat''' Statistics related to open sockets
 
* '''netstat''' Statistics related to open sockets
 +
* '''vmstat''' Reports information about processes, memory, IO, and CPU
  
 
=== File Editors ===
 
=== File Editors ===
 +
{{XKCD
 +
|name=real_programmers
 +
|id=378
 +
}}
  
 
It is sometimes convenient to edit files using the command line.  Three widely-used command line text editors are '''vi''', '''emacs''', and '''nano'''.
 
It is sometimes convenient to edit files using the command line.  Three widely-used command line text editors are '''vi''', '''emacs''', and '''nano'''.
Line 667: Line 430:
 
To save and then close your file, type the command ''':wq'''.  (To close the file without saving, use the command ''':q!''')
 
To save and then close your file, type the command ''':wq'''.  (To close the file without saving, use the command ''':q!''')
  
For more vi commands, see http://ss64.com/vi.html
+
 
 +
For more vi commands, see https://ss64.com/vi.html
  
 
==== Emacs ====
 
==== Emacs ====
Line 740: Line 504:
 
[http://en.wikipedia.org/wiki/Linux_kernel Linux Kernel]
 
[http://en.wikipedia.org/wiki/Linux_kernel Linux Kernel]
  
[http://www.eng.hawaii.edu/Tutor/vi.html VI Tutorial]
+
[http://www.yolinux.com/TUTORIALS/LinuxTutorialAdvanced_vi.html VI Tutorial]
  
 
[http://students.cec.wustl.edu/~jg18/guide/ Making the Transition to Linux: A Guide to the Linux Command Line Interface for Students]
 
[http://students.cec.wustl.edu/~jg18/guide/ Making the Transition to Linux: A Guide to the Linux Command Line Interface for Students]
  
[[Category:Module 1]]
+
 
 +
[[Category:Module 2]]

Latest revision as of 04:07, 23 January 2024

Linux is an open-source operating system based on UNIX. Linux is highly versatile and is used in a wide range of applications. Desktop Linux is Linux with a GUI (like Microsoft Windows or Mac OS X); Desktop Linux is popular in niche markets, and it is used widely in developing countries.

Linux is the most widely used operating system for web servers. In CSE330, we will be interacting with Linux from the command line. This article covers the tools you need to make the best use of Linux.

Linux Distributions

The open-source community is responsible for the development of many different distributions of Linux. Distributions, or distros, are different "flavors" of the Linux operating system with different objectives.

There are hundreds of distributions of Linux. Three of the main branches are Debian, SuSE (based on Slackware), and Red Hat Enterprise Linux (RHEL).

For CSE 330, you'll be using a version of Red Hat Enterprise Linux.

Red Hat Enterprise Linux (RHEL)

Red Hat Enterprise Linux, or RHEL, was first introduced in 1994. RHEL is known for being a good choice for enterprises that wish to use Linux as their primary OS. RHEL has an abundance of administration tools.

The Linux Lab in Lopata Hall uses Fedora Linux, a desktop distribution based on RHEL. CentOS is a popular RHEL derivative that is widely used in web servers. The Amazon EC2 Linux AMI is derived from CentOS, making it a distribution in the RHEL family.

Special Note: Linux Kernel and Modules

XKCD Comic: supported_features

What separates Linux from other Unix variants is its kernel. The kernel is the most important component of the operating system and is responsible for scheduling processes, providing access to the hardware devices, allocating memory to the programs, and so on.

The Linux kernel uses both monolithic and modular approaches. A monolithic kernel is a single program that contains all the code so any addition to kernel (such as code to access a driver) requires recompiling the code. A monolithic kernel is usually a little faster and could have a smaller size since only the absolutely necessary code is there. The modular kernel, on the other hand, enables dynamic loading and unloading of kernel code, called modules. Typical modules include device drivers. Thanks to this modular approach, Linux seldom requires a reboot after installing a new device.

Bash

In this course, you will be interacting with Linux primarily on the command line. The most widespread command line language is Bash. This is described in more detail in the Bash guide.

Files and Permissions

At the core of a Unix-based operating system is a directory structure with files and permissions.

Filesystem Hierarchy

The root directory of Linux contains a dozen or so subdirectories, each with a specific purpose:

  • /bin contains binaries used by all users
  • /sbin contains system binaries typically used only by the system administrator
  • /lib contains libraries for the binaries found in /bin and /sbin
  • /etc contains configuration files
    • /etc/yum.conf Configuration file for yum
    • /etc/yum/yum.repos.d Directory containing .repo files for online repositories
    • /etc/crontab System-wide crontab file
    • /etc/fstab Information about default partitions to be mounted
    • /etc/group List of groups in the system
    • /etc/hosts List of IP addresses with their names
    • /etc/inittab What to do at each run-level
    • /etc/inetd.conf Configuration file for some internet services (replaced by xinetd.* in most systems)
    • /etc/modules.conf Module information for the boot
    • /etc/motd Message to be seen at the login prompt
    • /etc/passwd User information
    • /etc/profile System level initial file for sh and its derivatives
    • /etc/shadow User passwords
  • /dev contains device files
  • /proc contains information on currently running processes
  • /var contains files whose contents is expected to change
    • /var/log contains system log files
      • /var/log/messages System/Kernel messages
      • /var/log/syslog System log (mostly for Daemons)
      • /var/log/wtmp' User access log (binary)
      • /var/log/dmesg Boot-up messages
      • /var/log/auth.log Authorization logs
    • /var/lib contains packages and database files
    • /var/spool contains print queues
  • /tmp contains temporary files that are deleted at system reboot
  • /usr contains user programs
    • /usr/bin contains binaries for user programs
    • /usr/sbin contains binaries for system administrators
    • /usr/lib contains libraries for /usr/bin and /usr/sbin
    • /usr/local contains programs that you install from source
  • /home contains users' home directories
  • /root is root's home directory
  • /boot contains boot loader files (do not touch unless you know what you are doing!)
  • /opt contains optional add-on applications
  • /mnt is where system administrators can mount filesystems
  • /media contains links to removable media devices (for example, CDs)
  • /srv contains site-specific data which are served by the system


For more information, see the Wikipedia article on the Filesystem Hierarchy Standard.

File Permissions

Every file in Linux has permissions that define which users can Read, Write, and Execute it. Every file has an owner and a group. The permissions for a file are set on three levels: User (owner), Group, and Other.

Symbolic Notation

When you view the permissions of a file in Linux, they will most often be displayed in symbolic notation. Symbolic notation consists of 10 characters: the first defines the file type, and then there are three characters each for User, Group, and Other permissions.

  • -r--r--r-- is a normal file that is readable by all users but writable or executable by no one.
  • -rwxr-xr-x is a normal file that is readable and executable by everyone but only writable by User (the file's owner). This is the most common permission set.

Viewing File Permissions

To view the permissions of all files in a certain directory, run the binary ls -l in Bash:

$ ls -l   # displays a list of all files in a directory with their permissions in symbolic notation
total 16
lrwxr-xr-x  1 sffc  wheel   6 Aug  9 09:13 link -> myfile.txt
-rwxr--r--  1 sffc  wheel  12 Aug  9 09:13 myfile.txt
$ ls -l myfile.txt   # displays the permissions of only myfile.txt
-rwxr--r--  1 sffc  wheel  12 Aug  9 09:13 myfile.txt
$

Setting File Permissions

Linux comes with several useful binaries for setting file permissions.

  • chmod is used for setting permissions
  • chown is used for setting a file's owner
  • chgrp is used for setting a file's group

Some examples are shown below.

$ chmod a+x myfile.txt   # turns on the Execute option for all users
$ chmod o-w myfile.txt   # turns off the Write option for Other users
$ chmod u+wx-r myfile.txt   # turns on the Write and Execute options for User (the file's owner) and also turns off the Read option for User
$ chown todd myfile.txt   # sets the owner of myfile.txt to the user todd.  Note: First comes the user, then comes the filename: not the other way around!
$ chgrp staff myfile.txt   # sets the group of myfile.txt to usergroup staff
$

For more information, see https://www.linux.com/training-tutorials/how-manage-file-and-folder-permissions-linux/

The . and .. Directories

The . directory is a reference to the current directory. The .. directory brings you one level up in the filesystem.

Symbolic Links

A symbolic link, or symlink, is basically a link from one spot in the filesystem to another. You can think of them like aliases in Mac OS X. To create a a symlink, use the ln -s command:

$ ln -s /path/to/file.txt /path/to/link   # creates a symlink to file.txt at /path/to/link

# Example:
$ ln -s /home/todd/instructions.doc /var/www/public_html/classes/instructions.doc   # creates a symlink in the web server to instructions.doc
$ vi /var/www/public_html/classes/instructions.doc   # changes to the symbolic link will be reflected in the original file
$

User Management

Adding a New User

To create a new user, use the useradd command. Then, set a password for that user using the passwd command:

$ sudo useradd -r -m -c "<My Full Name>" <myUserName>
$ sudo passwd <myUserName>
Enter new UNIX password: 
Retype new UNIX password: 
$

Note: If Linux doesn't like your command, first check your syntax and then try it with single quotes around your full name and password instead of double quotes. If this works it is likely that you used some special characters in your password such as '!', which serves a special purpose in the shell. This is because Linux takes parameters enclosed in double quotes as one string, but still looks for special characters inside of the string and acts in accordance with their normal meaning. On the other hand, single quotes tell the shell to ignore any special characters within.

Adding a User to the Sudoers List

For security reasons, you should never SSH into your server as the root user. Instead, you should use a normal user to whom you give sudo privileges. (For more detail on sudo, see the Linux guide.)

To give a user sudo privileges, use the command visudo, which opens up the SUDO configuration file in the system's default text editor. (Never edit the file /etc/sudoers directly!) SUDO users are specified using lines similar to

alice   ALL=(ALL) ALL

Add that line immediately below the line defining root:

root   ALL=(ALL) ALL

When you are finished, save and close the file. For more information on editing files on the command line, refer to the section about File Editors in this guide.

In this case, alice will be able to run any command with SUDO privileges on the computer.

Note: The visudo utility is especially useful because it not only locks the sudoers file against simultaneous edits, but it also checks the file for syntax errors when you quit. Therefore, it is always recommended that you use visudo to change the sudoers list rather than attempting to modify it directly.

Networking

In Linux, you can see your network information by typing ifconfig. This command shows the status information of each network interface, including the IP address you will need to remotely connect to your instance. The interface lo is the special loopback interface with IP address 127.0.0.1. This refers to your local machine and any connection from your machine to your machine goes through this pseudo-interface. Typical network interfaces include eth0, eth1,..., wlan0, etc. Ethernet cards are represented with ethX. In the past, most wireless cards showed up as wlanX, but it is also common now for them to be represented with ethX names. ifconfig also gives information such as hardware address (MAC), netmask, and broadcast addresses.

You can start or stop networking by calling /etc/init.d/networking script. As with most /etc/init.d scripts, this script takes several options, such as start, stop, restart. Note even if you stop networking, you would still have your lo interface. You can look at the code of the script to find out what it actually does. You can also stop or start individual interfaces by using the ifup and ifdown commands.

The network configuration files are stored in /etc/network. /etc/network/interfaces contains the defaults for each interface. For xample, you could specify static IP, netmask, network, broadcast and default gateway for an interface here, but you should not need to edit this files in general. These default options can be changed with the ifconfig command. The /etc/network/if-down.d and /etc/network/if-up.d directories contain the scripts that are going to be executed when an interface is turned on or off. Of course, most modern Linux distributions have GUI tools for doing network configuration more easily, and you shouldn't need to change anything for the purposes of this course.

Synchronizing Date and Time

In order to avoid setting your system's time manually at every daylight savings change, you can use a Network Time Server via the Network Time Protocol (NTP).

The NTP Daemon comes pre-installed on EC2 AMI instances.

Setting Timezone

Your server is probably not set to the correct timezone by default.

The timezone files are in the directory /usr/share/zoneinfo. They are further organized within subdirectories grouped by region. For instance, Rome's time zone file is stored within /usr/share/zoneinfo/Europe.

In order to set the time zone, simply copy the desired time zone file to our /etc directory as a new file named "localtime". For example, to set the the machine's system time to Rome's time zone, we would enter the command

sudo cp /usr/share/zoneinfo/Europe/Rome /etc/localtime

ntp uses /etc/ntp.conf configuration file to find the ostnames of remote time servers. The defaults here are probably fine.

Installing Software

The package management tool in Red Hat Enterprise Linux (and therefore also your Amazon EC2 instance) is rpm. If you have an rpm package, you can install it by

$ rpm -i somepackage.rpm
$

This requires that somepackage.rpm be in your current directory, which means you will have to download the file yourself (or create it). It requires you to manually install any dependencies the package has.

Repository-Based Package Managers

A better alternative is to use a repository-based package manager. In RHEL, this is yum.

Before you install new software, you need to ensure that your local list of available packages is up-to-date. Run the following commands to perform this operation:

yum check-update

After you have ensured that your package list is synced with the remote repository, you can start installing packages. To install a package, the following command:

yum install <package-name>

If you get a permission denied error, try sudoing the command, like sudo apt-get install <xxxx>

For example, this is how you would install lynx, a command-line web browser, in your RHEL Linux distribution:

$ lynx --version   # is lynx installed?
-bash: lynx: command not found
$ sudo yum check-update   # sync package lists with the remote repositories
$ sudo yum install lynx    # install the lynx package
Downloading Packages:
lynx-2.8.6-27.6.amzn1.i686.rpm        | 1.8 MB     00:00     
Running rpm_check_debug
Running Transaction Test
Transaction Test Succeeded
Running Transaction
  Installing : lynx-2.8.6-27.6.amzn1.i686             1/1 

Installed:
  lynx.i686 0:2.8.6-27.6.amzn1                                                                                                                                                                                                                

Complete!
$ lynx --version   # test again to see if we have lynx installed
Lynx Version 2.8.6rel.5 (09 May 2007)
$

You can also search for available packages by name or by the name of a file that they install.

$ yum search lynx   # search for packages whose name contains lynx
======= N/S Matched: lynx =======
lynx.i686 : A text-based Web browser
$ yum provides lynx   # search for packages that install a file or command named lynx
lynx-2.8.6-27.6.amzn1.i686 : A text-based Web browser
Repo        : installed
Matched from:
Other       : Provides-match: lynx
$

The list of repositories that yum searches is located at /etc/yum.conf. The list of repositories that apt searches is located at /etc/apt/sources.list.

Essential Packages

To save you some headaches later on, it is recommended that you install a few essential package bundles when you first create a new Linux instance. These packages include things like Make and a C compiler.

RHEL:

$ sudo yum groupinstall "Development Tools"
$ sudo yum install kernel-devel kernel-headers

Searching for Packages

You can search for packages by name, command, and so on. For example, to search for all packages relating to "lynx", you could do:


RHEL:

$ yum search lynx

Command Reference

Earlier, you saw that cat is a command that shows the contents of a file. Below is a list of other useful commands in Linux.

Navigation and File Management

  • ls List file(s) in current working directory
    • ll Shortcut to ls -l. List files with more details than ls. Only available in certain distributions
  • cd Change working directory. Note: cd called without any arguments moves you to your home directory
  • cp Copy a file
  • mv Move or rename a file
  • rm Remove a file
    • rm -r Remove a directory and all files in it
  • ln -s Create a symlink to a file
  • mkdir Create a directory
  • rmdir Remove a directory (directory must be empty; if it's not, use rm -r)
  • cat Display the contents of a file
  • less Display the contents of a file, wait for the user at each page
  • tail Display the last 20 lines of a file
    • tail -f Display the last 20 lines of a file and then wait for changes, displaying them as they occur. Useful for monitoring log files.
  • chown Change the owner of a file
  • chgrp Change the group of a file
  • chmod Change the security permissions of a file
  • grep Display the lines of a file matching a user specified string
  • diff Display the difference between two files

System Administration

  • df Display free diskspace
  • du Display disk usage
  • free Display memory usage information
  • date Display current time and date
  • top Display the CPU and Memory usages of current processes
  • ps Display current processes
  • kill Terminate a running process
  • killall Terminate the running process matching user specified criterias
  • ping hostname Ping a host
  • host Get the IP address of a host
  • passwd Change the user password
  • su user Switch to the privileges of another user
  • shutdown Power off the computer
  • reboot Reboot the computer
  • clear Clear the terminal
  • ifconfig Display/Configure a network device
  • file Show the file type
  • lsmod Display loaded kernel modules
  • insmod Install a kernel module
  • modprobe Load a kernel module (also load the dependencies)
  • adduser Add a new user
  • exit Exit from a shell
  • lpr Print a file
  • head Display lines at the beginning of a file
  • tail Display lines at the end of a file
  • pwd Display the name of the current working directory
  • lsof Open files in the system
  • netstat Statistics related to open sockets
  • vmstat Reports information about processes, memory, IO, and CPU

File Editors

XKCD Comic: real_programmers

It is sometimes convenient to edit files using the command line. Three widely-used command line text editors are vi, emacs, and nano.

Vi

To edit a file using Vi, use the command vi. You will see something like this:

.
~                                                     
~                                                     
~                                                     
~                                                     
~                                                     
~                                                     
~                                                     
~                                                     
~                                                     
~                                                     
~                                                     
~                                                     
"myfile.txt" [New File]

To insert text into the file, press i once, then type away:

Hello World
~                                                     
~                                                     
~                                                     
~                                                     
~                                                     
~                                                     
~                                                     
~                                                     
~                                                     
~                                                     
~                                                     
~                                                     
-- INSERT -- 

To leave insert mode, press ESC.

To save your file, type the command :w and press Enter.

Hello World
~                                                     
~                                                     
~                                                     
~                                                     
~                                                     
~                                                     
~                                                     
~                                                     
~                                                     
~                                                     
~                                                     
~                                                     
"myfile.txt" [New] 1L, 12C written

To save and then close your file, type the command :wq. (To close the file without saving, use the command :q!)


For more vi commands, see https://ss64.com/vi.html

Emacs

To edit a file using emacs, use the command emacs. You can start typing immediately:

Hello World











-uuu:**-F1  myfile.txt     All L1     (Text)----------

To save a file, type C-x C-s (that means Control-X, then Control-S):

Hello World











-uuu:---F1  myfile.txt     All L1     (Text)----------
Wrote /home/todd/myfile.txt

To quit emacs, type C-x C-c. (It will ask you whether or not to save the file if you've made changes.)

For more emacs commands, see http://souptonuts.sourceforge.net/chirico/emacs_ref.html

Nano

To edit a file using nano, use the command nano. You can start editing the file immediately:

  GNU nano 2.0.6               File: myfile.txt                                      

Hello World









^G Get Help   ^O WriteOut   ^R Read File  ^Y Prev Page  ^K Cut Text   ^C Cur Pos
^X Exit       ^J Justify    ^W Where Is   ^V Next Page  ^U UnCut Text ^T To Spell

Nano tells you the commands you need right there so you don't have to always keep looking them up like with Vi and Emacs.

For more detail on Nano commands, see http://www.nano-editor.org/dist/v2.2/nano.html

Linux Resources

Linux System Administration Tutorial

Working with the Shell (SUSE Documentation)

Linux Kernel

VI Tutorial

Making the Transition to Linux: A Guide to the Linux Command Line Interface for Students