System Administration Original
This page serves as a reference for all things related to system administration of Linux. You will be installing Fedora Linux on a Amazon EC2 instance (virtual server) which you will then use for all of your assignments for the rest of the course. You will be responsible for maintaining your instance for the entire semester, including the installation of new software you will need for each assignment. There are also notes here on other topics which are not necessary for the completion of the course, but which you may find interesting.
Contents
- 1 Working as Linux User
- 2 Networking
- 3 Configuring Your System
- 4 Additional Information
Working as Linux User
This section will cover some basics of functioning as a Linux user on the command line.
Bash
Bash is your default shell environment, i.e., it is the command line environment you will be in when you open a terminal (or remotely access your VM over SSH later). Bash is one of the shells installed by default (and it is default in most systems). It is located at /bin directory. Bash is a derivative of sh, one of the first shells. Other popular shells include csh and tcsh, shells with c-like syntax for scripting, and zsh a bash-like shell which focuses on extending the capabilities of the shell environment.
Variables
It is possible to set and use variables in your bash environment. When setting a variable, that variable is written by its name (no pre-declaration required). However, to use it you need to put special character $ in front of the variable. For example,
MYVARIABLE="this is a test"
sets the value of MYVARIABLE whereas
echo $MYVARIABLE
shows the value of MYVARIABLE. If you want your variable to be accessible to the child processes (e.g., in a script or program you call from the shell), you need to export it with
export $MYVARIABLE
after you set it. Alternatively, you can set and export it at the same time.
export MYVARIABLE="this is a test"
You can see the available variables by typing
set
Some of interesting variables are:
PATH: search path for the commands
PWD: name of the current directory
SHELL:type of shell
TERM: type of the terminal
USER: the account name
HOME: the user's home directory
PS1: the prompt at command line
$$: the process id of current shell
$RANDOM: a random value
$?: the return value of the last command
$_: the last argument of the previous command
$#: where # is a number, the value of the #th argument
IFS: input field separator
Running Programs
In Linux, it is enough to type the name of a program to execute it. The name could be absolute (i.e., full name including the directories, such as /mydir/myprogram), or relative (the location with respect to current directory, such as ../../mydir/myprogram). In order to avoid typing the directory names every time you want to run a program, a list of directories can be specified by default in $PATH variable. If you type a command, the directories in that variable are going to be added to your program in the order of their occurrence. The first program found is then going to be executed. While it is tempting to put . in your PATH (. means the current directory), this is an extremely unwise thing to do. To see why, consider that you have changed your current directory to a different user's home area. They could have a program in their base directory called ls. If . is first in your path, it will execute that program instead of the intended system ls program. Since the other user wrote the program, it could do anything, and you are running it as yourself, meaning that it could potentially harm your account, e.g., deleting all the files your home directory. You might also consider putting . last in your path so that any system programs will be found first. This is still very dangerous, however. Perhaps this other malicious program was called sl instead. Just on typo from ls to sl and the worst could happen.
A program runs in the foreground (unless it detaches itself from the terminal) by default. You can run a program in the background by adding & at of the command (after arguments). In this case, the shell would fork a process for that program and enable the command prompt back for input. At any time, jobs command can be used to see the processes running at the background. fg command brings the specified process back to foreground. A program running in the background can be stopped by typing ctrl-c in most cases. Typing ctrl-z interrupts a program running in the foreground. If a program is interrupted, it will not continue executing until it is resumed. An interrupted program can be brought back to foreground by fg, or it could be send to background by bg.
A process can be killed by kill command.
kill process-number
in some cases the kill signal can be ignored, so it may be necessary to force kill the program by sending an absolute KILL signal.
kill -9 process-number
The running processes can be found by typing ps (see the processes in the current shell), or ps -eaf (see all processes).
A program's standard output can be send to a file by typing >filename at the end. Similarly, >> appends to a file. In Linux, there are three default file handlers, standard input or STDIN, standard output or STDOUT, and standard error or STDERR. STDOUT has a file handler number 1 and STDERR has a number of 2. In bash, you can direct either of these handlers to a file. For example
someprogram 1>filename
redirects the standard output, where as
someprogram 2>filename
redirects the error output. Alternatively, you can redirect one file handler to another, such as
someprogram 2>&1
where STDERR is redirected to STDOUT.
Similarly, output of program can be redirected to the input of another program through pipes, e.g.,
program1 | program2
where program1's output is sent as an input to program2. Redirection is possible for STDIN too. A program can get its input by redirecting STDIN using <
myprogram < inputfile
Finally, ` can be used to capture the output of a program, and use it as a string such as in setting a variable
MYVARIABLE=`someprogram` echo $MYVARIABLE
Programming the shell
As bash is nothing but a command interpreter, it actually comes with a built-in programming language. Users can take advantage of this powerful language to simplify and automate various tasks. Programs written in shell languages (and other interpreted languages) are referred to as scripts. They can be run from the command line like any other program using the correct shell program as the interpreter. The scripts themselves are just text files with lists of commands. For example,
bash commandfile
reads and executes the commands from the text file named commandfile. A better approach is to make commandfile executable and run it as if it were a compiled program
chmod a+x commandfile ./commandfile
In this case, the shell is going to execute commandfile. For this to work, you must also specify the interpreter of commandfile on the first line of the script file, starting with #! (pronounced sha-bang), e.g.
#!/bin/sh ls
would simply calls the sh program to run the remaining commands (ls in this case). It is a good idea to always specify the interpreter at the first line of the scripts. For bash scripts, it would be
#!/bin/bash
Statements
Any line in a bash script is a program to be executed. Lines are broken with ;.
Conditional statements
Bash supports if statements. The format is
if [ CONDITION ] then somecommand fi
or
if [ CONDITION ] then somecommand else someothercommand fi
CONDITION could be a logical statement or it could be a test (run man test for more details). For example
if [ $val = 5 ]
or
if [ $val -eq 5 ]; then echo value is 5 fi
if [ somefile1 -ot somefile2 ]; then echo somefile1 is older than somefile2 fi
Bash also has case statements. The format is
case $mywar in value1) commands; ;; value2) commands; ;; *) commands; ;; esac
In this case, ;; means end of a case block and * means catch anything.
In general, you will nearly always put string variables in quotes, ". To see why, remember that shell variables are simply expanded to their content when used. For example,
myvar="Some very good text was here. Now it is gone and all that is left is this boring message" if [ $myvar = "This is very good text" ]
would fail with an error message as $myvar would be expanded to its content, like this:
if [ Some very good text was here. Now it is gone and all that is left is this boring message = "This is very good text" ]
To avoid this, you should have the statement as
if [ "$myvar" = "This is very good text" ]
Loop statements
Bash provides standard loop statements, for, while, until. They can be executed in a script or it could be typed at the command prompt.
The format of for statement is
for VAR in somevalue1 somevalue2 .... somevaluen do executesomecommand done
This loop will execute the for block for each value of VAR. For example,
sum=0 for i in 1 2 3 4 5 6 7 8 9 10 do sum=$[$sum+$i] done
would sum numbers from 1 to 10. We can also use other techniques in for line, e.g. replacing for in the above code with
for i in `seq 1 1000`
would get the sum from 1 to 1000. Note the usage of `
The format of while and until are very similar
while [ CONDITION ] do execute some command done
and
until [ CONDITION ] do execute some command done
For both of these commands, CONDITION is the same as for the if statement.
Functions
Bash also provides functions. They could be defined at the command prompt and then can be called from command prompt. The structure of a function is similar to most modern languages.
myfunction(){ execute some commands }
The function can then be called with:
myfunction
You can send parameters to the function by adding them next to the function name:
myfunction arg1 arg2 ....
Within a function, you can access the arguments using $#, i.e., $1 for first argument, $2 for second argument, etc.
Additional Information
Advanced Bash-Scripting Guide.
Permissions
This is a very helpful tutorial on file permissions in Linux.
If you're still having a bit of trouble with using numerical chmod, you may want to review counting in binary up to 7.
Networking
In Linux, you can see your network information by typing ifconfig. This command shows the status information of each network interface, including the IP address you will need to remotely connect to your instance. The interface lo is the special loopback interface with IP address 127.0.0.1. This refers to your local machine and any connection from your machine to your machine goes through this pseudo-interface. Typical network interfaces include eth0, eth1,..., wlan0, etc. Ethernet cards are represented with ethX. In the past, most wireless cards showed up as wlanX, but it is also common now for them to be represented with ethX names. ifconfig also gives information such as hardware address (MAC), netmask, and broadcast addresses.
You can start or stop networking by calling /etc/init.d/networking script. As with most /etc/init.d scripts, this script takes several options, such as start, stop, restart. Note even if you stop networking, you would still have your lo interface. You can look at the code of the script to find out what it actually does. You can also stop or start individual interfaces by using the ifup and ifdown commands.
The network configuration files are stored in /etc/network. /etc/network/interfaces contains the defaults for each interface. For xample, you could specify static IP, netmask, network, broadcast and default gateway for an interface here, but you should not need to edit this files in general. These default options can be changed with the ifconfig command. The /etc/network/if-down.d and /etc/network/if-up.d directories contain the scripts that are going to be executed when an interface is turned on or off. Of course, most modern Linux distributions have GUI tools for doing network configuration more easily, and you shouldn't need to change anything for the purposes of this course.
Configuring Your System
This section gives details about how to further configure your Fedora instance.
Updating repositories and installing new software
The package management tool in Red Hat Linux/Fedora is rpm, and the package format is called rpm. If you have a rpm package, you can install the package by typing
rpm -i somepackage.rpm
This requires that somepackage.rpm be in your current directory, which means you will have to download the file yourself (or create it). It requires you to manually install any dependencies the package has. An alternative is to use yum. yum searches online repositories and creates a list of available packages. The locations of the packages are specified by the .repo files in the directory /etc/yum/yum.repos., generally you don't need to modify it, but in some cases the default repositories do not have the packages for some of the latest updates to software for a while after various independent repositories. In the case that you do utilize an independent repository, it is critical that you do your homework and are sure that you can trust the repository. You can install a package with
yum install package-name
if you don't know the exact name of your package, you can search the name by typing the command
yum search keyword
or, when you don't know the name of the package that provides a desired function or file.
yum provides keyword
Remote Access
SSH
The primary mode of remote access to your machine is SSH (secure shell). SSH access requires that the sshd daemon is running in your machine. By default, SSH is preinstalled on your EC2 instance. On a local linux box you can install ssh by issuing the command.
yum install openssh-server
As you likely know, it is often desirable to use SSH to access your machine without specifying your password. This is done by generating a public/private encryption key pair on your local host, and copying the public key to the remote machine. This is the default way Amazon uses to log in to its EC2 instances, and it is recommended that you utilize this method. Instructions on setting this up can be found on the course website. Additional details can be found here [1].
The configuration files for SSH are in /etc/ssh. You can modify the files to affect SSH permissions, among other things. For example, it is always a good idea to disable root access over ssh. This could be done by editing /etc/ssh/sshd_config and setting
PermitRootLogin no
Note that you must restart the ssh process for this to take effect. Should that fail, resetting your server should do the trick.
Warning: Disabling root access over SSH for your EC2 instance should only be done after setting up an additional user account and adding that account to the sudoers list.
SSH For Windows: PuTTY
It is recommended that you run Linux locally. However, if you have yet to free yourself from the bonds of the Microsoft hegemony, you can easily access your instance by running a windows SSH client.
The best SSH client available for windows is PuTTY, which is completely free and can be downloaded here.
PuTTY is fairly simple and straight forward with one caveat: Amazon's .pem key pair files are not compatible with PuTTY keys. In order to convert .pem keys to a PuTTY .ppk privte key file, you should use the puttygen.exe utility available from the same page [2] as PuTTY. Next select import under the conversions menu,load the amazon .pem key file and press the save private key button. Be sure to save the file in the directory where PuTTY looks for its keys.
Copy and paste works similarly to the X Window System in Unix. You use the left mouse button to select text in the PuTTY window. The act of selection automatically copies the text to the clipboard: there is no need to press Ctrl-Ins or Ctrl-C or anything else. In fact, pressing Ctrl-C will send a Ctrl-C character to the other end of your connection (just like it does the rest of the time), which may have unpleasant effects. The only thing you need to do, to copy text to the clipboard, is to select it.
To paste the clipboard contents into a PuTTY window, by default you click the right mouse button. If you have a three-button mouse and are used to X applications, you can configure pasting to be done by the middle button instead, but this is not the default because most Windows users don't have a middle button at all.
Also, here is a good PuTTY tutorial that you might find useful to get started helpful tutorial.
SSHFS
SSHFS is a filesystem client which allows secure mounting of remote file systems. While there are other ways to mount remote file systems, SSHFS has the advantage of being able to mount a file system located on any host that has an SSH daemon running without any host side installation or configuration. This means that you can easily access and edit your files using all of your local applications including IDEs.
As you may have inferred from the name, the underlying implementation utilizes SSH File Transfer Protocol in combination with FUSE, a package now included in the kernel that allows unprivileged users to easily create their own file systems in userspace (see the wikipedia entry for more information [3]).
To mount a share using password based authentication, the command is
sshfs user@domain:/path/to/remote/directory /path/to/local/mountpoint
e.g. To mount the directory /home/joe/myfiles in the user joe's home directory for a machine with the domain schmoesfiles.org using SSHFS you would enter the command
sshfs joe@www.schmoesfiles.org:myfiles
Note that if you are using public key authentication, the command to mount the remote share is slightly different
sshfs -o IdentityFile=/path/to/private/key user@domain:/path/to/remote/directory /path/to/local/mountpoint
To unmount the filesystem you can use the following command
fusermount -u /path/to/local/mountpoint
Disabling Root User
The root account, also called the super user account, is the equivalent of the Administrator account in Windows. These days, it is common practice to disable the root account in order to increase security. Indeed, many Linux distributions have the root user disabled by default. Of course, without the root user we need to have a way to access super user privileges. The sudo command provides this. Sudo enables individual users to run some commands as the root user. It has a configuration file, /etc/sudoers, where the access privileges are specified. For example, a line such as
alice ALL=(ALL) ALL
gives permission to alice to run any command as root. The first time alice runs sudo, it will ask alice for her password, but then it remembers the successful sudo for a certain time period so that additional sudo commands will not require a password again. Given the above configuration, alice can use sudo to become root by typing
sudo bash
or
sudo su -
The sudoers file can also be configured to never ask for a password (but still allow the command to be run), and to specify that only certain commands can be run:
bruce ALL= NOPASSWD: /usr/sbin/kill,/usr/bin/killall
In this case, bruce can only run /usr/sbin/kill, and /usr/bin/killall as root (without typing his password).
You can use any editor to edit /etc/sudoers, but standard practice is to use visudo or sudoedit which locks the sudoers file to ensure that only one person is editing the file at a time. Additionally, visudo performs syntax checking before modifying the actual sudoers file.
Setting System Time
In order to avoid setting your system's time manually at every daylight savings change, you can use a Network Time Server. The NTP (Network Time Protocol) daemon comes pre-installed on EC2 instances, however it may not be set to the correct time zone.
The time zone files are in the directory /usr/share/zoneinfo. They are further organized within subdirectories grouped by region.
For instance, Rome's time zone file is stored within /usr/share/zoneinfo/Europe In order to set the time zone, simply copy the desired time zone file to our /etc directory as a new file named "localtime" e.g. To set the the machine's System time to Rome's time zone we would enter the command
sudo cp /usr/share/zoneinfo/Europe /etc/localtime
ntp uses /etc/ntp.conf configuration file to find out the IP address of remote time servers. By default, Fedora points ntp at a Fedora server, which is perfectly acceptable.
Cron
Cron is a system service which is used to invoke programs or scripts in a periodic manner. The crontab is the file which contains the schedule of what programs should be called when. Cron itself is a system daemon which runs in the background, wakes up periodically, and runs anything in the schedule whose time has come. crontab is also the command you run to edit the schedule file.
The crontab command can run run with -l option to show the current schedule, and with -e option to launch an editor to modify the schedule. The $VISUAL environment variable determines which editor is launched. For example, set $VISUAL to '/usr/bin/vi' to run vi to edit the schedule.
Each line in the crontab file can be a comment, a variable declaration, or an event line.
Comments
Comments begin with a comment mark #, and must be the first character on the line.
Variable declarations
Variable declarations are of the form
name=value
Unlike bash scripts, you can get away with putting spaces around the = sign.
Event lines
Each event line specifies a time and a date, and a command which is to be executed, in the format
minute hour date month day command
The first five fields can be numbers or ranges, in the format described below. Note that you can specify either the date (i.e. within the month) or the day (of the week), but not both. The other field should be set to *.
The sixth field is a command with parameters. Percent signs, unless escaped with a \ backslash, will be turned into newlines and everything after the first one of these will be fed into the command's STDIN stream.
It is also possible to execute shell scripts or run various applications with cron. For example. if you want it to start an alarm at 6AM every weekday morning, here is the crontab line you would use:
0 6 * * 1-5 /home/user/alarm.sh
Normally, the crontab file contains a MAILTO variable that directs output (STDOUT and STDERR) to an email to the address (e.g. MAILTO=dave). If this is not working, the script may quit unexpectedly when its output has nowhere to go.
Range format
* | Any number |
*/5 | Any number, in steps of 5 |
1-6 | Any number between 1 and 6 (inclusive) |
0-30/5 | Any number between 0 and 30, in steps of 5 |
1,4,9 | 1, 4 or 9 |
Months
Months can be specified in numbers or in words.
1 = jan
2 = feb
...
12 = dec
Days of the week
Days of the week also can be specified in numbers or words.
0 = Sunday
1 = Monday
2 = Tuesday
...
6 - Saturday
7 - Sunday
Examples
# fetch e-mail every ten minutes */10 * * * * fetchmail # send myself a birthday greeting 0 9 7 28 * mail -s'Happy Birthday' ajs318%Many Happy Returns - you old fart!%.%% # back up my recipe database every Monday 30 5 * * 1 mysqldump --opt recipes > /home/ajs318/backups/recipes.sql
Running programs during system boot
When a Linux system boots there are a series of scripts that are called to start up system processes, daemons, and other programs (such as SSH servers, web servers, database programs, etc). The simplest way to add something to the boot process is to add it to /etc/rc.local, which is a script that is called automatically at the very end of the boot process. Simply write a script that does what you want and then call it from with in /etc/rc.local to ensure that your script is called at the end of the boot process.
You can also add scripts which run at different times during the boot process. The way to do this varies by Linux distribution. For Fedora, see [4] (specifically the section entitled Init Script Activation).
Apache
Apache is the leading web server available for several platforms. It is highly configurable and has a wide range of modules ready for different needs.
You can install the Apache package by running
yum install httpd
In Fedora, apache configuration files are stored under /etc/httpd/conf. The most important file is httpd.conf where you specify your preferences. Some important directives are
DocumentRoot: The path to the directory where the top level web files are going to be stored (default is /var/www/html).
IfModule: The following block would be included if specified module exists.
User: Which user apache2 will run as.
Group: Which group will have group access to default web files.
AccessFileName: The name of the access file (that specifies user names/passwords and other limitations to files/directories).
ErrorLog: Where any errors will be written.
Include: Include some other files.
LogFormat: How to write a log message.
ErrorDocument: Files to display for some HTTP errors(500,404,402 etc.).
apache2 log files are stored in /var/log/httpd. access_log shows the requests to your server and error_log reports the errors (such as missing files).
If the Alias module is loaded, you can map a directory URL to another directory in your file system.
Alias /url-dir "/mydir/in/my/server"
You can specify individual directory properties with Directory directive
<Directory directoryname> some options some permissions some others directives </Directory>
For example,
<Directory /var/www/> Options Indexes FollowSymLinks AllowOverride None Order allow,deny allow from all RedirectMatch ^/$ /apache2-default/ </Directory>
Set options for the /var/www directory. The Options directive says that Index related directions are enabled and users may put symbolic links that will be followed. No files within a directory can override these default files. Access is allowed to anybody. Note that this directory is actually the root directory of the web server.
Setting up Virtual Hosts
VirtualHost directives are used to set up virtual hosts within one web server. For example,
<VirtualHost cse330.dyndns.org> ServerAdmin webmaster@localhost ServerName cse330.dyndns.org DocumentRoot /home/www/cse330/ ErrorLog /var/log/httpd/error_log LogLevel warn CustomLog /var/log/apache2/access_log combined ServerSignature On </VirtualHost>
This configuration enables any requests that use a host name of cse330.dyndns.org will use /home/www/cse330 as the root document directory. Make sure that this directory exists and is readable by the httpd process (which runs as the apache user in Fedora).
You can add any such directives to the end of the httpd.conf file. Alternatively, and preferably, you can put this configuration in a separate file and include that file at the end of httpd.conf.
You can put the above virtual host description in a file located in a sites-enabled directory that you create (in different linux distributions, this may be created for you by default). Standard practice is to put the above configuration in a file under /etc/httpd/sites-available, and create a symbolic link to a file in the sites-enabled directory. This way you can just remove the link if you want to disable the virtual host. For example, if you have a file /etc/httpd/sites-available/mytest.conf, you add it to the enabled list like so:
cd /etc/httpd/sites-enabled ln -s ../sites-available/mytest.conf .
After any change to apache, you can tell apache to reload the configuration files by running:
/etc/init.d/httpd reload
If you are loading a new module or any other large change you will probably have to actually restart apache completely:
/etc/init.d/httpd restart
Command and File Reference
Commands
ls List file(s)
cd Change directory
cp Copy file(s)
mv Move file(s)
rm Remove file(s)
ln Create a link to a file
mkdir Create a directory
rmdir Remove a directory
chown Change the owner of a file
chgrp Change the group of a file
chmod Change the security permissions of a file
cat Display the contents of a file
less Display the contents of a file, wait for the user at each page
grep Display the lines of a file or files matching user specified string
diff Display the difference between two files
df Display free diskspace
du Display disk usage
free Display memory usage information
date Display current time and date
top Display the CPU and Memory usages of current processes
ps Display current processes
kill Terminate a running process
killall Terminate the running process matching user specified criterias
ping Ping a host
host Get the IP address of a host
passwd Change the user password
su Switch to the privileges of another user
shutdown Power off the computer
reboot Reboot the computer
clear Clear the terminal
vi Visual Editor
ifconfig Display/Configure a network device
file Show the file type
lsmod Display loaded kernel modules
insmod Install a kernel module
modprobe Load a kernel module (also load the dependencies)
adduser Add a new user
exit Exit from a shell
lpr Print a file
head Display lines at the beginning of a file
tail Display lines at the end of a file
pwd Display the name of the current directory
lsof Open files in the system
netstat Statistics related to open sockets
Directories
/var Location of frequently changing system files
/etc Common configuration files
/root Root's home directory
/home The home directories for regular users
/usr System programs and documents
/proc System resources and consumptions
/tmp Temporary files
/lib Kernel libraries
/boot Boot files
Files
Under /etc
/etc/yum.conf Configuration file for yum
/etc/yum/yum.repos.d Directory containing .repo files for online repositories
/etc/crontab System-wide crontab file
/etc/fstab Information about default partitions to be mounted
/etc/group List of groups in the system
/etc/hosts List of IP addresses with their names
/etc/inittab What to do at each run-level
/etc/inetd.conf Configuration file for some internet services (replaced by xinetd.* in most systems)
/etc/modules.conf Module information for the boot
/etc/motd Message to be seen at the login prompt
/etc/passwd User information
/etc/profile System level initial file for sh and its derivatives
/etc/shadow User passwords
Under /var
/var/log/messages System/Kernel messages
/var/log/syslog System log (mostly for Daemons)
/var/log/wtmp' User access log (binary)
/var/log/dmesg Boot-up messages
/var/log/auth.log Authorization logs
Suggested Reading
Linux System Administration Tutorial
Working with the Shell (SUSE Documentation)
Making the Transition to Linux: A Guide to the Linux Command Line Interface for Students
Additional Information
A few other topics are covered below.
Linux Kernel
What separates Linux from other Unix variants is its kernel. The kernel is the most important component of the operating system and is responsible for scheduling processes, providing access to the hardware devices, allocating memory to the programs, and so on.
The Linux kernel uses both monolithic and modular approaches. A monolithic kernel is a single program that contains all the code so any addition to kernel (such as code to access a driver) requires recompiling the code. A monolithic kernel is usually a little faster and could have a smaller size since only the absolutely necessary code is there. The modular kernel, on the other hand, enables dynamic loading and unloading of kernel code, called modules. Typical modules include device drivers. Thanks to this modular approach, Linux seldom requires a reboot after installing a new device.
LDAP
LDAP is a Lightweight Directory Access Protocol. It is commonly used for getting personal and authentication information from a central server. More information for LDAP is available on the OpenLDAP website.
Your initial LDAP database
Before you start installing LDAP, lets look at what kind of information we are going to use. You can write this information to a text file to populate your LDAP database later. The structure you decide upon is also important as you have to let the LDAP server know what that structure is.
Lets assume we are creating an LDAP service for the Babylon 5 space station using files as initial entries. At the top, we need to define an organization and then we need to describe the organizational units. Our organizational unit will be Interstellar Alliance (ISA) and our subunits will be the planets belonging to this organization (Earth and Minbar for the sake of briefness). Then we will have information about people who are citizens of these planets.
We describe ISA with
dn: o=ISA objectclass: top objectClass: organization o: ISA description: Interstellar Alliance
The organization name (o) is ISA, and this entry has a distinct name (dn) of o=ISA. It is also an instance of classes top and organization. Under this organization, we need to have entries for Earth and Minbar.
dn: ou=Earth,o=ISA ou: Earth objectClass: top objectClass: organizationalUnit description: Human
dn: ou=Minbar,o=ISA ou: Minbar objectClass: top objectClass: organizationalUnit description: Minbari
Note that, dns for child nodes contain the path to reach them.
We also need an administrator for LDAP so that we can access and modify the entries later.
dn: cn=isaadmin,o=ISA objectClass: organizationalRole cn: isaadmin description: LDAP directory administrator
Then we will have the information about people.
dn: cn=John Sheridan,ou=Earth,o=ISA ou: Earth o: ISA cn: John Sheridan objectClass: top objectClass: person
objectClass: organizationalPerson objectClass: inetOrgPerson givenname: John sn: Sheridan postalAddress: Human Sector l: Babylon 5 homeDirectory: /tmp st: Babylon 5 telephoneNumber: (800)555-1212 homePhone: 800-555-1313 facsimileTelephoneNumber: 800-555-1414 userPassword: sheridan title: Commander of Babylon 5
This entry is an instance of a class derived from person, organizationalPerson,inetOrgPerson, hence its attributes are from those classes. There are several optional attributes these classes contain that are not included in the description of this particular person. If you want to use LDAP only to provide the information about the people, this description would be sufficient. But if you need to provide authentication to other systems, you need other information too. First of all, you need to inform LDAP that this entry also contains user information by adding object classes posixAccount and shadowAccount. Furthermore, you need to the give other information such as account name, user id, the groups this person belongs to, the home directory, etc.
So, a more general entry for this person could be:
dn: cn=John Sheridan,ou=Earth,o=ISA ou: Earth o: ISA cn: John Sheridan objectClass: top objectClass: person objectClass: posixAccount objectClass: shadowAccount objectClass: organizationalPerson objectClass: inetOrgPerson givenname: John sn: Sheridan uid: starkiller postalAddress: Human Sector l: Babylon 5 uidNumber: 1025 gidNumber: 9000 homeDirectory: /tmp st: Babylon 5 telephoneNumber: (800)555-1212 homePhone: 800-555-1313 facsimileTelephoneNumber: 800-555-1414 userPassword: * title: Commander of Babylon 5
So John Sheridan has account name starkiller with UID 1025 and home directory /tmp. Notice that, we set this person's group number to 9000. But how does a client machine know a group? LDAP also serves information about groups, so you can create a group entry.
dn: cn=chargroup,o=ISA objectClass: posixGroup objectClass: top cn: chargroup userPassword: {crypt}x gidNumber: 9000
Finally, you can repeat this for other personal.
Setting up the server
In order to use LDAP, we need slapd, ldap-utils, libldap2, libldap2-dev packages.
apt-get install slapd ldap-utils libldap2 libldap2-dev
If the installation program asks for an admin password, type a password but don't worry about it much since we will create our own admin later.
slapd is an LDAP server. It has configuration files under /etc/ldap. For now, we are interested in slapd.conf. This files include some default schema that describes object classes you can use in your entities. It also describes a default LDAP directory database.
database bdb
describes a Berkley database that is going to be used (you can select other alternatives). It also has a default suffix. If you want, you modify the lines for the default database description or you can set up your own database. Basically,you need to select a suffix for your database (usually the organization's dn) and give the dn of the LDAP administrator and its password. Finally, you need to specify the permissions.
If we continue with Babylon 5 example, remember our organization had dn: o=ISA, so that will be our suffix
suffix "o=ISA"
We would also need to inform LDAP about the administrator account so that we can access LDAP and modify it.
rootdn "cn=isaadmin,o=ISA" rootpw jms_rulez
In this example, the password was left in plain text, but you can also use encrypted passwords. We need to give the administrator the full access to modify the database:
# The admin dn has full write access, everyone else # can read everything. access to * by dn="cn=isaadmin,o=ISA" write by * read
and the others can modify their own passwords:
access to attrs=userPassword,shadowLastChange by dn="cn=isaadmin,o=ISA" write by anonymous auth by self write by * none
Actually, the last bit needs to come before the administrator access since otherwise, it will overwrite the administrator's write access.
So we are now ready to use ldap. Since we have updated slapd.conf, we need to restart slapd.
/etc/init.d/slapd restart
and we need to populate the initial database:
ldapadd -f ~/babylon5.ldif -xv -D "cn=isaadmin,o=ISA" -h 127.0.0.1 -w jms_rulez
The format is
ldapadd -f LDIF_FILE_TO_BE_USED -xv -D "admin's dn' -h HOSTNAME_FOR_LDAP_SERVER -wADMIN_PASSWORD
The -x option tells LDAP to use plain authentication and -v says verbose output.
If you have problems, you can stop slapd and use
slapadd -u -l babylon5.ldif -b o=ISA -cv
to see detailed error messages. slapadd accesses your database directory directly without going through the server. If you want to remove the LDAP directory, you can directly remove everything under /var/lib/ldap/ (the path specified in slapd.conf) and the next time you start slapd, it will create initial files (but you need to repopulate).
You can verify if your LDAP is working with
ldapsearch -x -b 'o=ISA'
ldapsearch takes other parameters to let you search for specific information. In the example, we look at all the entries that have o=ISA.
Setting up the client
First you need to install the client side packages:
apt-get install ldap-utils libpam-ldap libnss-ldap nscd
Now we need to inform Linux to look at LDAP for authentication. We do that by modifying /etc/nsswitch.conf:
passwd: ldap compat group: ldap compat shadow: ldap compat
PAM is the Linux module that handles authentications which allows you to have different authentication protocols for different programs. We need to update the authentication methods to use LDAP for account information. This is done by editing files:
/etc/pam.d/common-account
account sufficient pam_ldap.so account required pam_unix.so try_first_pass
/etc/pam.d/common-auth
auth sufficient pam_ldap.so auth required pam_unix.so nullok_secure try_first_pass
/etc/pam.d/common-password
password sufficient pam_ldap.so password required pam_unix.so nullok obscure min=4 max=8 md5 try_first_pass
We also need to update /etc/ldap/ldap.conf (with your partner's information)
BASE yourbase URI ldap://yourhost rootbinddn Your admin's dn
In our example case, it will be
BASE o=ISA URI ldap://128.252.160.XXX #replace XXX with the final IP number rootbinddn cn=isaadmin,o=ISA
and then similar changes go in /etc/libnss-ldap.conf (with your partner's information)
base o=ISA host 128.252.160.xxx #replace xxx with your server's IP rootbinddn cn=isaadmin,o=ISA
Both libnss and pam_ldap get the rootbindn's password from text files so add your administrator's password there and make sure those files have 500 permissions.
/etc/libnss-ldap.secret and /etc/pam_ldap.secret
Finally you need to restart nscd
/etc/init.d/nscd restart
nscd somtimes uses a local cache which may not be updated after LDAP configuration. You could install install nscd after LDAP has been configured or disable the cache for the password file in ncsd configuration file /etc/nscd.conf
enable-cache passwd no
Now you can change the password of a user in LDAP with
password username
You can get the password file with
getent passwd
Your LDAP entries should be there.
Alternatively, you can type
getent passwd nameofauser
If you don't see anything after these commands, something is missing in your configuration. Make sure your admin password is right and URIs, bases are correct. Try to access the LDAP server by using ldapsearch:
ldapsearch -x -D 'cn=isaadmin,o=ISA' -w jms_rulez #make sure you have your parameters for -w (password) and -D (admin entity)