sort command can be used to sort contents from standard in or a file. You've probably seen and used it before in the context of pipelining.
There are tons of different ways you can specify sorts, and we'll go over the important ones.
The input to
sort should be a stream of records separated by a newline character. Each characteristic (or column) is known as the field, which is separated by some user-specified character (the delimiter). This is most often a tab, comma, or semicolon.
By default, the
sort command will sort alphabetically by the first field.
$ cat employees.txt Caelestinus, Joon Directory Photios, Roland CEO Eliseus, Meindert Secretary Pino, Derryl Assistant Gemini, Klaos Assistant $ sort employees.txt # Sort first field by alphabetical order Caelestinus, Joon Directory Eliseus, Meindert Secretary Gemini, Klaos Assistant Photios, Roland CEO Pino, Derryl Assistant
You can sort multiple files at once and have the output be a sorted file.
$ sort names1.txt names2.txt names3.txt # (names1.txt, names2.txt, names3.txt are unsorted files) Avi Bobby Brian Cat Derrick Duke Irvin JB Jen Jim Jizelle John Ragmar Shawn Telly
You can use the
-m option to merge presorted input files. The
sort command is able to easily perform this because its implementation is based on merge sort.
$ cat sortedMales.txt Blowers, Nigel Gilbertson, Collin Imhoff, Parker Peru, Colton Shisler, Odell Twine, Leonardo $ cat sortedFemales.txt Aurea, Levin Deshazo, Taryn Duong, Arminda Moorhead, Alyse Murrieta, Caroll Yen, Beata $ sort -m sortedMales.txt sortedFemales.txt Aurea, Levin Blowers, Nigel Deshazo, Taryn Duong, Arminda Gilbertson, Collin Imhoff, Parker Moorhead, Alyse Murrieta, Caroll Peru, Colton Shisler, Odell Twine, Leona
To specify the delimiter, we can use the
-t option, followed by the delimiter wrapped in single quotations.
Furthermore, we can specify the field number with the
-k option, followed by the field (column) index. With just a single integer (ie.
-k2), the sort key will begin at column 2 and extend to the end of the line. However, if we use
-k2,2 it will only sort based on the second column.
If there is a group of parameters that you'd like to sort within a field, you can do so with the decimal point. For example, say you have a date field in the 3rd column formatted by MM-DD-YYYY. It would make sense to order by year first, then month, then day, right?
$ sort -t ',' -k 3.7n -k 3.1n -k 3.4n dates.txt Tel,Aziz,12-31-1989 Ping,Sarah,09-29-1990 Het,Holm,01-01-1992 Hum,Horry,04-23-1995 Ith,Rebecca,06-12-2001
To ignore any leading blank spaces, use the
$ sort -t',' -k2,2 -b sortedMales.txt Moorhead, Alyse Duong, Arminda Murrieta, Caroll Gilbertson, Collin Peru, Colton Twine, Leonardo Aurea, Levin Blowers, Nigel Shisler, Odell Imhoff, Parker Deshazo, Taryn
Now our file is sorted by the second field (first name).
If you try to sort a text full of numbers, the sorted output may not be what you expect. For example, try sorting a list of even numbers from 1 - 10.
$ sort oneToTen.txt 10 2 4 6 8
To get the correct results, we must pass in the
-n option, which sorts by integer value.
$ sort -n oneToTen.txt 2 4 6 8 10
There are plenty more options you can check through the
man page. Here are the most frequently used ones.
fis for "folding" each letter to its corresponding lowercase letter.
-k 2would sort on the second field (aka second column).
-t ';'would separate fields with a semicolon.
After sorting, we can use
uniq to find characteristics of our file! Let's learn about that next.
In this completely revised second edition of the perennial best seller How Linux Works, author Brian Ward makes the concepts behind Linux internals accessible to anyone curious about the inner workings of the operating system. Inside, you'll find the kind of knowledge that normally comes from years of experience doing things the hard way.$ Check price
This foam seat cushion relieves lowerback pain, numbness and pressure sores by promoting healthy weight distribution, posture and spine alignment. Furthermore, it reduces pressure on the tailbone and hip bones while sitting. Perfect for sitting on the computer desk for long periods of time.$ Check price