Print all words on lines of a file in reverse order

All examples presented below work for general case where there's an arbitrary number of words on the line. The essential idea is the same everywhere - we have to read the file line by line and print the words in reverse. AWK facilitates this the best because it already has all the necessary tools for text processing done programmatically, and is most portable - it can be used with any awk derivative, and most systems have it. Python also has quite a few good utilities for string processing that allow us to do the job. It's a tool for more modern systems, I'd say. Bash, IMHO, is the least desirable approach, due to portability, potential hazards, and the amount of "trickery" that needs to be done.

AWK

$ awk '{for(i=NF;i>=1;i--) printf "%s ", $i;print ""}' input.txt                                                  
Earth Hello 
Mars Hello

The way this works is fairly simple: we're looping backwards through each word on the line, printing words separated with space - that's done by printf "%s ",$i function (for printing formatted strings) and for-loop. NF variable corresponds to number of fields. The default field separator is assumed to be space. We start by setting a throw-away variable i to the number of words, and on each iteration, decrement the variable. Thus, if there's 3 words on line, we print field $3, then $2, and $1. After the last pass, variable i becomes 0, the condition i>=1 becomes false, and the loop terminates. To prevent lines being spliced together, we insert a newline using print "". AWK code blocks {} are processed for each line in this case (if there's a matching condition in front of code block, it depends on the match for the code block to be executed or not).

Python

For those who like alternative solutions, here's python:

$ python -c "import sys;print '\n'.join([ ' '.join(line.split()[::-1]) for line in sys.stdin ])"  < input.txt     
Earth Hello
Mars Hello

The idea here is slightly different. < operator tells your current shell to redirect input.txt into python's stdin stream, and we read that line by line. Here we use list comprehension to create a list of lines - that's what the [ ' '.join(line.split()[::-1]) for line in sys.stdin ] part does. The part ' '.join(line.split()[::-1]) takes a line, splits it into list of words, reverses the list via [::-1] , and then ' '.join() creates a space-separated string out of it. We have as a result a list of larger strings. Finally, '\n'.join() makes an even larger string, with each item joined via newline.

In short, this method is basically a "break and rebuild" approach.

BASH

#!/bin/bash

while IFS= read -r line
do
     bash -c 'i=$#; while [ $i -gt 0 ];do printf "%s " ${!i}; i=$(($i-1)); done' sh $line   
     echo 
done < input.txt

And a test run:

$ ./reverse_words.sh                                                                                              
Earth Hello 
Mars Hello

Bash itself doesn't have strong text processing capabilities. What happens here is that we read the file line by line via

while IFS= read -r line
do
   # some code
done < text.txt

This is a frequent technique and is widely used in shell scripting to read output of a command or a text file line-by-line. Each line is stored into $line variable.

On the inside we have

bash -c 'i=$#; while [ $i -gt 0 ];do printf "%s " ${!i}; i=$(($i-1)); done' sh $line

Here we use bash with -c flag to run a set of commands enclosed into single-quotes. When -c is used, bash will start assigning command-line arguments into variables starting with $0. Because that $0 is traditionally used to signify a program's name, I use sh dummy variable first.

The unquoted $line will be broken down into individual items due to the behavior known as word-splitting. Word splitting is often undesirable in shell scripting, and you will often hear people say "always quote your variables, like "$foo"." In this case, however, word-splitting is desirable for processing simple text. If your text contains something like $var, it might break this approach. For this, and several other reasons, I'd say python and awk approach are better.

As for the inner code, it's also simple: the unquoted $line is split into words and is passed to the inner code for processing. We take the number of arguments $#, store it into the throw away variable i, and again - print out each item using something known as variable indirection - that's the ${!i} part (note that this is bashism - it's not available in other shells). And again, we use printf "%s " to print out each word, space-separated. Once that's done, echo will append a newline.

Essentially this approach is a mix of both awk and python. We read the file line by line, but divide and conquer each line, using several of bash's features to do the job.

A simpler variation can be done with the GNU tac command, and again playing with word splitting. tac is used to reverse lines of input stream or file, but in this case we specify -s " " to use space as separator. Thus, var will contain a newline-separated list of words in reverse order, but due to $var not being quoted, newline will be substituted with space. Trickery, and again not the most reliable, but works.

#!/bin/bash

while IFS= read -r line
do
     var=$(tac -s " " <<< "$line" )
     echo  $var
done < input.txt

Test runs:

And here's the 3 methods with arbitrary lines of input

$ cat input.txt                                                                                                   
Hello Earth end of line
Hello Mars  another end of line
abra cadabra magic
$ ./reverse_words.sh                                                                                              
line of end Earth Hello 
line of end another Mars Hello 
magic cadabra abra 
$ python -c "import sys;print '\n'.join([ ' '.join(line.split()[::-1]) for line in sys.stdin ])"  < input.txt  
line of end Earth Hello
line of end another Mars Hello
magic cadabra abra
$ awk '{for(i=NF;i>=1;i--) printf "%s ", $i;print ""}' input.txt
line of end Earth Hello 
line of end another Mars Hello 
magic cadabra abra

Extra: perl and ruby

Same idea as with python - we split each line into array of words, reverse the array, and print it out.

$ perl -lane '@r=reverse(@F); print "@r"' input.txt                           
line of end Earth Hello
line of end another Mars Hello
magic cadabra abra


$ ruby -ne 'puts $_.chomp.split().reverse.join(" ")' < input.txt                                                  
line of end Earth Hello
line of end another Mars Hello
magic cadabra abra

Just swap the words around, with awk:

awk '{print $2, $1}'

Example:

% cat bar.txt
Hello Earth
Hello Mars

% awk '{print $2, $1}' bar.txt
Earth Hello
Mars Hello

The obligatory `sed` solution

The following GNU sed program uses a loop to move each word (starting from the first) at the end of the line. More details are inserted in the code as comments.

sed -r '
    # Mark the current end of the line by appending a LF character ("\n")
    G

    # Main loop: move the first word of the line just after the LF
    # and repeat until the LF is at the beginning of the line
    :loop
    s/([^[:space:]]+)(.*\n)/\2\1 /
    t loop

    # Remove remaining spaces up to the LF and the superfluous trailing space
    s/.*\n| $//g
'

Write-only version:

sed -r 'G; :loop; s/(\S+)(.*\n)/\2\1 /; t loop; s/.*\n| $//g'

Test:

$ sed -r '...' <<< "The quick
brown fox jumps

over
the lazy dog"

... yields:

quick The 
jumps fox brown 

over 
dog lazy the

Portably (POSIXly):

sed '
  G
  :loop
     s/\([^[:space:]]\{1,\}\)\(.*\n\)/\2\1 /
  t loop
  s/ $//
  s/.*\n//'

Print all words on lines of a file in reverse order

AWK

Python

BASH

Test runs:

Extra: perl and ruby

The obligatory `sed` solution

Tags:

Bash

Text Processing

Array

Related

Recent Posts

Print all words on lines of a file in reverse order

AWK

Python

BASH

Test runs:

Extra: perl and ruby

The obligatory sed solution

Tags:

Bash

Text Processing

Array

Related

The obligatory `sed` solution