From time to time, I’m involved into a trolling conversation when any linux kiddie tells me:

Bash is really the superior shell

I totally disagree, but as I’m getting older, I don’t argue anymore.

Anyway, in this post I will expose two arguments, or I should say two reasons, why I usually use ksh93 to run my scripts.

Note I’m really talking about the engine of the script, (the shebang definition). set I’m used to the bourn shell syntax therefore I also exclude any C shell from the comparison. My $SHELL for interactivity is zsh because it’s efficient enough and it has a bunch of really cool features I won’t discuss in this post (maybe later) Read, loops, forks and efficiency… More than 10 years ago, as I was working for a project at IBM, my excellent team leader told me to refer to this book: Unix Power Tools. I did learn a lot with it. And one feature I’ve always used is the while read loop. The use case Let’s take this script as example:  1 2 3 4 5 6 7  $ cat test for i in $(seq 1 500) do echo$i | read a echo -ne "$a\r" done echo "" It simply iterate 500 times and display the counter on the screen. The result of execution Let’s execute it in different shells   1 2 3 4 5 6 7 8 9 10 11  for i in bash zsh ksh do echo "$i =>" eval $i test done bash => zsh => 500 ksh => 500 Bash is the only one which does not display the expected result. The explanation is that the shell sees a pipe and the fork the process. The assignation to the variable a is in another context and therefore, when the father wants to display $a in the current shell, the variable is empty.

Wait, but why does ksh (and zsh) do display the correct result ? Simply because ksh and zsh have noticed that the command after the pipe was a builtin, and therefore that it was un-useful to fork.

Strace to the rescue…

To prove it, let’s check for syscalls with the strace tool, and count how many clones and calls are performed:

  1 2 3 4 5 6 7 8 9 10 11 12 13 14  $for i in bash zsh ksh do echo "$i =>" strace -c $i test 2>&1 | egrep "clone|calls" done bash => % time seconds usecs/call calls errors syscall 56.05 0.067081 67 1001 clone zsh => % time seconds usecs/call calls errors syscall 71.57 0.057681 115 501 clone ksh => % time seconds usecs/call calls errors syscall 68.50 0.042059 84 500 clone quod erat demonstrandum, twice as much clone in bash thant in ksh|zsh. Efficiency Of course this as an impact on performances, because fork are expensive, let’s query the execution time:   1 2 3 4 5 6 7 8 9 10 11 12 13 14  for i in bash zsh ksh do echo "$i =>" eval time $i test done bash => bash test 0,17s user 0,86s system 95% cpu 1,079 total zsh => 500 zsh test 0,08s user 0,46s system 82% cpu 0,648 total ksh => 500 ksh test 0,07s user 0,46s system 65% cpu 0,819 total This sounds clear to me… The KSH93 Getopts unknown feature Another cool feature I’ve discovered recently is the little addon of the getopts feature. I wanted to use the getopts built in in a script. As usual, I did RTFM (because I never know when to use colon etc.). Here is the extract of the man page of ksh93 relative to the getopts function: getopts [ -a name ] optstring vname [ arg ... ] Checks arg for legal options. If arg is omitted, the positional parameters are used. An option argument begins with a + or a -. An option not beginning with + or - or the argument -- ends the options. Options beginning with + are only recognized when optstring begins with a +. optstring contains the letters that getopts recognizes. If a letter is followed by a :, that option is expected to have an argument. The options can be separated from the argument by blanks. The option -? causes getopts to generate a usage message on standard error. The -a argument can be used to specify the name to use for the usage message, which defaults to$0.

getopts places the next option letter it finds inside variable vname each time it is invoked.
The option letter will be prepended with a + when arg begins with a +.
The index of the next arg is stored in OPTIND.
The option argument, if any, gets stored in OPTARG.

A leading : in optstring causes getopts to store the letter of an invalid option in OPTARG,
and to set vname to ? for an unknown option and to : when a required option argument is missing.
Otherwise, getopts prints an error message.
The exit status is non-zero when there are no more options.

There is no way to specify any of the options :, +, -, ?, [, and ].

The option # can only be specified as the first option.


This particular sentence, in the middle of the documentation peaked my interest

The option -? causes getopts to generate a usage message on standard error.

What? We can generate usage with getopts?

Cool, any script should be documented, but any documentation should not be difficult to implement.

https://xkcd.com/1343/

I did googled and found this web page which is an extract from this book Learning the Korn Shell

An example is sometimes better than an explanation (and the book is complete on this subject)

The example

The script

I’m gonna try with a little help (from my friends)

 1 2 3 4 5 6 7 8 9  \$ ./blog.ksh --help Usage: ./blog.ksh [ options ] OPTIONS -u, --user=user to run the command as Use the name of the user you want to sudo to: The default value is owulveryck. -e, --env=environnement environnement to use (eg: dev, prod) The default value is dev. -p, --path=Execution PATH prefix of the chroot The default value is /tmp.

And let’s try with an invalid option…

 1 2 3   ./blog.ksh -t ./blog.ksh: -t: unknown option Usage: ./blog.ksh [-u user to run the command as] [-e environnement] [-p Execution PATH]

Conclusion

By now, KSH93 remains my favorite engine for shell scripts, but is sometimes replaced by ZSH.

Actually, ZSH seems as “smart” and efficient, but this getopts feature is really nice for any script aim to be distributed widely.