mirror of https://github.com/gelisam/hawk.git synced 2024-12-11 09:54:51 +03:00

Haskell text processor for the command-line

Go to file

Samuel Gélineau 2a1308b5ad fix spurious extra row lines are terminated by "\n", not separated by "\n", so treat the "\n" separator as a special case.		2014-02-28 21:12:06 -05:00
doc	fix doc and tests: ByteString must be qualified.	2014-02-09 01:04:41 -05:00
runtime/System/Console/Hawk	fix spurious extra row	2014-02-28 21:12:06 -05:00
src	split off runtime, fix crazy hClose bug	2014-02-23 11:18:17 -05:00
tests	tests were depending on cache/{modules,extentions}	2014-02-09 18:17:52 -05:00
.gitignore	ignore vim files	2013-12-21 18:34:40 +01:00
haskell-awk.cabal	split off runtime, fix crazy hClose bug	2014-02-23 11:18:17 -05:00
LICENSE	change to Apache v2 license	2013-09-04 21:54:04 +02:00
NOTICE	change to Apache v2 license	2013-09-04 21:54:04 +02:00
README.md	mention that the package name is not hawk	2014-02-01 22:57:12 -05:00
Setup.hs	add license headers	2013-09-04 21:58:51 +02:00

README.md

Hawk

Transform text from the command-line using Haskell expressions. Similar to awk, but using Haskell as the text-processing language.

Examples

In Unix the file /etc/passwd is used to keep track of every registered user in the system. Each entry in the file contains information about a single user, using a simple colon-separated format. For example:

root:x:0:0:root:/root:/bin/bash

The first field is the username. We can use Hawk to list all usernames as follows:

> cat /etc/passwd | hawk -d: -m 'head'
root

The -d option tells Hawk to use : as word delimiters, causing the first line to be interpreted as ["root", "x", "0", "0", "root", "/root", "/bin/bash"]. The -m tells Hawk to map a function over each line of the input. In this case, the function head extracts the first word of the line, which happens to be the username.

We could of course have achieved identical results by using awk instead of Hawk:

> cat /etc/passwd | awk -F: '{print $1}'
root

While Hawk and awk have similar use cases, the philosophy behind the two is very different. Awk uses a specialized language designed to concisely express many text transformations, while Hawk uses the general-purpose language Haskell, which is also known for being concise, among other things. There are many standard command-line tools that can be easily approximated using short Haskell expressions.

Another important difference is that while awk one-liners are self-contained, Hawk encourages the use of libraries and user-defined functions. By adding function definitions, module imports and language pragmas to Hawk's user-configurable prelude file, those functions, libraries and language extensions become available to Hawk one-liners. For instance, we could add a takeLast function extracting the last n elements from a list, and use it to (inefficiently) approximate tail:

> echo 'takeLast n = reverse . take n . reverse' >> ~/.hawk/prelude.hs
> seq 0 100 | hawk -a 'takeLast 3'
98
99
100

For more details, see the documentation.

Installation

To install the stable version, simply use cabal install haskell-awk (not cabal install hawk, that's another unrelated package) and add ~/.cabal/bin (or your sandbox's bin folder) to your PATH. You should be ready to use Hawk:

> hawk '[1..3]'
1
2
3

To install the development version, clone this repository and use cabal install or cabal-dev install to compile Hawk and its dependencies. Cabal installs the binary to ~/.cabal/bin/hawk, while cabal-dev installs it to ./cabal-dev/bin/hawk. The first run will create a default configuration into ~/.hawk/prelude.hs if it doesn't exist.