.. _sect-readline: ********************************** Example: Minimal Readline Bindings ********************************** In this section, we'll see how to create bindings for a C library (the `GNU Readline `_ library) in Idris, and make them available in a package. We'll only create the most minimal bindings, but nevertheless they demonstrate some of the trickier problems in creating bindings to a C library, in that they need to handle memory allocation of ``String``. You can find the example in full in the Idris 2 source repository, in `samples/FFI-readline `_. As a minimal example, this can be used as a starting point for other C library bindings. We are going to provide bindings to the following functions in the Readline API, available via ``#include ``: :: char* readline (const char *prompt); void add_history(const char *string); Additionally, we are going to support tab completion, which in the Readline API is achieved by setting a global variable to a callback function (see Section :ref:`sect-callbacks`) which explains how to handle the completion: :: typedef char *rl_compentry_func_t (const char *, int); rl_compentry_func_t * rl_completion_entry_function; A completion function takes a ``String``, which is the text to complete, and an ``Int``, which is the number of times it has asked for a completion so far. In Idris, this could be a function ``complete : String -> Int -> IO String``. So, for example, if the text so far is ``"id"``, and the possible completions are ``idiomatic`` and ``idris``, then ``complete "id" 0`` would produce the string ``"idiomatic"`` and ``complete "id" 1`` would produce ``"idris"``. We will define *glue* functions in a C file ``idris_readline.c``, which compiles to a shared object ``libidrisreadline``, so we write a function for locating the C functions: .. code-block:: idris rlib : String -> String rlib fn = "C:" ++ fn ++ ",libidrisreadline" Each of the foreign bindings will have a ``%foreign`` specifier which locates functions via ``rlib``. Basic behaviour: Reading input, and history ------------------------------------------- We can start by writing a binding for ``readline`` directly. It's interactive, so needs to return a ``PrimIO``: .. code-block:: idris %foreign (rlib "readline") prim_readline : String -> PrimIO String Then, we can write an ``IO`` wrapper: .. code-block:: idris readline : String -> IO String readline prompt = primIO $ readline prompt Unfortunately, this isn't quite good enough! The C ``readline`` function returns a ``NULL`` string if there is no input due to encountering an end of file. So, we need to handle that - if we don't, we'll get a crash on encountering end of file (remember: it's the Idris programmer's responsibility to give an appropriate type to the C binding!) Instead, we need to use a ``Ptr`` to say that it might be a ``NULL`` pointer (see Section :ref:`sect-ffi-string`): .. code-block:: idris %foreign (rlib "readline") prim_readline : String -> PrimIO (Ptr String) We also need to provide a way to check whether the returned ``Ptr String`` is ``NULL``. To do so, we'll write some glue code to convert back and forth between ``Ptr String`` and ``String``, in a file ``idris_readline.c`` and a corresponding header ``idris_readline.h``. In ``idris_readline.h`` we have: :: int isNullString(void* str); // return 0 if a string in NULL, non zero otherwise char* getString(void* str); // turn a non-NULL Ptr String into a String (assuming not NULL) void* mkString(char* str); // turn a String into a Ptr String void* nullString(); // create a new NULL String Correspondingly, in ``idris_readline.c``: :: int isNullString(void* str) { return str == NULL; } char* getString(void* str) { return (char*)str; } void* mkString(char* str) { return (void*)str; } void* nullString() { return NULL; } Now, we can use ``prim_readline`` as follows, with a safe API, checking whether the result it returns is ``NULL`` or a concrete ``String``: .. code-block:: idris %foreign (rlib "isNullString") prim_isNullString : Ptr String -> Int export isNullString : Ptr String -> Bool isNullString str = if prim_isNullString str == 0 then False else True export readline : String -> IO (Maybe String) readline s = do mstr <- primIO $ prim_readline s if isNullString mstr then pure $ Nothing else pure $ Just (getString mstr) We'll need ``nullString`` and ``mkString`` later, for dealing with completions. Once we've read a string, we'll want to add it to the input history. We can provide a binding to ``add_history`` as follows: .. code-block:: idris %foreign (rlib "add_history") prim_add_history : String -> PrimIO () export addHistory : String -> IO () addHistory s = primIO $ prim_add_history s In this case, since Idris is in control of the ``String``, we know it's not going to be ``NULL``, so we can add it directly. A small ``readline`` program that reads input, and echoes it, recording input history for non-empty inputs, can be written as follows: .. code-block:: idris echoLoop : IO () echoLoop = do Just x <- readline "> " | Nothing => putStrLn "EOF" putStrLn ("Read: " ++ x) when (x /= "") $ addHistory x if x /= "quit" then echoLoop else putStrLn "Done" This gives us command history, and command line editing, but Readline becomes much more useful when we add tab completion. The default tab completion, which is available even in the small example above, is to tab complete file names in the current working directory. But for any realistic application, we probably want to tab complete other commands, such as function names, references to local data, or anything that is appropriate for the application. Completions ----------- Readline has a large API, with several ways of supporting tab completion, typically involving setting a global variable to an appropriate completion function. We'll use the following: :: typedef char *rl_compentry_func_t (const char *, int); rl_compentry_func_t * rl_completion_entry_function; The completion function takes the prefix of the completion, and the number of times it has been called so far on this prefix, and returns the next completion, or ``NULL`` if there are no more completions. An Idris equivalent would therefore have the following type: .. code-block:: idris setCompletionFn : (String -> Int -> IO (Maybe String)) -> IO () The function returns ``Nothing`` if there are no more completions, or ``Just str`` for some ``str`` if there is another one for the current input. We might hope that it's a matter of defining a function to assign the completion function... :: void idrisrl_setCompletion(rl_compentry_func_t* fn) { rl_completion_entry_function = fn; } ...then defining the Idris binding, which needs to take into account that the Readline library expects ``NULL`` when there are no more completions: .. code-block:: idris %foreign (rlib "idrisrl_setCompletion") prim_setCompletion : (String -> Int -> PrimIO (Ptr String)) -> PrimIO () export setCompletionFn : (String -> Int -> IO (Maybe String)) -> IO () setCompletionFn fn = primIO $ prim_setCompletion $ \s, i => toPrim $ do mstr <- fn s i case mstr of Nothing => pure nullString // need to return a Ptr String to readline! Just str => pure (mkString str) So, we turn ``Nothing`` into ``nullString`` and ``Just str`` into ``mkString str``. Unfortunately, this doesn't quite work. To see what goes wrong, let's try it for the most basic completion function that returns one completion no matter what the input: .. code-block:: idris testComplete : String -> Int -> IO (Maybe String) testComplete text 0 = pure $ Just "hamster" testComplete text st = pure Nothing We'll try this in a small modification of ``echoLoop`` above, setting a completion function first: .. code-block :: idris main : IO () main = do setCompletionFn testComplete echoLoop We see that there is a problem when we try running it, and hitting TAB before entering anything: :: Main> :exec main > free(): invalid pointer The Idris code which sets up the completion is fine, but there is a problem with the memory allocation in the C glue code. This problem arises because we haven't thought carefully enough about which parts of our program are responsible for allocating and freeing strings. When Idris calls a foreign function that returns a string, it copies the string to the Idris heap and frees it immediately. But, if the foreign library also frees the string, it ends up being freed twice. This is what's happening here: the callback passed to ``prim_setCompletion`` frees the string and puts it onto the Idris heap, but Readline also frees the string returned by ``prim_setCompletion`` once it has processed it. We can solve this problem by writing a wrapper for the completion function which reallocates the string, and using that in ``idrisrl_setCompletion`` instead. :: rl_compentry_func_t* my_compentry; char* compentry_wrapper(const char* text, int i) { char* res = my_compentry(text, i); // my_compentry is an Idris function, so res is on the Idris heap, // and freed on return if (res != NULL) { char* comp = malloc(strlen(res)+1); // comp is passed back to readline, which frees it when // it is finished with it strcpy(comp, res); return comp; } else { return NULL; } } void idrisrl_setCompletion(rl_compentry_func_t* fn) { rl_completion_entry_function = compentry_wrapper; my_compentry = fn; // fn is an Idris function, called by compentry_wrapper } So, we define the completion function in C, which calls the Idris completion function then makes sure the string returned by the Idris function is copied to the C heap. We now have a primitive API that covers the most fundamental features of the readline API: .. code-block:: idris readline : String -> IO (Maybe String) addHistory : String -> IO () setCompletionFn : (String -> Int -> IO (Maybe String)) -> IO ()