2020-05-20 13:23:04 +03:00
|
|
|
.. _sect-readline:
|
|
|
|
|
|
|
|
**********************************
|
|
|
|
Example: Minimal Readline Bindings
|
|
|
|
**********************************
|
|
|
|
|
|
|
|
In this section, we'll see how to create bindings for a C library (the `GNU
|
|
|
|
Readline <https://tiswww.case.edu/php/chet/readline/rltop.html>`_ library) in
|
|
|
|
Idris, and make them available in a package. We'll only create the most minimal
|
|
|
|
bindings, but nevertheless they demonstrate some of the trickier problems in
|
|
|
|
creating bindings to a C library, in that they need to handle memory allocation
|
|
|
|
of ``String``.
|
|
|
|
|
2020-07-21 14:30:33 +03:00
|
|
|
You can find the example in full in the Idris 2 source repository,
|
2020-05-20 13:23:04 +03:00
|
|
|
in `samples/FFI-readline
|
|
|
|
<https://github.com/edwinb/Idris2/tree/master/samples/FFI-readline>`_. As a
|
|
|
|
minimal example, this can be used as a starting point for other C library
|
|
|
|
bindings.
|
|
|
|
|
|
|
|
We are going to provide bindings to the following functions in the Readline
|
|
|
|
API, available via ``#include <readline/readline.h>``:
|
|
|
|
|
|
|
|
::
|
|
|
|
|
|
|
|
char* readline (const char *prompt);
|
|
|
|
void add_history(const char *string);
|
|
|
|
|
|
|
|
Additionally, we are going to support tab completion, which in the Readline
|
|
|
|
API is achieved by setting a global variable to a callback function
|
|
|
|
(see Section :ref:`sect-callbacks`) which explains how to handle the
|
|
|
|
completion:
|
|
|
|
|
|
|
|
::
|
|
|
|
|
|
|
|
typedef char *rl_compentry_func_t (const char *, int);
|
|
|
|
rl_compentry_func_t * rl_completion_entry_function;
|
|
|
|
|
|
|
|
A completion function takes a ``String``, which is the text to complete, and
|
|
|
|
an ``Int``, which is the number of times it has asked for a completion so far.
|
|
|
|
In Idris, this could be a function ``complete : String -> Int -> IO String``.
|
|
|
|
So, for example, if the text so far is ``"id"``, and the possible completions
|
|
|
|
are ``idiomatic`` and ``idris``, then ``complete "id" 0`` would produce the
|
|
|
|
string ``"idiomatic"`` and ``complete "id" 1`` would produce ``"idris"``.
|
|
|
|
|
|
|
|
We will define *glue* functions in a C file ``idris_readline.c``, which compiles
|
|
|
|
to a shared object ``libidrisreadline``, so we write a function for locating
|
|
|
|
the C functions:
|
|
|
|
|
|
|
|
.. code-block:: idris
|
|
|
|
|
|
|
|
rlib : String -> String
|
|
|
|
rlib fn = "C:" ++ fn ++ ",libidrisreadline"
|
|
|
|
|
|
|
|
Each of the foreign bindings will have a ``%foreign`` specifier which locates
|
|
|
|
functions via ``rlib``.
|
|
|
|
|
|
|
|
Basic behaviour: Reading input, and history
|
|
|
|
-------------------------------------------
|
|
|
|
|
|
|
|
We can start by writing a binding for ``readline`` directly. It's interactive,
|
|
|
|
so needs to return a ``PrimIO``:
|
|
|
|
|
|
|
|
.. code-block:: idris
|
|
|
|
|
|
|
|
%foreign (rlib "readline")
|
2020-07-21 14:30:33 +03:00
|
|
|
prim__readline : String -> PrimIO String
|
2020-05-20 13:23:04 +03:00
|
|
|
|
|
|
|
Then, we can write an ``IO`` wrapper:
|
|
|
|
|
|
|
|
.. code-block:: idris
|
|
|
|
|
|
|
|
readline : String -> IO String
|
|
|
|
readline prompt = primIO $ readline prompt
|
|
|
|
|
|
|
|
Unfortunately, this isn't quite good enough! The C ``readline`` function
|
|
|
|
returns a ``NULL`` string if there is no input due to encountering an
|
|
|
|
end of file. So, we need to handle that - if we don't, we'll get a crash
|
|
|
|
on encountering end of file (remember: it's the Idris programmer's responsibility
|
|
|
|
to give an appropriate type to the C binding!)
|
|
|
|
|
|
|
|
Instead, we need to use a ``Ptr`` to say that it might be a ``NULL``
|
|
|
|
pointer (see Section :ref:`sect-ffi-string`):
|
|
|
|
|
|
|
|
.. code-block:: idris
|
|
|
|
|
|
|
|
%foreign (rlib "readline")
|
2020-07-21 14:30:33 +03:00
|
|
|
prim__readline : String -> PrimIO (Ptr String)
|
2020-05-20 13:23:04 +03:00
|
|
|
|
2020-07-21 14:30:33 +03:00
|
|
|
We also need to provide a way to check whether the returned ``Ptr String`` is
|
2020-05-20 13:23:04 +03:00
|
|
|
``NULL``. To do so, we'll write some glue code to convert back and forth
|
|
|
|
between ``Ptr String`` and ``String``, in a file ``idris_readline.c`` and a
|
|
|
|
corresponding header ``idris_readline.h``. In ``idris_readline.h`` we have:
|
|
|
|
|
|
|
|
::
|
|
|
|
|
|
|
|
int isNullString(void* str); // return 0 if a string in NULL, non zero otherwise
|
|
|
|
char* getString(void* str); // turn a non-NULL Ptr String into a String (assuming not NULL)
|
|
|
|
void* mkString(char* str); // turn a String into a Ptr String
|
|
|
|
void* nullString(); // create a new NULL String
|
|
|
|
|
|
|
|
Correspondingly, in ``idris_readline.c``:
|
|
|
|
|
|
|
|
::
|
|
|
|
|
|
|
|
int isNullString(void* str) {
|
|
|
|
return str == NULL;
|
|
|
|
}
|
|
|
|
|
|
|
|
char* getString(void* str) {
|
|
|
|
return (char*)str;
|
|
|
|
}
|
|
|
|
|
|
|
|
void* mkString(char* str) {
|
|
|
|
return (void*)str;
|
|
|
|
}
|
|
|
|
|
|
|
|
void* nullString() {
|
|
|
|
return NULL;
|
|
|
|
}
|
|
|
|
|
2020-07-21 14:30:33 +03:00
|
|
|
Now, we can use ``prim__readline`` as follows, with a safe API, checking
|
2020-05-20 13:23:04 +03:00
|
|
|
whether the result it returns is ``NULL`` or a concrete ``String``:
|
|
|
|
|
|
|
|
.. code-block:: idris
|
|
|
|
|
|
|
|
%foreign (rlib "isNullString")
|
2020-07-21 14:30:33 +03:00
|
|
|
prim__isNullString : Ptr String -> Int
|
2020-05-20 13:23:04 +03:00
|
|
|
|
|
|
|
export
|
|
|
|
isNullString : Ptr String -> Bool
|
2020-07-21 14:30:33 +03:00
|
|
|
isNullString str = if prim__isNullString str == 0 then False else True
|
2020-05-20 13:23:04 +03:00
|
|
|
|
|
|
|
export
|
|
|
|
readline : String -> IO (Maybe String)
|
|
|
|
readline s
|
2020-07-21 14:30:33 +03:00
|
|
|
= do mstr <- primIO $ prim__readline s
|
2020-05-20 13:23:04 +03:00
|
|
|
if isNullString mstr
|
|
|
|
then pure $ Nothing
|
|
|
|
else pure $ Just (getString mstr)
|
|
|
|
|
|
|
|
We'll need ``nullString`` and ``mkString`` later, for dealing with completions.
|
|
|
|
|
|
|
|
Once we've read a string, we'll want to add it to the input history. We can
|
|
|
|
provide a binding to ``add_history`` as follows:
|
|
|
|
|
|
|
|
.. code-block:: idris
|
|
|
|
|
|
|
|
%foreign (rlib "add_history")
|
2020-07-21 14:30:33 +03:00
|
|
|
prim__add_history : String -> PrimIO ()
|
2020-05-20 13:23:04 +03:00
|
|
|
|
|
|
|
export
|
|
|
|
addHistory : String -> IO ()
|
2020-07-21 14:30:33 +03:00
|
|
|
addHistory s = primIO $ prim__add_history s
|
2020-05-20 13:23:04 +03:00
|
|
|
|
|
|
|
In this case, since Idris is in control of the ``String``, we know it's not
|
|
|
|
going to be ``NULL``, so we can add it directly.
|
|
|
|
|
|
|
|
A small ``readline`` program that reads input, and echoes it, recording input
|
|
|
|
history for non-empty inputs, can be written as follows:
|
|
|
|
|
|
|
|
.. code-block:: idris
|
|
|
|
|
|
|
|
echoLoop : IO ()
|
2020-07-21 14:30:33 +03:00
|
|
|
echoLoop
|
2020-05-20 13:23:04 +03:00
|
|
|
= do Just x <- readline "> "
|
|
|
|
| Nothing => putStrLn "EOF"
|
|
|
|
putStrLn ("Read: " ++ x)
|
|
|
|
when (x /= "") $ addHistory x
|
|
|
|
if x /= "quit"
|
|
|
|
then echoLoop
|
|
|
|
else putStrLn "Done"
|
|
|
|
|
|
|
|
This gives us command history, and command line editing, but Readline becomes
|
|
|
|
much more useful when we add tab completion. The default tab completion, which
|
|
|
|
is available even in the small example above, is to tab complete file names
|
|
|
|
in the current working directory. But for any realistic application, we probably
|
|
|
|
want to tab complete other commands, such as function names, references to
|
|
|
|
local data, or anything that is appropriate for the application.
|
|
|
|
|
|
|
|
Completions
|
|
|
|
-----------
|
|
|
|
|
|
|
|
Readline has a large API, with several ways of supporting tab completion,
|
|
|
|
typically involving setting a global variable to an appropriate completion
|
|
|
|
function. We'll use the following:
|
|
|
|
|
|
|
|
::
|
|
|
|
|
|
|
|
typedef char *rl_compentry_func_t (const char *, int);
|
|
|
|
rl_compentry_func_t * rl_completion_entry_function;
|
|
|
|
|
|
|
|
The completion function takes the prefix of the completion, and the number
|
|
|
|
of times it has been called so far on this prefix, and returns the next
|
|
|
|
completion, or ``NULL`` if there are no more completions. An Idris equivalent
|
|
|
|
would therefore have the following type:
|
|
|
|
|
|
|
|
.. code-block:: idris
|
|
|
|
|
|
|
|
setCompletionFn : (String -> Int -> IO (Maybe String)) -> IO ()
|
|
|
|
|
|
|
|
The function returns ``Nothing`` if there are no more completions, or
|
|
|
|
``Just str`` for some ``str`` if there is another one for the current
|
|
|
|
input.
|
|
|
|
|
|
|
|
We might hope that it's a matter of defining a function to assign the
|
|
|
|
completion function...
|
|
|
|
|
|
|
|
::
|
|
|
|
|
|
|
|
void idrisrl_setCompletion(rl_compentry_func_t* fn) {
|
|
|
|
rl_completion_entry_function = fn;
|
|
|
|
}
|
|
|
|
|
|
|
|
...then defining the Idris binding, which needs to take into account that
|
|
|
|
the Readline library expects ``NULL`` when there are no more completions:
|
|
|
|
|
|
|
|
.. code-block:: idris
|
|
|
|
|
|
|
|
%foreign (rlib "idrisrl_setCompletion")
|
2020-07-21 14:30:33 +03:00
|
|
|
prim__setCompletion : (String -> Int -> PrimIO (Ptr String)) -> PrimIO ()
|
2020-05-20 13:23:04 +03:00
|
|
|
|
|
|
|
export
|
|
|
|
setCompletionFn : (String -> Int -> IO (Maybe String)) -> IO ()
|
|
|
|
setCompletionFn fn
|
2020-07-21 14:30:33 +03:00
|
|
|
= primIO $ prim__setCompletion $ \s, i => toPrim $
|
2020-05-20 13:23:04 +03:00
|
|
|
do mstr <- fn s i
|
|
|
|
case mstr of
|
|
|
|
Nothing => pure nullString // need to return a Ptr String to readline!
|
|
|
|
Just str => pure (mkString str)
|
|
|
|
|
|
|
|
So, we turn ``Nothing`` into ``nullString`` and ``Just str`` into ``mkString
|
|
|
|
str``. Unfortunately, this doesn't quite work. To see what goes wrong, let's
|
|
|
|
try it for the most basic completion function that returns one completion no
|
|
|
|
matter what the input:
|
|
|
|
|
|
|
|
.. code-block:: idris
|
|
|
|
|
|
|
|
testComplete : String -> Int -> IO (Maybe String)
|
|
|
|
testComplete text 0 = pure $ Just "hamster"
|
|
|
|
testComplete text st = pure Nothing
|
|
|
|
|
|
|
|
We'll try this in a small modification of ``echoLoop`` above, setting a
|
|
|
|
completion function first:
|
|
|
|
|
|
|
|
.. code-block :: idris
|
|
|
|
|
|
|
|
main : IO ()
|
|
|
|
main = do setCompletionFn testComplete
|
|
|
|
echoLoop
|
|
|
|
|
|
|
|
We see that there is a problem when we try running it, and hitting TAB before
|
|
|
|
entering anything:
|
|
|
|
|
|
|
|
::
|
|
|
|
|
|
|
|
Main> :exec main
|
|
|
|
> free(): invalid pointer
|
|
|
|
|
|
|
|
The Idris code which sets up the completion is fine, but there is a problem
|
|
|
|
with the memory allocation in the C glue code.
|
|
|
|
|
|
|
|
This problem arises because we haven't thought carefully enough about which
|
|
|
|
parts of our program are responsible for allocating and freeing strings.
|
|
|
|
When Idris calls a foreign function that returns a string, it copies the
|
|
|
|
string to the Idris heap and frees it immediately. But, if the foreign
|
|
|
|
library also frees the string, it ends up being freed twice. This is what's
|
2020-07-21 14:30:33 +03:00
|
|
|
happening here: the callback passed to ``prim__setCompletion`` frees the string
|
2020-05-20 13:23:04 +03:00
|
|
|
and puts it onto the Idris heap, but Readline also frees the string returned
|
2020-07-21 14:30:33 +03:00
|
|
|
by ``prim__setCompletion`` once it has processed it. We can solve this
|
2020-05-20 13:23:04 +03:00
|
|
|
problem by writing a wrapper for the completion function which reallocates
|
|
|
|
the string, and using that in ``idrisrl_setCompletion`` instead.
|
|
|
|
|
|
|
|
::
|
|
|
|
|
|
|
|
rl_compentry_func_t* my_compentry;
|
|
|
|
|
|
|
|
char* compentry_wrapper(const char* text, int i) {
|
|
|
|
char* res = my_compentry(text, i); // my_compentry is an Idris function, so res is on the Idris heap,
|
|
|
|
// and freed on return
|
|
|
|
if (res != NULL) {
|
|
|
|
char* comp = malloc(strlen(res)+1); // comp is passed back to readline, which frees it when
|
|
|
|
// it is finished with it
|
|
|
|
strcpy(comp, res);
|
|
|
|
return comp;
|
|
|
|
}
|
|
|
|
else {
|
|
|
|
return NULL;
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
|
|
|
void idrisrl_setCompletion(rl_compentry_func_t* fn) {
|
|
|
|
rl_completion_entry_function = compentry_wrapper;
|
|
|
|
my_compentry = fn; // fn is an Idris function, called by compentry_wrapper
|
|
|
|
}
|
|
|
|
|
|
|
|
So, we define the completion function in C, which calls the Idris completion
|
|
|
|
function then makes sure the string returned by the Idris function is copied
|
|
|
|
to the C heap.
|
|
|
|
|
|
|
|
We now have a primitive API that covers the most fundamental features of the
|
|
|
|
readline API:
|
|
|
|
|
|
|
|
.. code-block:: idris
|
|
|
|
|
|
|
|
readline : String -> IO (Maybe String)
|
|
|
|
addHistory : String -> IO ()
|
|
|
|
setCompletionFn : (String -> Int -> IO (Maybe String)) -> IO ()
|