streamly/dev/container-api.md

# Data Containers

Containers are persistent containers of data e.g. files, arrays, maps. For
consistency we use similar API for all such containers where possible. Usually
the API names are relative to the current module e.g. toBytes in an array
module means converting the array to bytes, it sounds intuitive if read
qualified with the module name e.g. Array.toBytes. This doc lists some
conventions and guidelines to be followed.

## Bytes and Chunks

APIs that deal with streams of Word8 may have "bytes" in their names
while the ones dealing with "Array Word8" have "chunks" in their
names. Usually the default is bytes when there is no explicit mention in
the name.

## Unfolds and Folds

* Unfolds are named as "read" or with a "read" prefix (e.g. readChunks).
* Folds are named as "write" or with a "write" prefix (e.g. writeChunks).

## To and from Stream

### From and Put

* Immutable construction from some external source is named with a "from"
  prefix (e.g.  fromList).
* Mutation of an existing container uses a "put" prefix instead of "from". When
  an API uses an existing container to write to and does not return a newly
  constructed container then use "put".
* "from" vs "put": "from" assumes creation of a new object, it may fail if the
  object being created already exists (e.g. the file exists), it may not take a
  lock as it assumes immutability. "put" may create a new object or overwrite
  an existing one, it may take a lock for writing as it assumes mutability.

### To and Get

* Converting the complete object to an external representation is prefixed with
  "to" (e.g. toBytes).
* For mutable objects "get" APIs may be used instead of "to" APIs.
* "to" vs "get": "to" assumes immutable object so does not have to take a lock.
  "get" assumes mutable object so may take a lock.

### Append

* Use "append" prefix for appending data at the end of a mutable container

## With additional config

Sometimes we need to modify the behavior of a combinator using some additional
config. For example, combinators to read/write using a specified size of
buffer. For such cases we apply the "With" suffix to standard combinator names:

* readWith
* readChunksWith
* toBytesWith
* getBytesWith

## Random Access (Arrays)

### Single elements

* getIndex (for arrays)
* putIndex (for mutable arrays)

### Ranges

* ...FromTo (e.g. readFromTo, readChunksFromTo, toBytesFromTo, getBytesFromTo)
* ...FromThenTo
* ...Indices

FromThenTo vs FromStepN:

We could use `fromThenTo` or `fromStepN` style APIs for stepwise
enumeration. The first one uses absolute numbers whereas the second one
uses relative positioning. We prefer FromThenTo style where possible for
the following reasons:

* It is the style used in base lists, and we are already using it in Enumerable
  type class.
* Both have their pros and cons. In `fromThenTo`, (1) it may be
  non-intuitive whether `To` is inclusive, (2) specifying an empty range
  is a bit awkward (e.g. fromThenTo 1 1 0). In `fromStepN`, if we know
  the first and the last element then we will have to compute the offset
  correctly. Ideally, when we know the elements then former is more
  suitable while if we know the count then the latter is more suitable.

Note that in case of floating point numbers `FromStep` style may have an
advantage over `FromThen` style. Here is a quote from the documentation of
`enumerateFromThenNum`:

```
Note that in the strange world of floating point numbers, using
@enumerateFromThenNum (from, from + 1)@ is almost exactly the same as
@enumerateFromStepNum (from, 1) but not precisely the same. Because @(from
+ 1) - from@ is not exactly 1, it may lose some precision, the loss may
also be aggregated in each step, if you want that precision then use
'enumerateFromStepNum' instead.
```

### Appending

* append (for mutable arrays)

## Key-Value Store Access (Maps)

* getKey
* findKey (test existence)
* createKey (insert new key for mutable maps)
* putKey (insert or update for mutable maps)
* updateKey (update existing or fail)
* deleteKey  (delete existing or fail)
* destroyKey (delete existing or not)

## Points to Consider

The fold and unfold APIs can be used to express the to/from stream APIs. So we
may not need both, it may just add to more APIs being proliferated.

We may need an "append" style fold as well? What would "stream" append
operations be called then?

We may need locked version of "write" folds for mutable containers for
concurrent access.
Split stdio ops into a Console.Stdio module Change the FileSystem.Handle from* APIs to put* Mutable write APIs would use a "put" prefix to make the name more intuitive. 2021-04-22 15:54:07 +03:00			`# Data Containers`

			`Containers are persistent containers of data e.g. files, arrays, maps. For`
			`consistency we use similar API for all such containers where possible. Usually`
			`the API names are relative to the current module e.g. toBytes in an array`
			`module means converting the array to bytes, it sounds intuitive if read`
			`qualified with the module name e.g. Array.toBytes. This doc lists some`
			`conventions and guidelines to be followed.`

Add a section about byte/chunk stream naming 2021-07-27 13:01:48 +03:00			`## Bytes and Chunks`

			`APIs that deal with streams of Word8 may have "bytes" in their names`
			`while the ones dealing with "Array Word8" have "chunks" in their`
			`names. Usually the default is bytes when there is no explicit mention in`
			`the name.`

Split stdio ops into a Console.Stdio module Change the FileSystem.Handle from* APIs to put* Mutable write APIs would use a "put" prefix to make the name more intuitive. 2021-04-22 15:54:07 +03:00			`## Unfolds and Folds`

			`* Unfolds are named as "read" or with a "read" prefix (e.g. readChunks).`
			`* Folds are named as "write" or with a "write" prefix (e.g. writeChunks).`

Update "With" (config) and "FromTo" (range) variants 2021-07-30 16:08:05 +03:00			`## To and from Stream`

			`### From and Put`
Split stdio ops into a Console.Stdio module Change the FileSystem.Handle from* APIs to put* Mutable write APIs would use a "put" prefix to make the name more intuitive. 2021-04-22 15:54:07 +03:00
			`* Immutable construction from some external source is named with a "from"`
			`prefix (e.g. fromList).`
			`* Mutation of an existing container uses a "put" prefix instead of "from". When`
			`an API uses an existing container to write to and does not return a newly`
			`constructed container then use "put".`
			`* "from" vs "put": "from" assumes creation of a new object, it may fail if the`
			`object being created already exists (e.g. the file exists), it may not take a`
			`lock as it assumes immutability. "put" may create a new object or overwrite`
			`an existing one, it may take a lock for writing as it assumes mutability.`

Update "With" (config) and "FromTo" (range) variants 2021-07-30 16:08:05 +03:00			`### To and Get`
Split stdio ops into a Console.Stdio module Change the FileSystem.Handle from* APIs to put* Mutable write APIs would use a "put" prefix to make the name more intuitive. 2021-04-22 15:54:07 +03:00
			`* Converting the complete object to an external representation is prefixed with`
			`"to" (e.g. toBytes).`
			`* For mutable objects "get" APIs may be used instead of "to" APIs.`
			`* "to" vs "get": "to" assumes immutable object so does not have to take a lock.`
			`"get" assumes mutable object so may take a lock.`

Update "With" (config) and "FromTo" (range) variants 2021-07-30 16:08:05 +03:00			`### Append`
Split stdio ops into a Console.Stdio module Change the FileSystem.Handle from* APIs to put* Mutable write APIs would use a "put" prefix to make the name more intuitive. 2021-04-22 15:54:07 +03:00
			`* Use "append" prefix for appending data at the end of a mutable container`

Update "With" (config) and "FromTo" (range) variants 2021-07-30 16:08:05 +03:00			`## With additional config`

			`Sometimes we need to modify the behavior of a combinator using some additional`
			`config. For example, combinators to read/write using a specified size of`
			`buffer. For such cases we apply the "With" suffix to standard combinator names:`

			`* readWith`
			`* readChunksWith`
			`* toBytesWith`
			`* getBytesWith`

Split stdio ops into a Console.Stdio module Change the FileSystem.Handle from* APIs to put* Mutable write APIs would use a "put" prefix to make the name more intuitive. 2021-04-22 15:54:07 +03:00			`## Random Access (Arrays)`

Update "With" (config) and "FromTo" (range) variants 2021-07-30 16:08:05 +03:00			`### Single elements`

Split stdio ops into a Console.Stdio module Change the FileSystem.Handle from* APIs to put* Mutable write APIs would use a "put" prefix to make the name more intuitive. 2021-04-22 15:54:07 +03:00			`* getIndex (for arrays)`
			`* putIndex (for mutable arrays)`
Update "With" (config) and "FromTo" (range) variants 2021-07-30 16:08:05 +03:00
			`### Ranges`

			`* ...FromTo (e.g. readFromTo, readChunksFromTo, toBytesFromTo, getBytesFromTo)`
			`* ...FromThenTo`
			`* ...Indices`

Add fromThenTo vs fromStepN pros/cons 2021-07-30 18:11:34 +03:00			`FromThenTo vs FromStepN:`

			We could use `fromThenTo` or `fromStepN` style APIs for stepwise
			`enumeration. The first one uses absolute numbers whereas the second one`
			`uses relative positioning. We prefer FromThenTo style where possible for`
			`the following reasons:`
Update "With" (config) and "FromTo" (range) variants 2021-07-30 16:08:05 +03:00
			`* It is the style used in base lists, and we are already using it in Enumerable`
			`type class.`
Add fromThenTo vs fromStepN pros/cons 2021-07-30 18:11:34 +03:00			* Both have their pros and cons. In `fromThenTo`, (1) it may be
			non-intuitive whether `To` is inclusive, (2) specifying an empty range
			is a bit awkward (e.g. fromThenTo 1 1 0). In `fromStepN`, if we know
			`the first and the last element then we will have to compute the offset`
			`correctly. Ideally, when we know the elements then former is more`
			`suitable while if we know the count then the latter is more suitable.`
Update "With" (config) and "FromTo" (range) variants 2021-07-30 16:08:05 +03:00
Express enumerations in a more modular way Instead of implementing each function individually build them on basic building blocks. Also named them correctly. 2021-08-22 07:05:23 +03:00			Note that in case of floating point numbers `FromStep` style may have an
Refactor, rename, reorg code, update docs * Add enumerateFromStepNum as a common function to implement two others * Rename, Bounded suffix means the API has a Bounded constraint * Move some functions in relevant sections * Update some docs 2021-08-21 23:40:02 +03:00			advantage over `FromThen` style. Here is a quote from the documentation of
			`enumerateFromThenNum`:

			```
			`Note that in the strange world of floating point numbers, using`
			`@enumerateFromThenNum (from, from + 1)@ is almost exactly the same as`
			`@enumerateFromStepNum (from, 1) but not precisely the same. Because @(from`
			`+ 1) - from@ is not exactly 1, it may lose some precision, the loss may`
			`also be aggregated in each step, if you want that precision then use`
			`'enumerateFromStepNum' instead.`
			```

Update "With" (config) and "FromTo" (range) variants 2021-07-30 16:08:05 +03:00			`### Appending`

Split stdio ops into a Console.Stdio module Change the FileSystem.Handle from* APIs to put* Mutable write APIs would use a "put" prefix to make the name more intuitive. 2021-04-22 15:54:07 +03:00			`* append (for mutable arrays)`

Update "With" (config) and "FromTo" (range) variants 2021-07-30 16:08:05 +03:00			`## Key-Value Store Access (Maps)`
Split stdio ops into a Console.Stdio module Change the FileSystem.Handle from* APIs to put* Mutable write APIs would use a "put" prefix to make the name more intuitive. 2021-04-22 15:54:07 +03:00
			`* getKey`
			`* findKey (test existence)`
			`* createKey (insert new key for mutable maps)`
			`* putKey (insert or update for mutable maps)`
			`* updateKey (update existing or fail)`
			`* deleteKey (delete existing or fail)`
			`* destroyKey (delete existing or not)`

			`## Points to Consider`

			`The fold and unfold APIs can be used to express the to/from stream APIs. So we`
			`may not need both, it may just add to more APIs being proliferated.`

			`We may need an "append" style fold as well? What would "stream" append`
			`operations be called then?`

			`We may need locked version of "write" folds for mutable containers for`
			`concurrent access.`