sq/README.md

# sq: swiss army knife for data

`sq` is a command line tool that provides `jq`-style access to
structured data sources such as SQL databases,
or document formats like CSV or Excel.

`sq` can perform cross-source joins,
execute database-native SQL, and output to a multitude of formats including JSON,
Excel, CSV, HTML, Markdown and XML, or insert directly to a SQL database.
`sq` can also inspect sources to view metadata about the source structure (tables,
columns, size) and has commands for common database operations such as copying
or dropping tables.


## Install

For other installation options, see [here](https://github.com/neilotoole/sq/wiki/Home#Install).

It is strongly advised to install [shell completion](#shell-completion).

### macOS

```shell script
brew tap neilotoole/sq && brew install sq
```


### Windows

```
scoop bucket add sq https://github.com/neilotoole/sq
scoop install sq
```


### Linux

#### apt

```shell script
curl -fsSLO https://github.com/neilotoole/sq/releases/latest/download/sq-linux-amd64.deb && sudo apt install -y ./sq-linux-amd64.deb && rm ./sq-linux-amd64.deb
```

#### rpm

```shell script
sudo rpm -i https://github.com/neilotoole/sq/releases/latest/download/sq-linux-amd64.rpm
```

#### yum

```shell script
yum localinstall -y https://github.com/neilotoole/sq/releases/latest/download/sq-linux-amd64.rpm
```

## Shell completion

Shell completion is available for `bash`, `zsh`, `fish`, and `powershell`.

Execute `sq completion --help` for installation instructions.

## Quickstart

Use `sq help` to see command help. The [tutorial](https://github.com/neilotoole/sq/wiki/Tutorial) is the best place to start.
The [cookbook](https://github.com/neilotoole/sq/wiki/Cookbook) has recipes for common actions.

The major concept is: `sq` operates on data sources, which are treated as SQL databases (even if the source is really a CSV or XLSX file etc).

In a nutshell, you `sq add` a source (giving it a `handle`), and then execute commands against the source.


### Sources

Initially there are no sources.

```sh
$ sq ls


```

Let's add a source. First we'll add a SQLite database, but this could also be Postgres,
SQL Server, Excel, etc. Download the sample DB, and `sq add` the source. We
use `-h` to specify a _handle_ to use.

```sh
$ wget https://sq.io/testdata/sakila.db

$ sq add ./sakila.db -h @sakila_sl3
@sakila_sl3  sqlite3  sakila.db

$ sq ls -v
HANDLE       DRIVER   LOCATION                 OPTIONS
@sakila_sl3* sqlite3  sqlite3:/root/sakila.db

$ sq ping @sakila_sl3
@sakila_sl3  1ms  pong

$ sq src
@sakila_sl3  sqlite3  sakila.db
```

The `sq ping` command simply pings the source to verify that it's available.

`sq src` lists the _active source_, which in our case is `@sakila_sl3`. You can change the active source using `sq src @other_src`. When there's an active source specified, you can usually omit the handle from `sq` commands. Thus you could instead do:

```sh
$ sq ping
@sakila_sl3  1ms  pong
```

### Query

Fundamentally, `sq` is for querying data. Using our jq-style syntax:

```sh
$ sq '.actor | .actor_id < 100 | .[0:3]'
actor_id  first_name  last_name     last_update
1         PENELOPE    GUINESS       2020-02-15T06:59:28Z
2         NICK        WAHLBERG      2020-02-15T06:59:28Z
3         ED          CHASE         2020-02-15T06:59:28Z
```


The above query selected some rows from the `actor` table. You could also use native SQL, e.g.:

```sh
$ sq sql 'SELECT * FROM actor WHERE actor_id < 100 LIMIT 3'
actor_id  first_name  last_name  last_update
1         PENELOPE    GUINESS    2020-02-15T06:59:28Z
2         NICK        WAHLBERG   2020-02-15T06:59:28Z
3         ED          CHASE      2020-02-15T06:59:28Z
```

But we're flying a bit blind here: how did we know about the `actor` table?

### Inspect

`sq inspect` is your friend (output abbreviated):

```sh
$ sq inspect
HANDLE          DRIVER   NAME       FQ NAME         SIZE   TABLES  LOCATION
@sakila_sl3     sqlite3  sakila.db  sakila.db/main  5.6MB  21      sqlite3:///root/sakila.db

TABLE                   ROWS   TYPE   SIZE  NUM COLS  COL NAMES                                                                          COL TYPES
actor                   200    table  -     4         actor_id, first_name, last_name, last_update                                       numeric, VARCHAR(45), VARCHAR(45), TIMESTAMP
address                 603    table  -     8         address_id, address, address2, district, city_id, postal_code, phone, last_update  int, VARCHAR(50), VARCHAR(50), VARCHAR(20), INT, VARCHAR(10), VARCHAR(20), TIMESTAMP
category                16     table  -     3         category_id, name, last_update
```

Use `--json` (`-j`) to output in JSON (output abbreviated):

```shell
$ sq inspect -j
{
  "handle": "@sakila_sl3",
  "name": "sakila.db",
  "driver": "sqlite3",
  "db_version": "3.31.1",
  "location": "sqlite3:///root/sakila.db",
  "size": 5828608,
  "tables": [
    {
      "name": "actor",
      "table_type": "table",
      "row_count": 200,
      "columns": [
        {
          "name": "actor_id",
          "position": 0,
          "primary_key": true,
          "base_type": "numeric",
          "column_type": "numeric",
          "kind": "decimal",
          "nullable": false
        }
```

Combine `sq inspect` with [jq](https://stedolan.github.io/jq/) for some useful capabilities. Here's how to [list](https://github.com/neilotoole/sq/wiki/Cookbook#list-name-of-each-table-in-a-source) all the table names in the active source:

```sh
$ sq inspect -j | jq -r '.tables[] | .name'
actor
address
category
city
country
customer
[...]
```

And here's how you could [export](https://github.com/neilotoole/sq/wiki/Cookbook#export-all-tables-to-csv) each table to a CSV file:

```sh
$ sq inspect -j | jq -r '.tables[] | .name' | xargs -I % sq .% --csv --output %.csv
$ ls
actor.csv     city.csv	    customer_list.csv  film_category.csv  inventory.csv  rental.csv		     staff.csv
address.csv   country.csv   film.csv	       film_list.csv	  language.csv	 sales_by_film_category.csv  staff_list.csv
category.csv  customer.csv  film_actor.csv     film_text.csv	  payment.csv	 sales_by_store.csv	     store.csv
```

Note that you can also inspect an individual table:

```sh
$ sq inspect @sakila_sl3.actor
TABLE  ROWS  TYPE   SIZE  NUM COLS  COL NAMES                                     COL TYPES
actor  200   table  -     4         actor_id, first_name, last_name, last_update  numeric, VARCHAR(45), VARCHAR(45), TIMESTAMP

```

### Insert Output Into Database Source

`sq` query results can be output in various formats (JSON, XML, CSV, etc), and can also be "outputted" as an *insert* into database sources.

That is, you can use `sq` to insert results from a Postgres query into a MySQL table, or copy an Excel worksheet into a SQLite table, or a push a CSV file into a SQL Server table etc.

> **Note:** If you want to copy a table inside the same (database) source, use `sq tbl copy` instead, which uses the database's native table copy functionality.

For this example, we'll insert an Excel worksheet into our `@sakila_sl3` SQLite database. First, we download the XLSX file, and `sq add` it as a source.

```sh
$ wget https://sq.io/testdata/xl_demo.xlsx

$ sq add ./xl_demo.xlsx --opts header=true
@xl_demo_xlsx  xlsx  xl_demo.xlsx

$ sq @xl_demo_xlsx.person
uid  username    email                  address_id
1    neilotoole  neilotoole@apache.org  1
2    ksoze       kaiser@soze.org        2
3    kubla       kubla@khan.mn          NULL
[...]
```

Now, execute the same query, but this time `sq` inserts the results into a new table (`person`) in `@sakila_sl3`:

```shell
$ sq @xl_demo_xlsx.person --insert @sakila_sl3.person
Inserted 7 rows into @sakila_sl3.person

$ sq inspect @sakila_sl3.person
TABLE   ROWS  TYPE   SIZE  NUM COLS  COL NAMES                         COL TYPES
person  7     table  -     4         uid, username, email, address_id  INTEGER, TEXT, TEXT, INTEGER

$ sq @sakila_sl3.person
uid  username    email                  address_id
1    neilotoole  neilotoole@apache.org  1
2    ksoze       kaiser@soze.org        2
3    kubla       kubla@khan.mn          NULL
[...]
```

### Cross-Source Join

`sq` has rudimentary support for cross-source joins. That is, you can join an Excel worksheet with a CSV file, or Postgres table, etc.

> **Note:** The current mechanism for these joins is highly naive: `sq` copies the joined table from each source to a "scratch database" (SQLite by default), and then performs the JOIN using the scratch database's SQL interface. Thus, performance is abysmal for larger tables. There are massive optimizations to be made, but none have been implemented yet.

See the [tutorial](https://github.com/neilotoole/sq/wiki/Tutorial#join) for further details, but given an Excel source `@xl_demo` and a CSV source `@csv_demo`, you can do:

```sh
$ sq '@csv_demo.data, @xl_demo.address | join(.D == .address_id) | .C, .city'
C                      city
neilotoole@apache.org  Washington
kaiser@soze.org        Ulan Bator
nikola@tesla.rs        Washington
augustus@caesar.org    Ulan Bator
plato@athens.gr        Washington
```


### Table Commands

`sq` provides several handy commands for working with tables. Note that these commands work directly against SQL database sources, using their native SQL commands.

```sh
$ sq tbl copy .actor .actor_copy
Copied table: @sakila_sl3.actor --> @sakila_sl3.actor_copy (200 rows copied)

$ sq tbl truncate .actor_copy
Truncated 200 rows from @sakila_sl3.actor_copy

$ sq tbl drop .actor_copy
Dropped table @sakila_sl3.actor_copy
```


### UNIX Pipes

For file-based sources (such as CSV or XLSX), you can `sq add` the source file, but you can also pipe it:

```shell
$ cat ./example.xlsx | sq .Sheet1
```

Similarly, you can inspect:

```shell
$ cat ./example.xlsx | sq inspect
```


## Data Source Drivers
`sq` knows how to deal with a data source type via a _driver_ implementation. To view the installed/supported drivers:

```sh
$ sq drivers
DRIVER     DESCRIPTION                            USER-DEFINED  DOC
sqlite3    SQLite                                 false         https://github.com/mattn/go-sqlite3
postgres   PostgreSQL                             false         https://github.com/jackc/pgx
sqlserver  Microsoft SQL Server                   false         https://github.com/denisenkom/go-mssqldb
mysql      MySQL                                  false         https://github.com/go-sql-driver/mysql
csv        Comma-Separated Values                 false         https://en.wikipedia.org/wiki/Comma-separated_values
tsv        Tab-Separated Values                   false         https://en.wikipedia.org/wiki/Tab-separated_values
json       JSON                                   false         https://en.wikipedia.org/wiki/JSON
jsona      JSON Array: LF-delimited JSON arrays   false         https://en.wikipedia.org/wiki/JSON
jsonl      JSON Lines: LF-delimited JSON objects  false         https://en.wikipedia.org/wiki/JSON_streaming#Line-delimited_JSON
xlsx       Microsoft Excel XLSX                   false         https://en.wikipedia.org/wiki/Microsoft_Excel
```


## Output Formats
`sq` has many output formats:

- `--table`: Text/Table
- `--json`: JSON
- `--jsona`: JSON Array
- `--jsonl`: JSON Lines
- `--csv` / `--tsv` : CSV / TSV
- `--xlsx`: XLSX (Microsoft Excel)
- `--html`: HTML
- `--xml`: XML
- `--markdown`: Markdown
- `--raw`: Raw (bytes)


## Acknowledgements

- Much inspiration is owed to [jq](https://stedolan.github.io/jq/).
- See [`go.mod`](https://github.com/neilotoole/sq/blob/master/go.mod) for a list of third-party packages.
- Additionally, `sq` incorporates modified versions of:
    - [`olekukonko/tablewriter`](https://github.com/olekukonko/tablewriter)
    - [`segmentio/encoding`](https://github.com/segmentio/encoding) for JSON encoding.
- The [_Sakila_](https://dev.mysql.com/doc/sakila/en/) example databases were lifted from [jOOQ](https://github.com/jooq/jooq), which in turn owe their heritage to earlier work on Sakila.

## Similar / Related / Noteworthy Projects

- [usql](https://github.com/xo/usql)
- [textql](https://github.com/dinedal/textql)
- [golang-migrate](https://github.com/golang-migrate/migrate)
- [octosql](https://github.com/cube2222/octosql)
- [rq](https://github.com/dflemstr/rq)
codebase refactor 2020-08-06 20:58:47 +03:00			`# sq: swiss army knife for data`
tidy up 2016-10-17 07:14:01 +03:00
README update 2021-01-02 09:31:52 +03:00			`sq` is a command line tool that provides `jq`-style access to
README update 2020-10-20 18:18:56 +03:00			`structured data sources such as SQL databases,`
doc update 2021-01-04 07:45:04 +03:00			`or document formats like CSV or Excel.`
README update 2021-01-02 09:31:30 +03:00
doc update 2021-01-04 07:44:09 +03:00			`sq` can perform cross-source joins,
Minor tidying of README and .goreleaser.yml 2020-08-06 21:37:33 +03:00			`execute database-native SQL, and output to a multitude of formats including JSON,`
doc update 2021-01-04 07:47:13 +03:00			`Excel, CSV, HTML, Markdown and XML, or insert directly to a SQL database.`
doc update 2021-01-04 07:56:50 +03:00			`sq` can also inspect sources to view metadata about the source structure (tables,
minor doc tidy (#63) 2020-08-19 23:46:04 +03:00			`columns, size) and has commands for common database operations such as copying`
Minor tidying of README and .goreleaser.yml 2020-08-06 21:37:33 +03:00			`or dropping tables.`


docs update 2021-01-04 03:40:32 +03:00			`## Install`
tidy up 2016-10-17 07:14:01 +03:00
docs update 2021-01-04 03:40:32 +03:00			`For other installation options, see [here](https://github.com/neilotoole/sq/wiki/Home#Install).`
tidy up 2016-10-17 07:14:01 +03:00
Cobra upgrade: includes shell completion work (#81) Addressed #80 2021-02-22 10:37:00 +03:00			`It is strongly advised to install [shell completion](#shell-completion).`
doc update 2021-01-04 07:46:25 +03:00
docs update 2021-01-04 03:40:32 +03:00			`### macOS`
tidy up 2016-10-17 07:14:01 +03:00
docs update 2021-01-04 03:40:32 +03:00			```shell script
			`brew tap neilotoole/sq && brew install sq`
			```

doc update 2021-01-04 07:46:25 +03:00
docs update 2021-01-04 03:40:32 +03:00			`### Windows`

			```
			`scoop bucket add sq https://github.com/neilotoole/sq`
			`scoop install sq`
			```

doc update 2021-01-04 07:46:25 +03:00
docs update 2021-01-04 03:40:32 +03:00			`### Linux`

doc update 2021-01-04 09:28:03 +03:00			`#### apt`
doc update 2021-01-04 09:27:13 +03:00
docs update 2021-01-04 03:40:32 +03:00			```shell script
			`curl -fsSLO https://github.com/neilotoole/sq/releases/latest/download/sq-linux-amd64.deb && sudo apt install -y ./sq-linux-amd64.deb && rm ./sq-linux-amd64.deb`
			```
tidy up 2016-10-17 07:14:01 +03:00
doc update 2021-01-04 09:28:03 +03:00			`#### rpm`
tidy up 2016-10-17 07:14:01 +03:00
codebase refactor 2020-08-06 20:58:47 +03:00			```shell script
docs update 2021-01-04 03:40:32 +03:00			`sudo rpm -i https://github.com/neilotoole/sq/releases/latest/download/sq-linux-amd64.rpm`
codebase refactor 2020-08-06 20:58:47 +03:00			```
tidy up 2016-10-17 07:14:01 +03:00
doc update 2021-01-04 09:28:03 +03:00			`#### yum`
tidy up 2016-10-17 07:14:01 +03:00
codebase refactor 2020-08-06 20:58:47 +03:00			```shell script
docs update 2021-01-04 03:40:32 +03:00			`yum localinstall -y https://github.com/neilotoole/sq/releases/latest/download/sq-linux-amd64.rpm`
			```

Cobra upgrade: includes shell completion work (#81) Addressed #80 2021-02-22 10:37:00 +03:00			`## Shell completion`

			Shell completion is available for `bash`, `zsh`, `fish`, and `powershell`.

			Execute `sq completion --help` for installation instructions.
doc update 2021-01-04 07:46:25 +03:00
docs update 2021-01-04 03:40:32 +03:00			`## Quickstart`

doc update 2021-01-04 09:32:13 +03:00			Use `sq help` to see command help. The [tutorial](https://github.com/neilotoole/sq/wiki/Tutorial) is the best place to start.
doc update 2021-01-04 10:15:14 +03:00			`The [cookbook](https://github.com/neilotoole/sq/wiki/Cookbook) has recipes for common actions.`
docs update 2021-01-04 03:40:32 +03:00
doc update 2021-01-04 09:32:13 +03:00			The major concept is: `sq` operates on data sources, which are treated as SQL databases (even if the source is really a CSV or XLSX file etc).

doc update 2021-01-04 09:32:55 +03:00			In a nutshell, you `sq add` a source (giving it a `handle`), and then execute commands against the source.
docs update 2021-01-04 03:40:32 +03:00

			`### Sources`

			`Initially there are no sources.`

			```sh
			`$ sq ls`

doc update 2021-01-04 09:33:56 +03:00
docs update 2021-01-04 03:40:32 +03:00			```

doc update 2021-01-04 07:44:09 +03:00			`Let's add a source. First we'll add a SQLite database, but this could also be Postgres,`
doc update 2021-01-04 09:36:15 +03:00			SQL Server, Excel, etc. Download the sample DB, and `sq add` the source. We
doc update 2021-01-04 09:16:45 +03:00			use `-h` to specify a _handle_ to use.
docs update 2021-01-04 03:40:32 +03:00
			```sh
			`$ wget https://sq.io/testdata/sakila.db`

doc update 2021-01-04 07:44:09 +03:00			`$ sq add ./sakila.db -h @sakila_sl3`
			`@sakila_sl3 sqlite3 sakila.db`
docs update 2021-01-04 03:40:32 +03:00
			`$ sq ls -v`
doc update 2021-01-04 07:56:50 +03:00			`HANDLE DRIVER LOCATION OPTIONS`
			`@sakila_sl3* sqlite3 sqlite3:/root/sakila.db`
docs update 2021-01-04 03:40:32 +03:00
doc update 2021-01-04 07:44:09 +03:00			`$ sq ping @sakila_sl3`
			`@sakila_sl3 1ms pong`
docs update 2021-01-04 03:40:32 +03:00
			`$ sq src`
doc update 2021-01-04 07:44:09 +03:00			`@sakila_sl3 sqlite3 sakila.db`
docs update 2021-01-04 03:40:32 +03:00			```

doc update 2021-01-04 07:44:09 +03:00			The `sq ping` command simply pings the source to verify that it's available.
docs update 2021-01-04 03:40:32 +03:00
doc update 2021-01-04 09:36:15 +03:00			`sq src` lists the _active source_, which in our case is `@sakila_sl3`. You can change the active source using `sq src @other_src`. When there's an active source specified, you can usually omit the handle from `sq` commands. Thus you could instead do:
docs update 2021-01-04 03:40:32 +03:00
			```sh
			`$ sq ping`
doc update 2021-01-04 07:44:09 +03:00			`@sakila_sl3 1ms pong`
docs update 2021-01-04 03:40:32 +03:00			```

			`### Query`

doc update 2021-01-04 08:09:29 +03:00			Fundamentally, `sq` is for querying data. Using our jq-style syntax:
docs update 2021-01-04 03:40:32 +03:00
			```sh
			`$ sq '.actor \| .actor_id < 100 \| .[0:3]'`
			`actor_id first_name last_name last_update`
			`1 PENELOPE GUINESS 2020-02-15T06:59:28Z`
			`2 NICK WAHLBERG 2020-02-15T06:59:28Z`
			`3 ED CHASE 2020-02-15T06:59:28Z`
			```


doc update 2021-01-04 07:41:36 +03:00			The above query selected some rows from the `actor` table. You could also use native SQL, e.g.:
docs update 2021-01-04 03:40:32 +03:00
			```sh
			`$ sq sql 'SELECT * FROM actor WHERE actor_id < 100 LIMIT 3'`
			`actor_id first_name last_name last_update`
			`1 PENELOPE GUINESS 2020-02-15T06:59:28Z`
			`2 NICK WAHLBERG 2020-02-15T06:59:28Z`
			`3 ED CHASE 2020-02-15T06:59:28Z`
			```

			But we're flying a bit blind here: how did we know about the `actor` table?

			`### Inspect`

			`sq inspect` is your friend (output abbreviated):

			```sh
doc update 2021-01-04 08:12:00 +03:00			`$ sq inspect`
docs update 2021-01-04 03:40:32 +03:00			`HANDLE DRIVER NAME FQ NAME SIZE TABLES LOCATION`
doc update 2021-01-04 07:44:09 +03:00			`@sakila_sl3 sqlite3 sakila.db sakila.db/main 5.6MB 21 sqlite3:///root/sakila.db`
docs update 2021-01-04 03:40:32 +03:00
			`TABLE ROWS TYPE SIZE NUM COLS COL NAMES COL TYPES`
			`actor 200 table - 4 actor_id, first_name, last_name, last_update numeric, VARCHAR(45), VARCHAR(45), TIMESTAMP`
			`address 603 table - 8 address_id, address, address2, district, city_id, postal_code, phone, last_update int, VARCHAR(50), VARCHAR(50), VARCHAR(20), INT, VARCHAR(10), VARCHAR(20), TIMESTAMP`
			`category 16 table - 3 category_id, name, last_update`
			```

doc update 2021-01-04 10:35:00 +03:00			Use `--json` (`-j`) to output in JSON (output abbreviated):
docs update 2021-01-04 03:40:32 +03:00
doc update 2021-01-04 08:11:30 +03:00			```shell
			`$ sq inspect -j`
docs update 2021-01-04 03:40:32 +03:00			`{`
doc update 2021-01-04 07:44:09 +03:00			`"handle": "@sakila_sl3",`
docs update 2021-01-04 03:40:32 +03:00			`"name": "sakila.db",`
			`"driver": "sqlite3",`
			`"db_version": "3.31.1",`
			`"location": "sqlite3:///root/sakila.db",`
			`"size": 5828608,`
			`"tables": [`
			`{`
			`"name": "actor",`
			`"table_type": "table",`
			`"row_count": 200,`
			`"columns": [`
			`{`
			`"name": "actor_id",`
			`"position": 0,`
			`"primary_key": true,`
			`"base_type": "numeric",`
			`"column_type": "numeric",`
			`"kind": "decimal",`
			`"nullable": false`
			`}`
			```

doc update 2021-01-04 07:56:50 +03:00			Combine `sq inspect` with [jq](https://stedolan.github.io/jq/) for some useful capabilities. Here's how to [list](https://github.com/neilotoole/sq/wiki/Cookbook#list-name-of-each-table-in-a-source) all the table names in the active source:
docs update 2021-01-04 03:40:32 +03:00
			```sh
			`$ sq inspect -j \| jq -r '.tables[] \| .name'`
			`actor`
			`address`
			`category`
			`city`
			`country`
			`customer`
			`[...]`
			```

			`And here's how you could [export](https://github.com/neilotoole/sq/wiki/Cookbook#export-all-tables-to-csv) each table to a CSV file:`

			```sh
			`$ sq inspect -j \| jq -r '.tables[] \| .name' \| xargs -I % sq .% --csv --output %.csv`
			`$ ls`
			`actor.csv city.csv customer_list.csv film_category.csv inventory.csv rental.csv staff.csv`
			`address.csv country.csv film.csv film_list.csv language.csv sales_by_film_category.csv staff_list.csv`
			`category.csv customer.csv film_actor.csv film_text.csv payment.csv sales_by_store.csv store.csv`
			```

			`Note that you can also inspect an individual table:`

			```sh
doc update 2021-01-04 07:44:09 +03:00			`$ sq inspect @sakila_sl3.actor`
docs update 2021-01-04 03:40:32 +03:00			`TABLE ROWS TYPE SIZE NUM COLS COL NAMES COL TYPES`
			`actor 200 table - 4 actor_id, first_name, last_name, last_update numeric, VARCHAR(45), VARCHAR(45), TIMESTAMP`

			```

doc update 2021-01-04 08:07:12 +03:00			`### Insert Output Into Database Source`
doc update 2021-01-04 07:41:36 +03:00
			`sq` query results can be output in various formats (JSON, XML, CSV, etc), and can also be "outputted" as an insert into database sources.

doc update 2021-01-04 08:21:22 +03:00			That is, you can use `sq` to insert results from a Postgres query into a MySQL table, or copy an Excel worksheet into a SQLite table, or a push a CSV file into a SQL Server table etc.
doc update 2021-01-04 07:41:36 +03:00
			> Note: If you want to copy a table inside the same (database) source, use `sq tbl copy` instead, which uses the database's native table copy functionality.

doc update 2021-01-04 09:38:33 +03:00			For this example, we'll insert an Excel worksheet into our `@sakila_sl3` SQLite database. First, we download the XLSX file, and `sq add` it as a source.
doc update 2021-01-04 07:41:36 +03:00
			```sh
			`$ wget https://sq.io/testdata/xl_demo.xlsx`

			`$ sq add ./xl_demo.xlsx --opts header=true`
			`@xl_demo_xlsx xlsx xl_demo.xlsx`

			`$ sq @xl_demo_xlsx.person`
			`uid username email address_id`
			`1 neilotoole neilotoole@apache.org 1`
			`2 ksoze kaiser@soze.org 2`
			`3 kubla kubla@khan.mn NULL`
			`[...]`
			```

doc update 2021-01-04 09:39:34 +03:00			Now, execute the same query, but this time `sq` inserts the results into a new table (`person`) in `@sakila_sl3`:
doc update 2021-01-04 07:41:36 +03:00
doc update 2021-01-04 07:44:09 +03:00			```shell
			`$ sq @xl_demo_xlsx.person --insert @sakila_sl3.person`
			`Inserted 7 rows into @sakila_sl3.person`
doc update 2021-01-04 07:41:36 +03:00
doc update 2021-01-04 07:44:09 +03:00			`$ sq inspect @sakila_sl3.person`
			`TABLE ROWS TYPE SIZE NUM COLS COL NAMES COL TYPES`
			`person 7 table - 4 uid, username, email, address_id INTEGER, TEXT, TEXT, INTEGER`
doc update 2021-01-04 07:41:36 +03:00
doc update 2021-01-04 07:44:09 +03:00			`$ sq @sakila_sl3.person`
			`uid username email address_id`
			`1 neilotoole neilotoole@apache.org 1`
			`2 ksoze kaiser@soze.org 2`
			`3 kubla kubla@khan.mn NULL`
			`[...]`
			```
doc update 2021-01-04 07:41:36 +03:00
			`### Cross-Source Join`
docs update 2021-01-04 03:40:32 +03:00
doc update 2021-01-04 07:56:50 +03:00			`sq` has rudimentary support for cross-source joins. That is, you can join an Excel worksheet with a CSV file, or Postgres table, etc.
docs update 2021-01-04 03:40:32 +03:00
doc update 2021-01-04 09:41:39 +03:00			> Note: The current mechanism for these joins is highly naive: `sq` copies the joined table from each source to a "scratch database" (SQLite by default), and then performs the JOIN using the scratch database's SQL interface. Thus, performance is abysmal for larger tables. There are massive optimizations to be made, but none have been implemented yet.
docs update 2021-01-04 03:40:32 +03:00
			See the [tutorial](https://github.com/neilotoole/sq/wiki/Tutorial#join) for further details, but given an Excel source `@xl_demo` and a CSV source `@csv_demo`, you can do:

			```sh
			`$ sq '@csv_demo.data, @xl_demo.address \| join(.D == .address_id) \| .C, .city'`
			`C city`
			`neilotoole@apache.org Washington`
			`kaiser@soze.org Ulan Bator`
			`nikola@tesla.rs Washington`
			`augustus@caesar.org Ulan Bator`
			`plato@athens.gr Washington`
			```


doc update 2021-01-04 07:44:09 +03:00			`### Table Commands`
docs update 2021-01-04 03:40:32 +03:00
			`sq` provides several handy commands for working with tables. Note that these commands work directly against SQL database sources, using their native SQL commands.

			```sh
			`$ sq tbl copy .actor .actor_copy`
doc update 2021-01-04 07:44:09 +03:00			`Copied table: @sakila_sl3.actor --> @sakila_sl3.actor_copy (200 rows copied)`
docs update 2021-01-04 03:40:32 +03:00
			`$ sq tbl truncate .actor_copy`
doc update 2021-01-04 07:44:09 +03:00			`Truncated 200 rows from @sakila_sl3.actor_copy`
docs update 2021-01-04 03:40:32 +03:00
			`$ sq tbl drop .actor_copy`
doc update 2021-01-04 07:44:09 +03:00			`Dropped table @sakila_sl3.actor_copy`
tidy up 2016-10-17 07:14:01 +03:00			```
cleaning up build system (#38) 2016-10-21 19:14:48 +03:00
codebase refactor 2020-08-06 20:58:47 +03:00
cleaning up build system (#38) 2016-10-21 19:14:48 +03:00
docs update 2021-01-04 03:40:32 +03:00			`### UNIX Pipes`

doc update 2021-01-04 10:39:15 +03:00			For file-based sources (such as CSV or XLSX), you can `sq add` the source file, but you can also pipe it:
docs update 2021-01-04 03:40:32 +03:00
doc update 2021-01-04 10:39:15 +03:00			```shell
doc update 2021-01-04 10:39:43 +03:00			`$ cat ./example.xlsx \| sq .Sheet1`
doc update 2021-01-04 10:39:15 +03:00			```

doc update 2021-01-04 10:40:05 +03:00			`Similarly, you can inspect:`
doc update 2021-01-04 10:39:15 +03:00
			```shell
doc update 2021-01-04 10:39:43 +03:00			`$ cat ./example.xlsx \| sq inspect`
doc update 2021-01-04 10:39:15 +03:00			```
docs update 2021-01-04 03:40:32 +03:00

			`## Data Source Drivers`
doc update 2021-01-04 10:41:40 +03:00			`sq` knows how to deal with a data source type via a _driver_ implementation. To view the installed/supported drivers:
docs update 2021-01-04 03:40:32 +03:00
			```sh
			`$ sq drivers`
			`DRIVER DESCRIPTION USER-DEFINED DOC`
			`sqlite3 SQLite false https://github.com/mattn/go-sqlite3`
			`postgres PostgreSQL false https://github.com/jackc/pgx`
			`sqlserver Microsoft SQL Server false https://github.com/denisenkom/go-mssqldb`
			`mysql MySQL false https://github.com/go-sql-driver/mysql`
			`csv Comma-Separated Values false https://en.wikipedia.org/wiki/Comma-separated_values`
			`tsv Tab-Separated Values false https://en.wikipedia.org/wiki/Tab-separated_values`
			`json JSON false https://en.wikipedia.org/wiki/JSON`
			`jsona JSON Array: LF-delimited JSON arrays false https://en.wikipedia.org/wiki/JSON`
			`jsonl JSON Lines: LF-delimited JSON objects false https://en.wikipedia.org/wiki/JSON_streaming#Line-delimited_JSON`
			`xlsx Microsoft Excel XLSX false https://en.wikipedia.org/wiki/Microsoft_Excel`
			```

doc update 2021-01-04 07:44:09 +03:00
docs update 2021-01-04 03:40:32 +03:00			`## Output Formats`
doc update 2021-01-04 10:40:51 +03:00			`sq` has many output formats:
docs update 2021-01-04 03:40:32 +03:00
doc update 2021-01-04 15:07:44 +03:00			- `--table`: Text/Table
docs update 2021-01-04 03:40:32 +03:00			- `--json`: JSON
			- `--jsona`: JSON Array
			- `--jsonl`: JSON Lines
			- `--csv` / `--tsv` : CSV / TSV
			- `--xlsx`: XLSX (Microsoft Excel)
			- `--html`: HTML
			- `--xml`: XML
			- `--markdown`: Markdown
			- `--raw`: Raw (bytes)
doc update 2021-01-04 07:44:09 +03:00
cleaning up build system (#38) 2016-10-21 19:14:48 +03:00
codebase refactor 2020-08-06 20:58:47 +03:00			`## Acknowledgements`
cleaning up build system (#38) 2016-10-21 19:14:48 +03:00
Minor tidying of README and .goreleaser.yml 2020-08-06 21:37:33 +03:00			`- Much inspiration is owed to [jq](https://stedolan.github.io/jq/).`
			- See [`go.mod`](https://github.com/neilotoole/sq/blob/master/go.mod) for a list of third-party packages.
			- Additionally, `sq` incorporates modified versions of:
			- [`olekukonko/tablewriter`](https://github.com/olekukonko/tablewriter)
			- [`segmentio/encoding`](https://github.com/segmentio/encoding) for JSON encoding.
docs update 2021-01-04 03:40:32 +03:00			`- The [_Sakila_](https://dev.mysql.com/doc/sakila/en/) example databases were lifted from [jOOQ](https://github.com/jooq/jooq), which in turn owe their heritage to earlier work on Sakila.`

			`## Similar / Related / Noteworthy Projects`

			`- [usql](https://github.com/xo/usql)`
			`- [textql](https://github.com/dinedal/textql)`
			`- [golang-migrate](https://github.com/golang-migrate/migrate)`
			`- [octosql](https://github.com/cube2222/octosql)`
			`- [rq](https://github.com/dflemstr/rq)`
Minor tidying of README and .goreleaser.yml 2020-08-06 21:37:33 +03:00
cleaning up build system (#38) 2016-10-21 19:14:48 +03:00