id,node_id,name,full_name,private,owner,html_url,description,fork,created_at,updated_at,pushed_at,homepage,size,stargazers_count,watchers_count,language,has_issues,has_projects,has_downloads,has_wiki,has_pages,forks_count,archived,disabled,open_issues_count,license,topics,forks,open_issues,watchers,default_branch,permissions,temp_clone_token,organization,network_count,subscribers_count,readme,readme_html,allow_forking,visibility,is_template,template_repository,web_commit_signoff_required,has_discussions
107914493,MDEwOlJlcG9zaXRvcnkxMDc5MTQ0OTM=,datasette,simonw/datasette,0,9599,https://github.com/simonw/datasette,An open source multi-tool for exploring and publishing data,0,2017-10-23T00:39:03Z,2022-11-15T23:16:27Z,2022-11-16T03:47:14Z,https://datasette.io,5770,6628,6628,Python,1,0,1,1,0,463,0,0,435,apache-2.0,"[""asgi"", ""automatic-api"", ""csv"", ""datasets"", ""datasette"", ""datasette-io"", ""docker"", ""json"", ""python"", ""sql"", ""sqlite""]",463,435,6628,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,463,97,,,1,public,0,,0,1
110509816,MDEwOlJlcG9zaXRvcnkxMTA1MDk4MTY=,csvs-to-sqlite,simonw/csvs-to-sqlite,0,9599,https://github.com/simonw/csvs-to-sqlite,Convert CSV files into a SQLite database,0,2017-11-13T06:38:21Z,2021-11-18T16:33:39Z,2021-11-18T16:35:33Z,,138,655,655,Python,1,1,1,1,0,50,0,0,34,apache-2.0,"[""click"", ""csv"", ""datasette"", ""datasette-io"", ""datasette-tool"", ""pandas"", ""python"", ""sqlite""]",50,34,655,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,50,17,"# csvs-to-sqlite

[![PyPI](https://img.shields.io/pypi/v/csvs-to-sqlite.svg)](https://pypi.org/project/csvs-to-sqlite/)
[![Changelog](https://img.shields.io/github/v/release/simonw/csvs-to-sqlite?include_prereleases&label=changelog)](https://github.com/simonw/csvs-to-sqlite/releases)
[![Tests](https://github.com/simonw/csvs-to-sqlite/workflows/Test/badge.svg)](https://github.com/simonw/csvs-to-sqlite/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/csvs-to-sqlite/blob/main/LICENSE)

Convert CSV files into a SQLite database. Browse and publish that SQLite database with [Datasette](https://github.com/simonw/datasette).

Basic usage:

    csvs-to-sqlite myfile.csv mydatabase.db

This will create a new SQLite database called `mydatabase.db` containing a
single table, `myfile`, containing the CSV content.

You can provide multiple CSV files:

    csvs-to-sqlite one.csv two.csv bundle.db

The `bundle.db` database will contain two tables, `one` and `two`.

This means you can use wildcards:

    csvs-to-sqlite ~/Downloads/*.csv my-downloads.db

If you pass a path to one or more directories, the script will recursively
search those directories for CSV files and create tables for each one.

    csvs-to-sqlite ~/path/to/directory all-my-csvs.db

## Handling TSV (tab-separated values)

You can use the `-s` option to specify a different delimiter. If you want
to use a tab character you'll need to apply shell escaping like so:

    csvs-to-sqlite my-file.tsv my-file.db -s $'\t'

## Refactoring columns into separate lookup tables

Let's say you have a CSV file that looks like this:

    county,precinct,office,district,party,candidate,votes
    Clark,1,President,,REP,John R. Kasich,5
    Clark,2,President,,REP,John R. Kasich,0
    Clark,3,President,,REP,John R. Kasich,7

([Real example taken from the Open Elections project](https://github.com/openelections/openelections-data-sd/blob/master/2016/20160607__sd__primary__clark__precinct.csv))

You can now convert selected columns into separate lookup tables using the new
`--extract-column` option (shortname: `-c`) - for example:

    csvs-to-sqlite openelections-data-*/*.csv \
        -c county:County:name \
        -c precinct:Precinct:name \
        -c office -c district -c party -c candidate \
        openelections.db

The format is as follows:

    column_name:optional_table_name:optional_table_value_column_name

If you just specify the column name e.g. `-c office`, the following table will
be created:

    CREATE TABLE ""office"" (
        ""id"" INTEGER PRIMARY KEY,
        ""value"" TEXT
    );

If you specify all three options, e.g. `-c precinct:Precinct:name` the table
will look like this:

    CREATE TABLE ""Precinct"" (
        ""id"" INTEGER PRIMARY KEY,
        ""name"" TEXT
    );

The original tables will be created like this:

    CREATE TABLE ""ca__primary__san_francisco__precinct"" (
        ""county"" INTEGER,
        ""precinct"" INTEGER,
        ""office"" INTEGER,
        ""district"" INTEGER,
        ""party"" INTEGER,
        ""candidate"" INTEGER,
        ""votes"" INTEGER,
        FOREIGN KEY (county) REFERENCES County(id),
        FOREIGN KEY (party) REFERENCES party(id),
        FOREIGN KEY (precinct) REFERENCES Precinct(id),
        FOREIGN KEY (office) REFERENCES office(id),
        FOREIGN KEY (candidate) REFERENCES candidate(id)
    );

They will be populated with IDs that reference the new derived tables.

## Installation

    $ pip install csvs-to-sqlite

`csvs-to-sqlite` now requires Python 3. If you are running Python 2 you can install the last version to support Python 2:

    $ pip install csvs-to-sqlite==0.9.2

## csvs-to-sqlite --help

<!-- [[[cog
import cog
from csvs_to_sqlite import cli
from click.testing import CliRunner
runner = CliRunner()
result = runner.invoke(cli.cli, [""--help""])
help = result.output.replace(""Usage: cli"", ""Usage: csvs-to-sqlite"")
cog.out(
    ""```\n{}\n```"".format(help)
)
]]] -->
```
Usage: csvs-to-sqlite [OPTIONS] PATHS... DBNAME

  PATHS: paths to individual .csv files or to directories containing .csvs

  DBNAME: name of the SQLite database file to create

Options:
  -s, --separator TEXT            Field separator in input .csv
  -q, --quoting INTEGER           Control field quoting behavior per csv.QUOTE_*
                                  constants. Use one of QUOTE_MINIMAL (0),
                                  QUOTE_ALL (1), QUOTE_NONNUMERIC (2) or
                                  QUOTE_NONE (3).

  --skip-errors                   Skip lines with too many fields instead of
                                  stopping the import

  --replace-tables                Replace tables if they already exist
  -t, --table TEXT                Table to use (instead of using CSV filename)
  -c, --extract-column TEXT       One or more columns to 'extract' into a
                                  separate lookup table. If you pass a simple
                                  column name that column will be replaced with
                                  integer foreign key references to a new table
                                  of that name. You can customize the name of
                                  the table like so:     state:States:state_name
                                  
                                  This will pull unique values from the 'state'
                                  column and use them to populate a new 'States'
                                  table, with an id column primary key and a
                                  state_name column containing the strings from
                                  the original column.

  -d, --date TEXT                 One or more columns to parse into ISO
                                  formatted dates

  -dt, --datetime TEXT            One or more columns to parse into ISO
                                  formatted datetimes

  -df, --datetime-format TEXT     One or more custom date format strings to try
                                  when parsing dates/datetimes

  -pk, --primary-key TEXT         One or more columns to use as the primary key
  -f, --fts TEXT                  One or more columns to use to populate a full-
                                  text index

  -i, --index TEXT                Add index on this column (or a compound index
                                  with -i col1,col2)

  --shape TEXT                    Custom shape for the DB table - format is
                                  csvcol:dbcol(TYPE),...

  --filename-column TEXT          Add a column with this name and populate with
                                  CSV file name

  --fixed-column <TEXT TEXT>...   Populate column with a fixed string
  --fixed-column-int <TEXT INTEGER>...
                                  Populate column with a fixed integer
  --fixed-column-float <TEXT FLOAT>...
                                  Populate column with a fixed float
  --no-index-fks                  Skip adding index to foreign key columns
                                  created using --extract-column (default is to
                                  add them)

  --no-fulltext-fks               Skip adding full-text index on values
                                  extracted using --extract-column (default is
                                  to add them)

  --just-strings                  Import all columns as text strings by default
                                  (and, if specified, still obey --shape,
                                  --date/datetime, and --datetime-format)

  --version                       Show the version and exit.
  --help                          Show this message and exit.

```
<!-- [[[end]]] -->
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-csvs-to-sqlite"" class=""anchor"" aria-hidden=""true"" href=""#user-content-csvs-to-sqlite""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>csvs-to-sqlite</h1>
<p dir=""auto""><a href=""https://pypi.org/project/csvs-to-sqlite/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/4869c8a624ce3dbf120decc95900d98af23c54cf5c87d65f7164e640a5fcda6b/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f637376732d746f2d73716c6974652e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/csvs-to-sqlite.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/csvs-to-sqlite/releases""><img src=""https://camo.githubusercontent.com/c0cc260f11a725723219d723be3829d23edfc0962d0e8d191fa2bcd6ceaa749f/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f637376732d746f2d73716c6974653f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/csvs-to-sqlite?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/csvs-to-sqlite/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/csvs-to-sqlite/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/csvs-to-sqlite/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Convert CSV files into a SQLite database. Browse and publish that SQLite database with <a href=""https://github.com/simonw/datasette"">Datasette</a>.</p>
<p dir=""auto"">Basic usage:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""csvs-to-sqlite myfile.csv mydatabase.db
""><pre><code>csvs-to-sqlite myfile.csv mydatabase.db
</code></pre></div>
<p dir=""auto"">This will create a new SQLite database called <code>mydatabase.db</code> containing a
single table, <code>myfile</code>, containing the CSV content.</p>
<p dir=""auto"">You can provide multiple CSV files:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""csvs-to-sqlite one.csv two.csv bundle.db
""><pre><code>csvs-to-sqlite one.csv two.csv bundle.db
</code></pre></div>
<p dir=""auto"">The <code>bundle.db</code> database will contain two tables, <code>one</code> and <code>two</code>.</p>
<p dir=""auto"">This means you can use wildcards:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""csvs-to-sqlite ~/Downloads/*.csv my-downloads.db
""><pre><code>csvs-to-sqlite ~/Downloads/*.csv my-downloads.db
</code></pre></div>
<p dir=""auto"">If you pass a path to one or more directories, the script will recursively
search those directories for CSV files and create tables for each one.</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""csvs-to-sqlite ~/path/to/directory all-my-csvs.db
""><pre><code>csvs-to-sqlite ~/path/to/directory all-my-csvs.db
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-handling-tsv-tab-separated-values"" class=""anchor"" aria-hidden=""true"" href=""#user-content-handling-tsv-tab-separated-values""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Handling TSV (tab-separated values)</h2>
<p dir=""auto"">You can use the <code>-s</code> option to specify a different delimiter. If you want
to use a tab character you'll need to apply shell escaping like so:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""csvs-to-sqlite my-file.tsv my-file.db -s $'\t'
""><pre><code>csvs-to-sqlite my-file.tsv my-file.db -s $'\t'
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-refactoring-columns-into-separate-lookup-tables"" class=""anchor"" aria-hidden=""true"" href=""#user-content-refactoring-columns-into-separate-lookup-tables""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Refactoring columns into separate lookup tables</h2>
<p dir=""auto"">Let's say you have a CSV file that looks like this:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""county,precinct,office,district,party,candidate,votes
Clark,1,President,,REP,John R. Kasich,5
Clark,2,President,,REP,John R. Kasich,0
Clark,3,President,,REP,John R. Kasich,7
""><pre><code>county,precinct,office,district,party,candidate,votes
Clark,1,President,,REP,John R. Kasich,5
Clark,2,President,,REP,John R. Kasich,0
Clark,3,President,,REP,John R. Kasich,7
</code></pre></div>
<p dir=""auto"">(<a href=""https://github.com/openelections/openelections-data-sd/blob/master/2016/20160607__sd__primary__clark__precinct.csv"">Real example taken from the Open Elections project</a>)</p>
<p dir=""auto"">You can now convert selected columns into separate lookup tables using the new
<code>--extract-column</code> option (shortname: <code>-c</code>) - for example:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""csvs-to-sqlite openelections-data-*/*.csv \
    -c county:County:name \
    -c precinct:Precinct:name \
    -c office -c district -c party -c candidate \
    openelections.db
""><pre><code>csvs-to-sqlite openelections-data-*/*.csv \
    -c county:County:name \
    -c precinct:Precinct:name \
    -c office -c district -c party -c candidate \
    openelections.db
</code></pre></div>
<p dir=""auto"">The format is as follows:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""column_name:optional_table_name:optional_table_value_column_name
""><pre><code>column_name:optional_table_name:optional_table_value_column_name
</code></pre></div>
<p dir=""auto"">If you just specify the column name e.g. <code>-c office</code>, the following table will
be created:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""CREATE TABLE &quot;office&quot; (
    &quot;id&quot; INTEGER PRIMARY KEY,
    &quot;value&quot; TEXT
);
""><pre><code>CREATE TABLE ""office"" (
    ""id"" INTEGER PRIMARY KEY,
    ""value"" TEXT
);
</code></pre></div>
<p dir=""auto"">If you specify all three options, e.g. <code>-c precinct:Precinct:name</code> the table
will look like this:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""CREATE TABLE &quot;Precinct&quot; (
    &quot;id&quot; INTEGER PRIMARY KEY,
    &quot;name&quot; TEXT
);
""><pre><code>CREATE TABLE ""Precinct"" (
    ""id"" INTEGER PRIMARY KEY,
    ""name"" TEXT
);
</code></pre></div>
<p dir=""auto"">The original tables will be created like this:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""CREATE TABLE &quot;ca__primary__san_francisco__precinct&quot; (
    &quot;county&quot; INTEGER,
    &quot;precinct&quot; INTEGER,
    &quot;office&quot; INTEGER,
    &quot;district&quot; INTEGER,
    &quot;party&quot; INTEGER,
    &quot;candidate&quot; INTEGER,
    &quot;votes&quot; INTEGER,
    FOREIGN KEY (county) REFERENCES County(id),
    FOREIGN KEY (party) REFERENCES party(id),
    FOREIGN KEY (precinct) REFERENCES Precinct(id),
    FOREIGN KEY (office) REFERENCES office(id),
    FOREIGN KEY (candidate) REFERENCES candidate(id)
);
""><pre><code>CREATE TABLE ""ca__primary__san_francisco__precinct"" (
    ""county"" INTEGER,
    ""precinct"" INTEGER,
    ""office"" INTEGER,
    ""district"" INTEGER,
    ""party"" INTEGER,
    ""candidate"" INTEGER,
    ""votes"" INTEGER,
    FOREIGN KEY (county) REFERENCES County(id),
    FOREIGN KEY (party) REFERENCES party(id),
    FOREIGN KEY (precinct) REFERENCES Precinct(id),
    FOREIGN KEY (office) REFERENCES office(id),
    FOREIGN KEY (candidate) REFERENCES candidate(id)
);
</code></pre></div>
<p dir=""auto"">They will be populated with IDs that reference the new derived tables.</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ pip install csvs-to-sqlite
""><pre><code>$ pip install csvs-to-sqlite
</code></pre></div>
<p dir=""auto""><code>csvs-to-sqlite</code> now requires Python 3. If you are running Python 2 you can install the last version to support Python 2:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ pip install csvs-to-sqlite==0.9.2
""><pre><code>$ pip install csvs-to-sqlite==0.9.2
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-csvs-to-sqlite---help"" class=""anchor"" aria-hidden=""true"" href=""#user-content-csvs-to-sqlite---help""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>csvs-to-sqlite --help</h2>

<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""Usage: csvs-to-sqlite [OPTIONS] PATHS... DBNAME

  PATHS: paths to individual .csv files or to directories containing .csvs

  DBNAME: name of the SQLite database file to create

Options:
  -s, --separator TEXT            Field separator in input .csv
  -q, --quoting INTEGER           Control field quoting behavior per csv.QUOTE_*
                                  constants. Use one of QUOTE_MINIMAL (0),
                                  QUOTE_ALL (1), QUOTE_NONNUMERIC (2) or
                                  QUOTE_NONE (3).

  --skip-errors                   Skip lines with too many fields instead of
                                  stopping the import

  --replace-tables                Replace tables if they already exist
  -t, --table TEXT                Table to use (instead of using CSV filename)
  -c, --extract-column TEXT       One or more columns to 'extract' into a
                                  separate lookup table. If you pass a simple
                                  column name that column will be replaced with
                                  integer foreign key references to a new table
                                  of that name. You can customize the name of
                                  the table like so:     state:States:state_name
                                  
                                  This will pull unique values from the 'state'
                                  column and use them to populate a new 'States'
                                  table, with an id column primary key and a
                                  state_name column containing the strings from
                                  the original column.

  -d, --date TEXT                 One or more columns to parse into ISO
                                  formatted dates

  -dt, --datetime TEXT            One or more columns to parse into ISO
                                  formatted datetimes

  -df, --datetime-format TEXT     One or more custom date format strings to try
                                  when parsing dates/datetimes

  -pk, --primary-key TEXT         One or more columns to use as the primary key
  -f, --fts TEXT                  One or more columns to use to populate a full-
                                  text index

  -i, --index TEXT                Add index on this column (or a compound index
                                  with -i col1,col2)

  --shape TEXT                    Custom shape for the DB table - format is
                                  csvcol:dbcol(TYPE),...

  --filename-column TEXT          Add a column with this name and populate with
                                  CSV file name

  --fixed-column &lt;TEXT TEXT&gt;...   Populate column with a fixed string
  --fixed-column-int &lt;TEXT INTEGER&gt;...
                                  Populate column with a fixed integer
  --fixed-column-float &lt;TEXT FLOAT&gt;...
                                  Populate column with a fixed float
  --no-index-fks                  Skip adding index to foreign key columns
                                  created using --extract-column (default is to
                                  add them)

  --no-fulltext-fks               Skip adding full-text index on values
                                  extracted using --extract-column (default is
                                  to add them)

  --just-strings                  Import all columns as text strings by default
                                  (and, if specified, still obey --shape,
                                  --date/datetime, and --datetime-format)

  --version                       Show the version and exit.
  --help                          Show this message and exit.

""><pre><code>Usage: csvs-to-sqlite [OPTIONS] PATHS... DBNAME

  PATHS: paths to individual .csv files or to directories containing .csvs

  DBNAME: name of the SQLite database file to create

Options:
  -s, --separator TEXT            Field separator in input .csv
  -q, --quoting INTEGER           Control field quoting behavior per csv.QUOTE_*
                                  constants. Use one of QUOTE_MINIMAL (0),
                                  QUOTE_ALL (1), QUOTE_NONNUMERIC (2) or
                                  QUOTE_NONE (3).

  --skip-errors                   Skip lines with too many fields instead of
                                  stopping the import

  --replace-tables                Replace tables if they already exist
  -t, --table TEXT                Table to use (instead of using CSV filename)
  -c, --extract-column TEXT       One or more columns to 'extract' into a
                                  separate lookup table. If you pass a simple
                                  column name that column will be replaced with
                                  integer foreign key references to a new table
                                  of that name. You can customize the name of
                                  the table like so:     state:States:state_name
                                  
                                  This will pull unique values from the 'state'
                                  column and use them to populate a new 'States'
                                  table, with an id column primary key and a
                                  state_name column containing the strings from
                                  the original column.

  -d, --date TEXT                 One or more columns to parse into ISO
                                  formatted dates

  -dt, --datetime TEXT            One or more columns to parse into ISO
                                  formatted datetimes

  -df, --datetime-format TEXT     One or more custom date format strings to try
                                  when parsing dates/datetimes

  -pk, --primary-key TEXT         One or more columns to use as the primary key
  -f, --fts TEXT                  One or more columns to use to populate a full-
                                  text index

  -i, --index TEXT                Add index on this column (or a compound index
                                  with -i col1,col2)

  --shape TEXT                    Custom shape for the DB table - format is
                                  csvcol:dbcol(TYPE),...

  --filename-column TEXT          Add a column with this name and populate with
                                  CSV file name

  --fixed-column &lt;TEXT TEXT&gt;...   Populate column with a fixed string
  --fixed-column-int &lt;TEXT INTEGER&gt;...
                                  Populate column with a fixed integer
  --fixed-column-float &lt;TEXT FLOAT&gt;...
                                  Populate column with a fixed float
  --no-index-fks                  Skip adding index to foreign key columns
                                  created using --extract-column (default is to
                                  add them)

  --no-fulltext-fks               Skip adding full-text index on values
                                  extracted using --extract-column (default is
                                  to add them)

  --just-strings                  Import all columns as text strings by default
                                  (and, if specified, still obey --shape,
                                  --date/datetime, and --datetime-format)

  --version                       Show the version and exit.
  --help                          Show this message and exit.

</code></pre></div>

</article></div>",1,public,0,,,
130236762,MDEwOlJlcG9zaXRvcnkxMzAyMzY3NjI=,datasette-cluster-map,simonw/datasette-cluster-map,0,9599,https://github.com/simonw/datasette-cluster-map,Datasette plugin that shows a map for any data with latitude/longitude columns,0,2018-04-19T15:31:55Z,2021-12-07T21:55:01Z,2021-12-07T19:39:02Z,,97,40,40,JavaScript,1,1,1,1,0,10,0,0,12,apache-2.0,"[""datasette"", ""datasette-io"", ""datasette-plugin"", ""leafletjs""]",10,12,40,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,10,2,"# datasette-cluster-map

[![PyPI](https://img.shields.io/pypi/v/datasette-cluster-map.svg)](https://pypi.org/project/datasette-cluster-map/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-cluster-map?include_prereleases&label=changelog)](https://github.com/simonw/datasette-cluster-map/releases)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-cluster-map/blob/main/LICENSE)

A [Datasette plugin](https://docs.datasette.io/en/stable/plugins.html) that detects tables with `latitude` and `longitude` columns and then plots them on a map using [Leaflet.markercluster](https://github.com/Leaflet/Leaflet.markercluster).

More about this project: [Datasette plugins, and building a clustered map visualization](https://simonwillison.net/2018/Apr/20/datasette-plugins/).

## Demo

[global-power-plants.datasettes.com](https://global-power-plants.datasettes.com/global-power-plants/global-power-plants) hosts a demo of this plugin running against a database of 33,000 power plants around the world.

![Cluster map demo](https://static.simonwillison.net/static/2020/global-power-plants.png)

## Installation

Run `datasette install datasette-cluster-map` to add this plugin to your Datasette virtual environment. Datasette will automatically load the plugin if it is installed in this way.

If you are deploying using the `datasette publish` command you can use the `--install` option:

    datasette publish cloudrun mydb.db --install=datasette-cluster-map

If any of your tables have a `latitude` and `longitude` column, a map will be automatically displayed.

## Configuration

If your columns are called something else you can configure the column names using [plugin configuration](https://docs.datasette.io/en/stable/plugins.html#plugin-configuration) in a `metadata.json` file. For example, if all of your columns are called `xlat` and `xlng` you can create a `metadata.json` file like this:

```json
{
    ""title"": ""Regular metadata keys can go here too"",
    ""plugins"": {
        ""datasette-cluster-map"": {
            ""latitude_column"": ""xlat"",
            ""longitude_column"": ""xlng""
        }
    }
}
```

Then run Datasette like this:

    datasette mydata.db -m metadata.json

This will configure the required column names for every database loaded by that Datasette instance.

If you want to customize the column names for just one table in one database, you can do something like this:

```json
{
    ""databases"": {
        ""polar-bears"": {
            ""tables"": {
                ""USGS_WC_eartag_deployments_2009-2011"": {
                    ""plugins"": {
                        ""datasette-cluster-map"": {
                            ""latitude_column"": ""Capture Latitude"",
                            ""longitude_column"": ""Capture Longitude""
                        }
                    }
                }
            }
        }
    }
}
```

You can also use a custom SQL query to rename those columns to `latitude` and `longitude`, [for example](https://polar-bears.now.sh/polar-bears?sql=select+*%2C%0D%0A++++%22Capture+Latitude%22+as+latitude%2C%0D%0A++++%22Capture+Longitude%22+as+longitude%0D%0Afrom+%5BUSGS_WC_eartag_deployments_2009-2011%5D):

```sql
select *,
    ""Capture Latitude"" as latitude,
    ""Capture Longitude"" as longitude
from [USGS_WC_eartag_deployments_2009-2011]
```

The map defaults to being displayed above the main results table on the page. You can use the `""container""` plugin setting to provide a CSS selector indicating an element that the map should be appended to instead.

## Custom tile layers

You can customize the tile layer used  by the maps using the `tile_layer` and `tile_layer_options` configuration settings. For example, to use the [Stamen Watercolor tiles](http://maps.stamen.com/watercolor/#12/37.7706/-122.3782) you can use these settings:

```json
{
    ""plugins"": {
        ""datasette-cluster-map"": {
            ""tile_layer"": ""https://stamen-tiles-{s}.a.ssl.fastly.net/watercolor/{z}/{x}/{y}.{ext}"",
            ""tile_layer_options"": {
                ""attribution"": ""Map tiles by <a href=\""http://stamen.com\"">Stamen Design</a>, <a href=\""http://creativecommons.org/licenses/by/3.0\"">CC BY 3.0</a> &mdash; Map data &copy; <a href=\""https://www.openstreetmap.org/copyright\"">OpenStreetMap</a> contributors"",
                ""subdomains"": ""abcd"",
                ""minZoom"": 1,
                ""maxZoom"": 16,
                ""ext"": ""jpg""
            }
        }
    }
}
```
The [Leaflet Providers preview list](https://leaflet-extras.github.io/leaflet-providers/preview/index.html) has details of many other tile layers you can use.

## Custom marker popups

The marker popup defaults to displaying the data for the underlying database row.

You can customize this by including a `popup` column in your results containing JSON that defines a more useful popup.

The JSON in the popup column should look something like this:

```json
{
    ""image"": ""https://niche-museums.imgix.net/dodgems.heic?w=800&h=400&fit=crop"",
    ""alt"": ""Dingles Fairground Heritage Centre"",
    ""title"": ""Dingles Fairground Heritage Centre"",
    ""description"": ""Home of the National Fairground Collection, Dingles has over 45,000 indoor square feet of vintage fairground rides... and you can go on them! Highlights include the last complete surviving and opera"",
    ""link"": ""/browse/museums/26""
}
```

Each of these columns is optional.

- `title` is the title to show at the top of the popup
- `image` is the URL to an image to display in the popup
- `alt` is the alt attribute to use for that image
- `description` is a longer string of text to use as a description
- `link` is a URL that the marker content should link to

You can use the SQLite `json_object()` function to construct this data dynamically as part of your SQL query. Here's an example:

```sql
select json_object(
  'image', photo_url || '?w=800&h=400&fit=crop',
  'title', name,
  'description', substr(description, 0, 200),
  'link', '/browse/museums/' || id
  ) as popup,
  latitude, longitude from museums
where id in (26, 27) order by id
```

[Try that example here](https://www.niche-museums.com/browse?sql=select+json_object%28%0D%0A++%27image%27%2C+photo_url+%7C%7C+%27%3Fw%3D800%26h%3D400%26fit%3Dcrop%27%2C%0D%0A++%27title%27%2C+name%2C%0D%0A++%27description%27%2C+substr%28description%2C+0%2C+200%29%2C%0D%0A++%27link%27%2C+%27%2Fbrowse%2Fmuseums%2F%27+%7C%7C+id%0D%0A++%29+as+popup%2C%0D%0A++latitude%2C+longitude+from+museums) or take a look at [this demo built using a SQL view](https://dogsheep-photos.dogsheep.net/public/photos_on_a_map).

## How I deployed the demo

    datasette publish cloudrun global-power-plants.db \
        --service global-power-plants \
        --metadata metadata.json \
        --install=datasette-cluster-map \
        --extra-options=""--config facet_time_limit_ms:1000""

## Development

To set up this plugin locally, first checkout the code. Then create a new virtual environment:

    cd datasette-cluster-map
    python3 -mvenv venv
    source venv/bin/activate

Or if you are using `pipenv`:

    pipenv shell

Now install the dependencies and tests:

    pip install -e '.[test]'

To run the tests:

    pytest
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-datasette-cluster-map"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-cluster-map""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-cluster-map</h1>
<p dir=""auto""><a href=""https://pypi.org/project/datasette-cluster-map/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/6450215f6eeea9441e8212cad322848645a5fe49b0a77613b19c24aaf36eec9b/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d636c75737465722d6d61702e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-cluster-map.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-cluster-map/releases""><img src=""https://camo.githubusercontent.com/1e99ad2a8a1efa327c27477d9f56b3288ab98216a2860019b872ecdcff59e7e5/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d636c75737465722d6d61703f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-cluster-map?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-cluster-map/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">A <a href=""https://docs.datasette.io/en/stable/plugins.html"" rel=""nofollow"">Datasette plugin</a> that detects tables with <code>latitude</code> and <code>longitude</code> columns and then plots them on a map using <a href=""https://github.com/Leaflet/Leaflet.markercluster"">Leaflet.markercluster</a>.</p>
<p dir=""auto"">More about this project: <a href=""https://simonwillison.net/2018/Apr/20/datasette-plugins/"" rel=""nofollow"">Datasette plugins, and building a clustered map visualization</a>.</p>
<h2 dir=""auto""><a id=""user-content-demo"" class=""anchor"" aria-hidden=""true"" href=""#user-content-demo""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Demo</h2>
<p dir=""auto""><a href=""https://global-power-plants.datasettes.com/global-power-plants/global-power-plants"" rel=""nofollow"">global-power-plants.datasettes.com</a> hosts a demo of this plugin running against a database of 33,000 power plants around the world.</p>
<p dir=""auto""><a target=""_blank"" rel=""noopener noreferrer"" href=""https://camo.githubusercontent.com/8b609bcc80b913862801a773ae46d03393ad3f9e1f7c03a015dfc1123097ed71/68747470733a2f2f7374617469632e73696d6f6e77696c6c69736f6e2e6e65742f7374617469632f323032302f676c6f62616c2d706f7765722d706c616e74732e706e67""><img src=""https://camo.githubusercontent.com/8b609bcc80b913862801a773ae46d03393ad3f9e1f7c03a015dfc1123097ed71/68747470733a2f2f7374617469632e73696d6f6e77696c6c69736f6e2e6e65742f7374617469632f323032302f676c6f62616c2d706f7765722d706c616e74732e706e67"" alt=""Cluster map demo"" data-canonical-src=""https://static.simonwillison.net/static/2020/global-power-plants.png"" style=""max-width: 100%;""></a></p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p dir=""auto"">Run <code>datasette install datasette-cluster-map</code> to add this plugin to your Datasette virtual environment. Datasette will automatically load the plugin if it is installed in this way.</p>
<p dir=""auto"">If you are deploying using the <code>datasette publish</code> command you can use the <code>--install</code> option:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""datasette publish cloudrun mydb.db --install=datasette-cluster-map
""><pre><code>datasette publish cloudrun mydb.db --install=datasette-cluster-map
</code></pre></div>
<p dir=""auto"">If any of your tables have a <code>latitude</code> and <code>longitude</code> column, a map will be automatically displayed.</p>
<h2 dir=""auto""><a id=""user-content-configuration"" class=""anchor"" aria-hidden=""true"" href=""#user-content-configuration""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Configuration</h2>
<p dir=""auto"">If your columns are called something else you can configure the column names using <a href=""https://docs.datasette.io/en/stable/plugins.html#plugin-configuration"" rel=""nofollow"">plugin configuration</a> in a <code>metadata.json</code> file. For example, if all of your columns are called <code>xlat</code> and <code>xlng</code> you can create a <code>metadata.json</code> file like this:</p>
<div class=""highlight highlight-source-json position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{
    &quot;title&quot;: &quot;Regular metadata keys can go here too&quot;,
    &quot;plugins&quot;: {
        &quot;datasette-cluster-map&quot;: {
            &quot;latitude_column&quot;: &quot;xlat&quot;,
            &quot;longitude_column&quot;: &quot;xlng&quot;
        }
    }
}
""><pre>{
    <span class=""pl-ent"">""title""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>Regular metadata keys can go here too<span class=""pl-pds"">""</span></span>,
    <span class=""pl-ent"">""plugins""</span>: {
        <span class=""pl-ent"">""datasette-cluster-map""</span>: {
            <span class=""pl-ent"">""latitude_column""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>xlat<span class=""pl-pds"">""</span></span>,
            <span class=""pl-ent"">""longitude_column""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>xlng<span class=""pl-pds"">""</span></span>
        }
    }
}</pre></div>
<p dir=""auto"">Then run Datasette like this:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""datasette mydata.db -m metadata.json
""><pre><code>datasette mydata.db -m metadata.json
</code></pre></div>
<p dir=""auto"">This will configure the required column names for every database loaded by that Datasette instance.</p>
<p dir=""auto"">If you want to customize the column names for just one table in one database, you can do something like this:</p>
<div class=""highlight highlight-source-json position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{
    &quot;databases&quot;: {
        &quot;polar-bears&quot;: {
            &quot;tables&quot;: {
                &quot;USGS_WC_eartag_deployments_2009-2011&quot;: {
                    &quot;plugins&quot;: {
                        &quot;datasette-cluster-map&quot;: {
                            &quot;latitude_column&quot;: &quot;Capture Latitude&quot;,
                            &quot;longitude_column&quot;: &quot;Capture Longitude&quot;
                        }
                    }
                }
            }
        }
    }
}
""><pre>{
    <span class=""pl-ent"">""databases""</span>: {
        <span class=""pl-ent"">""polar-bears""</span>: {
            <span class=""pl-ent"">""tables""</span>: {
                <span class=""pl-ent"">""USGS_WC_eartag_deployments_2009-2011""</span>: {
                    <span class=""pl-ent"">""plugins""</span>: {
                        <span class=""pl-ent"">""datasette-cluster-map""</span>: {
                            <span class=""pl-ent"">""latitude_column""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>Capture Latitude<span class=""pl-pds"">""</span></span>,
                            <span class=""pl-ent"">""longitude_column""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>Capture Longitude<span class=""pl-pds"">""</span></span>
                        }
                    }
                }
            }
        }
    }
}</pre></div>
<p dir=""auto"">You can also use a custom SQL query to rename those columns to <code>latitude</code> and <code>longitude</code>, <a href=""https://polar-bears.now.sh/polar-bears?sql=select+*%2C%0D%0A++++%22Capture+Latitude%22+as+latitude%2C%0D%0A++++%22Capture+Longitude%22+as+longitude%0D%0Afrom+%5BUSGS_WC_eartag_deployments_2009-2011%5D"" rel=""nofollow"">for example</a>:</p>
<div class=""highlight highlight-source-sql position-relative overflow-auto"" data-snippet-clipboard-copy-content=""select *,
    &quot;Capture Latitude&quot; as latitude,
    &quot;Capture Longitude&quot; as longitude
from [USGS_WC_eartag_deployments_2009-2011]
""><pre><span class=""pl-k"">select</span> <span class=""pl-k"">*</span>,
    <span class=""pl-s""><span class=""pl-pds"">""</span>Capture Latitude<span class=""pl-pds"">""</span></span> <span class=""pl-k"">as</span> latitude,
    <span class=""pl-s""><span class=""pl-pds"">""</span>Capture Longitude<span class=""pl-pds"">""</span></span> <span class=""pl-k"">as</span> longitude
<span class=""pl-k"">from</span> [USGS_WC_eartag_deployments_2009<span class=""pl-k"">-</span><span class=""pl-c1"">2011</span>]</pre></div>
<p dir=""auto"">The map defaults to being displayed above the main results table on the page. You can use the <code>""container""</code> plugin setting to provide a CSS selector indicating an element that the map should be appended to instead.</p>
<h2 dir=""auto""><a id=""user-content-custom-tile-layers"" class=""anchor"" aria-hidden=""true"" href=""#user-content-custom-tile-layers""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Custom tile layers</h2>
<p dir=""auto"">You can customize the tile layer used  by the maps using the <code>tile_layer</code> and <code>tile_layer_options</code> configuration settings. For example, to use the <a href=""http://maps.stamen.com/watercolor/#12/37.7706/-122.3782"" rel=""nofollow"">Stamen Watercolor tiles</a> you can use these settings:</p>
<div class=""highlight highlight-source-json position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{
    &quot;plugins&quot;: {
        &quot;datasette-cluster-map&quot;: {
            &quot;tile_layer&quot;: &quot;https://stamen-tiles-{s}.a.ssl.fastly.net/watercolor/{z}/{x}/{y}.{ext}&quot;,
            &quot;tile_layer_options&quot;: {
                &quot;attribution&quot;: &quot;Map tiles by &lt;a href=\&quot;http://stamen.com\&quot;&gt;Stamen Design&lt;/a&gt;, &lt;a href=\&quot;http://creativecommons.org/licenses/by/3.0\&quot;&gt;CC BY 3.0&lt;/a&gt; &amp;mdash; Map data &amp;copy; &lt;a href=\&quot;https://www.openstreetmap.org/copyright\&quot;&gt;OpenStreetMap&lt;/a&gt; contributors&quot;,
                &quot;subdomains&quot;: &quot;abcd&quot;,
                &quot;minZoom&quot;: 1,
                &quot;maxZoom&quot;: 16,
                &quot;ext&quot;: &quot;jpg&quot;
            }
        }
    }
}
""><pre>{
    <span class=""pl-ent"">""plugins""</span>: {
        <span class=""pl-ent"">""datasette-cluster-map""</span>: {
            <span class=""pl-ent"">""tile_layer""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>https://stamen-tiles-{s}.a.ssl.fastly.net/watercolor/{z}/{x}/{y}.{ext}<span class=""pl-pds"">""</span></span>,
            <span class=""pl-ent"">""tile_layer_options""</span>: {
                <span class=""pl-ent"">""attribution""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>Map tiles by &lt;a href=<span class=""pl-cce"">\""</span>http://stamen.com<span class=""pl-cce"">\""</span>&gt;Stamen Design&lt;/a&gt;, &lt;a href=<span class=""pl-cce"">\""</span>http://creativecommons.org/licenses/by/3.0<span class=""pl-cce"">\""</span>&gt;CC BY 3.0&lt;/a&gt; &amp;mdash; Map data &amp;copy; &lt;a href=<span class=""pl-cce"">\""</span>https://www.openstreetmap.org/copyright<span class=""pl-cce"">\""</span>&gt;OpenStreetMap&lt;/a&gt; contributors<span class=""pl-pds"">""</span></span>,
                <span class=""pl-ent"">""subdomains""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>abcd<span class=""pl-pds"">""</span></span>,
                <span class=""pl-ent"">""minZoom""</span>: <span class=""pl-c1"">1</span>,
                <span class=""pl-ent"">""maxZoom""</span>: <span class=""pl-c1"">16</span>,
                <span class=""pl-ent"">""ext""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>jpg<span class=""pl-pds"">""</span></span>
            }
        }
    }
}</pre></div>
<p dir=""auto"">The <a href=""https://leaflet-extras.github.io/leaflet-providers/preview/index.html"" rel=""nofollow"">Leaflet Providers preview list</a> has details of many other tile layers you can use.</p>
<h2 dir=""auto""><a id=""user-content-custom-marker-popups"" class=""anchor"" aria-hidden=""true"" href=""#user-content-custom-marker-popups""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Custom marker popups</h2>
<p dir=""auto"">The marker popup defaults to displaying the data for the underlying database row.</p>
<p dir=""auto"">You can customize this by including a <code>popup</code> column in your results containing JSON that defines a more useful popup.</p>
<p dir=""auto"">The JSON in the popup column should look something like this:</p>
<div class=""highlight highlight-source-json position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{
    &quot;image&quot;: &quot;https://niche-museums.imgix.net/dodgems.heic?w=800&amp;h=400&amp;fit=crop&quot;,
    &quot;alt&quot;: &quot;Dingles Fairground Heritage Centre&quot;,
    &quot;title&quot;: &quot;Dingles Fairground Heritage Centre&quot;,
    &quot;description&quot;: &quot;Home of the National Fairground Collection, Dingles has over 45,000 indoor square feet of vintage fairground rides... and you can go on them! Highlights include the last complete surviving and opera&quot;,
    &quot;link&quot;: &quot;/browse/museums/26&quot;
}
""><pre>{
    <span class=""pl-ent"">""image""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>https://niche-museums.imgix.net/dodgems.heic?w=800&amp;h=400&amp;fit=crop<span class=""pl-pds"">""</span></span>,
    <span class=""pl-ent"">""alt""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>Dingles Fairground Heritage Centre<span class=""pl-pds"">""</span></span>,
    <span class=""pl-ent"">""title""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>Dingles Fairground Heritage Centre<span class=""pl-pds"">""</span></span>,
    <span class=""pl-ent"">""description""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>Home of the National Fairground Collection, Dingles has over 45,000 indoor square feet of vintage fairground rides... and you can go on them! Highlights include the last complete surviving and opera<span class=""pl-pds"">""</span></span>,
    <span class=""pl-ent"">""link""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>/browse/museums/26<span class=""pl-pds"">""</span></span>
}</pre></div>
<p dir=""auto"">Each of these columns is optional.</p>
<ul dir=""auto"">
<li><code>title</code> is the title to show at the top of the popup</li>
<li><code>image</code> is the URL to an image to display in the popup</li>
<li><code>alt</code> is the alt attribute to use for that image</li>
<li><code>description</code> is a longer string of text to use as a description</li>
<li><code>link</code> is a URL that the marker content should link to</li>
</ul>
<p dir=""auto"">You can use the SQLite <code>json_object()</code> function to construct this data dynamically as part of your SQL query. Here's an example:</p>
<div class=""highlight highlight-source-sql position-relative overflow-auto"" data-snippet-clipboard-copy-content=""select json_object(
  'image', photo_url || '?w=800&amp;h=400&amp;fit=crop',
  'title', name,
  'description', substr(description, 0, 200),
  'link', '/browse/museums/' || id
  ) as popup,
  latitude, longitude from museums
where id in (26, 27) order by id
""><pre><span class=""pl-k"">select</span> json_object(
  <span class=""pl-s""><span class=""pl-pds"">'</span>image<span class=""pl-pds"">'</span></span>, photo_url <span class=""pl-k"">||</span> <span class=""pl-s""><span class=""pl-pds"">'</span>?w=800&amp;h=400&amp;fit=crop<span class=""pl-pds"">'</span></span>,
  <span class=""pl-s""><span class=""pl-pds"">'</span>title<span class=""pl-pds"">'</span></span>, name,
  <span class=""pl-s""><span class=""pl-pds"">'</span>description<span class=""pl-pds"">'</span></span>, substr(description, <span class=""pl-c1"">0</span>, <span class=""pl-c1"">200</span>),
  <span class=""pl-s""><span class=""pl-pds"">'</span>link<span class=""pl-pds"">'</span></span>, <span class=""pl-s""><span class=""pl-pds"">'</span>/browse/museums/<span class=""pl-pds"">'</span></span> <span class=""pl-k"">||</span> id
  ) <span class=""pl-k"">as</span> popup,
  latitude, longitude <span class=""pl-k"">from</span> museums
<span class=""pl-k"">where</span> id <span class=""pl-k"">in</span> (<span class=""pl-c1"">26</span>, <span class=""pl-c1"">27</span>) <span class=""pl-k"">order by</span> id</pre></div>
<p dir=""auto""><a href=""https://www.niche-museums.com/browse?sql=select+json_object%28%0D%0A++%27image%27%2C+photo_url+%7C%7C+%27%3Fw%3D800%26h%3D400%26fit%3Dcrop%27%2C%0D%0A++%27title%27%2C+name%2C%0D%0A++%27description%27%2C+substr%28description%2C+0%2C+200%29%2C%0D%0A++%27link%27%2C+%27%2Fbrowse%2Fmuseums%2F%27+%7C%7C+id%0D%0A++%29+as+popup%2C%0D%0A++latitude%2C+longitude+from+museums"" rel=""nofollow"">Try that example here</a> or take a look at <a href=""https://dogsheep-photos.dogsheep.net/public/photos_on_a_map"" rel=""nofollow"">this demo built using a SQL view</a>.</p>
<h2 dir=""auto""><a id=""user-content-how-i-deployed-the-demo"" class=""anchor"" aria-hidden=""true"" href=""#user-content-how-i-deployed-the-demo""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>How I deployed the demo</h2>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""datasette publish cloudrun global-power-plants.db \
    --service global-power-plants \
    --metadata metadata.json \
    --install=datasette-cluster-map \
    --extra-options=&quot;--config facet_time_limit_ms:1000&quot;
""><pre><code>datasette publish cloudrun global-power-plants.db \
    --service global-power-plants \
    --metadata metadata.json \
    --install=datasette-cluster-map \
    --extra-options=""--config facet_time_limit_ms:1000""
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p dir=""auto"">To set up this plugin locally, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""cd datasette-cluster-map
python3 -mvenv venv
source venv/bin/activate
""><pre><code>cd datasette-cluster-map
python3 -mvenv venv
source venv/bin/activate
</code></pre></div>
<p dir=""auto"">Or if you are using <code>pipenv</code>:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pipenv shell
""><pre><code>pipenv shell
</code></pre></div>
<p dir=""auto"">Now install the dependencies and tests:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'
""><pre><code>pip install -e '.[test]'
</code></pre></div>
<p dir=""auto"">To run the tests:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pytest
""><pre><code>pytest
</code></pre></div>
</article></div>",1,public,0,,,
138669673,MDEwOlJlcG9zaXRvcnkxMzg2Njk2NzM=,datasette-vega,simonw/datasette-vega,0,9599,https://github.com/simonw/datasette-vega,Datasette plugin for visualizing data using Vega,0,2018-06-26T01:40:54Z,2021-12-10T22:20:46Z,2021-12-10T22:20:43Z,,59,42,42,JavaScript,1,1,1,1,0,2,0,0,31,apache-2.0,"[""datasette"", ""datasette-io"", ""datasette-plugin"", ""plugin"", ""react"", ""vega""]",2,31,42,master,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,2,2,"# datasette-vega

[![PyPI](https://img.shields.io/pypi/v/datasette-vega.svg)](https://pypi.org/project/datasette-vega/)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-vega/blob/master/LICENSE)

A [Datasette](https://github.com/simonw/datasette) plugin that provides tools
for generating charts using [Vega](https://vega.github.io/).

![Datasette Vega interface](https://raw.githubusercontent.com/simonw/datasette-vega/master/datasette-vega.png)

Try out the latest master build as a live demo at https://datasette-vega-latest.datasette.io/ or try the latest release installed as a plugin at https://fivethirtyeight.datasettes.com/

To add this to your Datasette installation, install the plugin like so:

    pip install datasette-vega

The plugin will then add itself to every Datasette table view.

If you are publishing data using the `datasette publish` command, you can
include this plugin like so:

    datasette publish now mydatabase.db --install=datasette-vega
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-datasette-vega"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-vega""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-vega</h1>
<p dir=""auto""><a href=""https://pypi.org/project/datasette-vega/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/3daf7c04104904d77b14dde7eb24240a712bd4eca97aa15afa673a306a7ef31a/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d766567612e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-vega.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-vega/blob/master/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">A <a href=""https://github.com/simonw/datasette"">Datasette</a> plugin that provides tools
for generating charts using <a href=""https://vega.github.io/"" rel=""nofollow"">Vega</a>.</p>
<p dir=""auto""><a target=""_blank"" rel=""noopener noreferrer"" href=""https://raw.githubusercontent.com/simonw/datasette-vega/master/datasette-vega.png""><img src=""https://raw.githubusercontent.com/simonw/datasette-vega/master/datasette-vega.png"" alt=""Datasette Vega interface"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Try out the latest master build as a live demo at <a href=""https://datasette-vega-latest.datasette.io/"" rel=""nofollow"">https://datasette-vega-latest.datasette.io/</a> or try the latest release installed as a plugin at <a href=""https://fivethirtyeight.datasettes.com/"" rel=""nofollow"">https://fivethirtyeight.datasettes.com/</a></p>
<p dir=""auto"">To add this to your Datasette installation, install the plugin like so:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install datasette-vega""><pre><code>pip install datasette-vega
</code></pre></div>
<p dir=""auto"">The plugin will then add itself to every Datasette table view.</p>
<p dir=""auto"">If you are publishing data using the <code>datasette publish</code> command, you can
include this plugin like so:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""datasette publish now mydatabase.db --install=datasette-vega""><pre><code>datasette publish now mydatabase.db --install=datasette-vega
</code></pre></div>
</article></div>",1,public,0,,,
140912432,MDEwOlJlcG9zaXRvcnkxNDA5MTI0MzI=,sqlite-utils,simonw/sqlite-utils,0,9599,https://github.com/simonw/sqlite-utils,Python CLI utility and library for manipulating SQLite databases,0,2018-07-14T03:21:46Z,2022-11-15T18:12:16Z,2022-11-15T15:53:38Z,https://sqlite-utils.datasette.io,1437,1029,1029,Python,1,1,1,1,0,79,0,0,72,apache-2.0,"[""cli"", ""click"", ""datasette"", ""datasette-io"", ""datasette-tool"", ""python"", ""sqlite"", ""sqlite-database""]",79,72,1029,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,79,16,"# sqlite-utils

[![PyPI](https://img.shields.io/pypi/v/sqlite-utils.svg)](https://pypi.org/project/sqlite-utils/)
[![Changelog](https://img.shields.io/github/v/release/simonw/sqlite-utils?include_prereleases&label=changelog)](https://sqlite-utils.datasette.io/en/stable/changelog.html)
[![Python 3.x](https://img.shields.io/pypi/pyversions/sqlite-utils.svg?logo=python&logoColor=white)](https://pypi.org/project/sqlite-utils/)
[![Tests](https://github.com/simonw/sqlite-utils/workflows/Test/badge.svg)](https://github.com/simonw/sqlite-utils/actions?query=workflow%3ATest)
[![Documentation Status](https://readthedocs.org/projects/sqlite-utils/badge/?version=stable)](http://sqlite-utils.datasette.io/en/stable/?badge=stable)
[![codecov](https://codecov.io/gh/simonw/sqlite-utils/branch/main/graph/badge.svg)](https://codecov.io/gh/simonw/sqlite-utils)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/sqlite-utils/blob/main/LICENSE)
[![discord](https://img.shields.io/discord/823971286308356157?label=discord)](https://discord.gg/Ass7bCAMDw)

Python CLI utility and library for manipulating SQLite databases.

## Some feature highlights

- [Pipe JSON](https://sqlite-utils.datasette.io/en/stable/cli.html#inserting-json-data) (or [CSV or TSV](https://sqlite-utils.datasette.io/en/stable/cli.html#inserting-csv-or-tsv-data)) directly into a new SQLite database file, automatically creating a table with the appropriate schema
- [Run in-memory SQL queries](https://sqlite-utils.datasette.io/en/stable/cli.html#querying-data-directly-using-an-in-memory-database), including joins, directly against data in CSV, TSV or JSON files and view the results
- [Configure SQLite full-text search](https://sqlite-utils.datasette.io/en/stable/cli.html#configuring-full-text-search) against your database tables and run search queries against them, ordered by relevance
- Run [transformations against your tables](https://sqlite-utils.datasette.io/en/stable/cli.html#transforming-tables) to make schema changes that SQLite `ALTER TABLE` does not directly support, such as changing the type of a column
- [Extract columns](https://sqlite-utils.datasette.io/en/stable/cli.html#extracting-columns-into-a-separate-table) into separate tables to better normalize your existing data

Read more on my blog, in this series of posts on [New features in sqlite-utils](https://simonwillison.net/series/sqlite-utils-features/) and other [entries tagged sqliteutils](https://simonwillison.net/tags/sqliteutils/).

## Installation

    pip install sqlite-utils

Or if you use [Homebrew](https://brew.sh/) for macOS:

    brew install sqlite-utils

## Using as a CLI tool

Now you can do things with the CLI utility like this:

    $ sqlite-utils memory dogs.csv ""select * from t""
    [{""id"": 1, ""age"": 4, ""name"": ""Cleo""},
     {""id"": 2, ""age"": 2, ""name"": ""Pancakes""}]

    $ sqlite-utils insert dogs.db dogs dogs.csv --csv
    [####################################]  100%

    $ sqlite-utils tables dogs.db --counts
    [{""table"": ""dogs"", ""count"": 2}]

    $ sqlite-utils dogs.db ""select id, name from dogs""
    [{""id"": 1, ""name"": ""Cleo""},
     {""id"": 2, ""name"": ""Pancakes""}]

    $ sqlite-utils dogs.db ""select * from dogs"" --csv
    id,age,name
    1,4,Cleo
    2,2,Pancakes

    $ sqlite-utils dogs.db ""select * from dogs"" --table
      id    age  name
    ----  -----  --------
       1      4  Cleo
       2      2  Pancakes

You can import JSON data into a new database table like this:

    $ curl https://api.github.com/repos/simonw/sqlite-utils/releases \
        | sqlite-utils insert releases.db releases - --pk id

Or for data in a CSV file:

    $ sqlite-utils insert dogs.db dogs dogs.csv --csv

`sqlite-utils memory` lets you import CSV or JSON data into an in-memory database and run SQL queries against it in a single command:

    $ cat dogs.csv | sqlite-utils memory - ""select name, age from stdin""

See the [full CLI documentation](https://sqlite-utils.datasette.io/en/stable/cli.html) for comprehensive coverage of many more commands.

## Using as a library

You can also `import sqlite_utils` and use it as a Python library like this:

```python
import sqlite_utils
db = sqlite_utils.Database(""demo_database.db"")
# This line creates a ""dogs"" table if one does not already exist:
db[""dogs""].insert_all([
    {""id"": 1, ""age"": 4, ""name"": ""Cleo""},
    {""id"": 2, ""age"": 2, ""name"": ""Pancakes""}
], pk=""id"")
```

Check out the [full library documentation](https://sqlite-utils.datasette.io/en/stable/python-api.html) for everything else you can do with the Python library.

## Related projects

* [Datasette](https://datasette.io/): A tool for exploring and publishing data
* [csvs-to-sqlite](https://github.com/simonw/csvs-to-sqlite): Convert CSV files into a SQLite database
* [db-to-sqlite](https://github.com/simonw/db-to-sqlite): CLI tool for exporting a MySQL or PostgreSQL database as a SQLite file
* [dogsheep](https://dogsheep.github.io/): A family of tools for personal analytics, built on top of `sqlite-utils`
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-sqlite-utils"" class=""anchor"" aria-hidden=""true"" href=""#user-content-sqlite-utils""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>sqlite-utils</h1>
<p dir=""auto""><a href=""https://pypi.org/project/sqlite-utils/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/f691f8124616f99c7978ecb2a58841aa6a7ace31234b0c87d392c4ef8db853e4/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f73716c6974652d7574696c732e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/sqlite-utils.svg"" style=""max-width: 100%;""></a>
<a href=""https://sqlite-utils.datasette.io/en/stable/changelog.html"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/b114e00dffe1e5b24d21c5cd11dc97f8a6ab83da51acbb971e5daebeeb47d690/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f73716c6974652d7574696c733f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/sqlite-utils?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://pypi.org/project/sqlite-utils/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/2413fbc1359bcdcf500aa31b497c8c25acf7db3f3fc3a33c540e809c984a189e/68747470733a2f2f696d672e736869656c64732e696f2f707970692f707976657273696f6e732f73716c6974652d7574696c732e7376673f6c6f676f3d707974686f6e266c6f676f436f6c6f723d7768697465"" alt=""Python 3.x"" data-canonical-src=""https://img.shields.io/pypi/pyversions/sqlite-utils.svg?logo=python&amp;logoColor=white"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/sqlite-utils/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/sqlite-utils/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""http://sqlite-utils.datasette.io/en/stable/?badge=stable"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/ebfbbefc97415a2c49a4757ef1f7836a0771eb543ba3c1d1267e466ab0ed8bd1/68747470733a2f2f72656164746865646f63732e6f72672f70726f6a656374732f73716c6974652d7574696c732f62616467652f3f76657273696f6e3d737461626c65"" alt=""Documentation Status"" data-canonical-src=""https://readthedocs.org/projects/sqlite-utils/badge/?version=stable"" style=""max-width: 100%;""></a>
<a href=""https://codecov.io/gh/simonw/sqlite-utils"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/762e8034bbc8654dab7f15e85ae5eb836d07b86ebf68ab955a8c18d8035fbf13/68747470733a2f2f636f6465636f762e696f2f67682f73696d6f6e772f73716c6974652d7574696c732f6272616e63682f6d61696e2f67726170682f62616467652e737667"" alt=""codecov"" data-canonical-src=""https://codecov.io/gh/simonw/sqlite-utils/branch/main/graph/badge.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/sqlite-utils/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a>
<a href=""https://discord.gg/Ass7bCAMDw"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/1e67d99839ed78183f50f78dee6bb4897d95ff44a4b7182d4bfd82c505d8d835/68747470733a2f2f696d672e736869656c64732e696f2f646973636f72642f3832333937313238363330383335363135373f6c6162656c3d646973636f7264"" alt=""discord"" data-canonical-src=""https://img.shields.io/discord/823971286308356157?label=discord"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Python CLI utility and library for manipulating SQLite databases.</p>
<h2 dir=""auto""><a id=""user-content-some-feature-highlights"" class=""anchor"" aria-hidden=""true"" href=""#user-content-some-feature-highlights""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Some feature highlights</h2>
<ul dir=""auto"">
<li><a href=""https://sqlite-utils.datasette.io/en/stable/cli.html#inserting-json-data"" rel=""nofollow"">Pipe JSON</a> (or <a href=""https://sqlite-utils.datasette.io/en/stable/cli.html#inserting-csv-or-tsv-data"" rel=""nofollow"">CSV or TSV</a>) directly into a new SQLite database file, automatically creating a table with the appropriate schema</li>
<li><a href=""https://sqlite-utils.datasette.io/en/stable/cli.html#querying-data-directly-using-an-in-memory-database"" rel=""nofollow"">Run in-memory SQL queries</a>, including joins, directly against data in CSV, TSV or JSON files and view the results</li>
<li><a href=""https://sqlite-utils.datasette.io/en/stable/cli.html#configuring-full-text-search"" rel=""nofollow"">Configure SQLite full-text search</a> against your database tables and run search queries against them, ordered by relevance</li>
<li>Run <a href=""https://sqlite-utils.datasette.io/en/stable/cli.html#transforming-tables"" rel=""nofollow"">transformations against your tables</a> to make schema changes that SQLite <code>ALTER TABLE</code> does not directly support, such as changing the type of a column</li>
<li><a href=""https://sqlite-utils.datasette.io/en/stable/cli.html#extracting-columns-into-a-separate-table"" rel=""nofollow"">Extract columns</a> into separate tables to better normalize your existing data</li>
</ul>
<p dir=""auto"">Read more on my blog, in this series of posts on <a href=""https://simonwillison.net/series/sqlite-utils-features/"" rel=""nofollow"">New features in sqlite-utils</a> and other <a href=""https://simonwillison.net/tags/sqliteutils/"" rel=""nofollow"">entries tagged sqliteutils</a>.</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install sqlite-utils""><pre class=""notranslate""><code>pip install sqlite-utils
</code></pre></div>
<p dir=""auto"">Or if you use <a href=""https://brew.sh/"" rel=""nofollow"">Homebrew</a> for macOS:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""brew install sqlite-utils""><pre class=""notranslate""><code>brew install sqlite-utils
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-using-as-a-cli-tool"" class=""anchor"" aria-hidden=""true"" href=""#user-content-using-as-a-cli-tool""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Using as a CLI tool</h2>
<p dir=""auto"">Now you can do things with the CLI utility like this:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ sqlite-utils memory dogs.csv &quot;select * from t&quot;
[{&quot;id&quot;: 1, &quot;age&quot;: 4, &quot;name&quot;: &quot;Cleo&quot;},
 {&quot;id&quot;: 2, &quot;age&quot;: 2, &quot;name&quot;: &quot;Pancakes&quot;}]

$ sqlite-utils insert dogs.db dogs dogs.csv --csv
[####################################]  100%

$ sqlite-utils tables dogs.db --counts
[{&quot;table&quot;: &quot;dogs&quot;, &quot;count&quot;: 2}]

$ sqlite-utils dogs.db &quot;select id, name from dogs&quot;
[{&quot;id&quot;: 1, &quot;name&quot;: &quot;Cleo&quot;},
 {&quot;id&quot;: 2, &quot;name&quot;: &quot;Pancakes&quot;}]

$ sqlite-utils dogs.db &quot;select * from dogs&quot; --csv
id,age,name
1,4,Cleo
2,2,Pancakes

$ sqlite-utils dogs.db &quot;select * from dogs&quot; --table
  id    age  name
----  -----  --------
   1      4  Cleo
   2      2  Pancakes""><pre class=""notranslate""><code>$ sqlite-utils memory dogs.csv ""select * from t""
[{""id"": 1, ""age"": 4, ""name"": ""Cleo""},
 {""id"": 2, ""age"": 2, ""name"": ""Pancakes""}]

$ sqlite-utils insert dogs.db dogs dogs.csv --csv
[####################################]  100%

$ sqlite-utils tables dogs.db --counts
[{""table"": ""dogs"", ""count"": 2}]

$ sqlite-utils dogs.db ""select id, name from dogs""
[{""id"": 1, ""name"": ""Cleo""},
 {""id"": 2, ""name"": ""Pancakes""}]

$ sqlite-utils dogs.db ""select * from dogs"" --csv
id,age,name
1,4,Cleo
2,2,Pancakes

$ sqlite-utils dogs.db ""select * from dogs"" --table
  id    age  name
----  -----  --------
   1      4  Cleo
   2      2  Pancakes
</code></pre></div>
<p dir=""auto"">You can import JSON data into a new database table like this:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ curl https://api.github.com/repos/simonw/sqlite-utils/releases \
    | sqlite-utils insert releases.db releases - --pk id""><pre class=""notranslate""><code>$ curl https://api.github.com/repos/simonw/sqlite-utils/releases \
    | sqlite-utils insert releases.db releases - --pk id
</code></pre></div>
<p dir=""auto"">Or for data in a CSV file:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ sqlite-utils insert dogs.db dogs dogs.csv --csv""><pre class=""notranslate""><code>$ sqlite-utils insert dogs.db dogs dogs.csv --csv
</code></pre></div>
<p dir=""auto""><code>sqlite-utils memory</code> lets you import CSV or JSON data into an in-memory database and run SQL queries against it in a single command:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ cat dogs.csv | sqlite-utils memory - &quot;select name, age from stdin&quot;""><pre class=""notranslate""><code>$ cat dogs.csv | sqlite-utils memory - ""select name, age from stdin""
</code></pre></div>
<p dir=""auto"">See the <a href=""https://sqlite-utils.datasette.io/en/stable/cli.html"" rel=""nofollow"">full CLI documentation</a> for comprehensive coverage of many more commands.</p>
<h2 dir=""auto""><a id=""user-content-using-as-a-library"" class=""anchor"" aria-hidden=""true"" href=""#user-content-using-as-a-library""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Using as a library</h2>
<p dir=""auto"">You can also <code>import sqlite_utils</code> and use it as a Python library like this:</p>
<div class=""highlight highlight-source-python notranslate position-relative overflow-auto"" dir=""auto"" data-snippet-clipboard-copy-content=""import sqlite_utils
db = sqlite_utils.Database(&quot;demo_database.db&quot;)
# This line creates a &quot;dogs&quot; table if one does not already exist:
db[&quot;dogs&quot;].insert_all([
    {&quot;id&quot;: 1, &quot;age&quot;: 4, &quot;name&quot;: &quot;Cleo&quot;},
    {&quot;id&quot;: 2, &quot;age&quot;: 2, &quot;name&quot;: &quot;Pancakes&quot;}
], pk=&quot;id&quot;)""><pre><span class=""pl-k"">import</span> <span class=""pl-s1"">sqlite_utils</span>
<span class=""pl-s1"">db</span> <span class=""pl-c1"">=</span> <span class=""pl-s1"">sqlite_utils</span>.<span class=""pl-v"">Database</span>(<span class=""pl-s"">""demo_database.db""</span>)
<span class=""pl-c""># This line creates a ""dogs"" table if one does not already exist:</span>
<span class=""pl-s1"">db</span>[<span class=""pl-s"">""dogs""</span>].<span class=""pl-en"">insert_all</span>([
    {<span class=""pl-s"">""id""</span>: <span class=""pl-c1"">1</span>, <span class=""pl-s"">""age""</span>: <span class=""pl-c1"">4</span>, <span class=""pl-s"">""name""</span>: <span class=""pl-s"">""Cleo""</span>},
    {<span class=""pl-s"">""id""</span>: <span class=""pl-c1"">2</span>, <span class=""pl-s"">""age""</span>: <span class=""pl-c1"">2</span>, <span class=""pl-s"">""name""</span>: <span class=""pl-s"">""Pancakes""</span>}
], <span class=""pl-s1"">pk</span><span class=""pl-c1"">=</span><span class=""pl-s"">""id""</span>)</pre></div>
<p dir=""auto"">Check out the <a href=""https://sqlite-utils.datasette.io/en/stable/python-api.html"" rel=""nofollow"">full library documentation</a> for everything else you can do with the Python library.</p>
<h2 dir=""auto""><a id=""user-content-related-projects"" class=""anchor"" aria-hidden=""true"" href=""#user-content-related-projects""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Related projects</h2>
<ul dir=""auto"">
<li><a href=""https://datasette.io/"" rel=""nofollow"">Datasette</a>: A tool for exploring and publishing data</li>
<li><a href=""https://github.com/simonw/csvs-to-sqlite"">csvs-to-sqlite</a>: Convert CSV files into a SQLite database</li>
<li><a href=""https://github.com/simonw/db-to-sqlite"">db-to-sqlite</a>: CLI tool for exporting a MySQL or PostgreSQL database as a SQLite file</li>
<li><a href=""https://dogsheep.github.io/"" rel=""nofollow"">dogsheep</a>: A family of tools for personal analytics, built on top of <code>sqlite-utils</code></li>
</ul>
</article></div>",1,public,0,,0,0
142967347,MDEwOlJlcG9zaXRvcnkxNDI5NjczNDc=,datasette-json-html,simonw/datasette-json-html,0,9599,https://github.com/simonw/datasette-json-html,Datasette plugin for rendering HTML based on JSON values,0,2018-07-31T05:41:39Z,2022-03-15T04:54:15Z,2022-03-22T01:43:59Z,,46,19,19,Python,1,1,1,1,0,1,0,0,0,apache-2.0,"[""datasette"", ""datasette-io"", ""datasette-plugin"", ""plugin""]",1,0,19,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,1,4,"# datasette-json-html

[![PyPI](https://img.shields.io/pypi/v/datasette-json-html.svg)](https://pypi.org/project/datasette-json-html/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-json-html?include_prereleases&label=changelog)](https://github.com/simonw/datasette-json-html/releases)
[![Tests](https://github.com/simonw/datasette-json-html/workflows/Test/badge.svg)](https://github.com/simonw/datasette-remote-metadata/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-json-html/blob/main/LICENSE)

Datasette plugin for rendering HTML based on JSON values, using the [render_cell plugin hook](https://docs.datasette.io/en/stable/plugin_hooks.html#render-cell-value-column-table-database-datasette).

This plugin looks for cell values that match a very specific JSON format and converts them into HTML when they are rendered by the Datasette interface.

## Links

    {
        ""href"": ""https://simonwillison.net/"",
        ""label"": ""Simon Willison""
    }

Will be rendered as an `<a href="""">` link:

    <a href=""https://simonwillison.net/"">Simon Willison</a>

You can set a tooltip on the link using a `""title""` key:


    {
        ""href"": ""https://simonwillison.net/"",
        ""label"": ""Simon Willison"",
        ""title"": ""My blog""
    }

Produces:

    <a href=""https://simonwillison.net/"" title=""My blog"">Simon Willison</a>

You can also include a description, which will be displayed below the link. If descriptions include newlines they will be converted to `<br>` elements:

    select json_object(
        ""href"", ""https://simonwillison.net/"",
        ""label"", ""Simon Willison"",
        ""description"", ""This can contain"" || x'0a' || ""newlines""
    )

Produces:

    <strong><a href=""https://simonwillison.net/"">Simon Willison</a></strong><br>This can contain<br>newlines

* [Literal JSON link demo](https://datasette-json-html.datasette.io/demo?sql=select+%27%7B%0D%0A++++%22href%22%3A+%22https%3A%2F%2Fsimonwillison.net%2F%22%2C%0D%0A++++%22label%22%3A+%22Simon+Willison%22%2C%0D%0A++++%22title%22%3A+%22My+blog%22%0D%0A%7D%27)

## List of links

    [
        {
            ""href"": ""https://simonwillison.net/"",
            ""label"": ""Simon Willison""
        },
        {
            ""href"": ""https://github.com/simonw/datasette"",
            ""label"": ""Datasette""
        }
    ]

Will be rendered as a comma-separated list of `<a href="""">` links:

    <a href=""https://simonwillison.net/"">Simon Willison</a>,
    <a href=""https://github.com/simonw/datasette"">Datasette</a>

The `href` property must begin with `https://` or `http://` or `/`, to avoid potential XSS injection attacks (for example URLs that begin with `javascript:`).

Lists of links cannot include `""description""` keys.

* [Literal list of links demo](https://datasette-json-html.datasette.io/demo?sql=select+%27%5B%0D%0A++++%7B%0D%0A++++++++%22href%22%3A+%22https%3A%2F%2Fsimonwillison.net%2F%22%2C%0D%0A++++++++%22label%22%3A+%22Simon+Willison%22%0D%0A++++%7D%2C%0D%0A++++%7B%0D%0A++++++++%22href%22%3A+%22https%3A%2F%2Fgithub.com%2Fsimonw%2Fdatasette%22%2C%0D%0A++++++++%22label%22%3A+%22Datasette%22%0D%0A++++%7D%0D%0A%5D%27)

## Images

The image tag is more complex. The most basic version looks like this:

    {
        ""img_src"": ""https://placekitten.com/200/300""
    }

This will render as:

    <img src=""https://placekitten.com/200/300"">

But you can also include one or more of `alt`, `caption`, `width` and `href`.

If you include width or alt, they will be added as attributes:

    {
        ""img_src"": ""https://placekitten.com/200/300"",
        ""alt"": ""Kitten"",
        ""width"": 200
    }

Produces:

    <img src=""https://placekitten.com/200/300""
        alt=""Kitten"" width=""200"">

* [Literal image demo](https://datasette-json-html.datasette.io/demo?sql=select+%27%7B%0D%0A++++%22img_src%22%3A+%22https%3A%2F%2Fplacekitten.com%2F200%2F300%22%2C%0D%0A++++%22alt%22%3A+%22Kitten%22%2C%0D%0A++++%22width%22%3A+200%0D%0A%7D%27)

The `href` key will cause the image to be wrapped in a link:

    {
        ""img_src"": ""https://placekitten.com/200/300"",
        ""href"": ""http://www.example.com""
    }

Produces:

    <a href=""http://www.example.com"">
        <img src=""https://placekitten.com/200/300"">
    </a>

The `caption` key wraps everything in a fancy figure/figcaption block:

    {
        ""img_src"": ""https://placekitten.com/200/300"",
        ""caption"": ""Kitten caption""
    }

Produces:

    <figure>
        <img src=""https://placekitten.com/200/300""></a>
        <figcaption>Kitten caption</figcaption>
    </figure>

## Preformatted text

You can use `{""pre"": ""text""}` to render text in a `<pre>` HTML tag:

    {
        ""pre"": ""This\nhas\nnewlines""
    }

Produces:

    <pre>This
    has
    newlines</pre>

If the value attached to the `""pre""` key is itself a JSON object, that JSON will be pretty-printed:

    {
        ""pre"": {
            ""this"": {
                ""object"": [""is"", ""nested""]
            }
        }
    }

Produces:

    <pre>{
      &#34;this&#34;: {
        &#34;object&#34;: [
          &#34;is&#34;,
          &#34;nested&#34;
        ]
      }
    }</pre>

* [Preformatted text with JSON demo](https://datasette-json-html.datasette.io/demo?sql=select+%27%7B%0D%0A++++%22pre%22%3A+%7B%0D%0A++++++++%22this%22%3A+%7B%0D%0A++++++++++++%22object%22%3A+%5B%22is%22%2C+%22nested%22%5D%0D%0A++++++++%7D%0D%0A++++%7D%0D%0A%7D%27)
* [Preformatted text demo showing the Mandelbrot Set](https://datasette-json-html.datasette.io/demo?sql=WITH+RECURSIVE%0D%0A++xaxis%28x%29+AS+%28VALUES%28-2.0%29+UNION+ALL+SELECT+x%2B0.05+FROM+xaxis+WHERE+x%3C1.2%29%2C%0D%0A++yaxis%28y%29+AS+%28VALUES%28-1.0%29+UNION+ALL+SELECT+y%2B0.1+FROM+yaxis+WHERE+y%3C1.0%29%2C%0D%0A++m%28iter%2C+cx%2C+cy%2C+x%2C+y%29+AS+%28%0D%0A++++SELECT+0%2C+x%2C+y%2C+0.0%2C+0.0+FROM+xaxis%2C+yaxis%0D%0A++++UNION+ALL%0D%0A++++SELECT+iter%2B1%2C+cx%2C+cy%2C+x*x-y*y+%2B+cx%2C+2.0*x*y+%2B+cy+FROM+m+%0D%0A+++++WHERE+%28x*x+%2B+y*y%29+%3C+4.0+AND+iter%3C28%0D%0A++%29%2C%0D%0A++m2%28iter%2C+cx%2C+cy%29+AS+%28%0D%0A++++SELECT+max%28iter%29%2C+cx%2C+cy+FROM+m+GROUP+BY+cx%2C+cy%0D%0A++%29%2C%0D%0A++a%28t%29+AS+%28%0D%0A++++SELECT+group_concat%28+substr%28%27+.%2B*%23%27%2C+1%2Bmin%28iter%2F7%2C4%29%2C+1%29%2C+%27%27%29+%0D%0A++++FROM+m2+GROUP+BY+cy%0D%0A++%29%0D%0ASELECT+json_object%28%27pre%27%2C+group_concat%28rtrim%28t%29%2Cx%270a%27%29%29+FROM+a%3B) using [this example](https://www.sqlite.org/lang_with.html#outlandish_recursive_query_examples) from the SQLite documentation

## Using these with SQLite JSON functions

The most powerful way to make use of this plugin is in conjunction with SQLite's [JSON functions](https://www.sqlite.org/json1.html). For example:

    select json_object(
        ""href"", ""https://simonwillison.net/"",
        ""label"", ""Simon Willison""
    );

* [json_object() link demo](https://datasette-json-html.datasette.io/demo?sql=select+json_object%28%0D%0A++++%22href%22%2C+%22https%3A%2F%2Fsimonwillison.net%2F%22%2C%0D%0A++++%22label%22%2C+%22Simon+Willison%22%0D%0A%29%3B)

You can use these functions to construct JSON objects that work with the plugin from data in a table:

    select id, json_object(
        ""href"", url, ""label"", text
    ) from mytable;

* [Demo that builds links against a table](https://datasette-json-html.datasette.io/demo?sql=select+json_object%28%22href%22%2C+url%2C+%22label%22%2C+package%2C+%22title%22%2C+package+%7C%7C+%22+%22+%7C%7C+url%29+as+package+from+packages)

The `json_group_array()` function is an aggregate function similar to `group_concat()` - it allows you to construct lists of JSON objects in conjunction with a `GROUP BY` clause.

This means you can use it to construct dynamic lists of links, for example:

    select
        substr(package, 0, 12) as prefix,
        json_group_array(
            json_object(
                ""href"", url,
                ""label"", package
            )
        ) as package_links
    from packages
    group by prefix

* [Demo of json_group_array()](https://datasette-json-html.datasette.io/demo?sql=select%0D%0A++++substr%28package%2C+0%2C+12%29+as+prefix%2C%0D%0A++++json_group_array%28%0D%0A++++++++json_object%28%0D%0A++++++++++++%22href%22%2C+url%2C%0D%0A++++++++++++%22label%22%2C+package%0D%0A++++++++%29%0D%0A++++%29+as+package_links%0D%0Afrom+packages%0D%0Agroup+by+prefix)

## The `urllib_quote_plus()` SQL function

Since this plugin is designed to be used with SQL that constructs the underlying JSON structure, it is likely you will need to construct dynamic URLs from results returned by a SQL query.

This plugin registers a custom SQLite function called `urllib_quote_plus()` to help you do that. It lets you use Python's [urllib.parse.quote\_plus() function](https://docs.python.org/3/library/urllib.parse.html#urllib.parse.quote_plus) from within a SQL query.

Here's an example of how you might use it:

    select id, json_object(
        ""href"",
        ""/mydatabase/other_table?_search="" || urllib_quote_plus(text),
        ""label"", text
    ) from mytable;
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-datasette-json-html"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-json-html""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-json-html</h1>
<p dir=""auto""><a href=""https://pypi.org/project/datasette-json-html/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/6781ecd1fa88174e5228eff74ade944f8f69a051d14df112d9dbc18bd3543551/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d6a736f6e2d68746d6c2e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-json-html.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-json-html/releases""><img src=""https://camo.githubusercontent.com/56a671c6595b71bd6d225b1c6d2ee0c5dbadd9bbbb44e4074ad4afe4c18905ce/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d6a736f6e2d68746d6c3f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-json-html?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-remote-metadata/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-json-html/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-json-html/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Datasette plugin for rendering HTML based on JSON values, using the <a href=""https://docs.datasette.io/en/stable/plugin_hooks.html#render-cell-value-column-table-database-datasette"" rel=""nofollow"">render_cell plugin hook</a>.</p>
<p dir=""auto"">This plugin looks for cell values that match a very specific JSON format and converts them into HTML when they are rendered by the Datasette interface.</p>
<h2 dir=""auto""><a id=""user-content-links"" class=""anchor"" aria-hidden=""true"" href=""#user-content-links""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Links</h2>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{
    &quot;href&quot;: &quot;https://simonwillison.net/&quot;,
    &quot;label&quot;: &quot;Simon Willison&quot;
}""><pre><code>{
    ""href"": ""https://simonwillison.net/"",
    ""label"": ""Simon Willison""
}
</code></pre></div>
<p dir=""auto"">Will be rendered as an <code>&lt;a href=""""&gt;</code> link:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""&lt;a href=&quot;https://simonwillison.net/&quot;&gt;Simon Willison&lt;/a&gt;""><pre><code>&lt;a href=""https://simonwillison.net/""&gt;Simon Willison&lt;/a&gt;
</code></pre></div>
<p dir=""auto"">You can set a tooltip on the link using a <code>""title""</code> key:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{
    &quot;href&quot;: &quot;https://simonwillison.net/&quot;,
    &quot;label&quot;: &quot;Simon Willison&quot;,
    &quot;title&quot;: &quot;My blog&quot;
}""><pre><code>{
    ""href"": ""https://simonwillison.net/"",
    ""label"": ""Simon Willison"",
    ""title"": ""My blog""
}
</code></pre></div>
<p dir=""auto"">Produces:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""&lt;a href=&quot;https://simonwillison.net/&quot; title=&quot;My blog&quot;&gt;Simon Willison&lt;/a&gt;""><pre><code>&lt;a href=""https://simonwillison.net/"" title=""My blog""&gt;Simon Willison&lt;/a&gt;
</code></pre></div>
<p dir=""auto"">You can also include a description, which will be displayed below the link. If descriptions include newlines they will be converted to <code>&lt;br&gt;</code> elements:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""select json_object(
    &quot;href&quot;, &quot;https://simonwillison.net/&quot;,
    &quot;label&quot;, &quot;Simon Willison&quot;,
    &quot;description&quot;, &quot;This can contain&quot; || x'0a' || &quot;newlines&quot;
)""><pre><code>select json_object(
    ""href"", ""https://simonwillison.net/"",
    ""label"", ""Simon Willison"",
    ""description"", ""This can contain"" || x'0a' || ""newlines""
)
</code></pre></div>
<p dir=""auto"">Produces:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""&lt;strong&gt;&lt;a href=&quot;https://simonwillison.net/&quot;&gt;Simon Willison&lt;/a&gt;&lt;/strong&gt;&lt;br&gt;This can contain&lt;br&gt;newlines""><pre><code>&lt;strong&gt;&lt;a href=""https://simonwillison.net/""&gt;Simon Willison&lt;/a&gt;&lt;/strong&gt;&lt;br&gt;This can contain&lt;br&gt;newlines
</code></pre></div>
<ul dir=""auto"">
<li><a href=""https://datasette-json-html.datasette.io/demo?sql=select+%27%7B%0D%0A++++%22href%22%3A+%22https%3A%2F%2Fsimonwillison.net%2F%22%2C%0D%0A++++%22label%22%3A+%22Simon+Willison%22%2C%0D%0A++++%22title%22%3A+%22My+blog%22%0D%0A%7D%27"" rel=""nofollow"">Literal JSON link demo</a></li>
</ul>
<h2 dir=""auto""><a id=""user-content-list-of-links"" class=""anchor"" aria-hidden=""true"" href=""#user-content-list-of-links""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>List of links</h2>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""[
    {
        &quot;href&quot;: &quot;https://simonwillison.net/&quot;,
        &quot;label&quot;: &quot;Simon Willison&quot;
    },
    {
        &quot;href&quot;: &quot;https://github.com/simonw/datasette&quot;,
        &quot;label&quot;: &quot;Datasette&quot;
    }
]""><pre><code>[
    {
        ""href"": ""https://simonwillison.net/"",
        ""label"": ""Simon Willison""
    },
    {
        ""href"": ""https://github.com/simonw/datasette"",
        ""label"": ""Datasette""
    }
]
</code></pre></div>
<p dir=""auto"">Will be rendered as a comma-separated list of <code>&lt;a href=""""&gt;</code> links:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""&lt;a href=&quot;https://simonwillison.net/&quot;&gt;Simon Willison&lt;/a&gt;,
&lt;a href=&quot;https://github.com/simonw/datasette&quot;&gt;Datasette&lt;/a&gt;""><pre><code>&lt;a href=""https://simonwillison.net/""&gt;Simon Willison&lt;/a&gt;,
&lt;a href=""https://github.com/simonw/datasette""&gt;Datasette&lt;/a&gt;
</code></pre></div>
<p dir=""auto"">The <code>href</code> property must begin with <code>https://</code> or <code>http://</code> or <code>/</code>, to avoid potential XSS injection attacks (for example URLs that begin with <code>javascript:</code>).</p>
<p dir=""auto"">Lists of links cannot include <code>""description""</code> keys.</p>
<ul dir=""auto"">
<li><a href=""https://datasette-json-html.datasette.io/demo?sql=select+%27%5B%0D%0A++++%7B%0D%0A++++++++%22href%22%3A+%22https%3A%2F%2Fsimonwillison.net%2F%22%2C%0D%0A++++++++%22label%22%3A+%22Simon+Willison%22%0D%0A++++%7D%2C%0D%0A++++%7B%0D%0A++++++++%22href%22%3A+%22https%3A%2F%2Fgithub.com%2Fsimonw%2Fdatasette%22%2C%0D%0A++++++++%22label%22%3A+%22Datasette%22%0D%0A++++%7D%0D%0A%5D%27"" rel=""nofollow"">Literal list of links demo</a></li>
</ul>
<h2 dir=""auto""><a id=""user-content-images"" class=""anchor"" aria-hidden=""true"" href=""#user-content-images""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Images</h2>
<p dir=""auto"">The image tag is more complex. The most basic version looks like this:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{
    &quot;img_src&quot;: &quot;https://placekitten.com/200/300&quot;
}""><pre><code>{
    ""img_src"": ""https://placekitten.com/200/300""
}
</code></pre></div>
<p dir=""auto"">This will render as:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""&lt;img src=&quot;https://placekitten.com/200/300&quot;&gt;""><pre><code>&lt;img src=""https://placekitten.com/200/300""&gt;
</code></pre></div>
<p dir=""auto"">But you can also include one or more of <code>alt</code>, <code>caption</code>, <code>width</code> and <code>href</code>.</p>
<p dir=""auto"">If you include width or alt, they will be added as attributes:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{
    &quot;img_src&quot;: &quot;https://placekitten.com/200/300&quot;,
    &quot;alt&quot;: &quot;Kitten&quot;,
    &quot;width&quot;: 200
}""><pre><code>{
    ""img_src"": ""https://placekitten.com/200/300"",
    ""alt"": ""Kitten"",
    ""width"": 200
}
</code></pre></div>
<p dir=""auto"">Produces:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""&lt;img src=&quot;https://placekitten.com/200/300&quot;
    alt=&quot;Kitten&quot; width=&quot;200&quot;&gt;""><pre><code>&lt;img src=""https://placekitten.com/200/300""
    alt=""Kitten"" width=""200""&gt;
</code></pre></div>
<ul dir=""auto"">
<li><a href=""https://datasette-json-html.datasette.io/demo?sql=select+%27%7B%0D%0A++++%22img_src%22%3A+%22https%3A%2F%2Fplacekitten.com%2F200%2F300%22%2C%0D%0A++++%22alt%22%3A+%22Kitten%22%2C%0D%0A++++%22width%22%3A+200%0D%0A%7D%27"" rel=""nofollow"">Literal image demo</a></li>
</ul>
<p dir=""auto"">The <code>href</code> key will cause the image to be wrapped in a link:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{
    &quot;img_src&quot;: &quot;https://placekitten.com/200/300&quot;,
    &quot;href&quot;: &quot;http://www.example.com&quot;
}""><pre><code>{
    ""img_src"": ""https://placekitten.com/200/300"",
    ""href"": ""http://www.example.com""
}
</code></pre></div>
<p dir=""auto"">Produces:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""&lt;a href=&quot;http://www.example.com&quot;&gt;
    &lt;img src=&quot;https://placekitten.com/200/300&quot;&gt;
&lt;/a&gt;""><pre><code>&lt;a href=""http://www.example.com""&gt;
    &lt;img src=""https://placekitten.com/200/300""&gt;
&lt;/a&gt;
</code></pre></div>
<p dir=""auto"">The <code>caption</code> key wraps everything in a fancy figure/figcaption block:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{
    &quot;img_src&quot;: &quot;https://placekitten.com/200/300&quot;,
    &quot;caption&quot;: &quot;Kitten caption&quot;
}""><pre><code>{
    ""img_src"": ""https://placekitten.com/200/300"",
    ""caption"": ""Kitten caption""
}
</code></pre></div>
<p dir=""auto"">Produces:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""&lt;figure&gt;
    &lt;img src=&quot;https://placekitten.com/200/300&quot;&gt;&lt;/a&gt;
    &lt;figcaption&gt;Kitten caption&lt;/figcaption&gt;
&lt;/figure&gt;""><pre><code>&lt;figure&gt;
    &lt;img src=""https://placekitten.com/200/300""&gt;&lt;/a&gt;
    &lt;figcaption&gt;Kitten caption&lt;/figcaption&gt;
&lt;/figure&gt;
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-preformatted-text"" class=""anchor"" aria-hidden=""true"" href=""#user-content-preformatted-text""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Preformatted text</h2>
<p dir=""auto"">You can use <code>{""pre"": ""text""}</code> to render text in a <code>&lt;pre&gt;</code> HTML tag:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{
    &quot;pre&quot;: &quot;This\nhas\nnewlines&quot;
}""><pre><code>{
    ""pre"": ""This\nhas\nnewlines""
}
</code></pre></div>
<p dir=""auto"">Produces:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""&lt;pre&gt;This
has
newlines&lt;/pre&gt;""><pre><code>&lt;pre&gt;This
has
newlines&lt;/pre&gt;
</code></pre></div>
<p dir=""auto"">If the value attached to the <code>""pre""</code> key is itself a JSON object, that JSON will be pretty-printed:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{
    &quot;pre&quot;: {
        &quot;this&quot;: {
            &quot;object&quot;: [&quot;is&quot;, &quot;nested&quot;]
        }
    }
}""><pre><code>{
    ""pre"": {
        ""this"": {
            ""object"": [""is"", ""nested""]
        }
    }
}
</code></pre></div>
<p dir=""auto"">Produces:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""&lt;pre&gt;{
  &amp;#34;this&amp;#34;: {
    &amp;#34;object&amp;#34;: [
      &amp;#34;is&amp;#34;,
      &amp;#34;nested&amp;#34;
    ]
  }
}&lt;/pre&gt;""><pre><code>&lt;pre&gt;{
  &amp;#34;this&amp;#34;: {
    &amp;#34;object&amp;#34;: [
      &amp;#34;is&amp;#34;,
      &amp;#34;nested&amp;#34;
    ]
  }
}&lt;/pre&gt;
</code></pre></div>
<ul dir=""auto"">
<li><a href=""https://datasette-json-html.datasette.io/demo?sql=select+%27%7B%0D%0A++++%22pre%22%3A+%7B%0D%0A++++++++%22this%22%3A+%7B%0D%0A++++++++++++%22object%22%3A+%5B%22is%22%2C+%22nested%22%5D%0D%0A++++++++%7D%0D%0A++++%7D%0D%0A%7D%27"" rel=""nofollow"">Preformatted text with JSON demo</a></li>
<li><a href=""https://datasette-json-html.datasette.io/demo?sql=WITH+RECURSIVE%0D%0A++xaxis%28x%29+AS+%28VALUES%28-2.0%29+UNION+ALL+SELECT+x%2B0.05+FROM+xaxis+WHERE+x%3C1.2%29%2C%0D%0A++yaxis%28y%29+AS+%28VALUES%28-1.0%29+UNION+ALL+SELECT+y%2B0.1+FROM+yaxis+WHERE+y%3C1.0%29%2C%0D%0A++m%28iter%2C+cx%2C+cy%2C+x%2C+y%29+AS+%28%0D%0A++++SELECT+0%2C+x%2C+y%2C+0.0%2C+0.0+FROM+xaxis%2C+yaxis%0D%0A++++UNION+ALL%0D%0A++++SELECT+iter%2B1%2C+cx%2C+cy%2C+x*x-y*y+%2B+cx%2C+2.0*x*y+%2B+cy+FROM+m+%0D%0A+++++WHERE+%28x*x+%2B+y*y%29+%3C+4.0+AND+iter%3C28%0D%0A++%29%2C%0D%0A++m2%28iter%2C+cx%2C+cy%29+AS+%28%0D%0A++++SELECT+max%28iter%29%2C+cx%2C+cy+FROM+m+GROUP+BY+cx%2C+cy%0D%0A++%29%2C%0D%0A++a%28t%29+AS+%28%0D%0A++++SELECT+group_concat%28+substr%28%27+.%2B*%23%27%2C+1%2Bmin%28iter%2F7%2C4%29%2C+1%29%2C+%27%27%29+%0D%0A++++FROM+m2+GROUP+BY+cy%0D%0A++%29%0D%0ASELECT+json_object%28%27pre%27%2C+group_concat%28rtrim%28t%29%2Cx%270a%27%29%29+FROM+a%3B"" rel=""nofollow"">Preformatted text demo showing the Mandelbrot Set</a> using <a href=""https://www.sqlite.org/lang_with.html#outlandish_recursive_query_examples"" rel=""nofollow"">this example</a> from the SQLite documentation</li>
</ul>
<h2 dir=""auto""><a id=""user-content-using-these-with-sqlite-json-functions"" class=""anchor"" aria-hidden=""true"" href=""#user-content-using-these-with-sqlite-json-functions""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Using these with SQLite JSON functions</h2>
<p dir=""auto"">The most powerful way to make use of this plugin is in conjunction with SQLite's <a href=""https://www.sqlite.org/json1.html"" rel=""nofollow"">JSON functions</a>. For example:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""select json_object(
    &quot;href&quot;, &quot;https://simonwillison.net/&quot;,
    &quot;label&quot;, &quot;Simon Willison&quot;
);""><pre><code>select json_object(
    ""href"", ""https://simonwillison.net/"",
    ""label"", ""Simon Willison""
);
</code></pre></div>
<ul dir=""auto"">
<li><a href=""https://datasette-json-html.datasette.io/demo?sql=select+json_object%28%0D%0A++++%22href%22%2C+%22https%3A%2F%2Fsimonwillison.net%2F%22%2C%0D%0A++++%22label%22%2C+%22Simon+Willison%22%0D%0A%29%3B"" rel=""nofollow"">json_object() link demo</a></li>
</ul>
<p dir=""auto"">You can use these functions to construct JSON objects that work with the plugin from data in a table:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""select id, json_object(
    &quot;href&quot;, url, &quot;label&quot;, text
) from mytable;""><pre><code>select id, json_object(
    ""href"", url, ""label"", text
) from mytable;
</code></pre></div>
<ul dir=""auto"">
<li><a href=""https://datasette-json-html.datasette.io/demo?sql=select+json_object%28%22href%22%2C+url%2C+%22label%22%2C+package%2C+%22title%22%2C+package+%7C%7C+%22+%22+%7C%7C+url%29+as+package+from+packages"" rel=""nofollow"">Demo that builds links against a table</a></li>
</ul>
<p dir=""auto"">The <code>json_group_array()</code> function is an aggregate function similar to <code>group_concat()</code> - it allows you to construct lists of JSON objects in conjunction with a <code>GROUP BY</code> clause.</p>
<p dir=""auto"">This means you can use it to construct dynamic lists of links, for example:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""select
    substr(package, 0, 12) as prefix,
    json_group_array(
        json_object(
            &quot;href&quot;, url,
            &quot;label&quot;, package
        )
    ) as package_links
from packages
group by prefix""><pre><code>select
    substr(package, 0, 12) as prefix,
    json_group_array(
        json_object(
            ""href"", url,
            ""label"", package
        )
    ) as package_links
from packages
group by prefix
</code></pre></div>
<ul dir=""auto"">
<li><a href=""https://datasette-json-html.datasette.io/demo?sql=select%0D%0A++++substr%28package%2C+0%2C+12%29+as+prefix%2C%0D%0A++++json_group_array%28%0D%0A++++++++json_object%28%0D%0A++++++++++++%22href%22%2C+url%2C%0D%0A++++++++++++%22label%22%2C+package%0D%0A++++++++%29%0D%0A++++%29+as+package_links%0D%0Afrom+packages%0D%0Agroup+by+prefix"" rel=""nofollow"">Demo of json_group_array()</a></li>
</ul>
<h2 dir=""auto""><a id=""user-content-the-urllib_quote_plus-sql-function"" class=""anchor"" aria-hidden=""true"" href=""#user-content-the-urllib_quote_plus-sql-function""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>The <code>urllib_quote_plus()</code> SQL function</h2>
<p dir=""auto"">Since this plugin is designed to be used with SQL that constructs the underlying JSON structure, it is likely you will need to construct dynamic URLs from results returned by a SQL query.</p>
<p dir=""auto"">This plugin registers a custom SQLite function called <code>urllib_quote_plus()</code> to help you do that. It lets you use Python's <a href=""https://docs.python.org/3/library/urllib.parse.html#urllib.parse.quote_plus"" rel=""nofollow"">urllib.parse.quote_plus() function</a> from within a SQL query.</p>
<p dir=""auto"">Here's an example of how you might use it:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""select id, json_object(
    &quot;href&quot;,
    &quot;/mydatabase/other_table?_search=&quot; || urllib_quote_plus(text),
    &quot;label&quot;, text
) from mytable;""><pre><code>select id, json_object(
    ""href"",
    ""/mydatabase/other_table?_search="" || urllib_quote_plus(text),
    ""label"", text
) from mytable;
</code></pre></div>
</article></div>",1,public,0,,,
163790822,MDEwOlJlcG9zaXRvcnkxNjM3OTA4MjI=,datasette-sqlite-fts4,simonw/datasette-sqlite-fts4,0,9599,https://github.com/simonw/datasette-sqlite-fts4,Datasette plugin that adds custom SQL functions for working with SQLite FTS4,0,2019-01-02T03:40:41Z,2022-07-31T16:33:25Z,2022-07-31T14:46:26Z,https://datasette.io/plugins/datasette-sqlite-fts4,14,3,3,Python,1,1,1,1,0,1,0,0,0,apache-2.0,"[""datasette"", ""datasette-io"", ""datasette-plugin"", ""plugin""]",1,0,3,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,1,2,"# datasette-sqlite-fts4

[![PyPI](https://img.shields.io/pypi/v/datasette-sqlite-fts4.svg)](https://pypi.org/project/datasette-sqlite-fts4/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-sqlite-fts4?include_prereleases&label=changelog)](https://github.com/simonw/datasette-sqlite-fts4/releases)
[![Tests](https://github.com/simonw/datasette-sqlite-fts4/workflows/Test/badge.svg)](https://github.com/simonw/datasette-sqlite-fts4/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-sqlite-fts4/blob/main/LICENSE)

Datasette plugin that exposes the custom SQL functions from [sqlite-fts4](https://github.com/simonw/sqlite-fts4).

[Interactive demo](https://datasette-sqlite-fts4.datasette.io/24ways-fts4?sql=select%0D%0A++++json_object%28%0D%0A++++++++""label""%2C+articles.title%2C+""href""%2C+articles.url%0D%0A++++%29+as+article%2C%0D%0A++++articles.author%2C%0D%0A++++rank_score%28matchinfo%28articles_fts%2C+""pcx""%29%29+as+score%2C%0D%0A++++rank_bm25%28matchinfo%28articles_fts%2C+""pcnalx""%29%29+as+bm25%2C%0D%0A++++json_object%28%0D%0A++++++++""pre""%2C+annotate_matchinfo%28matchinfo%28articles_fts%2C+""pcxnalyb""%29%2C+""pcxnalyb""%29%0D%0A++++%29+as+annotated_matchinfo%2C%0D%0A++++matchinfo%28articles_fts%2C+""pcxnalyb""%29+as+matchinfo%2C%0D%0A++++decode_matchinfo%28matchinfo%28articles_fts%2C+""pcxnalyb""%29%29+as+decoded_matchinfo%0D%0Afrom%0D%0A++++articles_fts+join+articles+on+articles.rowid+%3D+articles_fts.rowid%0D%0Awhere%0D%0A++++articles_fts+match+%3Asearch%0D%0Aorder+by+bm25&search=jquery+maps). Read [Exploring search relevance algorithms with SQLite](https://simonwillison.net/2019/Jan/7/exploring-search-relevance-algorithms-sqlite/) for further details on this project.

## Installation

    pip install datasette-sqlite-fts4

If you are deploying a database using `datasette publish` you can include this plugin using the `--install` option:

    datasette publish now mydb.db --install=datasette-sqlite-fts4
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-datasette-sqlite-fts4"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-sqlite-fts4""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-sqlite-fts4</h1>
<p dir=""auto""><a href=""https://pypi.org/project/datasette-sqlite-fts4/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/97851cd6d6e2680d8da8db28d662851a36db50317ea8909c060aef1d1fcec110/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d73716c6974652d667473342e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-sqlite-fts4.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-sqlite-fts4/releases""><img src=""https://camo.githubusercontent.com/e86e60059d267b02f60a9af932dbd99e1ec8d011e106ac2e8be952b364fcb7d1/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d73716c6974652d667473343f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-sqlite-fts4?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-sqlite-fts4/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-sqlite-fts4/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-sqlite-fts4/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Datasette plugin that exposes the custom SQL functions from <a href=""https://github.com/simonw/sqlite-fts4"">sqlite-fts4</a>.</p>
<p dir=""auto""><a href=""https://datasette-sqlite-fts4.datasette.io/24ways-fts4?sql=select%0D%0A++++json_object%28%0D%0A++++++++%22label%22%2C+articles.title%2C+%22href%22%2C+articles.url%0D%0A++++%29+as+article%2C%0D%0A++++articles.author%2C%0D%0A++++rank_score%28matchinfo%28articles_fts%2C+%22pcx%22%29%29+as+score%2C%0D%0A++++rank_bm25%28matchinfo%28articles_fts%2C+%22pcnalx%22%29%29+as+bm25%2C%0D%0A++++json_object%28%0D%0A++++++++%22pre%22%2C+annotate_matchinfo%28matchinfo%28articles_fts%2C+%22pcxnalyb%22%29%2C+%22pcxnalyb%22%29%0D%0A++++%29+as+annotated_matchinfo%2C%0D%0A++++matchinfo%28articles_fts%2C+%22pcxnalyb%22%29+as+matchinfo%2C%0D%0A++++decode_matchinfo%28matchinfo%28articles_fts%2C+%22pcxnalyb%22%29%29+as+decoded_matchinfo%0D%0Afrom%0D%0A++++articles_fts+join+articles+on+articles.rowid+%3D+articles_fts.rowid%0D%0Awhere%0D%0A++++articles_fts+match+%3Asearch%0D%0Aorder+by+bm25&amp;search=jquery+maps"" rel=""nofollow"">Interactive demo</a>. Read <a href=""https://simonwillison.net/2019/Jan/7/exploring-search-relevance-algorithms-sqlite/"" rel=""nofollow"">Exploring search relevance algorithms with SQLite</a> for further details on this project.</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install datasette-sqlite-fts4""><pre class=""notranslate""><code>pip install datasette-sqlite-fts4
</code></pre></div>
<p dir=""auto"">If you are deploying a database using <code>datasette publish</code> you can include this plugin using the <code>--install</code> option:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""datasette publish now mydb.db --install=datasette-sqlite-fts4""><pre class=""notranslate""><code>datasette publish now mydb.db --install=datasette-sqlite-fts4
</code></pre></div>
</article></div>",1,public,0,,0,
166159072,MDEwOlJlcG9zaXRvcnkxNjYxNTkwNzI=,db-to-sqlite,simonw/db-to-sqlite,0,9599,https://github.com/simonw/db-to-sqlite,CLI tool for exporting tables or queries from any SQL database to a SQLite file,0,2019-01-17T04:16:48Z,2021-06-11T22:52:12Z,2021-06-11T22:55:56Z,,77,226,226,Python,1,1,1,1,0,12,0,0,2,apache-2.0,"[""sqlalchemy"", ""sqlite"", ""datasette"", ""datasette-io"", ""datasette-tool""]",12,2,226,main,"{""admin"": false, ""push"": false, ""pull"": false}",,,12,4,"# db-to-sqlite

[![PyPI](https://img.shields.io/pypi/v/db-to-sqlite.svg)](https://pypi.python.org/pypi/db-to-sqlite)
[![Changelog](https://img.shields.io/github/v/release/simonw/db-to-sqlite?include_prereleases&label=changelog)](https://github.com/simonw/db-to-sqlite/releases)
[![Tests](https://github.com/simonw/db-to-sqlite/workflows/Test/badge.svg)](https://github.com/simonw/db-to-sqlite/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/db-to-sqlite/blob/main/LICENSE)

CLI tool for exporting tables or queries from any SQL database to a SQLite file.

## Installation

Install from PyPI like so:

    pip install db-to-sqlite

If you want to use it with MySQL, you can install the extra dependency like this:

    pip install 'db-to-sqlite[mysql]'

Installing the `mysqlclient` library on OS X can be tricky - I've found [this recipe](https://gist.github.com/simonw/90ac0afd204cd0d6d9c3135c3888d116) to work (run that before installing `db-to-sqlite`).

For PostgreSQL, use this:

    pip install 'db-to-sqlite[postgresql]'

## Usage

    Usage: db-to-sqlite [OPTIONS] CONNECTION PATH

      Load data from any database into SQLite.

      PATH is a path to the SQLite file to create, e.c. /tmp/my_database.db

      CONNECTION is a SQLAlchemy connection string, for example:

          postgresql://localhost/my_database
          postgresql://username:passwd@localhost/my_database

          mysql://root@localhost/my_database
          mysql://username:passwd@localhost/my_database

      More: https://docs.sqlalchemy.org/en/13/core/engines.html#database-urls

    Options:
      --version                     Show the version and exit.
      --all                         Detect and copy all tables
      --table TEXT                  Specific tables to copy
      --skip TEXT                   When using --all skip these tables
      --redact TEXT...              (table, column) pairs to redact with ***
      --sql TEXT                    Optional SQL query to run
      --output TEXT                 Table in which to save --sql query results
      --pk TEXT                     Optional column to use as a primary key
      --index-fks / --no-index-fks  Should foreign keys have indexes? Default on
      -p, --progress                Show progress bar
      --postgres-schema TEXT        PostgreSQL schema to use
      --help                        Show this message and exit.

For example, to save the content of the `blog_entry` table from a PostgreSQL database to a local file called `blog.db` you could do this:

    db-to-sqlite ""postgresql://localhost/myblog"" blog.db \
        --table=blog_entry

You can specify `--table` more than once.

You can also save the data from all of your tables, effectively creating a SQLite copy of your entire database. Any foreign key relationships will be detected and added to the SQLite database. For example:

    db-to-sqlite ""postgresql://localhost/myblog"" blog.db \
        --all

When running `--all` you can specify tables to skip using `--skip`:

    db-to-sqlite ""postgresql://localhost/myblog"" blog.db \
        --all \
        --skip=django_migrations

If you want to save the results of a custom SQL query, do this:

    db-to-sqlite ""postgresql://localhost/myblog"" output.db \
        --output=query_results \
        --sql=""select id, title, created from blog_entry"" \
        --pk=id

The `--output` option specifies the table that should contain the results of the query.

## Using db-to-sqlite with PostgreSQL schemas

If the tables you want to copy from your PostgreSQL database aren't in the default schema, you can specify an alternate one with the `--postgres-schema` option:

    db-to-sqlite ""postgresql://localhost/myblog"" blog.db \
        --all \
        --postgres-schema my_schema

## Using db-to-sqlite with Heroku Postgres

If you run an application on [Heroku](https://www.heroku.com/) using their [Postgres database product](https://www.heroku.com/postgres), you can use the `heroku config` command to access a compatible connection string:

    $ heroku config --app myappname | grep HEROKU_POSTG
    HEROKU_POSTGRESQL_OLIVE_URL: postgres://username:password@ec2-xxx-xxx-xxx-x.compute-1.amazonaws.com:5432/dbname

You can pass this to `db-to-sqlite` to create a local SQLite database with the data from your Heroku instance.

You can even do this using a bash one-liner:

    $ db-to-sqlite $(heroku config --app myappname | grep HEROKU_POSTG | cut -d: -f 2-) \
        /tmp/heroku.db --all -p
    1/23: django_migrations
    ...
    17/23: blog_blogmark
    [####################################]  100%
    ...

## Related projects

* [Datasette](https://github.com/simonw/datasette): A tool for exploring and publishing data. Works great with SQLite files generated using `db-to-sqlite`.
* [sqlite-utils](https://github.com/simonw/sqlite-utils): Python CLI utility and library for manipulating SQLite databases.
* [csvs-to-sqlite](https://github.com/simonw/csvs-to-sqlite): Convert CSV files into a SQLite database.

## Development

To set up this tool locally, first checkout the code. Then create a new virtual environment:

    cd db-to-sqlite
    python3 -mvenv venv
    source venv/bin/activate

Or if you are using `pipenv`:

    pipenv shell

Now install the dependencies and test dependencies:

    pip install -e '.[test]'

To run the tests:

    pytest

This will skip tests against MySQL or PostgreSQL if you do not have their additional dependencies installed.

You can install those extra dependencies like so:

    pip install -e '.[test_mysql,test_postgresql]'

You can alternative use `pip install psycopg2-binary` if you cannot install the `psycopg2` dependency used by the `test_postgresql` extra.

See [Running a MySQL server using Homebrew](https://til.simonwillison.net/homebrew/mysql-homebrew) for tips on running the tests against MySQL on macOS, including how to install the `mysqlclient` dependency.

The PostgreSQL and MySQL tests default to expecting to run against servers on localhost. You can use environment variables to point them at different test database servers:

- `MYSQL_TEST_DB_CONNECTION` - defaults to `mysql://root@localhost/test_db_to_sqlite`
- `POSTGRESQL_TEST_DB_CONNECTION` - defaults to `postgresql://localhost/test_db_to_sqlite`

The database you indicate in the environment variable - `test_db_to_sqlite` by default - will be deleted and recreated on every test run.
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-db-to-sqlite"" class=""anchor"" aria-hidden=""true"" href=""#user-content-db-to-sqlite""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>db-to-sqlite</h1>
<p><a href=""https://pypi.python.org/pypi/db-to-sqlite"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/b34869a6692a0e2ab6754463aad8578fe9f594788d99d4fd7ae2d815735d1660/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f64622d746f2d73716c6974652e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/db-to-sqlite.svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/db-to-sqlite/releases""><img src=""https://camo.githubusercontent.com/0f168b599361c3b2242a18a2b84f49d1c4e5520c7c425a89f43fd1b6f337f299/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f64622d746f2d73716c6974653f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/db-to-sqlite?include_prereleases&amp;label=changelog"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/db-to-sqlite/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/db-to-sqlite/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/db-to-sqlite/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>CLI tool for exporting tables or queries from any SQL database to a SQLite file.</p>
<h2><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p>Install from PyPI like so:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pip install db-to-sqlite
""><pre><code>pip install db-to-sqlite
</code></pre></div>
<p>If you want to use it with MySQL, you can install the extra dependency like this:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pip install 'db-to-sqlite[mysql]'
""><pre><code>pip install 'db-to-sqlite[mysql]'
</code></pre></div>
<p>Installing the <code>mysqlclient</code> library on OS X can be tricky - I've found <a href=""https://gist.github.com/simonw/90ac0afd204cd0d6d9c3135c3888d116"">this recipe</a> to work (run that before installing <code>db-to-sqlite</code>).</p>
<p>For PostgreSQL, use this:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pip install 'db-to-sqlite[postgresql]'
""><pre><code>pip install 'db-to-sqlite[postgresql]'
</code></pre></div>
<h2><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""Usage: db-to-sqlite [OPTIONS] CONNECTION PATH

  Load data from any database into SQLite.

  PATH is a path to the SQLite file to create, e.c. /tmp/my_database.db

  CONNECTION is a SQLAlchemy connection string, for example:

      postgresql://localhost/my_database
      postgresql://username:passwd@localhost/my_database

      mysql://root@localhost/my_database
      mysql://username:passwd@localhost/my_database

  More: https://docs.sqlalchemy.org/en/13/core/engines.html#database-urls

Options:
  --version                     Show the version and exit.
  --all                         Detect and copy all tables
  --table TEXT                  Specific tables to copy
  --skip TEXT                   When using --all skip these tables
  --redact TEXT...              (table, column) pairs to redact with ***
  --sql TEXT                    Optional SQL query to run
  --output TEXT                 Table in which to save --sql query results
  --pk TEXT                     Optional column to use as a primary key
  --index-fks / --no-index-fks  Should foreign keys have indexes? Default on
  -p, --progress                Show progress bar
  --postgres-schema TEXT        PostgreSQL schema to use
  --help                        Show this message and exit.
""><pre><code>Usage: db-to-sqlite [OPTIONS] CONNECTION PATH

  Load data from any database into SQLite.

  PATH is a path to the SQLite file to create, e.c. /tmp/my_database.db

  CONNECTION is a SQLAlchemy connection string, for example:

      postgresql://localhost/my_database
      postgresql://username:passwd@localhost/my_database

      mysql://root@localhost/my_database
      mysql://username:passwd@localhost/my_database

  More: https://docs.sqlalchemy.org/en/13/core/engines.html#database-urls

Options:
  --version                     Show the version and exit.
  --all                         Detect and copy all tables
  --table TEXT                  Specific tables to copy
  --skip TEXT                   When using --all skip these tables
  --redact TEXT...              (table, column) pairs to redact with ***
  --sql TEXT                    Optional SQL query to run
  --output TEXT                 Table in which to save --sql query results
  --pk TEXT                     Optional column to use as a primary key
  --index-fks / --no-index-fks  Should foreign keys have indexes? Default on
  -p, --progress                Show progress bar
  --postgres-schema TEXT        PostgreSQL schema to use
  --help                        Show this message and exit.
</code></pre></div>
<p>For example, to save the content of the <code>blog_entry</code> table from a PostgreSQL database to a local file called <code>blog.db</code> you could do this:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""db-to-sqlite &quot;postgresql://localhost/myblog&quot; blog.db \
    --table=blog_entry
""><pre><code>db-to-sqlite ""postgresql://localhost/myblog"" blog.db \
    --table=blog_entry
</code></pre></div>
<p>You can specify <code>--table</code> more than once.</p>
<p>You can also save the data from all of your tables, effectively creating a SQLite copy of your entire database. Any foreign key relationships will be detected and added to the SQLite database. For example:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""db-to-sqlite &quot;postgresql://localhost/myblog&quot; blog.db \
    --all
""><pre><code>db-to-sqlite ""postgresql://localhost/myblog"" blog.db \
    --all
</code></pre></div>
<p>When running <code>--all</code> you can specify tables to skip using <code>--skip</code>:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""db-to-sqlite &quot;postgresql://localhost/myblog&quot; blog.db \
    --all \
    --skip=django_migrations
""><pre><code>db-to-sqlite ""postgresql://localhost/myblog"" blog.db \
    --all \
    --skip=django_migrations
</code></pre></div>
<p>If you want to save the results of a custom SQL query, do this:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""db-to-sqlite &quot;postgresql://localhost/myblog&quot; output.db \
    --output=query_results \
    --sql=&quot;select id, title, created from blog_entry&quot; \
    --pk=id
""><pre><code>db-to-sqlite ""postgresql://localhost/myblog"" output.db \
    --output=query_results \
    --sql=""select id, title, created from blog_entry"" \
    --pk=id
</code></pre></div>
<p>The <code>--output</code> option specifies the table that should contain the results of the query.</p>
<h2><a id=""user-content-using-db-to-sqlite-with-postgresql-schemas"" class=""anchor"" aria-hidden=""true"" href=""#user-content-using-db-to-sqlite-with-postgresql-schemas""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Using db-to-sqlite with PostgreSQL schemas</h2>
<p>If the tables you want to copy from your PostgreSQL database aren't in the default schema, you can specify an alternate one with the <code>--postgres-schema</code> option:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""db-to-sqlite &quot;postgresql://localhost/myblog&quot; blog.db \
    --all \
    --postgres-schema my_schema
""><pre><code>db-to-sqlite ""postgresql://localhost/myblog"" blog.db \
    --all \
    --postgres-schema my_schema
</code></pre></div>
<h2><a id=""user-content-using-db-to-sqlite-with-heroku-postgres"" class=""anchor"" aria-hidden=""true"" href=""#user-content-using-db-to-sqlite-with-heroku-postgres""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Using db-to-sqlite with Heroku Postgres</h2>
<p>If you run an application on <a href=""https://www.heroku.com/"" rel=""nofollow"">Heroku</a> using their <a href=""https://www.heroku.com/postgres"" rel=""nofollow"">Postgres database product</a>, you can use the <code>heroku config</code> command to access a compatible connection string:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ heroku config --app myappname | grep HEROKU_POSTG
HEROKU_POSTGRESQL_OLIVE_URL: postgres://username:password@ec2-xxx-xxx-xxx-x.compute-1.amazonaws.com:5432/dbname
""><pre><code>$ heroku config --app myappname | grep HEROKU_POSTG
HEROKU_POSTGRESQL_OLIVE_URL: postgres://username:password@ec2-xxx-xxx-xxx-x.compute-1.amazonaws.com:5432/dbname
</code></pre></div>
<p>You can pass this to <code>db-to-sqlite</code> to create a local SQLite database with the data from your Heroku instance.</p>
<p>You can even do this using a bash one-liner:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ db-to-sqlite $(heroku config --app myappname | grep HEROKU_POSTG | cut -d: -f 2-) \
    /tmp/heroku.db --all -p
1/23: django_migrations
...
17/23: blog_blogmark
[####################################]  100%
...
""><pre><code>$ db-to-sqlite $(heroku config --app myappname | grep HEROKU_POSTG | cut -d: -f 2-) \
    /tmp/heroku.db --all -p
1/23: django_migrations
...
17/23: blog_blogmark
[####################################]  100%
...
</code></pre></div>
<h2><a id=""user-content-related-projects"" class=""anchor"" aria-hidden=""true"" href=""#user-content-related-projects""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Related projects</h2>
<ul>
<li><a href=""https://github.com/simonw/datasette"">Datasette</a>: A tool for exploring and publishing data. Works great with SQLite files generated using <code>db-to-sqlite</code>.</li>
<li><a href=""https://github.com/simonw/sqlite-utils"">sqlite-utils</a>: Python CLI utility and library for manipulating SQLite databases.</li>
<li><a href=""https://github.com/simonw/csvs-to-sqlite"">csvs-to-sqlite</a>: Convert CSV files into a SQLite database.</li>
</ul>
<h2><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p>To set up this tool locally, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""cd db-to-sqlite
python3 -mvenv venv
source venv/bin/activate
""><pre><code>cd db-to-sqlite
python3 -mvenv venv
source venv/bin/activate
</code></pre></div>
<p>Or if you are using <code>pipenv</code>:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pipenv shell
""><pre><code>pipenv shell
</code></pre></div>
<p>Now install the dependencies and test dependencies:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'
""><pre><code>pip install -e '.[test]'
</code></pre></div>
<p>To run the tests:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pytest
""><pre><code>pytest
</code></pre></div>
<p>This will skip tests against MySQL or PostgreSQL if you do not have their additional dependencies installed.</p>
<p>You can install those extra dependencies like so:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pip install -e '.[test_mysql,test_postgresql]'
""><pre><code>pip install -e '.[test_mysql,test_postgresql]'
</code></pre></div>
<p>You can alternative use <code>pip install psycopg2-binary</code> if you cannot install the <code>psycopg2</code> dependency used by the <code>test_postgresql</code> extra.</p>
<p>See <a href=""https://til.simonwillison.net/homebrew/mysql-homebrew"" rel=""nofollow"">Running a MySQL server using Homebrew</a> for tips on running the tests against MySQL on macOS, including how to install the <code>mysqlclient</code> dependency.</p>
<p>The PostgreSQL and MySQL tests default to expecting to run against servers on localhost. You can use environment variables to point them at different test database servers:</p>
<ul>
<li><code>MYSQL_TEST_DB_CONNECTION</code> - defaults to <code>mysql://root@localhost/test_db_to_sqlite</code></li>
<li><code>POSTGRESQL_TEST_DB_CONNECTION</code> - defaults to <code>postgresql://localhost/test_db_to_sqlite</code></li>
</ul>
<p>The database you indicate in the environment variable - <code>test_db_to_sqlite</code> by default - will be deleted and recreated on every test run.</p>
</article></div>",,,,,,
167730071,MDEwOlJlcG9zaXRvcnkxNjc3MzAwNzE=,datasette-pretty-json,simonw/datasette-pretty-json,0,9599,https://github.com/simonw/datasette-pretty-json,Datasette plugin that pretty-prints any column values that are valid JSON objects or arrays,0,2019-01-26T19:30:43Z,2022-09-24T06:13:11Z,2022-09-28T21:06:31Z,,14,8,8,Python,1,1,1,1,0,0,0,0,1,apache-2.0,"[""datasette"", ""datasette-io"", ""datasette-plugin"", ""json""]",0,1,8,master,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,0,2,"# datasette-pretty-json

[![PyPI](https://img.shields.io/pypi/v/datasette-pretty-json.svg)](https://pypi.org/project/datasette-pretty-json/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-pretty-json?include_prereleases&label=changelog)](https://github.com/simonw/datasette-pretty-json/releases)
[![Tests](https://github.com/simonw/datasette-pretty-json/workflows/Test/badge.svg)](https://github.com/simonw/datasette-pretty-json/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-pretty-json/blob/main/LICENSE)


[Datasette](https://github.com/simonw/datasette) plugin that pretty-prints any column values that are valid JSON objects or arrays.

You may also be interested in [datasette-json-html](https://github.com/simonw/datasette-json-html).
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-datasette-pretty-json"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-pretty-json""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-pretty-json</h1>
<p dir=""auto""><a href=""https://pypi.org/project/datasette-pretty-json/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/3f7d7f3de1cb9fbdf6ac58e04cea35cc1daee4d53bcbcdf5060a0d311db996dc/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d7072657474792d6a736f6e2e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-pretty-json.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-pretty-json/releases""><img src=""https://camo.githubusercontent.com/4fbe6b0ade0a8b75e9200545ca5459c3ba7d546a32ac2f1519417e8c2314d20b/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d7072657474792d6a736f6e3f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-pretty-json?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-pretty-json/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-pretty-json/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-pretty-json/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto""><a href=""https://github.com/simonw/datasette"">Datasette</a> plugin that pretty-prints any column values that are valid JSON objects or arrays.</p>
<p dir=""auto"">You may also be interested in <a href=""https://github.com/simonw/datasette-json-html"">datasette-json-html</a>.</p>
</article></div>",1,public,0,,0,
167759846,MDEwOlJlcG9zaXRvcnkxNjc3NTk4NDY=,markdown-to-sqlite,simonw/markdown-to-sqlite,0,9599,https://github.com/simonw/markdown-to-sqlite,CLI tool for loading markdown files into a SQLite database,0,2019-01-27T02:04:54Z,2022-05-13T18:09:26Z,2022-05-13T18:09:22Z,,13,49,49,Python,1,1,1,1,0,2,0,0,2,apache-2.0,"[""datasette-io"", ""datasette-tool"", ""markdown"", ""sqlite"", ""yaml""]",2,2,49,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,2,3,"# markdown-to-sqlite

[![PyPI](https://img.shields.io/pypi/v/markdown-to-sqlite.svg)](https://pypi.python.org/pypi/markdown-to-sqlite)
[![Changelog](https://img.shields.io/github/v/release/simonw/markdown-to-sqlite?include_prereleases&label=changelog)](https://github.com/simonw/markdown-to-sqlite/releases)
[![Tests](https://github.com/simonw/markdown-to-sqlite/workflows/Test/badge.svg)](https://github.com/simonw/markdown-to-sqlite/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/markdown-to-sqlite/blob/main/LICENSE)

CLI tool for loading markdown files into a SQLite database.

YAML embedded in the markdown files will be used to populate additional columns.

    Usage: markdown-to-sqlite [OPTIONS] DBNAME TABLE PATHS...

For example:

    $ markdown-to-sqlite docs.db documents file1.md file2.md

## Breaking change

Prior to version 1.0 this argument order was different - markdown files were listed before the database and table.
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-markdown-to-sqlite"" class=""anchor"" aria-hidden=""true"" href=""#user-content-markdown-to-sqlite""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>markdown-to-sqlite</h1>
<p dir=""auto""><a href=""https://pypi.python.org/pypi/markdown-to-sqlite"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/e5aa6980c413976937ea698f111c31b580e6186e9f018673df37b3e9fe1047fb/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6d61726b646f776e2d746f2d73716c6974652e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/markdown-to-sqlite.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/markdown-to-sqlite/releases""><img src=""https://camo.githubusercontent.com/29f48f8deb508081ca40341be55a1cdbe25dcd7a637e5277a24b82f167643ab6/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6d61726b646f776e2d746f2d73716c6974653f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/markdown-to-sqlite?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/markdown-to-sqlite/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/markdown-to-sqlite/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/markdown-to-sqlite/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">CLI tool for loading markdown files into a SQLite database.</p>
<p dir=""auto"">YAML embedded in the markdown files will be used to populate additional columns.</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""Usage: markdown-to-sqlite [OPTIONS] DBNAME TABLE PATHS...""><pre class=""notranslate""><code class=""notranslate"">Usage: markdown-to-sqlite [OPTIONS] DBNAME TABLE PATHS...
</code></pre></div>
<p dir=""auto"">For example:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ markdown-to-sqlite docs.db documents file1.md file2.md""><pre class=""notranslate""><code class=""notranslate"">$ markdown-to-sqlite docs.db documents file1.md file2.md
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-breaking-change"" class=""anchor"" aria-hidden=""true"" href=""#user-content-breaking-change""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Breaking change</h2>
<p dir=""auto"">Prior to version 1.0 this argument order was different - markdown files were listed before the database and table.</p>
</article></div>",1,public,0,,,
168474970,MDEwOlJlcG9zaXRvcnkxNjg0NzQ5NzA=,dbf-to-sqlite,simonw/dbf-to-sqlite,0,9599,https://github.com/simonw/dbf-to-sqlite,"CLI tool for converting DBF files (dBase, FoxPro etc) to SQLite",0,2019-01-31T06:30:46Z,2021-03-23T01:29:41Z,2020-02-16T00:41:20Z,,8,25,25,Python,1,1,1,1,0,8,0,0,3,apache-2.0,"[""sqlite"", ""foxpro"", ""dbf"", ""dbase"", ""datasette-io"", ""datasette-tool""]",8,3,25,master,"{""admin"": false, ""push"": false, ""pull"": false}",,,8,2,"# dbf-to-sqlite

[![PyPI](https://img.shields.io/pypi/v/dbf-to-sqlite.svg)](https://pypi.python.org/pypi/dbf-to-sqlite)
[![Travis CI](https://travis-ci.com/simonw/dbf-to-sqlite.svg?branch=master)](https://travis-ci.com/simonw/dbf-to-sqlite)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/dbf-to-sqlite/blob/master/LICENSE)

CLI tool for converting DBF files (dBase, FoxPro etc) to SQLite.

## Installation

    pip install dbf-to-sqlite

## Usage

    $ dbf-to-sqlite --help
    Usage: dbf-to-sqlite [OPTIONS] DBF_PATHS... SQLITE_DB

      Convert DBF files (dBase, FoxPro etc) to SQLite

      https://github.com/simonw/dbf-to-sqlite

    Options:
      --version      Show the version and exit.
      --table TEXT   Table name to use (only valid for single files)
      -v, --verbose  Show what's going on
      --help         Show this message and exit.

Example usage:

    $ dbf-to-sqlite *.DBF database.db

This will create a new SQLite database called `database.db` containing one table for each of the `DBF` files in the current directory.

Looking for DBF files to try this out on? Try downloading the [Himalayan Database](http://himalayandatabase.com/) of all expeditions that have climbed in the Nepal Himalaya.
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-dbf-to-sqlite"" class=""anchor"" aria-hidden=""true"" href=""#user-content-dbf-to-sqlite""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>dbf-to-sqlite</h1>
<p><a href=""https://pypi.python.org/pypi/dbf-to-sqlite"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/59d39789edaeb918bb1febd34597470b4d6cab449725f3db3c0bb59d3c02551a/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6462662d746f2d73716c6974652e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/dbf-to-sqlite.svg"" style=""max-width:100%;""></a>
<a href=""https://travis-ci.com/simonw/dbf-to-sqlite"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/d7e086b366bfa3de03a5c75c2112c011ac4eaf71863fed90253e3b5c28e13f89/68747470733a2f2f7472617669732d63692e636f6d2f73696d6f6e772f6462662d746f2d73716c6974652e7376673f6272616e63683d6d6173746572"" alt=""Travis CI"" data-canonical-src=""https://travis-ci.com/simonw/dbf-to-sqlite.svg?branch=master"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/dbf-to-sqlite/blob/master/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>CLI tool for converting DBF files (dBase, FoxPro etc) to SQLite.</p>
<h2><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pip install dbf-to-sqlite
""><pre><code>pip install dbf-to-sqlite
</code></pre></div>
<h2><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ dbf-to-sqlite --help
Usage: dbf-to-sqlite [OPTIONS] DBF_PATHS... SQLITE_DB

  Convert DBF files (dBase, FoxPro etc) to SQLite

  https://github.com/simonw/dbf-to-sqlite

Options:
  --version      Show the version and exit.
  --table TEXT   Table name to use (only valid for single files)
  -v, --verbose  Show what's going on
  --help         Show this message and exit.
""><pre><code>$ dbf-to-sqlite --help
Usage: dbf-to-sqlite [OPTIONS] DBF_PATHS... SQLITE_DB

  Convert DBF files (dBase, FoxPro etc) to SQLite

  https://github.com/simonw/dbf-to-sqlite

Options:
  --version      Show the version and exit.
  --table TEXT   Table name to use (only valid for single files)
  -v, --verbose  Show what's going on
  --help         Show this message and exit.
</code></pre></div>
<p>Example usage:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ dbf-to-sqlite *.DBF database.db
""><pre><code>$ dbf-to-sqlite *.DBF database.db
</code></pre></div>
<p>This will create a new SQLite database called <code>database.db</code> containing one table for each of the <code>DBF</code> files in the current directory.</p>
<p>Looking for DBF files to try this out on? Try downloading the <a href=""http://himalayandatabase.com/"" rel=""nofollow"">Himalayan Database</a> of all expeditions that have climbed in the Nepal Himalaya.</p>
</article></div>",,,,,,
174715153,MDEwOlJlcG9zaXRvcnkxNzQ3MTUxNTM=,datasette-jellyfish,simonw/datasette-jellyfish,0,9599,https://github.com/simonw/datasette-jellyfish,Datasette plugin adding SQL functions for fuzzy text matching powered by Jellyfish,0,2019-03-09T16:02:01Z,2021-02-06T02:33:49Z,2021-02-06T02:34:18Z,https://datasette.io/plugins/datasette-jellyfish,15,9,9,Python,1,1,1,1,0,2,0,0,0,apache-2.0,"[""datasette"", ""datasette-plugin"", ""datasette-io""]",2,0,9,main,"{""admin"": false, ""push"": false, ""pull"": false}",,,2,1,"# datasette-jellyfish

[![PyPI](https://img.shields.io/pypi/v/datasette-jellyfish.svg)](https://pypi.org/project/datasette-jellyfish/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-jellyfish?include_prereleases&label=changelog)](https://github.com/simonw/datasette-jellyfish/releases)
[![Tests](https://github.com/simonw/datasette-jellyfish/workflows/Test/badge.svg)](https://github.com/simonw/datasette-jellyfish/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-jellyfish/blob/main/LICENSE)

Datasette plugin that adds custom SQL functions for fuzzy string matching, built on top of the [Jellyfish](https://github.com/jamesturk/jellyfish) Python library by James Turk and Michael Stephens.

Interactive demos:

* [soundex, metaphone, nysiis, match_rating_codex comparison](https://latest-with-plugins.datasette.io/fixtures?sql=SELECT%0D%0A++++soundex%28%3As%29%2C+%0D%0A++++metaphone%28%3As%29%2C+%0D%0A++++nysiis%28%3As%29%2C+%0D%0A++++match_rating_codex%28%3As%29&s=demo).
* [distance functions comparison](https://latest-with-plugins.datasette.io/fixtures?sql=SELECT%0D%0A++++levenshtein_distance%28%3As1%2C+%3As2%29%2C%0D%0A++++damerau_levenshtein_distance%28%3As1%2C+%3As2%29%2C%0D%0A++++hamming_distance%28%3As1%2C+%3As2%29%2C%0D%0A++++jaro_similarity%28%3As1%2C+%3As2%29%2C%0D%0A++++jaro_winkler_similarity%28%3As1%2C+%3As2%29%2C%0D%0A++++match_rating_comparison%28%3As1%2C+%3As2%29%3B&s1=barrack+obama&s2=barrack+h+obama)

Examples:

    SELECT soundex(""hello"");
        -- Outputs H400
    SELECT metaphone(""hello"");
        -- Outputs HL
    SELECT nysiis(""hello"");
        -- Outputs HAL
    SELECT match_rating_codex(""hello"");
        -- Outputs HLL
    SELECT porter_stem(""running"");
        -- Outputs run
    SELECT levenshtein_distance(""hello"", ""hello world"");
        -- Outputs 6
    SELECT damerau_levenshtein_distance(""hello"", ""hello world"");
        -- Outputs 6
    SELECT hamming_distance(""hello"", ""hello world"");
        -- Outputs 6
    SELECT jaro_similarity(""hello"", ""hello world"");
        -- Outputs 0.8181818181818182
    SELECT jaro_winkler_similarity(""hello"", ""hello world"");
        -- Outputs 0.890909090909091
    SELECT match_rating_comparison(""hello"", ""helloo"");
        -- Outputs 1

See [the Jellyfish documentation](https://jellyfish.readthedocs.io/en/latest/) for an explanation of each of these functions.","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-datasette-jellyfish"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-jellyfish""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-jellyfish</h1>
<p><a href=""https://pypi.org/project/datasette-jellyfish/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/e05229cfd49e2fc6fbff79424545caee53b94da156d85d3f3bdb39ee9e32aef2/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d6a656c6c79666973682e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-jellyfish.svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-jellyfish/releases""><img src=""https://camo.githubusercontent.com/13a248cf10fa13fb53cb22841b0065f965423a4e0f5c17d19673603b2baca376/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d6a656c6c79666973683f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-jellyfish?include_prereleases&amp;label=changelog"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-jellyfish/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-jellyfish/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-jellyfish/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>Datasette plugin that adds custom SQL functions for fuzzy string matching, built on top of the <a href=""https://github.com/jamesturk/jellyfish"">Jellyfish</a> Python library by James Turk and Michael Stephens.</p>
<p>Interactive demos:</p>
<ul>
<li><a href=""https://latest-with-plugins.datasette.io/fixtures?sql=SELECT%0D%0A++++soundex%28%3As%29%2C+%0D%0A++++metaphone%28%3As%29%2C+%0D%0A++++nysiis%28%3As%29%2C+%0D%0A++++match_rating_codex%28%3As%29&amp;s=demo"" rel=""nofollow"">soundex, metaphone, nysiis, match_rating_codex comparison</a>.</li>
<li><a href=""https://latest-with-plugins.datasette.io/fixtures?sql=SELECT%0D%0A++++levenshtein_distance%28%3As1%2C+%3As2%29%2C%0D%0A++++damerau_levenshtein_distance%28%3As1%2C+%3As2%29%2C%0D%0A++++hamming_distance%28%3As1%2C+%3As2%29%2C%0D%0A++++jaro_similarity%28%3As1%2C+%3As2%29%2C%0D%0A++++jaro_winkler_similarity%28%3As1%2C+%3As2%29%2C%0D%0A++++match_rating_comparison%28%3As1%2C+%3As2%29%3B&amp;s1=barrack+obama&amp;s2=barrack+h+obama"" rel=""nofollow"">distance functions comparison</a></li>
</ul>
<p>Examples:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""SELECT soundex(&quot;hello&quot;);
    -- Outputs H400
SELECT metaphone(&quot;hello&quot;);
    -- Outputs HL
SELECT nysiis(&quot;hello&quot;);
    -- Outputs HAL
SELECT match_rating_codex(&quot;hello&quot;);
    -- Outputs HLL
SELECT porter_stem(&quot;running&quot;);
    -- Outputs run
SELECT levenshtein_distance(&quot;hello&quot;, &quot;hello world&quot;);
    -- Outputs 6
SELECT damerau_levenshtein_distance(&quot;hello&quot;, &quot;hello world&quot;);
    -- Outputs 6
SELECT hamming_distance(&quot;hello&quot;, &quot;hello world&quot;);
    -- Outputs 6
SELECT jaro_similarity(&quot;hello&quot;, &quot;hello world&quot;);
    -- Outputs 0.8181818181818182
SELECT jaro_winkler_similarity(&quot;hello&quot;, &quot;hello world&quot;);
    -- Outputs 0.890909090909091
SELECT match_rating_comparison(&quot;hello&quot;, &quot;helloo&quot;);
    -- Outputs 1
""><pre><code>SELECT soundex(""hello"");
    -- Outputs H400
SELECT metaphone(""hello"");
    -- Outputs HL
SELECT nysiis(""hello"");
    -- Outputs HAL
SELECT match_rating_codex(""hello"");
    -- Outputs HLL
SELECT porter_stem(""running"");
    -- Outputs run
SELECT levenshtein_distance(""hello"", ""hello world"");
    -- Outputs 6
SELECT damerau_levenshtein_distance(""hello"", ""hello world"");
    -- Outputs 6
SELECT hamming_distance(""hello"", ""hello world"");
    -- Outputs 6
SELECT jaro_similarity(""hello"", ""hello world"");
    -- Outputs 0.8181818181818182
SELECT jaro_winkler_similarity(""hello"", ""hello world"");
    -- Outputs 0.890909090909091
SELECT match_rating_comparison(""hello"", ""helloo"");
    -- Outputs 1
</code></pre></div>
<p>See <a href=""https://jellyfish.readthedocs.io/en/latest/"" rel=""nofollow"">the Jellyfish documentation</a> for an explanation of each of these functions.</p>
</article></div>",,,,,,
175321497,MDEwOlJlcG9zaXRvcnkxNzUzMjE0OTc=,csv-diff,simonw/csv-diff,0,9599,https://github.com/simonw/csv-diff,Python CLI tool and library for diffing CSV and JSON files,0,2019-03-13T01:11:26Z,2022-07-29T20:01:02Z,2022-07-29T20:00:59Z,,34,198,198,Python,1,1,1,1,0,29,0,0,18,apache-2.0,"[""click"", ""csv"", ""datasette-io"", ""datasette-tool"", ""diff"", ""git-scraping""]",29,18,198,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,29,7,"# csv-diff

[![PyPI](https://img.shields.io/pypi/v/csv-diff.svg)](https://pypi.org/project/csv-diff/)
[![Changelog](https://img.shields.io/github/v/release/simonw/csv-diff?include_prereleases&label=changelog)](https://github.com/simonw/csv-diff/releases)
[![Tests](https://github.com/simonw/csv-diff/workflows/Test/badge.svg)](https://github.com/simonw/csv-diff/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/csv-diff/blob/main/LICENSE)

Tool for viewing the difference between two CSV, TSV or JSON files. See [Generating a commit log for San Francisco’s official list of trees](https://simonwillison.net/2019/Mar/13/tree-history/) (and the [sf-tree-history repo commit log](https://github.com/simonw/sf-tree-history/commits)) for background information on this project.

## Installation

    pip install csv-diff

## Usage

Consider two CSV files:

`one.csv`

    id,name,age
    1,Cleo,4
    2,Pancakes,2

`two.csv`

    id,name,age
    1,Cleo,5
    3,Bailey,1

`csv-diff` can show a human-readable summary of differences between the files:

    $ csv-diff one.csv two.csv --key=id
    1 row changed, 1 row added, 1 row removed

    1 row changed

      Row 1
        age: ""4"" => ""5""

    1 row added

      id: 3
      name: Bailey
      age: 1

    1 row removed

      id: 2
      name: Pancakes
      age: 2

The `--key=id` option means that the `id` column should be treated as the unique key, to identify which records have changed.

The tool will automatically detect if your files are comma- or tab-separated. You can over-ride this automatic detection and force the tool to use a specific format using `--format=tsv` or `--format=csv`.

You can also feed it JSON files, provided they are a JSON array of objects where each object has the same keys. Use `--format=json` if your input files are JSON.

Use `--show-unchanged` to include full details of the unchanged values for rows with at least one change in the diff output:

    % csv-diff one.csv two.csv --key=id --show-unchanged
    1 row changed

      id: 1
        age: ""4"" => ""5""

        Unchanged:
          name: ""Cleo""

You can use the `--json` option to get a machine-readable difference:

    $ csv-diff one.csv two.csv --key=id --json
    {
        ""added"": [
            {
                ""id"": ""3"",
                ""name"": ""Bailey"",
                ""age"": ""1""
            }
        ],
        ""removed"": [
            {
                ""id"": ""2"",
                ""name"": ""Pancakes"",
                ""age"": ""2""
            }
        ],
        ""changed"": [
            {
                ""key"": ""1"",
                ""changes"": {
                    ""age"": [
                        ""4"",
                        ""5""
                    ]
                }
            }
        ],
        ""columns_added"": [],
        ""columns_removed"": []
    }

## As a Python library

You can also import the Python library into your own code like so:

    from csv_diff import load_csv, compare
    diff = compare(
        load_csv(open(""one.csv""), key=""id""),
        load_csv(open(""two.csv""), key=""id"")
    )

`diff` will now contain the same data structure as the output in the `--json` example above.

If the columns in the CSV have changed, those added or removed columns will be ignored when calculating changes made to specific rows.

## As a Docker container

### Build the image

    $ docker build -t csvdiff .

### Run the container

    $ docker run --rm -v $(pwd):/files csvdiff

Suppose current directory contains two csv files : one.csv two.csv

    $ docker run --rm -v $(pwd):/files csvdiff one.csv two.csv
    
## Alternatives

- [csvdiff](https://github.com/aswinkarthik/csvdiff) is a ""fast diff tool for comparing CSV files"" - you may get better results from this than from `csv-diff` against larger files.
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-csv-diff"" class=""anchor"" aria-hidden=""true"" href=""#user-content-csv-diff""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>csv-diff</h1>
<p dir=""auto""><a href=""https://pypi.org/project/csv-diff/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/75784b8c5ee65df6e894c25c0efe54d360e3b6a33714da9aa0c6bb86ede1f153/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6373762d646966662e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/csv-diff.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/csv-diff/releases""><img src=""https://camo.githubusercontent.com/a8ece0f4436cb61b1524e3722c02363faf7aa45fe7e62f6634604f2c30421517/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6373762d646966663f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/csv-diff?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/csv-diff/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/csv-diff/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/csv-diff/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Tool for viewing the difference between two CSV, TSV or JSON files. See <a href=""https://simonwillison.net/2019/Mar/13/tree-history/"" rel=""nofollow"">Generating a commit log for San Francisco’s official list of trees</a> (and the <a href=""https://github.com/simonw/sf-tree-history/commits"">sf-tree-history repo commit log</a>) for background information on this project.</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install csv-diff""><pre class=""notranslate""><code>pip install csv-diff
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p dir=""auto"">Consider two CSV files:</p>
<p dir=""auto""><code>one.csv</code></p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""id,name,age
1,Cleo,4
2,Pancakes,2""><pre class=""notranslate""><code>id,name,age
1,Cleo,4
2,Pancakes,2
</code></pre></div>
<p dir=""auto""><code>two.csv</code></p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""id,name,age
1,Cleo,5
3,Bailey,1""><pre class=""notranslate""><code>id,name,age
1,Cleo,5
3,Bailey,1
</code></pre></div>
<p dir=""auto""><code>csv-diff</code> can show a human-readable summary of differences between the files:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ csv-diff one.csv two.csv --key=id
1 row changed, 1 row added, 1 row removed

1 row changed

  Row 1
    age: &quot;4&quot; =&gt; &quot;5&quot;

1 row added

  id: 3
  name: Bailey
  age: 1

1 row removed

  id: 2
  name: Pancakes
  age: 2""><pre class=""notranslate""><code>$ csv-diff one.csv two.csv --key=id
1 row changed, 1 row added, 1 row removed

1 row changed

  Row 1
    age: ""4"" =&gt; ""5""

1 row added

  id: 3
  name: Bailey
  age: 1

1 row removed

  id: 2
  name: Pancakes
  age: 2
</code></pre></div>
<p dir=""auto"">The <code>--key=id</code> option means that the <code>id</code> column should be treated as the unique key, to identify which records have changed.</p>
<p dir=""auto"">The tool will automatically detect if your files are comma- or tab-separated. You can over-ride this automatic detection and force the tool to use a specific format using <code>--format=tsv</code> or <code>--format=csv</code>.</p>
<p dir=""auto"">You can also feed it JSON files, provided they are a JSON array of objects where each object has the same keys. Use <code>--format=json</code> if your input files are JSON.</p>
<p dir=""auto"">Use <code>--show-unchanged</code> to include full details of the unchanged values for rows with at least one change in the diff output:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""% csv-diff one.csv two.csv --key=id --show-unchanged
1 row changed

  id: 1
    age: &quot;4&quot; =&gt; &quot;5&quot;

    Unchanged:
      name: &quot;Cleo&quot;""><pre class=""notranslate""><code>% csv-diff one.csv two.csv --key=id --show-unchanged
1 row changed

  id: 1
    age: ""4"" =&gt; ""5""

    Unchanged:
      name: ""Cleo""
</code></pre></div>
<p dir=""auto"">You can use the <code>--json</code> option to get a machine-readable difference:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ csv-diff one.csv two.csv --key=id --json
{
    &quot;added&quot;: [
        {
            &quot;id&quot;: &quot;3&quot;,
            &quot;name&quot;: &quot;Bailey&quot;,
            &quot;age&quot;: &quot;1&quot;
        }
    ],
    &quot;removed&quot;: [
        {
            &quot;id&quot;: &quot;2&quot;,
            &quot;name&quot;: &quot;Pancakes&quot;,
            &quot;age&quot;: &quot;2&quot;
        }
    ],
    &quot;changed&quot;: [
        {
            &quot;key&quot;: &quot;1&quot;,
            &quot;changes&quot;: {
                &quot;age&quot;: [
                    &quot;4&quot;,
                    &quot;5&quot;
                ]
            }
        }
    ],
    &quot;columns_added&quot;: [],
    &quot;columns_removed&quot;: []
}""><pre class=""notranslate""><code>$ csv-diff one.csv two.csv --key=id --json
{
    ""added"": [
        {
            ""id"": ""3"",
            ""name"": ""Bailey"",
            ""age"": ""1""
        }
    ],
    ""removed"": [
        {
            ""id"": ""2"",
            ""name"": ""Pancakes"",
            ""age"": ""2""
        }
    ],
    ""changed"": [
        {
            ""key"": ""1"",
            ""changes"": {
                ""age"": [
                    ""4"",
                    ""5""
                ]
            }
        }
    ],
    ""columns_added"": [],
    ""columns_removed"": []
}
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-as-a-python-library"" class=""anchor"" aria-hidden=""true"" href=""#user-content-as-a-python-library""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>As a Python library</h2>
<p dir=""auto"">You can also import the Python library into your own code like so:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""from csv_diff import load_csv, compare
diff = compare(
    load_csv(open(&quot;one.csv&quot;), key=&quot;id&quot;),
    load_csv(open(&quot;two.csv&quot;), key=&quot;id&quot;)
)""><pre class=""notranslate""><code>from csv_diff import load_csv, compare
diff = compare(
    load_csv(open(""one.csv""), key=""id""),
    load_csv(open(""two.csv""), key=""id"")
)
</code></pre></div>
<p dir=""auto""><code>diff</code> will now contain the same data structure as the output in the <code>--json</code> example above.</p>
<p dir=""auto"">If the columns in the CSV have changed, those added or removed columns will be ignored when calculating changes made to specific rows.</p>
<h2 dir=""auto""><a id=""user-content-as-a-docker-container"" class=""anchor"" aria-hidden=""true"" href=""#user-content-as-a-docker-container""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>As a Docker container</h2>
<h3 dir=""auto""><a id=""user-content-build-the-image"" class=""anchor"" aria-hidden=""true"" href=""#user-content-build-the-image""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Build the image</h3>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ docker build -t csvdiff .""><pre class=""notranslate""><code>$ docker build -t csvdiff .
</code></pre></div>
<h3 dir=""auto""><a id=""user-content-run-the-container"" class=""anchor"" aria-hidden=""true"" href=""#user-content-run-the-container""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Run the container</h3>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ docker run --rm -v $(pwd):/files csvdiff""><pre class=""notranslate""><code>$ docker run --rm -v $(pwd):/files csvdiff
</code></pre></div>
<p dir=""auto"">Suppose current directory contains two csv files : one.csv two.csv</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ docker run --rm -v $(pwd):/files csvdiff one.csv two.csv""><pre class=""notranslate""><code>$ docker run --rm -v $(pwd):/files csvdiff one.csv two.csv
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-alternatives"" class=""anchor"" aria-hidden=""true"" href=""#user-content-alternatives""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Alternatives</h2>
<ul dir=""auto"">
<li><a href=""https://github.com/aswinkarthik/csvdiff"">csvdiff</a> is a ""fast diff tool for comparing CSV files"" - you may get better results from this than from <code>csv-diff</code> against larger files.</li>
</ul>
</article></div>",1,public,0,,0,
175550127,MDEwOlJlcG9zaXRvcnkxNzU1NTAxMjc=,yaml-to-sqlite,simonw/yaml-to-sqlite,0,9599,https://github.com/simonw/yaml-to-sqlite,Utility for converting YAML files to SQLite,0,2019-03-14T04:49:08Z,2021-06-13T09:04:40Z,2021-06-13T04:45:52Z,,19,36,36,Python,1,1,1,1,0,2,0,0,0,apache-2.0,"[""yaml"", ""sqlite"", ""datasette-io"", ""datasette-tool""]",2,0,36,main,"{""admin"": false, ""push"": false, ""pull"": false}",,,2,1,"# yaml-to-sqlite

[![PyPI](https://img.shields.io/pypi/v/yaml-to-sqlite.svg)](https://pypi.org/project/yaml-to-sqlite/)
[![Changelog](https://img.shields.io/github/v/release/simonw/yaml-to-sqlite?include_prereleases&label=changelog)](https://github.com/simonw/yaml-to-sqlite/releases)
[![Tests](https://github.com/simonw/yaml-to-sqlite/workflows/Test/badge.svg)](https://github.com/simonw/yaml-to-sqlite/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/yaml-to-sqlite/blob/main/LICENSE)

Load the contents of a YAML file into a SQLite database table.

```
$ yaml-to-sqlite --help
Usage: yaml-to-sqlite [OPTIONS] DB_PATH TABLE YAML_FILE

  Convert YAML files to SQLite

Options:
  --version             Show the version and exit.
  --pk TEXT             Column to use as a primary key
  --single-column TEXT  If YAML file is a list of values, populate this column
  --help                Show this message and exit.
```
## Usage

Given a `news.yml` file containing the following:
```yaml
- date: 2021-06-05
  body: |-
    [Datasette 0.57](https://docs.datasette.io/en/stable/changelog.html#v0-57) is out with an important security patch.
- date: 2021-05-10
  body: |-
    [Django SQL Dashboard](https://simonwillison.net/2021/May/10/django-sql-dashboard/) is a new tool that brings a useful authenticated subset of Datasette to Django projects that are built on top of PostgreSQL.
```
Running this command:
```bash
$ yaml-to-sqlite news.db stories news.yml
```
Will create a database file with this schema:
```bash
$ sqlite-utils schema news.db
CREATE TABLE [stories] (
   [date] TEXT,
   [body] TEXT
);
```
The `--pk` option can be used to set a column as the primary key for the table:

```bash
$ yaml-to-sqlite news.db stories news.yml --pk date
$ sqlite-utils schema news.db
CREATE TABLE [stories] (
   [date] TEXT PRIMARY KEY,
   [body] TEXT
);
```
## Single column YAML lists

The `--single-column` option can be used when the YAML file is a list of values, for example a file called `dogs.yml` containing the following:

```yaml
- Cleo
- Pancakes
- Nixie
```
Running this command:
```bash
$ yaml-to-sqlite dogs.db dogs.yaml --single-column=name
```
Will create a single `dogs` table with a single `name` column that is the primary key:

```bash
$ sqlite-utils schema dogs.db
CREATE TABLE [dogs] (
   [name] TEXT PRIMARY KEY
);
$ sqlite-utils dogs.db 'select * from dogs' -t
name
--------
Cleo
Pancakes
Nixie
```
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-yaml-to-sqlite"" class=""anchor"" aria-hidden=""true"" href=""#user-content-yaml-to-sqlite""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>yaml-to-sqlite</h1>
<p><a href=""https://pypi.org/project/yaml-to-sqlite/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/32afda5e7bc913df42ad343b589f5d20c4fb51d9755037f4af1df86149cd0d94/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f79616d6c2d746f2d73716c6974652e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/yaml-to-sqlite.svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/yaml-to-sqlite/releases""><img src=""https://camo.githubusercontent.com/83cd14b53497376686dfabbadad49ff0e485a8a84f8266713a3c89fe7181ccbc/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f79616d6c2d746f2d73716c6974653f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/yaml-to-sqlite?include_prereleases&amp;label=changelog"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/yaml-to-sqlite/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/yaml-to-sqlite/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/yaml-to-sqlite/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>Load the contents of a YAML file into a SQLite database table.</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ yaml-to-sqlite --help
Usage: yaml-to-sqlite [OPTIONS] DB_PATH TABLE YAML_FILE

  Convert YAML files to SQLite

Options:
  --version             Show the version and exit.
  --pk TEXT             Column to use as a primary key
  --single-column TEXT  If YAML file is a list of values, populate this column
  --help                Show this message and exit.
""><pre><code>$ yaml-to-sqlite --help
Usage: yaml-to-sqlite [OPTIONS] DB_PATH TABLE YAML_FILE

  Convert YAML files to SQLite

Options:
  --version             Show the version and exit.
  --pk TEXT             Column to use as a primary key
  --single-column TEXT  If YAML file is a list of values, populate this column
  --help                Show this message and exit.
</code></pre></div>
<h2><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p>Given a <code>news.yml</code> file containing the following:</p>
<div class=""highlight highlight-source-yaml position-relative"" data-snippet-clipboard-copy-content=""- date: 2021-06-05
  body: |-
    [Datasette 0.57](https://docs.datasette.io/en/stable/changelog.html#v0-57) is out with an important security patch.
- date: 2021-05-10
  body: |-
    [Django SQL Dashboard](https://simonwillison.net/2021/May/10/django-sql-dashboard/) is a new tool that brings a useful authenticated subset of Datasette to Django projects that are built on top of PostgreSQL.
""><pre>- <span class=""pl-ent"">date</span>: <span class=""pl-c1"">2021-06-05</span>
  <span class=""pl-ent"">body</span>: <span class=""pl-s"">|-</span>
<span class=""pl-s"">    [Datasette 0.57](https://docs.datasette.io/en/stable/changelog.html#v0-57) is out with an important security patch.</span>
<span class=""pl-s""></span>- <span class=""pl-ent"">date</span>: <span class=""pl-c1"">2021-05-10</span>
  <span class=""pl-ent"">body</span>: <span class=""pl-s"">|-</span>
<span class=""pl-s"">    [Django SQL Dashboard](https://simonwillison.net/2021/May/10/django-sql-dashboard/) is a new tool that brings a useful authenticated subset of Datasette to Django projects that are built on top of PostgreSQL.</span></pre></div>
<p>Running this command:</p>
<div class=""highlight highlight-source-shell position-relative"" data-snippet-clipboard-copy-content=""$ yaml-to-sqlite news.db stories news.yml
""><pre>$ yaml-to-sqlite news.db stories news.yml</pre></div>
<p>Will create a database file with this schema:</p>
<div class=""highlight highlight-source-shell position-relative"" data-snippet-clipboard-copy-content=""$ sqlite-utils schema news.db
CREATE TABLE [stories] (
   [date] TEXT,
   [body] TEXT
);
""><pre>$ sqlite-utils schema news.db
CREATE TABLE [stories] (
   [date] TEXT,
   [body] TEXT
)<span class=""pl-k"">;</span></pre></div>
<p>The <code>--pk</code> option can be used to set a column as the primary key for the table:</p>
<div class=""highlight highlight-source-shell position-relative"" data-snippet-clipboard-copy-content=""$ yaml-to-sqlite news.db stories news.yml --pk date
$ sqlite-utils schema news.db
CREATE TABLE [stories] (
   [date] TEXT PRIMARY KEY,
   [body] TEXT
);
""><pre>$ yaml-to-sqlite news.db stories news.yml --pk date
$ sqlite-utils schema news.db
CREATE TABLE [stories] (
   [date] TEXT PRIMARY KEY,
   [body] TEXT
)<span class=""pl-k"">;</span></pre></div>
<h2><a id=""user-content-single-column-yaml-lists"" class=""anchor"" aria-hidden=""true"" href=""#user-content-single-column-yaml-lists""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Single column YAML lists</h2>
<p>The <code>--single-column</code> option can be used when the YAML file is a list of values, for example a file called <code>dogs.yml</code> containing the following:</p>
<div class=""highlight highlight-source-yaml position-relative"" data-snippet-clipboard-copy-content=""- Cleo
- Pancakes
- Nixie
""><pre>- <span class=""pl-s"">Cleo</span>
- <span class=""pl-s"">Pancakes</span>
- <span class=""pl-s"">Nixie</span></pre></div>
<p>Running this command:</p>
<div class=""highlight highlight-source-shell position-relative"" data-snippet-clipboard-copy-content=""$ yaml-to-sqlite dogs.db dogs.yaml --single-column=name
""><pre>$ yaml-to-sqlite dogs.db dogs.yaml --single-column=name</pre></div>
<p>Will create a single <code>dogs</code> table with a single <code>name</code> column that is the primary key:</p>
<div class=""highlight highlight-source-shell position-relative"" data-snippet-clipboard-copy-content=""$ sqlite-utils schema dogs.db
CREATE TABLE [dogs] (
   [name] TEXT PRIMARY KEY
);
$ sqlite-utils dogs.db 'select * from dogs' -t
name
--------
Cleo
Pancakes
Nixie
""><pre>$ sqlite-utils schema dogs.db
CREATE TABLE [dogs] (
   [name] TEXT PRIMARY KEY
)<span class=""pl-k"">;</span>
$ sqlite-utils dogs.db <span class=""pl-s""><span class=""pl-pds"">'</span>select * from dogs<span class=""pl-pds"">'</span></span> -t
name
--------
Cleo
Pancakes
Nixie</pre></div>
</article></div>",,,,,,
189321671,MDEwOlJlcG9zaXRvcnkxODkzMjE2NzE=,datasette-jq,simonw/datasette-jq,0,9599,https://github.com/simonw/datasette-jq,Datasette plugin that adds a custom SQL function for executing jq expressions against JSON values,0,2019-05-30T01:06:31Z,2020-12-24T17:35:27Z,2020-04-09T05:43:43Z,,11,10,10,Python,1,1,1,1,0,0,0,0,0,apache-2.0,"[""jq"", ""datasette"", ""datasette-plugin"", ""datasette-io""]",0,0,10,master,"{""admin"": false, ""push"": false, ""pull"": false}",,,0,2,"# datasette-jq

[![PyPI](https://img.shields.io/pypi/v/datasette-jq.svg)](https://pypi.org/project/datasette-jq/)
[![CircleCI](https://circleci.com/gh/simonw/datasette-jq.svg?style=svg)](https://circleci.com/gh/simonw/datasette-jq)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-jq/blob/master/LICENSE)

Datasette plugin that adds custom SQL functions for executing [jq](https://stedolan.github.io/jq/) expressions against JSON values.

Install this plugin in the same environment as Datasette to enable the `jq()` SQL function.

Usage:

    select jq(
        column_with_json,
        ""{top_3: .classifiers[:3], v: .version}""
    )

See [the jq manual](https://stedolan.github.io/jq/manual/#Basicfilters) for full details of supported expression syntax.

## Interactive demo

You can try this plugin out at [datasette-jq-demo.datasette.io](https://datasette-jq-demo.datasette.io/)

Sample query:

    select package, ""https://pypi.org/project/"" || package || ""/"" as url,
    jq(info, ""{summary: .info.summary, author: .info.author, versions: .releases|keys|reverse}"")
    from packages

[Try this query out](https://datasette-jq-demo.datasette.io/demo?sql=select+package%2C+%22https%3A%2F%2Fpypi.org%2Fproject%2F%22+%7C%7C+package+%7C%7C+%22%2F%22+as+url%2C%0D%0Ajq%28info%2C+%22%7Bsummary%3A+.info.summary%2C+author%3A+.info.author%2C+versions%3A+.releases%7Ckeys%7Creverse%7D%22%29%0D%0Afrom+packages) in the interactive demo.
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-datasette-jq"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-jq""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-jq</h1>
<p><a href=""https://pypi.org/project/datasette-jq/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/c2171714e633d8829fa1aa20a14db54e2fd581574883e34eb1c684abd5c40963/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d6a712e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-jq.svg"" style=""max-width:100%;""></a>
<a href=""https://circleci.com/gh/simonw/datasette-jq"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/95b100e2cf4b8cda790f4e9b37a3eee67f56566f45acf66097aba367f649209e/68747470733a2f2f636972636c6563692e636f6d2f67682f73696d6f6e772f6461746173657474652d6a712e7376673f7374796c653d737667"" alt=""CircleCI"" data-canonical-src=""https://circleci.com/gh/simonw/datasette-jq.svg?style=svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-jq/blob/master/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>Datasette plugin that adds custom SQL functions for executing <a href=""https://stedolan.github.io/jq/"" rel=""nofollow"">jq</a> expressions against JSON values.</p>
<p>Install this plugin in the same environment as Datasette to enable the <code>jq()</code> SQL function.</p>
<p>Usage:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""select jq(
    column_with_json,
    &quot;{top_3: .classifiers[:3], v: .version}&quot;
)
""><pre><code>select jq(
    column_with_json,
    ""{top_3: .classifiers[:3], v: .version}""
)
</code></pre></div>
<p>See <a href=""https://stedolan.github.io/jq/manual/#Basicfilters"" rel=""nofollow"">the jq manual</a> for full details of supported expression syntax.</p>
<h2><a id=""user-content-interactive-demo"" class=""anchor"" aria-hidden=""true"" href=""#user-content-interactive-demo""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Interactive demo</h2>
<p>You can try this plugin out at <a href=""https://datasette-jq-demo.datasette.io/"" rel=""nofollow"">datasette-jq-demo.datasette.io</a></p>
<p>Sample query:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""select package, &quot;https://pypi.org/project/&quot; || package || &quot;/&quot; as url,
jq(info, &quot;{summary: .info.summary, author: .info.author, versions: .releases|keys|reverse}&quot;)
from packages
""><pre><code>select package, ""https://pypi.org/project/"" || package || ""/"" as url,
jq(info, ""{summary: .info.summary, author: .info.author, versions: .releases|keys|reverse}"")
from packages
</code></pre></div>
<p><a href=""https://datasette-jq-demo.datasette.io/demo?sql=select+package%2C+%22https%3A%2F%2Fpypi.org%2Fproject%2F%22+%7C%7C+package+%7C%7C+%22%2F%22+as+url%2C%0D%0Ajq%28info%2C+%22%7Bsummary%3A+.info.summary%2C+author%3A+.info.author%2C+versions%3A+.releases%7Ckeys%7Creverse%7D%22%29%0D%0Afrom+packages"" rel=""nofollow"">Try this query out</a> in the interactive demo.</p>
</article></div>",,,,,,
190950781,MDEwOlJlcG9zaXRvcnkxOTA5NTA3ODE=,datasette-bplist,simonw/datasette-bplist,0,9599,https://github.com/simonw/datasette-bplist,Datasette plugin for working with Apple's binary plist format,0,2019-06-09T01:15:01Z,2021-06-07T18:05:00Z,2019-06-09T01:17:19Z,,7,9,9,Python,1,1,1,1,0,0,0,0,1,apache-2.0,"[""bplist"", ""datasette"", ""datasette-plugin"", ""datasette-io""]",0,1,9,master,"{""admin"": false, ""push"": false, ""pull"": false}",,,0,0,"# datasette-bplist

[![PyPI](https://img.shields.io/pypi/v/datasette-bplist.svg)](https://pypi.org/project/datasette-bplist/)
[![CircleCI](https://circleci.com/gh/simonw/datasette-bplist.svg?style=svg)](https://circleci.com/gh/simonw/datasette-bplist)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-bplist/blob/master/LICENSE)

Datasette plugin for working with Apple's [binary plist](https://en.wikipedia.org/wiki/Property_list) format.

This plugin adds two features: a display hook and a SQL function.

The display hook will detect any database values that are encoded using the binary plist format. It will decode them, convert them into JSON and display them pretty-printed in the Datasette UI.

The SQL function `bplist_to_json(value)` can be used inside a SQL query to convert a binary plist value into a JSON string. This can then be used with SQLite's `json_extract()` function or with the [datasette-jq](https://github.com/simonw/datasette-jq) plugin to further analyze that data as part of a SQL query.

Install this plugin in the same environment as Datasette to enable this new functionality:

    pip install datasette-bplist

## Trying it out

If you use a Mac you already have plenty of SQLite databases that contain binary plist data.

One example is the database that powers the Apple Photos app.

This database tends to be locked, so you will need to create a copy of the database in order to run queries against it:

    cp ~/Pictures/Photos\ Library.photoslibrary/database/photos.db /tmp/photos.db

The database also makes use of custom SQLite extensions which prevent it from opening in Datasette.

You can work around this by exporting the data that you want to experiment with into a new SQLite file.

I recommend trying this plugin against the `RKMaster_dataNote` table, which contains plist-encoded EXIF metadata about the photos you have taken.

You can export that table into a fresh database like so:

    sqlite3 /tmp/photos.db "".dump RKMaster_dataNote"" | sqlite3 /tmp/exif.db

Now run `datasette /tmp/exif.db` and you can start trying out the plugin.

## Using the bplist_to_json() SQL function

Once you have the `exif.db` demo working, you can try the `bplist_to_json()` SQL function.

Here's a query that shows the camera lenses you have used the most often to take photos:

    select
        json_extract(
            bplist_to_json(value),
            ""$.{Exif}.LensModel""
        ) as lens,
        count(*) as n
    from RKMaster_dataNote
    group by lens
    order by n desc;

If you have a large number of photos this query can take a long time to execute, so you may need to increase the SQL time limit enforced by Datasette like so:

    $ datasette /tmp/exif.db \
        --config sql_time_limit_ms:10000

Here's another query, showing the time at which you took every photo in your library which is classified as as screenshot:

    select
        attachedToId,
        json_extract(
            bplist_to_json(value),
            ""$.{Exif}.DateTimeOriginal""
        )
    from RKMaster_dataNote
    where
        json_extract(
            bplist_to_json(value),
            ""$.{Exif}.UserComment""
        ) = ""Screenshot""

And if you install the [datasette-cluster-map](https://github.com/simonw/datasette-cluster-map) plugin, this query will show you a map of your most recent 1000 photos:

    select
        *, 
        json_extract(
            bplist_to_json(value),
            ""$.{GPS}.Latitude""
        ) as latitude,
        -json_extract(
            bplist_to_json(value),
            ""$.{GPS}.Longitude""
        ) as longitude,
        json_extract(
            bplist_to_json(value),
            ""$.{Exif}.DateTimeOriginal""
        ) as datetime
    from
        RKMaster_dataNote
    where
        latitude is not null
    order by
        attachedToId desc
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-datasette-bplist"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-bplist""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-bplist</h1>
<p><a href=""https://pypi.org/project/datasette-bplist/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/a6a0093ab5212f3444dc00643b1dddd80bba4eea970443b0cbab79157ebab0ea/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d62706c6973742e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-bplist.svg"" style=""max-width:100%;""></a>
<a href=""https://circleci.com/gh/simonw/datasette-bplist"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/b4666b829d10d25169928fd8de374c652be7345117ec655c69ba8e6d8b9d2421/68747470733a2f2f636972636c6563692e636f6d2f67682f73696d6f6e772f6461746173657474652d62706c6973742e7376673f7374796c653d737667"" alt=""CircleCI"" data-canonical-src=""https://circleci.com/gh/simonw/datasette-bplist.svg?style=svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-bplist/blob/master/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>Datasette plugin for working with Apple's <a href=""https://en.wikipedia.org/wiki/Property_list"" rel=""nofollow"">binary plist</a> format.</p>
<p>This plugin adds two features: a display hook and a SQL function.</p>
<p>The display hook will detect any database values that are encoded using the binary plist format. It will decode them, convert them into JSON and display them pretty-printed in the Datasette UI.</p>
<p>The SQL function <code>bplist_to_json(value)</code> can be used inside a SQL query to convert a binary plist value into a JSON string. This can then be used with SQLite's <code>json_extract()</code> function or with the <a href=""https://github.com/simonw/datasette-jq"">datasette-jq</a> plugin to further analyze that data as part of a SQL query.</p>
<p>Install this plugin in the same environment as Datasette to enable this new functionality:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pip install datasette-bplist
""><pre><code>pip install datasette-bplist
</code></pre></div>
<h2><a id=""user-content-trying-it-out"" class=""anchor"" aria-hidden=""true"" href=""#user-content-trying-it-out""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Trying it out</h2>
<p>If you use a Mac you already have plenty of SQLite databases that contain binary plist data.</p>
<p>One example is the database that powers the Apple Photos app.</p>
<p>This database tends to be locked, so you will need to create a copy of the database in order to run queries against it:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""cp ~/Pictures/Photos\ Library.photoslibrary/database/photos.db /tmp/photos.db
""><pre><code>cp ~/Pictures/Photos\ Library.photoslibrary/database/photos.db /tmp/photos.db
</code></pre></div>
<p>The database also makes use of custom SQLite extensions which prevent it from opening in Datasette.</p>
<p>You can work around this by exporting the data that you want to experiment with into a new SQLite file.</p>
<p>I recommend trying this plugin against the <code>RKMaster_dataNote</code> table, which contains plist-encoded EXIF metadata about the photos you have taken.</p>
<p>You can export that table into a fresh database like so:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""sqlite3 /tmp/photos.db &quot;.dump RKMaster_dataNote&quot; | sqlite3 /tmp/exif.db
""><pre><code>sqlite3 /tmp/photos.db "".dump RKMaster_dataNote"" | sqlite3 /tmp/exif.db
</code></pre></div>
<p>Now run <code>datasette /tmp/exif.db</code> and you can start trying out the plugin.</p>
<h2><a id=""user-content-using-the-bplist_to_json-sql-function"" class=""anchor"" aria-hidden=""true"" href=""#user-content-using-the-bplist_to_json-sql-function""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Using the bplist_to_json() SQL function</h2>
<p>Once you have the <code>exif.db</code> demo working, you can try the <code>bplist_to_json()</code> SQL function.</p>
<p>Here's a query that shows the camera lenses you have used the most often to take photos:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""select
    json_extract(
        bplist_to_json(value),
        &quot;$.{Exif}.LensModel&quot;
    ) as lens,
    count(*) as n
from RKMaster_dataNote
group by lens
order by n desc;
""><pre><code>select
    json_extract(
        bplist_to_json(value),
        ""$.{Exif}.LensModel""
    ) as lens,
    count(*) as n
from RKMaster_dataNote
group by lens
order by n desc;
</code></pre></div>
<p>If you have a large number of photos this query can take a long time to execute, so you may need to increase the SQL time limit enforced by Datasette like so:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ datasette /tmp/exif.db \
    --config sql_time_limit_ms:10000
""><pre><code>$ datasette /tmp/exif.db \
    --config sql_time_limit_ms:10000
</code></pre></div>
<p>Here's another query, showing the time at which you took every photo in your library which is classified as as screenshot:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""select
    attachedToId,
    json_extract(
        bplist_to_json(value),
        &quot;$.{Exif}.DateTimeOriginal&quot;
    )
from RKMaster_dataNote
where
    json_extract(
        bplist_to_json(value),
        &quot;$.{Exif}.UserComment&quot;
    ) = &quot;Screenshot&quot;
""><pre><code>select
    attachedToId,
    json_extract(
        bplist_to_json(value),
        ""$.{Exif}.DateTimeOriginal""
    )
from RKMaster_dataNote
where
    json_extract(
        bplist_to_json(value),
        ""$.{Exif}.UserComment""
    ) = ""Screenshot""
</code></pre></div>
<p>And if you install the <a href=""https://github.com/simonw/datasette-cluster-map"">datasette-cluster-map</a> plugin, this query will show you a map of your most recent 1000 photos:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""select
    *, 
    json_extract(
        bplist_to_json(value),
        &quot;$.{GPS}.Latitude&quot;
    ) as latitude,
    -json_extract(
        bplist_to_json(value),
        &quot;$.{GPS}.Longitude&quot;
    ) as longitude,
    json_extract(
        bplist_to_json(value),
        &quot;$.{Exif}.DateTimeOriginal&quot;
    ) as datetime
from
    RKMaster_dataNote
where
    latitude is not null
order by
    attachedToId desc
""><pre><code>select
    *, 
    json_extract(
        bplist_to_json(value),
        ""$.{GPS}.Latitude""
    ) as latitude,
    -json_extract(
        bplist_to_json(value),
        ""$.{GPS}.Longitude""
    ) as longitude,
    json_extract(
        bplist_to_json(value),
        ""$.{Exif}.DateTimeOriginal""
    ) as datetime
from
    RKMaster_dataNote
where
    latitude is not null
order by
    attachedToId desc
</code></pre></div>
</article></div>",,,,,,
191022928,MDEwOlJlcG9zaXRvcnkxOTEwMjI5Mjg=,datasette-render-binary,simonw/datasette-render-binary,0,9599,https://github.com/simonw/datasette-render-binary,Datasette plugin for rendering binary data,0,2019-06-09T15:25:52Z,2021-06-02T09:29:20Z,2019-06-13T16:14:31Z,,62,7,7,Python,1,1,1,1,0,0,0,0,1,apache-2.0,"[""datasette"", ""datasette-plugin"", ""datasette-io""]",0,1,7,master,"{""admin"": false, ""push"": false, ""pull"": false}",,,0,1,"# datasette-render-binary

[![PyPI](https://img.shields.io/pypi/v/datasette-render-binary.svg)](https://pypi.org/project/datasette-render-binary/)
[![CircleCI](https://circleci.com/gh/simonw/datasette-render-binary.svg?style=svg)](https://circleci.com/gh/simonw/datasette-render-binary)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-render-binary/blob/master/LICENSE)

Datasette plugin for rendering binary data.

Install this plugin in the same environment as Datasette to enable this new functionality:

    pip install datasette-render-binary

Binary data in cells will now be rendered as a mixture of characters and octets.

![Screenshot](https://raw.githubusercontent.com/simonw/datasette-render-binary/master/example.png)
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-datasette-render-binary"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-render-binary""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-render-binary</h1>
<p><a href=""https://pypi.org/project/datasette-render-binary/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/a83615e99f9534c53bfc77b437ad7e5b60431113905c2dcc6a1832df12ce734a/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d72656e6465722d62696e6172792e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-render-binary.svg"" style=""max-width:100%;""></a>
<a href=""https://circleci.com/gh/simonw/datasette-render-binary"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/969d0483aca477a61ba4afc0f0aa6ed38f54509ea1376031d455864f01ed217a/68747470733a2f2f636972636c6563692e636f6d2f67682f73696d6f6e772f6461746173657474652d72656e6465722d62696e6172792e7376673f7374796c653d737667"" alt=""CircleCI"" data-canonical-src=""https://circleci.com/gh/simonw/datasette-render-binary.svg?style=svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-render-binary/blob/master/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>Datasette plugin for rendering binary data.</p>
<p>Install this plugin in the same environment as Datasette to enable this new functionality:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pip install datasette-render-binary
""><pre><code>pip install datasette-render-binary
</code></pre></div>
<p>Binary data in cells will now be rendered as a mixture of characters and octets.</p>
<p><a target=""_blank"" rel=""noopener noreferrer"" href=""https://raw.githubusercontent.com/simonw/datasette-render-binary/master/example.png""><img src=""https://raw.githubusercontent.com/simonw/datasette-render-binary/master/example.png"" alt=""Screenshot"" style=""max-width:100%;""></a></p>
</article></div>",,,,,,
195087137,MDEwOlJlcG9zaXRvcnkxOTUwODcxMzc=,datasette-auth-github,simonw/datasette-auth-github,0,9599,https://github.com/simonw/datasette-auth-github,Datasette plugin that authenticates users against GitHub,0,2019-07-03T16:02:53Z,2021-06-03T11:42:54Z,2021-02-25T06:40:17Z,https://datasette-auth-github-demo.datasette.io/,119,34,34,Python,1,1,1,1,0,4,0,0,3,apache-2.0,"[""asgi"", ""datasette"", ""datasette-plugin"", ""datasette-io""]",4,3,34,main,"{""admin"": false, ""push"": false, ""pull"": false}",,,4,1,"# datasette-auth-github

[![PyPI](https://img.shields.io/pypi/v/datasette-auth-github.svg)](https://pypi.org/project/datasette-auth-github/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-auth-github?include_prereleases&label=changelog)](https://github.com/simonw/datasette-auth-github/releases)
[![Tests](https://github.com/simonw/datasette-auth-github/workflows/Test/badge.svg)](https://github.com/simonw/datasette-auth-github/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-auth-github/blob/main/LICENSE)

Datasette plugin that authenticates users against GitHub.

<!-- toc -->

- [Setup instructions](#setup-instructions)
- [The authenticated actor](#the-authenticated-actor)
- [Restricting access to specific users](#restricting-access-to-specific-users)
- [Restricting access to specific GitHub organizations or teams](#restricting-access-to-specific-github-organizations-or-teams)
- [What to do if a user is removed from an organization or team](#what-to-do-if-a-user-is-removed-from-an-organization-or-team)

<!-- tocstop -->

## Setup instructions

* Install the plugin: `datasette install datasette-auth-github`
* Create a GitHub OAuth app: https://github.com/settings/applications/new
* Set the Authorization callback URL to `http://127.0.0.1:8001/-/github-auth-callback`
* Create a `metadata.json` file with the following structure:

```json
{
    ""title"": ""datasette-auth-github demo"",
    ""plugins"": {
        ""datasette-auth-github"": {
            ""client_id"": {""$env"": ""GITHUB_CLIENT_ID""},
            ""client_secret"": {""$env"": ""GITHUB_CLIENT_SECRET""}
        }
    }
}
```

Now you can start Datasette like this, passing in the secrets as environment variables:

    $ GITHUB_CLIENT_ID=XXX GITHUB_CLIENT_SECRET=YYY datasette \
        fixtures.db -m metadata.json

Note that hard-coding secrets in `metadata.json` is a bad idea as they will be visible to anyone who can navigate to `/-/metadata`. Instead, we use Datasette's mechanism for [adding secret plugin configuration options](https://docs.datasette.io/en/stable/plugins.html#secret-configuration-values).

By default anonymous users will still be able to interact with Datasette. If you wish all users to have to sign in with a GitHub account first, add this to your ``metadata.json``:

```json
{
    ""allow"": {
        ""id"": ""*""
    },
    ""plugins"": {
        ""datasette-auth-github"": {
            ""..."": ""...""
        }
    }
}
```
## The authenticated actor

Visit `/-/actor` when signed in to see the shape of the authenticated actor. It should look something like this:

```json
{
    ""actor"": {
        ""display"": ""simonw"",
        ""gh_id"": ""9599"",
        ""gh_name"": ""Simon Willison"",
        ""gh_login"": ""simonw"",
        ""gh_email"": ""..."",
        ""gh_orgs"": [
            ""dogsheep"",
            ""datasette-project""
        ],
        ""gh_teams"": [
            ""dogsheep/test""
        ]
    }
}
```

The `gh_orgs` and `gh_teams` properties will only be present if you used `load_teams` or `load_orgs`, documented below.

## Restricting access to specific users

You can use Datasette's [permissions mechanism](https://docs.datasette.io/en/stable/authentication.html) to specify which user or users are allowed to access your instance. Here's how to restrict access to just GitHub user `simonw`:

```json
{
    ""allow"": {
        ""gh_login"": ""simonw""
    },
    ""plugins"": {
        ""datasette-auth-github"": {
            ""..."": ""...""
        }
    }
}
```

This `""allow""` block can be positioned at the database, table or query level instead: see [Configuring permissions in metadata.json](https://docs.datasette.io/en/stable/authentication.html#configuring-permissions-in-metadata-json) for details.

Note that GitHub allows users to change their username, and it is possible for other people to claim old usernames. If you are concerned that your users may change their usernames you can key the allow blocks against GitHub user IDs instead, which do not change:

```json
{
    ""allow"": {
        ""gh_id"": ""9599""
    }
}
```

## Restricting access to specific GitHub organizations or teams

You can also restrict access to users who are members of a specific GitHub organization.

You'll need to configure the plugin to check if the user is a member of that organization when they first sign in. You can do that using the `""load_orgs""` plugin configuration option.

Then you can use `""allow"": {""gh_orgs"": [...]}` to specify which organizations are allowed access.

```json
{
    ""plugins"": {
        ""datasette-auth-github"": {
            ""..."": ""..."",
            ""load_orgs"": [""your-organization""]
        }
    },
    ""allow"": {
        ""gh_orgs"": ""your-organization""
    }
}
```

If your organization is [arranged into teams](https://help.github.com/en/articles/organizing-members-into-teams) you can restrict access to a specific team like this:

```json
{
    ""plugins"": {
        ""datasette-auth-github"": {
            ""..."": ""..."",
            ""load_teams"": [
                ""your-organization/staff"",
                ""your-organization/engineering"",
            ]
        }
    },
    ""allows"": {
        ""gh_team"": ""your-organization/engineering""
    }
}
```

## What to do if a user is removed from an organization or team

A user's organization and team memberships are checked once, when they first sign in. Those teams and organizations are then persisted in the user's signed `ds_actor` cookie.

This means that if a user is removed from an organization or team but still has a Datasette cookie, they will still be able to access that Datasette instance.

You can remedy this by rotating the `DATASETTE_SECRET` environment variable any time you make changes to your GitHub organization members.

Changing this value will cause all of your existing users to be signed out, by invalidating their cookies. When they sign back in again their new memberships will be recorded in a new cookie.

See [Configuring the secret](https://docs.datasette.io/en/stable/settings.html?highlight=secret#configuring-the-secret) in the Datasette documentation for more details.
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-datasette-auth-github"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-auth-github""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-auth-github</h1>
<p><a href=""https://pypi.org/project/datasette-auth-github/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/a3e596637d6128f29e3fcfeb8e50ecbe5c7e1c328e94c5d338238fa0f70a2a86/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d617574682d6769746875622e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-auth-github.svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-auth-github/releases""><img src=""https://camo.githubusercontent.com/86aa92461a20542b5de5e6d64b24503b5b58f83bf66a98a86bc33ea443ed93b8/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d617574682d6769746875623f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-auth-github?include_prereleases&amp;label=changelog"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-auth-github/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-auth-github/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-auth-github/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>Datasette plugin that authenticates users against GitHub.</p>

<ul>
<li><a href=""#user-content-setup-instructions"">Setup instructions</a></li>
<li><a href=""#user-content-the-authenticated-actor"">The authenticated actor</a></li>
<li><a href=""#user-content-restricting-access-to-specific-users"">Restricting access to specific users</a></li>
<li><a href=""#user-content-restricting-access-to-specific-github-organizations-or-teams"">Restricting access to specific GitHub organizations or teams</a></li>
<li><a href=""#user-content-what-to-do-if-a-user-is-removed-from-an-organization-or-team"">What to do if a user is removed from an organization or team</a></li>
</ul>

<h2><a id=""user-content-setup-instructions"" class=""anchor"" aria-hidden=""true"" href=""#user-content-setup-instructions""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Setup instructions</h2>
<ul>
<li>Install the plugin: <code>datasette install datasette-auth-github</code></li>
<li>Create a GitHub OAuth app: <a href=""https://github.com/settings/applications/new"">https://github.com/settings/applications/new</a></li>
<li>Set the Authorization callback URL to <code>http://127.0.0.1:8001/-/github-auth-callback</code></li>
<li>Create a <code>metadata.json</code> file with the following structure:</li>
</ul>
<div class=""highlight highlight-source-json position-relative"" data-snippet-clipboard-copy-content=""{
    &quot;title&quot;: &quot;datasette-auth-github demo&quot;,
    &quot;plugins&quot;: {
        &quot;datasette-auth-github&quot;: {
            &quot;client_id&quot;: {&quot;$env&quot;: &quot;GITHUB_CLIENT_ID&quot;},
            &quot;client_secret&quot;: {&quot;$env&quot;: &quot;GITHUB_CLIENT_SECRET&quot;}
        }
    }
}
""><pre>{
    <span class=""pl-s""><span class=""pl-pds"">""</span>title<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>datasette-auth-github demo<span class=""pl-pds"">""</span></span>,
    <span class=""pl-s""><span class=""pl-pds"">""</span>plugins<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>datasette-auth-github<span class=""pl-pds"">""</span></span>: {
            <span class=""pl-s""><span class=""pl-pds"">""</span>client_id<span class=""pl-pds"">""</span></span>: {<span class=""pl-s""><span class=""pl-pds"">""</span>$env<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>GITHUB_CLIENT_ID<span class=""pl-pds"">""</span></span>},
            <span class=""pl-s""><span class=""pl-pds"">""</span>client_secret<span class=""pl-pds"">""</span></span>: {<span class=""pl-s""><span class=""pl-pds"">""</span>$env<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>GITHUB_CLIENT_SECRET<span class=""pl-pds"">""</span></span>}
        }
    }
}</pre></div>
<p>Now you can start Datasette like this, passing in the secrets as environment variables:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ GITHUB_CLIENT_ID=XXX GITHUB_CLIENT_SECRET=YYY datasette \
    fixtures.db -m metadata.json
""><pre><code>$ GITHUB_CLIENT_ID=XXX GITHUB_CLIENT_SECRET=YYY datasette \
    fixtures.db -m metadata.json
</code></pre></div>
<p>Note that hard-coding secrets in <code>metadata.json</code> is a bad idea as they will be visible to anyone who can navigate to <code>/-/metadata</code>. Instead, we use Datasette's mechanism for <a href=""https://docs.datasette.io/en/stable/plugins.html#secret-configuration-values"" rel=""nofollow"">adding secret plugin configuration options</a>.</p>
<p>By default anonymous users will still be able to interact with Datasette. If you wish all users to have to sign in with a GitHub account first, add this to your <code>metadata.json</code>:</p>
<div class=""highlight highlight-source-json position-relative"" data-snippet-clipboard-copy-content=""{
    &quot;allow&quot;: {
        &quot;id&quot;: &quot;*&quot;
    },
    &quot;plugins&quot;: {
        &quot;datasette-auth-github&quot;: {
            &quot;...&quot;: &quot;...&quot;
        }
    }
}
""><pre>{
    <span class=""pl-s""><span class=""pl-pds"">""</span>allow<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>id<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>*<span class=""pl-pds"">""</span></span>
    },
    <span class=""pl-s""><span class=""pl-pds"">""</span>plugins<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>datasette-auth-github<span class=""pl-pds"">""</span></span>: {
            <span class=""pl-s""><span class=""pl-pds"">""</span>...<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>...<span class=""pl-pds"">""</span></span>
        }
    }
}</pre></div>
<h2><a id=""user-content-the-authenticated-actor"" class=""anchor"" aria-hidden=""true"" href=""#user-content-the-authenticated-actor""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>The authenticated actor</h2>
<p>Visit <code>/-/actor</code> when signed in to see the shape of the authenticated actor. It should look something like this:</p>
<div class=""highlight highlight-source-json position-relative"" data-snippet-clipboard-copy-content=""{
    &quot;actor&quot;: {
        &quot;display&quot;: &quot;simonw&quot;,
        &quot;gh_id&quot;: &quot;9599&quot;,
        &quot;gh_name&quot;: &quot;Simon Willison&quot;,
        &quot;gh_login&quot;: &quot;simonw&quot;,
        &quot;gh_email&quot;: &quot;...&quot;,
        &quot;gh_orgs&quot;: [
            &quot;dogsheep&quot;,
            &quot;datasette-project&quot;
        ],
        &quot;gh_teams&quot;: [
            &quot;dogsheep/test&quot;
        ]
    }
}
""><pre>{
    <span class=""pl-s""><span class=""pl-pds"">""</span>actor<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>display<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>simonw<span class=""pl-pds"">""</span></span>,
        <span class=""pl-s""><span class=""pl-pds"">""</span>gh_id<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>9599<span class=""pl-pds"">""</span></span>,
        <span class=""pl-s""><span class=""pl-pds"">""</span>gh_name<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>Simon Willison<span class=""pl-pds"">""</span></span>,
        <span class=""pl-s""><span class=""pl-pds"">""</span>gh_login<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>simonw<span class=""pl-pds"">""</span></span>,
        <span class=""pl-s""><span class=""pl-pds"">""</span>gh_email<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>...<span class=""pl-pds"">""</span></span>,
        <span class=""pl-s""><span class=""pl-pds"">""</span>gh_orgs<span class=""pl-pds"">""</span></span>: [
            <span class=""pl-s""><span class=""pl-pds"">""</span>dogsheep<span class=""pl-pds"">""</span></span>,
            <span class=""pl-s""><span class=""pl-pds"">""</span>datasette-project<span class=""pl-pds"">""</span></span>
        ],
        <span class=""pl-s""><span class=""pl-pds"">""</span>gh_teams<span class=""pl-pds"">""</span></span>: [
            <span class=""pl-s""><span class=""pl-pds"">""</span>dogsheep/test<span class=""pl-pds"">""</span></span>
        ]
    }
}</pre></div>
<p>The <code>gh_orgs</code> and <code>gh_teams</code> properties will only be present if you used <code>load_teams</code> or <code>load_orgs</code>, documented below.</p>
<h2><a id=""user-content-restricting-access-to-specific-users"" class=""anchor"" aria-hidden=""true"" href=""#user-content-restricting-access-to-specific-users""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Restricting access to specific users</h2>
<p>You can use Datasette's <a href=""https://docs.datasette.io/en/stable/authentication.html"" rel=""nofollow"">permissions mechanism</a> to specify which user or users are allowed to access your instance. Here's how to restrict access to just GitHub user <code>simonw</code>:</p>
<div class=""highlight highlight-source-json position-relative"" data-snippet-clipboard-copy-content=""{
    &quot;allow&quot;: {
        &quot;gh_login&quot;: &quot;simonw&quot;
    },
    &quot;plugins&quot;: {
        &quot;datasette-auth-github&quot;: {
            &quot;...&quot;: &quot;...&quot;
        }
    }
}
""><pre>{
    <span class=""pl-s""><span class=""pl-pds"">""</span>allow<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>gh_login<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>simonw<span class=""pl-pds"">""</span></span>
    },
    <span class=""pl-s""><span class=""pl-pds"">""</span>plugins<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>datasette-auth-github<span class=""pl-pds"">""</span></span>: {
            <span class=""pl-s""><span class=""pl-pds"">""</span>...<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>...<span class=""pl-pds"">""</span></span>
        }
    }
}</pre></div>
<p>This <code>""allow""</code> block can be positioned at the database, table or query level instead: see <a href=""https://docs.datasette.io/en/stable/authentication.html#configuring-permissions-in-metadata-json"" rel=""nofollow"">Configuring permissions in metadata.json</a> for details.</p>
<p>Note that GitHub allows users to change their username, and it is possible for other people to claim old usernames. If you are concerned that your users may change their usernames you can key the allow blocks against GitHub user IDs instead, which do not change:</p>
<div class=""highlight highlight-source-json position-relative"" data-snippet-clipboard-copy-content=""{
    &quot;allow&quot;: {
        &quot;gh_id&quot;: &quot;9599&quot;
    }
}
""><pre>{
    <span class=""pl-s""><span class=""pl-pds"">""</span>allow<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>gh_id<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>9599<span class=""pl-pds"">""</span></span>
    }
}</pre></div>
<h2><a id=""user-content-restricting-access-to-specific-github-organizations-or-teams"" class=""anchor"" aria-hidden=""true"" href=""#user-content-restricting-access-to-specific-github-organizations-or-teams""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Restricting access to specific GitHub organizations or teams</h2>
<p>You can also restrict access to users who are members of a specific GitHub organization.</p>
<p>You'll need to configure the plugin to check if the user is a member of that organization when they first sign in. You can do that using the <code>""load_orgs""</code> plugin configuration option.</p>
<p>Then you can use <code>""allow"": {""gh_orgs"": [...]}</code> to specify which organizations are allowed access.</p>
<div class=""highlight highlight-source-json position-relative"" data-snippet-clipboard-copy-content=""{
    &quot;plugins&quot;: {
        &quot;datasette-auth-github&quot;: {
            &quot;...&quot;: &quot;...&quot;,
            &quot;load_orgs&quot;: [&quot;your-organization&quot;]
        }
    },
    &quot;allow&quot;: {
        &quot;gh_orgs&quot;: &quot;your-organization&quot;
    }
}
""><pre>{
    <span class=""pl-s""><span class=""pl-pds"">""</span>plugins<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>datasette-auth-github<span class=""pl-pds"">""</span></span>: {
            <span class=""pl-s""><span class=""pl-pds"">""</span>...<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>...<span class=""pl-pds"">""</span></span>,
            <span class=""pl-s""><span class=""pl-pds"">""</span>load_orgs<span class=""pl-pds"">""</span></span>: [<span class=""pl-s""><span class=""pl-pds"">""</span>your-organization<span class=""pl-pds"">""</span></span>]
        }
    },
    <span class=""pl-s""><span class=""pl-pds"">""</span>allow<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>gh_orgs<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>your-organization<span class=""pl-pds"">""</span></span>
    }
}</pre></div>
<p>If your organization is <a href=""https://help.github.com/en/articles/organizing-members-into-teams"">arranged into teams</a> you can restrict access to a specific team like this:</p>
<div class=""highlight highlight-source-json position-relative"" data-snippet-clipboard-copy-content=""{
    &quot;plugins&quot;: {
        &quot;datasette-auth-github&quot;: {
            &quot;...&quot;: &quot;...&quot;,
            &quot;load_teams&quot;: [
                &quot;your-organization/staff&quot;,
                &quot;your-organization/engineering&quot;,
            ]
        }
    },
    &quot;allows&quot;: {
        &quot;gh_team&quot;: &quot;your-organization/engineering&quot;
    }
}
""><pre>{
    <span class=""pl-s""><span class=""pl-pds"">""</span>plugins<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>datasette-auth-github<span class=""pl-pds"">""</span></span>: {
            <span class=""pl-s""><span class=""pl-pds"">""</span>...<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>...<span class=""pl-pds"">""</span></span>,
            <span class=""pl-s""><span class=""pl-pds"">""</span>load_teams<span class=""pl-pds"">""</span></span>: [
                <span class=""pl-s""><span class=""pl-pds"">""</span>your-organization/staff<span class=""pl-pds"">""</span></span>,
                <span class=""pl-s""><span class=""pl-pds"">""</span>your-organization/engineering<span class=""pl-pds"">""</span></span>,
            ]
        }
    },
    <span class=""pl-s""><span class=""pl-pds"">""</span>allows<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>gh_team<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>your-organization/engineering<span class=""pl-pds"">""</span></span>
    }
}</pre></div>
<h2><a id=""user-content-what-to-do-if-a-user-is-removed-from-an-organization-or-team"" class=""anchor"" aria-hidden=""true"" href=""#user-content-what-to-do-if-a-user-is-removed-from-an-organization-or-team""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>What to do if a user is removed from an organization or team</h2>
<p>A user's organization and team memberships are checked once, when they first sign in. Those teams and organizations are then persisted in the user's signed <code>ds_actor</code> cookie.</p>
<p>This means that if a user is removed from an organization or team but still has a Datasette cookie, they will still be able to access that Datasette instance.</p>
<p>You can remedy this by rotating the <code>DATASETTE_SECRET</code> environment variable any time you make changes to your GitHub organization members.</p>
<p>Changing this value will cause all of your existing users to be signed out, by invalidating their cookies. When they sign back in again their new memberships will be recorded in a new cookie.</p>
<p>See <a href=""https://docs.datasette.io/en/stable/settings.html?highlight=secret#configuring-the-secret"" rel=""nofollow"">Configuring the secret</a> in the Datasette documentation for more details.</p>
</article></div>",,,,,,
195145678,MDEwOlJlcG9zaXRvcnkxOTUxNDU2Nzg=,sqlite-diffable,simonw/sqlite-diffable,0,9599,https://github.com/simonw/sqlite-diffable,Tools for dumping/loading a SQLite database to diffable directory structure,0,2019-07-04T00:58:46Z,2022-07-12T17:00:19Z,2022-08-18T22:49:29Z,,30,42,42,Python,1,1,1,1,0,3,0,0,3,apache-2.0,"[""datasette-io"", ""datasette-tool"", ""sqlite""]",3,3,42,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,3,1,"# sqlite-diffable

[![PyPI](https://img.shields.io/pypi/v/sqlite-diffable.svg)](https://pypi.org/project/sqlite-diffable/)
[![Changelog](https://img.shields.io/github/v/release/simonw/sqlite-diffable?include_prereleases&label=changelog)](https://github.com/simonw/sqlite-diffable/releases)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/sqlite-diffable/blob/main/LICENSE)

Tools for dumping/loading a SQLite database to diffable directory structure

## Installation

    pip install sqlite-diffable

## Demo

The repository at [simonw/simonwillisonblog-backup](https://github.com/simonw/simonwillisonblog-backup) contains a backup of the database on my blog, https://simonwillison.net/ - created using this tool.

## Dumping a database

Given a SQLite database called `fixtures.db` containing a table `facetable`, the following will dump out that table to the `dump/` directory:

    sqlite-diffable dump fixtures.db dump/ facetable

To dump out every table in that database, use `--all`:

    sqlite-diffable dump fixtures.db dump/ --all

## Loading a database

To load a previously dumped database, run the following:

    sqlite-diffable load restored.db dump/

This will show an error if any of the tables that are being restored already exist in the database file.

You can replace those tables (dropping them before restoring them) using the `--replace` option:

    sqlite-diffable load restored.db dump/ --replace

## Converting to JSON objects

Table rows are stored in the `.ndjson` files as newline-delimited JSON arrays, like this:

```
[""a"", ""a"", ""a-a"", 63, null, 0.7364712141640124, ""$null""]
[""a"", ""b"", ""a-b"", 51, null, 0.6020187290499803, ""$null""]
```

Sometimes it can be more convenient to work with a list of JSON objects.

The `sqlite-diffable objects` command can read a `.ndjson` file and its accompanying `.metadata.json` file and output JSON objects to standard output:

    sqlite-diffable objects fixtures.db dump/sortable.ndjson

The output of that command looks something like this:
```
{""pk1"": ""a"", ""pk2"": ""a"", ""content"": ""a-a"", ""sortable"": 63, ""sortable_with_nulls"": null, ""sortable_with_nulls_2"": 0.7364712141640124, ""text"": ""$null""}
{""pk1"": ""a"", ""pk2"": ""b"", ""content"": ""a-b"", ""sortable"": 51, ""sortable_with_nulls"": null, ""sortable_with_nulls_2"": 0.6020187290499803, ""text"": ""$null""}
```

Add `-o` to write that output to a file:

    sqlite-diffable objects fixtures.db dump/sortable.ndjson -o output.txt

Add `--array` to output a JSON array of objects, as opposed to a newline-delimited file:

    sqlite-diffable objects fixtures.db dump/sortable.ndjson --array
Output:
```
[
{""pk1"": ""a"", ""pk2"": ""a"", ""content"": ""a-a"", ""sortable"": 63, ""sortable_with_nulls"": null, ""sortable_with_nulls_2"": 0.7364712141640124, ""text"": ""$null""},
{""pk1"": ""a"", ""pk2"": ""b"", ""content"": ""a-b"", ""sortable"": 51, ""sortable_with_nulls"": null, ""sortable_with_nulls_2"": 0.6020187290499803, ""text"": ""$null""}
]
```

## Storage format

Each table is represented as two files. The first, `table_name.metadata.json`, contains metadata describing the structure of the table. For a table called `redirects_redirect` that file might look like this:

```json
{
    ""name"": ""redirects_redirect"",
    ""columns"": [
        ""id"",
        ""domain"",
        ""path"",
        ""target"",
        ""created""
    ],
    ""schema"": ""CREATE TABLE [redirects_redirect] (\n   [id] INTEGER PRIMARY KEY,\n   [domain] TEXT,\n   [path] TEXT,\n   [target] TEXT,\n   [created] TEXT\n)""
}
```

It is an object with three keys: `name` is the name of the table, `columns` is an array of column strings and `schema` is the SQL schema text used for tha table.

The second file, `table_name.ndjson`, contains [newline-delimited JSON](http://ndjson.org/) for every row in the table. Each row is represented as a JSON array with items corresponding to each of the columns defined in the metadata.

That file for the `redirects_redirect.ndjson` table might look like this:

```
[1, ""feeds.simonwillison.net"", ""swn-everything"", ""https://simonwillison.net/atom/everything/"", ""2017-10-01T21:11:36.440537+00:00""]
[2, ""feeds.simonwillison.net"", ""swn-entries"", ""https://simonwillison.net/atom/entries/"", ""2017-10-01T21:12:32.478849+00:00""]
[3, ""feeds.simonwillison.net"", ""swn-links"", ""https://simonwillison.net/atom/links/"", ""2017-10-01T21:12:54.820729+00:00""]
```
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-sqlite-diffable"" class=""anchor"" aria-hidden=""true"" href=""#user-content-sqlite-diffable""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>sqlite-diffable</h1>
<p dir=""auto""><a href=""https://pypi.org/project/sqlite-diffable/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/f397aed437dd402178061cc28cd230ddaa228d7a19d4e6e33dd7d9d6b9aa7016/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f73716c6974652d6469666661626c652e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/sqlite-diffable.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/sqlite-diffable/releases""><img src=""https://camo.githubusercontent.com/ad89df075a7b5a43a53000aaf22188af53405e2b5177102cb1b550954996dac4/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f73716c6974652d6469666661626c653f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/sqlite-diffable?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/sqlite-diffable/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Tools for dumping/loading a SQLite database to diffable directory structure</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install sqlite-diffable""><pre class=""notranslate""><code>pip install sqlite-diffable
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-demo"" class=""anchor"" aria-hidden=""true"" href=""#user-content-demo""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Demo</h2>
<p dir=""auto"">The repository at <a href=""https://github.com/simonw/simonwillisonblog-backup"">simonw/simonwillisonblog-backup</a> contains a backup of the database on my blog, <a href=""https://simonwillison.net/"" rel=""nofollow"">https://simonwillison.net/</a> - created using this tool.</p>
<h2 dir=""auto""><a id=""user-content-dumping-a-database"" class=""anchor"" aria-hidden=""true"" href=""#user-content-dumping-a-database""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Dumping a database</h2>
<p dir=""auto"">Given a SQLite database called <code>fixtures.db</code> containing a table <code>facetable</code>, the following will dump out that table to the <code>dump/</code> directory:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""sqlite-diffable dump fixtures.db dump/ facetable""><pre class=""notranslate""><code>sqlite-diffable dump fixtures.db dump/ facetable
</code></pre></div>
<p dir=""auto"">To dump out every table in that database, use <code>--all</code>:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""sqlite-diffable dump fixtures.db dump/ --all""><pre class=""notranslate""><code>sqlite-diffable dump fixtures.db dump/ --all
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-loading-a-database"" class=""anchor"" aria-hidden=""true"" href=""#user-content-loading-a-database""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Loading a database</h2>
<p dir=""auto"">To load a previously dumped database, run the following:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""sqlite-diffable load restored.db dump/""><pre class=""notranslate""><code>sqlite-diffable load restored.db dump/
</code></pre></div>
<p dir=""auto"">This will show an error if any of the tables that are being restored already exist in the database file.</p>
<p dir=""auto"">You can replace those tables (dropping them before restoring them) using the <code>--replace</code> option:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""sqlite-diffable load restored.db dump/ --replace""><pre class=""notranslate""><code>sqlite-diffable load restored.db dump/ --replace
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-converting-to-json-objects"" class=""anchor"" aria-hidden=""true"" href=""#user-content-converting-to-json-objects""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Converting to JSON objects</h2>
<p dir=""auto"">Table rows are stored in the <code>.ndjson</code> files as newline-delimited JSON arrays, like this:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""[&quot;a&quot;, &quot;a&quot;, &quot;a-a&quot;, 63, null, 0.7364712141640124, &quot;$null&quot;]
[&quot;a&quot;, &quot;b&quot;, &quot;a-b&quot;, 51, null, 0.6020187290499803, &quot;$null&quot;]""><pre class=""notranslate""><code>[""a"", ""a"", ""a-a"", 63, null, 0.7364712141640124, ""$null""]
[""a"", ""b"", ""a-b"", 51, null, 0.6020187290499803, ""$null""]
</code></pre></div>
<p dir=""auto"">Sometimes it can be more convenient to work with a list of JSON objects.</p>
<p dir=""auto"">The <code>sqlite-diffable objects</code> command can read a <code>.ndjson</code> file and its accompanying <code>.metadata.json</code> file and output JSON objects to standard output:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""sqlite-diffable objects fixtures.db dump/sortable.ndjson""><pre class=""notranslate""><code>sqlite-diffable objects fixtures.db dump/sortable.ndjson
</code></pre></div>
<p dir=""auto"">The output of that command looks something like this:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{&quot;pk1&quot;: &quot;a&quot;, &quot;pk2&quot;: &quot;a&quot;, &quot;content&quot;: &quot;a-a&quot;, &quot;sortable&quot;: 63, &quot;sortable_with_nulls&quot;: null, &quot;sortable_with_nulls_2&quot;: 0.7364712141640124, &quot;text&quot;: &quot;$null&quot;}
{&quot;pk1&quot;: &quot;a&quot;, &quot;pk2&quot;: &quot;b&quot;, &quot;content&quot;: &quot;a-b&quot;, &quot;sortable&quot;: 51, &quot;sortable_with_nulls&quot;: null, &quot;sortable_with_nulls_2&quot;: 0.6020187290499803, &quot;text&quot;: &quot;$null&quot;}""><pre class=""notranslate""><code>{""pk1"": ""a"", ""pk2"": ""a"", ""content"": ""a-a"", ""sortable"": 63, ""sortable_with_nulls"": null, ""sortable_with_nulls_2"": 0.7364712141640124, ""text"": ""$null""}
{""pk1"": ""a"", ""pk2"": ""b"", ""content"": ""a-b"", ""sortable"": 51, ""sortable_with_nulls"": null, ""sortable_with_nulls_2"": 0.6020187290499803, ""text"": ""$null""}
</code></pre></div>
<p dir=""auto"">Add <code>-o</code> to write that output to a file:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""sqlite-diffable objects fixtures.db dump/sortable.ndjson -o output.txt""><pre class=""notranslate""><code>sqlite-diffable objects fixtures.db dump/sortable.ndjson -o output.txt
</code></pre></div>
<p dir=""auto"">Add <code>--array</code> to output a JSON array of objects, as opposed to a newline-delimited file:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""sqlite-diffable objects fixtures.db dump/sortable.ndjson --array""><pre class=""notranslate""><code>sqlite-diffable objects fixtures.db dump/sortable.ndjson --array
</code></pre></div>
<p dir=""auto"">Output:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""[
{&quot;pk1&quot;: &quot;a&quot;, &quot;pk2&quot;: &quot;a&quot;, &quot;content&quot;: &quot;a-a&quot;, &quot;sortable&quot;: 63, &quot;sortable_with_nulls&quot;: null, &quot;sortable_with_nulls_2&quot;: 0.7364712141640124, &quot;text&quot;: &quot;$null&quot;},
{&quot;pk1&quot;: &quot;a&quot;, &quot;pk2&quot;: &quot;b&quot;, &quot;content&quot;: &quot;a-b&quot;, &quot;sortable&quot;: 51, &quot;sortable_with_nulls&quot;: null, &quot;sortable_with_nulls_2&quot;: 0.6020187290499803, &quot;text&quot;: &quot;$null&quot;}
]""><pre class=""notranslate""><code>[
{""pk1"": ""a"", ""pk2"": ""a"", ""content"": ""a-a"", ""sortable"": 63, ""sortable_with_nulls"": null, ""sortable_with_nulls_2"": 0.7364712141640124, ""text"": ""$null""},
{""pk1"": ""a"", ""pk2"": ""b"", ""content"": ""a-b"", ""sortable"": 51, ""sortable_with_nulls"": null, ""sortable_with_nulls_2"": 0.6020187290499803, ""text"": ""$null""}
]
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-storage-format"" class=""anchor"" aria-hidden=""true"" href=""#user-content-storage-format""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Storage format</h2>
<p dir=""auto"">Each table is represented as two files. The first, <code>table_name.metadata.json</code>, contains metadata describing the structure of the table. For a table called <code>redirects_redirect</code> that file might look like this:</p>
<div class=""highlight highlight-source-json notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{
    &quot;name&quot;: &quot;redirects_redirect&quot;,
    &quot;columns&quot;: [
        &quot;id&quot;,
        &quot;domain&quot;,
        &quot;path&quot;,
        &quot;target&quot;,
        &quot;created&quot;
    ],
    &quot;schema&quot;: &quot;CREATE TABLE [redirects_redirect] (\n   [id] INTEGER PRIMARY KEY,\n   [domain] TEXT,\n   [path] TEXT,\n   [target] TEXT,\n   [created] TEXT\n)&quot;
}""><pre>{
    <span class=""pl-ent"">""name""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>redirects_redirect<span class=""pl-pds"">""</span></span>,
    <span class=""pl-ent"">""columns""</span>: [
        <span class=""pl-s""><span class=""pl-pds"">""</span>id<span class=""pl-pds"">""</span></span>,
        <span class=""pl-s""><span class=""pl-pds"">""</span>domain<span class=""pl-pds"">""</span></span>,
        <span class=""pl-s""><span class=""pl-pds"">""</span>path<span class=""pl-pds"">""</span></span>,
        <span class=""pl-s""><span class=""pl-pds"">""</span>target<span class=""pl-pds"">""</span></span>,
        <span class=""pl-s""><span class=""pl-pds"">""</span>created<span class=""pl-pds"">""</span></span>
    ],
    <span class=""pl-ent"">""schema""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>CREATE TABLE [redirects_redirect] (<span class=""pl-cce"">\n</span>   [id] INTEGER PRIMARY KEY,<span class=""pl-cce"">\n</span>   [domain] TEXT,<span class=""pl-cce"">\n</span>   [path] TEXT,<span class=""pl-cce"">\n</span>   [target] TEXT,<span class=""pl-cce"">\n</span>   [created] TEXT<span class=""pl-cce"">\n</span>)<span class=""pl-pds"">""</span></span>
}</pre></div>
<p dir=""auto"">It is an object with three keys: <code>name</code> is the name of the table, <code>columns</code> is an array of column strings and <code>schema</code> is the SQL schema text used for tha table.</p>
<p dir=""auto"">The second file, <code>table_name.ndjson</code>, contains <a href=""http://ndjson.org/"" rel=""nofollow"">newline-delimited JSON</a> for every row in the table. Each row is represented as a JSON array with items corresponding to each of the columns defined in the metadata.</p>
<p dir=""auto"">That file for the <code>redirects_redirect.ndjson</code> table might look like this:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""[1, &quot;feeds.simonwillison.net&quot;, &quot;swn-everything&quot;, &quot;https://simonwillison.net/atom/everything/&quot;, &quot;2017-10-01T21:11:36.440537+00:00&quot;]
[2, &quot;feeds.simonwillison.net&quot;, &quot;swn-entries&quot;, &quot;https://simonwillison.net/atom/entries/&quot;, &quot;2017-10-01T21:12:32.478849+00:00&quot;]
[3, &quot;feeds.simonwillison.net&quot;, &quot;swn-links&quot;, &quot;https://simonwillison.net/atom/links/&quot;, &quot;2017-10-01T21:12:54.820729+00:00&quot;]""><pre class=""notranslate""><code>[1, ""feeds.simonwillison.net"", ""swn-everything"", ""https://simonwillison.net/atom/everything/"", ""2017-10-01T21:11:36.440537+00:00""]
[2, ""feeds.simonwillison.net"", ""swn-entries"", ""https://simonwillison.net/atom/entries/"", ""2017-10-01T21:12:32.478849+00:00""]
[3, ""feeds.simonwillison.net"", ""swn-links"", ""https://simonwillison.net/atom/links/"", ""2017-10-01T21:12:54.820729+00:00""]
</code></pre></div>
</article></div>",1,public,0,,0,
195696804,MDEwOlJlcG9zaXRvcnkxOTU2OTY4MDQ=,datasette-cors,simonw/datasette-cors,0,9599,https://github.com/simonw/datasette-cors,Datasette plugin for configuring CORS headers,0,2019-07-07T21:03:11Z,2021-02-27T00:31:13Z,2019-07-11T04:40:57Z,,11,9,9,Python,1,1,1,1,0,0,0,0,1,apache-2.0,"[""datasette"", ""datasette-plugin"", ""datasette-io""]",0,1,9,master,"{""admin"": false, ""push"": false, ""pull"": false}",,,0,3,"# datasette-cors

[![PyPI](https://img.shields.io/pypi/v/datasette-cors.svg)](https://pypi.org/project/datasette-cors/)
[![CircleCI](https://circleci.com/gh/simonw/datasette-cors.svg?style=svg)](https://circleci.com/gh/simonw/datasette-cors)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-cors/blob/master/LICENSE)

Datasette plugin for configuring CORS headers, based on https://github.com/simonw/asgi-cors

You can use this plugin to allow JavaScript running on a whitelisted set of domains to make `fetch()` calls to the JSON API provided by your Datasette instance.

## Installation

    pip install datasette-cors

## Configuration

You need to add some configuration to your Datasette `metadata.json` file for this plugin to take effect.

To whitelist specific domains, use this:

```json
{
    ""plugins"": {
        ""datasette-cors"": {
            ""hosts"": [""https://www.example.com""]
        }
    }
}
```

You can also whitelist patterns like this:

```json
{
    ""plugins"": {
        ""datasette-cors"": {
            ""host_wildcards"": [""https://*.example.com""]
        }
    }
}
```

## Testing it

To test this plugin out, run it locally by saving one of the above examples as `metadata.json` and running this:

    $ datasette --memory -m metadata.json

Now visit https://www.example.com/ in your browser, open the browser developer console and paste in the following:

```javascript
fetch(""http://127.0.0.1:8001/:memory:.json?sql=select+sqlite_version%28%29"").then(r => r.json()).then(console.log)
```

If the plugin is running correctly, you will see the JSON response output to the console.
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-datasette-cors"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-cors""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-cors</h1>
<p><a href=""https://pypi.org/project/datasette-cors/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/3f4dc5b1725858ca83723674cadd1a59373f3f4265fb01eb43c91ca9380b24bb/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d636f72732e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-cors.svg"" style=""max-width:100%;""></a>
<a href=""https://circleci.com/gh/simonw/datasette-cors"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/831507d824fb545803ebf51bf318c316eb5f9405e6cc9dabcb4835f356f4053d/68747470733a2f2f636972636c6563692e636f6d2f67682f73696d6f6e772f6461746173657474652d636f72732e7376673f7374796c653d737667"" alt=""CircleCI"" data-canonical-src=""https://circleci.com/gh/simonw/datasette-cors.svg?style=svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-cors/blob/master/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>Datasette plugin for configuring CORS headers, based on <a href=""https://github.com/simonw/asgi-cors"">https://github.com/simonw/asgi-cors</a></p>
<p>You can use this plugin to allow JavaScript running on a whitelisted set of domains to make <code>fetch()</code> calls to the JSON API provided by your Datasette instance.</p>
<h2><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pip install datasette-cors
""><pre><code>pip install datasette-cors
</code></pre></div>
<h2><a id=""user-content-configuration"" class=""anchor"" aria-hidden=""true"" href=""#user-content-configuration""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Configuration</h2>
<p>You need to add some configuration to your Datasette <code>metadata.json</code> file for this plugin to take effect.</p>
<p>To whitelist specific domains, use this:</p>
<div class=""highlight highlight-source-json position-relative"" data-snippet-clipboard-copy-content=""{
    &quot;plugins&quot;: {
        &quot;datasette-cors&quot;: {
            &quot;hosts&quot;: [&quot;https://www.example.com&quot;]
        }
    }
}
""><pre>{
    <span class=""pl-s""><span class=""pl-pds"">""</span>plugins<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>datasette-cors<span class=""pl-pds"">""</span></span>: {
            <span class=""pl-s""><span class=""pl-pds"">""</span>hosts<span class=""pl-pds"">""</span></span>: [<span class=""pl-s""><span class=""pl-pds"">""</span>https://www.example.com<span class=""pl-pds"">""</span></span>]
        }
    }
}</pre></div>
<p>You can also whitelist patterns like this:</p>
<div class=""highlight highlight-source-json position-relative"" data-snippet-clipboard-copy-content=""{
    &quot;plugins&quot;: {
        &quot;datasette-cors&quot;: {
            &quot;host_wildcards&quot;: [&quot;https://*.example.com&quot;]
        }
    }
}
""><pre>{
    <span class=""pl-s""><span class=""pl-pds"">""</span>plugins<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>datasette-cors<span class=""pl-pds"">""</span></span>: {
            <span class=""pl-s""><span class=""pl-pds"">""</span>host_wildcards<span class=""pl-pds"">""</span></span>: [<span class=""pl-s""><span class=""pl-pds"">""</span>https://*.example.com<span class=""pl-pds"">""</span></span>]
        }
    }
}</pre></div>
<h2><a id=""user-content-testing-it"" class=""anchor"" aria-hidden=""true"" href=""#user-content-testing-it""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Testing it</h2>
<p>To test this plugin out, run it locally by saving one of the above examples as <code>metadata.json</code> and running this:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ datasette --memory -m metadata.json
""><pre><code>$ datasette --memory -m metadata.json
</code></pre></div>
<p>Now visit <a href=""https://www.example.com/"" rel=""nofollow"">https://www.example.com/</a> in your browser, open the browser developer console and paste in the following:</p>
<div class=""highlight highlight-source-js position-relative"" data-snippet-clipboard-copy-content=""fetch(&quot;http://127.0.0.1:8001/:memory:.json?sql=select+sqlite_version%28%29&quot;).then(r =&gt; r.json()).then(console.log)
""><pre><span class=""pl-en"">fetch</span><span class=""pl-kos"">(</span><span class=""pl-s"">""http://127.0.0.1:8001/:memory:.json?sql=select+sqlite_version%28%29""</span><span class=""pl-kos"">)</span><span class=""pl-kos"">.</span><span class=""pl-en"">then</span><span class=""pl-kos"">(</span><span class=""pl-s1"">r</span> <span class=""pl-c1"">=&gt;</span> <span class=""pl-s1"">r</span><span class=""pl-kos"">.</span><span class=""pl-en"">json</span><span class=""pl-kos"">(</span><span class=""pl-kos"">)</span><span class=""pl-kos"">)</span><span class=""pl-kos"">.</span><span class=""pl-en"">then</span><span class=""pl-kos"">(</span><span class=""pl-smi"">console</span><span class=""pl-kos"">.</span><span class=""pl-c1"">log</span><span class=""pl-kos"">)</span></pre></div>
<p>If the plugin is running correctly, you will see the JSON response output to the console.</p>
</article></div>",,,,,,
197882382,MDEwOlJlcG9zaXRvcnkxOTc4ODIzODI=,healthkit-to-sqlite,dogsheep/healthkit-to-sqlite,0,53015001,https://github.com/dogsheep/healthkit-to-sqlite,Convert an Apple Healthkit export zip to a SQLite database,0,2019-07-20T05:03:12Z,2021-08-20T00:55:34Z,2021-08-20T00:56:17Z,https://datasette.io/tools/healthkit-to-sqlite,29,91,91,Python,1,1,1,1,0,4,0,0,8,apache-2.0,"[""datasette"", ""datasette-io"", ""datasette-tool"", ""dogsheep"", ""healthkit"", ""sqlite""]",4,8,91,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,53015001,4,3,"# healthkit-to-sqlite

[![PyPI](https://img.shields.io/pypi/v/healthkit-to-sqlite.svg)](https://pypi.org/project/healthkit-to-sqlite/)
[![Changelog](https://img.shields.io/github/v/release/dogsheep/healthkit-to-sqlite?include_prereleases&label=changelog)](https://github.com/dogsheep/healthkit-to-sqlite/releases)
[![Tests](https://github.com/dogsheep/healthkit-to-sqlite/workflows/Test/badge.svg)](https://github.com/dogsheep/healthkit-to-sqlite/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/dogsheep/healthkit-to-sqlite/blob/main/LICENSE)

Convert an Apple Healthkit export zip to a SQLite database

## How to install

    $ pip install healthkit-to-sqlite

## How to use

First you need to export your Apple HealthKit data.

1. On your iPhone, open the ""Health"" app
2. Click the profile icon in the top right
3. Click ""Export Health Data"" at the bottom of that page
4. Save the resulting file somewhere you can access it, or AirDrop it directly to your laptop.

Now you can convert the resulting `export.zip` file to SQLite like so:

    $ healthkit-to-sqlite export.zip healthkit.db

A progress bar will be displayed. You can disable this using `--silent`.

```
Importing from HealthKit  [#-------------]    5%  00:01:33
```

You can explore the resulting data using [Datasette](https://datasette.readthedocs.io/) like this:

    $ datasette healthkit.db
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-healthkit-to-sqlite"" class=""anchor"" aria-hidden=""true"" href=""#user-content-healthkit-to-sqlite""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>healthkit-to-sqlite</h1>
<p><a href=""https://pypi.org/project/healthkit-to-sqlite/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/d8fd72edb8183afd279306fabff220e4c1670906c58c256e9e3bd7fbdea8c76f/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6865616c74686b69742d746f2d73716c6974652e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/healthkit-to-sqlite.svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/dogsheep/healthkit-to-sqlite/releases""><img src=""https://camo.githubusercontent.com/9092a0b3a68a53e13d8470fdf4909dda3cabe09e9e74b53960ad2063c79ae8e4/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f646f6773686565702f6865616c74686b69742d746f2d73716c6974653f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/dogsheep/healthkit-to-sqlite?include_prereleases&amp;label=changelog"" style=""max-width:100%;""></a>
<a href=""https://github.com/dogsheep/healthkit-to-sqlite/actions?query=workflow%3ATest""><img src=""https://github.com/dogsheep/healthkit-to-sqlite/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width:100%;""></a>
<a href=""https://github.com/dogsheep/healthkit-to-sqlite/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>Convert an Apple Healthkit export zip to a SQLite database</p>
<h2><a id=""user-content-how-to-install"" class=""anchor"" aria-hidden=""true"" href=""#user-content-how-to-install""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>How to install</h2>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ pip install healthkit-to-sqlite
""><pre><code>$ pip install healthkit-to-sqlite
</code></pre></div>
<h2><a id=""user-content-how-to-use"" class=""anchor"" aria-hidden=""true"" href=""#user-content-how-to-use""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>How to use</h2>
<p>First you need to export your Apple HealthKit data.</p>
<ol>
<li>On your iPhone, open the ""Health"" app</li>
<li>Click the profile icon in the top right</li>
<li>Click ""Export Health Data"" at the bottom of that page</li>
<li>Save the resulting file somewhere you can access it, or AirDrop it directly to your laptop.</li>
</ol>
<p>Now you can convert the resulting <code>export.zip</code> file to SQLite like so:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ healthkit-to-sqlite export.zip healthkit.db
""><pre><code>$ healthkit-to-sqlite export.zip healthkit.db
</code></pre></div>
<p>A progress bar will be displayed. You can disable this using <code>--silent</code>.</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""Importing from HealthKit  [#-------------]    5%  00:01:33
""><pre><code>Importing from HealthKit  [#-------------]    5%  00:01:33
</code></pre></div>
<p>You can explore the resulting data using <a href=""https://datasette.readthedocs.io/"" rel=""nofollow"">Datasette</a> like this:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ datasette healthkit.db
""><pre><code>$ datasette healthkit.db
</code></pre></div>
</article></div>",,,,,,
205429375,MDEwOlJlcG9zaXRvcnkyMDU0MjkzNzU=,swarm-to-sqlite,dogsheep/swarm-to-sqlite,0,53015001,https://github.com/dogsheep/swarm-to-sqlite,Create a SQLite database containing your checkin history from Foursquare Swarm,0,2019-08-30T17:37:29Z,2021-02-22T07:58:39Z,2021-01-18T04:36:03Z,,49,37,37,Python,1,1,1,1,0,1,0,0,1,apache-2.0,"[""sqlite"", ""foursquare"", ""swarm"", ""foursquare-api"", ""datasette"", ""dogsheep"", ""datasette-io"", ""datasette-tool""]",1,1,37,main,"{""admin"": false, ""push"": false, ""pull"": false}",,53015001,1,3,"# swarm-to-sqlite

[![PyPI](https://img.shields.io/pypi/v/swarm-to-sqlite.svg)](https://pypi.org/project/swarm-to-sqlite/)
[![Changelog](https://img.shields.io/github/v/release/dogsheep/swarm-to-sqlite?include_prereleases&label=changelog)](https://github.com/dogsheep/swarm-to-sqlite/releases)
[![Tests](https://github.com/dogsheep/swarm-to-sqlite/workflows/Test/badge.svg)](https://github.com/dogsheep/swarm-to-sqlite/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/dogsheep/swarm-to-sqlite/blob/main/LICENSE)

Create a SQLite database containing your checkin history from Foursquare Swarm.

## How to install

    $ pip install swarm-to-sqlite

## Usage

You will need to first obtain a valid OAuth token for your Foursquare account. You can do so using this tool: https://your-foursquare-oauth-token.glitch.me/

Simplest usage is to simply provide the name of the database file you wish to write to. The tool will prompt you to paste in your token, and will then download your checkins and store them in the specified database file.

    $ swarm-to-sqlite checkins.db
    Please provide your Foursquare OAuth token:
    Importing 3699 checkins  [#########-----------------------] 27% 00:02:31

You can also pass the token as a command-line option:

    $ swarm-to-sqlite checkins.db --token=XXX

Or as an environment variable:

    $ export FOURSQUARE_TOKEN=XXX
    $ swarm-to-sqlite checkins.db

To retrieve just checkins within the past X hours, days or weeks, use the `--since=` option. For example, to pull only checkins that happened within the last 10 days use:

    $ swarm-to-sqlite checkins.db --token=XXX --since=10d

Use `2w` for two weeks, `10h` for ten hours, `3d` for three days.

In addition to saving the checkins to a database, you can also write them to a JSON file using the `--save` option:

    $ swarm-to-sqlite checkins.db --save=checkins.json

Having done this, you can re-import checkins directly from that file (rather than making API calls to fetch data from Foursquare) like this:

    $ swarm-to-sqlite checkins.db --load=checkins.json

## Using with Datasette

The SQLite database produced by this tool is designed to be browsed using [Datasette](https://datasette.io/).

You can install the [datasette-cluster-map](https://datasette.io/plugins/datasette-cluster-map) plugin to view your checkins on a map.
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-swarm-to-sqlite"" class=""anchor"" aria-hidden=""true"" href=""#user-content-swarm-to-sqlite""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>swarm-to-sqlite</h1>
<p><a href=""https://pypi.org/project/swarm-to-sqlite/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/336537cfcc544f29699c00a2aa4b5a199c9a21f53e43aff833e268e664735ed9/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f737761726d2d746f2d73716c6974652e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/swarm-to-sqlite.svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/dogsheep/swarm-to-sqlite/releases""><img src=""https://camo.githubusercontent.com/3a244282c89501bdea48af0387ebed5c0a03d41ef8f87331ed3fe63421f42c4f/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f646f6773686565702f737761726d2d746f2d73716c6974653f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/dogsheep/swarm-to-sqlite?include_prereleases&amp;label=changelog"" style=""max-width:100%;""></a>
<a href=""https://github.com/dogsheep/swarm-to-sqlite/actions?query=workflow%3ATest""><img src=""https://github.com/dogsheep/swarm-to-sqlite/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width:100%;""></a>
<a href=""https://github.com/dogsheep/swarm-to-sqlite/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>Create a SQLite database containing your checkin history from Foursquare Swarm.</p>
<h2><a id=""user-content-how-to-install"" class=""anchor"" aria-hidden=""true"" href=""#user-content-how-to-install""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>How to install</h2>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ pip install swarm-to-sqlite
""><pre><code>$ pip install swarm-to-sqlite
</code></pre></div>
<h2><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p>You will need to first obtain a valid OAuth token for your Foursquare account. You can do so using this tool: <a href=""https://your-foursquare-oauth-token.glitch.me/"" rel=""nofollow"">https://your-foursquare-oauth-token.glitch.me/</a></p>
<p>Simplest usage is to simply provide the name of the database file you wish to write to. The tool will prompt you to paste in your token, and will then download your checkins and store them in the specified database file.</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ swarm-to-sqlite checkins.db
Please provide your Foursquare OAuth token:
Importing 3699 checkins  [#########-----------------------] 27% 00:02:31
""><pre><code>$ swarm-to-sqlite checkins.db
Please provide your Foursquare OAuth token:
Importing 3699 checkins  [#########-----------------------] 27% 00:02:31
</code></pre></div>
<p>You can also pass the token as a command-line option:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ swarm-to-sqlite checkins.db --token=XXX
""><pre><code>$ swarm-to-sqlite checkins.db --token=XXX
</code></pre></div>
<p>Or as an environment variable:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ export FOURSQUARE_TOKEN=XXX
$ swarm-to-sqlite checkins.db
""><pre><code>$ export FOURSQUARE_TOKEN=XXX
$ swarm-to-sqlite checkins.db
</code></pre></div>
<p>To retrieve just checkins within the past X hours, days or weeks, use the <code>--since=</code> option. For example, to pull only checkins that happened within the last 10 days use:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ swarm-to-sqlite checkins.db --token=XXX --since=10d
""><pre><code>$ swarm-to-sqlite checkins.db --token=XXX --since=10d
</code></pre></div>
<p>Use <code>2w</code> for two weeks, <code>10h</code> for ten hours, <code>3d</code> for three days.</p>
<p>In addition to saving the checkins to a database, you can also write them to a JSON file using the <code>--save</code> option:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ swarm-to-sqlite checkins.db --save=checkins.json
""><pre><code>$ swarm-to-sqlite checkins.db --save=checkins.json
</code></pre></div>
<p>Having done this, you can re-import checkins directly from that file (rather than making API calls to fetch data from Foursquare) like this:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ swarm-to-sqlite checkins.db --load=checkins.json
""><pre><code>$ swarm-to-sqlite checkins.db --load=checkins.json
</code></pre></div>
<h2><a id=""user-content-using-with-datasette"" class=""anchor"" aria-hidden=""true"" href=""#user-content-using-with-datasette""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Using with Datasette</h2>
<p>The SQLite database produced by this tool is designed to be browsed using <a href=""https://datasette.io/"" rel=""nofollow"">Datasette</a>.</p>
<p>You can install the <a href=""https://datasette.io/plugins/datasette-cluster-map"" rel=""nofollow"">datasette-cluster-map</a> plugin to view your checkins on a map.</p>
</article></div>",,,,,,
206156866,MDEwOlJlcG9zaXRvcnkyMDYxNTY4NjY=,twitter-to-sqlite,dogsheep/twitter-to-sqlite,0,53015001,https://github.com/dogsheep/twitter-to-sqlite,Save data from Twitter to a SQLite database,0,2019-09-03T19:30:08Z,2021-12-26T18:08:43Z,2021-12-26T18:08:40Z,,298,269,269,Python,1,1,1,1,0,13,0,0,10,apache-2.0,"[""datasette"", ""datasette-io"", ""datasette-tool"", ""dogsheep"", ""sqlite"", ""twitter"", ""twitter-api""]",13,10,269,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,53015001,13,5,"# twitter-to-sqlite

[![PyPI](https://img.shields.io/pypi/v/twitter-to-sqlite.svg)](https://pypi.org/project/twitter-to-sqlite/)
[![Changelog](https://img.shields.io/github/v/release/dogsheep/twitter-to-sqlite?include_prereleases&label=changelog)](https://github.com/dogsheep/twitter-to-sqlite/releases)
[![Tests](https://github.com/dogsheep/twitter-to-sqlite/workflows/Test/badge.svg)](https://github.com/dogsheep/twitter-to-sqlite/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/dogsheep/twitter-to-sqlite/blob/main/LICENSE)

Save data from Twitter to a SQLite database.

**This tool currently uses Twitter API v1**. You may be unable to use it if you do not have an API key for that version of the API.

<!-- toc -->

- [How to install](#how-to-install)
- [Authentication](#authentication)
- [Retrieving tweets by specific accounts](#retrieving-tweets-by-specific-accounts)
- [Retrieve user profiles in bulk](#retrieve-user-profiles-in-bulk)
- [Retrieve tweets in bulk](#retrieve-tweets-in-bulk)
- [Retrieving Twitter followers](#retrieving-twitter-followers)
- [Retrieving friends](#retrieving-friends)
- [Retrieving favorited tweets](#retrieving-favorited-tweets)
- [Retrieving Twitter lists](#retrieving-twitter-lists)
- [Retrieving Twitter list memberships](#retrieving-twitter-list-memberships)
- [Retrieving just follower and friend IDs](#retrieving-just-follower-and-friend-ids)
- [Retrieving tweets from your home timeline](#retrieving-tweets-from-your-home-timeline)
- [Retrieving your mentions](#retrieving-your-mentions)
- [Providing input from a SQL query with --sql and --attach](#providing-input-from-a-sql-query-with---sql-and---attach)
- [Running searches](#running-searches)
- [Capturing tweets in real-time with track and follow](#capturing-tweets-in-real-time-with-track-and-follow)
  * [track](#track)
  * [follow](#follow)
- [Importing data from your Twitter archive](#importing-data-from-your-twitter-archive)
- [Design notes](#design-notes)

<!-- tocstop -->

## How to install

    $ pip install twitter-to-sqlite

## Authentication

First, you will need to create a Twitter application at https://developer.twitter.com/en/apps. You may need to apply for a Twitter developer account - if so, you may find this [example of an email application](https://raw.githubusercontent.com/dogsheep/twitter-to-sqlite/main/email.png) useful that has been approved in the past.

Once you have created your application, navigate to the ""Keys and tokens"" page and make note of the following:

* Your API key
* Your API secret key
* Your access token
* Your access token secret

You will need to save all four of these values to a JSON file in order to use this tool.

You can create that JSON file by running the following command and pasting in the values at the prompts:

    $ twitter-to-sqlite auth
    Create an app here: https://developer.twitter.com/en/apps
    Then navigate to 'Keys and tokens' and paste in the following:

    API key: xxx
    API secret key: xxx
    Access token: xxx
    Access token secret: xxx

This will create a file called `auth.json` in your current directory containing the required values. To save the file at a different path or filename, use the `--auth=myauth.json` option.

## Retrieving tweets by specific accounts

The `user-timeline` command retrieves all of the tweets posted by the specified user accounts. It defaults to the account belonging to the authenticated user:

    $ twitter-to-sqlite user-timeline twitter.db
    Importing tweets  [#####-------------------------------]  2799/17780  00:01:39

All of these commands assume that there is an `auth.json` file in the current directory. You can provide the path to your `auth.json` file using `-a`:

    $ twitter-to-sqlite user-timeline twitter.db -a /path/to/auth.json

To load tweets for other users, pass their screen names as arguments:

    $ twitter-to-sqlite user-timeline twitter.db cleopaws nichemuseums

Twitter's API only returns up to around 3,200 tweets for most user accounts, but you may find that it returns all available tweets for your own user account.

You can pass numeric Twitter user IDs instead of screen names using the `--ids` parameter.

You can use `--since` to retrieve every tweet since the last time you imported for that user, or `--since_id=xxx` to retrieve every tweet since a specific tweet ID.

This command also accepts `--sql` and `--attach` options, documented below.

## Retrieve user profiles in bulk

If you have a list of Twitter screen names (or user IDs) you can bulk fetch their fully inflated Twitter profiles using the `users-lookup` command:

    $ twitter-to-sqlite users-lookup users.db simonw cleopaws

You can pass user IDs instead using the `--ids` option:

    $ twitter-to-sqlite users-lookup users.db 12497 3166449535 --ids

This command also accepts `--sql` and `--attach` options, documented below.

## Retrieve tweets in bulk

If you have a list of tweet IDS you can bulk fetch them using the `statuses-lookup` command:

    $ twitter-to-sqlite statuses-lookup tweets.db 1122154819815239680 1122154178493575169

The `--sql` and `--attach` options are supported.

Here's a recipe to retrieve any tweets that existing tweets are in-reply-to which have not yet been stored in your database:

    $ twitter-to-sqlite statuses-lookup tweets.db \
        --sql='
            select in_reply_to_status_id
            from tweets
            where in_reply_to_status_id is not null' \
        --skip-existing

The `--skip-existing` option means that tweets that have already been stored in the database will not be fetched again.

## Retrieving Twitter followers

The `followers` command retrieves details of every follower of the specified accounts. You can use it to retrieve your own followers, or you can pass one or more screen names to pull the followers for other accounts.

The following command pulls your followers and saves them in a SQLite database file called `twitter.db`:

    $ twitter-to-sqlite followers twitter.db

This command is **extremely slow**, because Twitter impose a rate limit of no more than one request per minute to this endpoint! If you are running it against an account with thousands of followers you should expect this to take several hours.

To retrieve followers for another account, use:

    $ twitter-to-sqlite followers twitter.db cleopaws

This command also accepts the `--ids`, `--sql` and `--attach` options.

See [Analyzing my Twitter followers with Datasette](https://simonwillison.net/2018/Jan/28/analyzing-my-twitter-followers/) for the original inspiration for this command.

## Retrieving friends

The `friends` command works like the `followers` command, but retrieves the specified (or currently authenticated) user's friends - defined as accounts that the user is following.

    $ twitter-to-sqlite friends twitter.db

It takes the same options as the `followers` command.

## Retrieving favorited tweets

The `favorites` command retrieves tweets that have been favorited by a specified user. Called without any extra arguments it retrieves tweets favorited by the currently authenticated user:

    $ twitter-to-sqlite favorites faves.db

You can also use the `--screen_name` or `--user_id` arguments to retrieve favorite tweets for another user:

    $ twitter-to-sqlite favorites faves-obama.db --screen_name=BarackObama

Use the `--stop_after=xxx` argument to retrieve only the most recent number of favorites, e.g. to get the authenticated user's 50 most recent favorites:

    $ twitter-to-sqlite favorites faves.db --stop_after=50

## Retrieving Twitter lists

The `lists` command retrieves all of the lists belonging to one or more users.

    $ twitter-to-sqlite lists lists.db simonw dogsheep

This command also accepts the `--sql` and `--attach` and `--ids` options.

To additionally fetch the list of members for each list, use `--members`.

## Retrieving Twitter list memberships

The `list-members` command can be used to retrieve details of one or more Twitter lists, including all of their members.

    $ twitter-to-sqlite list-members members.db simonw/the-good-place

You can pass multiple `screen_name/list_slug` identifiers.

If you know the numeric IDs of the lists instead, you can use `--ids`:

    $ twitter-to-sqlite list-members members.db 927913322841653248 --ids

## Retrieving just follower and friend IDs

It's also possible to retrieve just the numeric Twitter IDs of the accounts that specific users are following (""friends"" in Twitter's API terminology) or followed-by:

    $ twitter-to-sqlite followers-ids members.db simonw cleopaws

This will populate the `following` table with `followed_id`/`follower_id` pairs for the two specified accounts, listing every account ID that is following either of those two accounts.

    $ twitter-to-sqlite friends-ids members.db simonw cleopaws

This will do the same thing but pull the IDs that those accounts are following.

Both of these commands also support `--sql` and `--attach` as an alternative to passing screen names as direct command-line arguments. You can use `--ids` to process the inputs as user IDs rather than screen names.

The underlying Twitter APIs have a rate limit of 15 requests every 15 minutes - though they do return up to 5,000 IDs in each call. By default both of these subcommands will wait for 61 seconds between API calls in order to stay within the rate limit - you can adjust this behaviour down to just one second delay if you know you will not be making many calls using `--sleep=1`.

## Retrieving tweets from your home timeline

The `home-timeline` command retrieves up to 800 tweets from the home timeline of the authenticated user - generally this means tweets from people you follow.

    $ twitter-to-sqlite home-timeline twitter.db
    Importing timeline  [#################--------]  591/800  00:01:14

The tweets are stored in the `tweets` table, and a record is added to the `timeline_tweets` table noting that this tweet came in due to being spotted in the timeline of your user.

You can use `--since` to retrieve just tweets that have been posted since the last time this command was run, or `--since_id=xxx` to explicitly pass in a tweet ID to use as the last position.

You can then view your timeline in Datasette using the following URL:

`/tweets/tweets?_where=id+in+(select+tweet+from+[timeline_tweets])&_sort_desc=id&_facet=user`

This will filter your tweets table to just tweets that appear in your timeline, ordered by most recent first and use faceting to show you which users are responsible for the most tweets.

## Retrieving your mentions

The `mentions-timeline` command works like `home-timeline` except it retrieves tweets that mention the authenticated user's account. It records the user account that was mentioned in a `mentions_tweets` table.

It supports `--since` and `--since_id` in the same was as `home-timeline` does.

## Providing input from a SQL query with --sql and --attach

This option is available for some subcommands - run `twitter-to-sqlite command-name --help` to check.

You can provide Twitter screen names (or user IDs or tweet IDs) directly as command-line arguments, or you can provide those screen names or IDs by executing a SQL query.

For example: consider a SQLite database with an `attendees` table listing names and Twitter accounts - something like this:

| First   | Last       | Twitter      |
|---------|------------|--------------|
| Simon   | Willison   | simonw       |
| Avril   | Lavigne    | AvrilLavigne |

You can run the `users-lookup` command to pull the Twitter profile of every user listed in that database by loading the screen names using a `--sql` query:

    $ twitter-to-sqlite users-lookup my.db --sql=""select Twitter from attendees""

If your database table contains Twitter IDs, you can select those IDs and pass the `--ids` argument. For example, to fetch the profiles of users who have had their user IDs inserted into the `following` table using the `twitter-to-sqlite friends-ids` command:

    $ twitter-to-sqlite users-lookup my.db --sql=""select follower_id from following"" --ids

Or to avoid re-fetching users that have already been fetched:

    $ twitter-to-sqlite users-lookup my.db \
        --sql=""select followed_id from following where followed_id not in (
            select id from users)"" --ids

If your data lives in a separate database file you can attach it using `--attach`. For example, consider the attendees example above but the data lives in an `attendees.db` file, and you want to fetch the user profiles into a `tweets.db` file. You could do that like this:

    $ twitter-to-sqlite users-lookup tweets.db \
        --attach=attendees.db \
        --sql=""select Twitter from attendees.attendees""

The filename (without the extension) will be used as the database alias within SQLite. If you want a different alias for some reason you can specify that with a colon like this:

    $ twitter-to-sqlite users-lookup tweets.db \
        --attach=foo:attendees.db \
        --sql=""select Twitter from foo.attendees""

## Running searches

The `search` command runs a search against the Twitter [standard search API](https://developer.twitter.com/en/docs/tweets/search/api-reference/get-search-tweets).

    $ twitter-to-sqlite search tweets.db ""dogsheep""

This will import up to around 320 tweets that match that search term into the `tweets` table. It will also create a record in the `search_runs` table recording that the search took place, and many-to-many records in the `search_runs_tweets` table recording which tweets were seen for that search at that time.

You can use the `--since` parameter to check for previous search runs with the same arguments and only retrieve tweets that were posted since the last retrieved matching tweet.

The following additional options for `search` are supported:

* `--geocode`: `latitude,longitude,radius` where radius is a number followed by mi or km
* `--lang`: ISO 639-1 language code e.g. `en` or `es`
* `--locale`: Locale: only `ja` is currently effective
* `--result_type`: `mixed`, `recent` or `popular`. Defaults to `mixed`
* `--count`: Number of results per page, defaults to the maximum of 100
* `--stop_after`: Stop after this many results
* `--since_id`: Pull tweets since this Tweet ID. You probably want to use `--since` instead of this.

## Capturing tweets in real-time with track and follow

This functionality is **experimental**. Please [file bug reports](https://github.com/dogsheep/twitter-to-sqlite/issues) if you find any!

Twitter provides a real-time API which can be used to subscribe to tweets as they happen. `twitter-to-sqlite` can use this API to continually update a SQLite database with tweets matching certain keywords, or referencing specific users.

### track

To track keywords, use the `track` command:

    $ twitter-to-sqlite track tweets.db kakapo

This command will continue to run until you hit Ctrl+C. It will capture any tweets mentioning the keyword [kakapo](https://en.wikipedia.org/wiki/Kakapo) and store them in the `tweets.db` database file.

You can pass multiple keywords as a space separated list. This will capture tweets matching either of those keywords:

    $ twitter-to-sqlite track tweets.db kakapo raccoon

You can enclose phrases in quotes to search for tweets matching both of those keywords:

    $ twitter-to-sqlite track tweets.db 'trash panda'

See [the Twitter track documentation](https://developer.twitter.com/en/docs/tweets/filter-realtime/guides/basic-stream-parameters#track) for advanced tips on using this command.

Add the `--verbose` option to see matching tweets (in their verbose JSON form) displayed to the terminal as they are captured:

    $ twitter-to-sqlite track tweets.db raccoon --verbose

### follow

The `follow` command will capture all tweets that are relevant to one or more specific Twitter users.

    $ twitter-to-sqlite follow tweets.db nytimes

This includes tweets by those users, tweets that reply to or quote those users and retweets by that user. See [the Twitter follow documentation](https://developer.twitter.com/en/docs/tweets/filter-realtime/guides/basic-stream-parameters#follow) for full details.

The command accepts one or more screen names.

You can feed it numeric Twitter user IDs instead of screen names by using the `--ids` flag.

The command also supports the `--sql` and `--attach` options, and the `--verbose` option for displaying tweets as they are captured.

Here's how to start following tweets from every user ID currently represented as being followed in the `following` table (populated using the `friends-ids` command):

    $ twitter-to-sqlite follow tweets.db \
        --sql=""select distinct followed_id from following"" \
        --ids

## Importing data from your Twitter archive

You can request an archive of your Twitter data by [following these instructions](https://help.twitter.com/en/managing-your-account/how-to-download-your-twitter-archive).

Twitter will send you a link to download a `.zip` file. You can import the contents of that file into a set of tables in a new database file called `archive.db` (each table beginning with the `archive_` prefix) using the `import` command:

    $ twitter-to-sqlite import archive.db ~/Downloads/twitter-2019-06-25-b31f2.zip

This command does not populate any of the regular tables, since Twitter's export data does not exactly match the schema returned by the Twitter API.

It will delete and recreate the corresponding `archive_*` tables every time you run it. If this is not what you want, run the command against a new SQLite database file name rather than running it against one that already exists.

If you have already decompressed your archive, you can run this against the directory that you decompressed it to:

    $ twitter-to-sqlite import archive.db ~/Downloads/twitter-2019-06-25-b31f2/

You can also run it against one or more specific files within that folder. For example, to import just the follower.js and following.js files:

    $ twitter-to-sqlite import archive.db \
        ~/Downloads/twitter-2019-06-25-b31f2/follower.js \
        ~/Downloads/twitter-2019-06-25-b31f2/following.js

You may want to use other commands to populate tables based on data from the archive. For example, to retrieve full API versions of each of the tweets you have favourited in your archive, you could run the following:

    $ twitter-to-sqlite statuses-lookup archive.db \
        --sql='select tweetId from archive_like' \
        --skip-existing

If you want these imported tweets to then be reflected in the `favorited_by` table, you can do so by applying the following SQL query:

    $ sqlite3 archive.db
    SQLite version 3.22.0 2018-01-22 18:45:57
    Enter "".help"" for usage hints.
    sqlite> INSERT OR IGNORE INTO favorited_by (tweet, user)
       ...>     SELECT tweetId, 'YOUR_TWITTER_ID' FROM archive_like;
    <Ctrl+D>

Replace YOUR_TWITTER_ID with your numeric Twitter ID. If you don't know that ID you can find it out by running the following:

    $ twitter-to-sqlite fetch \
        ""https://api.twitter.com/1.1/account/verify_credentials.json"" \
        | grep '""id""' | head -n 1

## Design notes

* Tweet IDs are stored as integers, to afford sorting by ID in a sensible way
* While we configure foreign key relationships between tables, we do not ask SQLite to enforce them. This is used by the `following` table to allow the `followers-ids` and `friends-ids` commands to populate it with user IDs even if the user accounts themselves are not yet present in the `users` table.
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-twitter-to-sqlite"" class=""anchor"" aria-hidden=""true"" href=""#user-content-twitter-to-sqlite""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>twitter-to-sqlite</h1>
<p dir=""auto""><a href=""https://pypi.org/project/twitter-to-sqlite/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/fce3d286daf4a0ac037476f32a7f1885dd7f90329ec1df80f7fe6b9322c72f5c/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f747769747465722d746f2d73716c6974652e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/twitter-to-sqlite.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/dogsheep/twitter-to-sqlite/releases""><img src=""https://camo.githubusercontent.com/e1228185d86e3eb446efcb27ff81748cdad3b90ac14ae58c61dab61a90bb992d/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f646f6773686565702f747769747465722d746f2d73716c6974653f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/dogsheep/twitter-to-sqlite?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/dogsheep/twitter-to-sqlite/actions?query=workflow%3ATest""><img src=""https://github.com/dogsheep/twitter-to-sqlite/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/dogsheep/twitter-to-sqlite/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Save data from Twitter to a SQLite database.</p>
<p dir=""auto""><strong>This tool currently uses Twitter API v1</strong>. You may be unable to use it if you do not have an API key for that version of the API.</p>

<ul dir=""auto"">
<li><a href=""#user-content-how-to-install"">How to install</a></li>
<li><a href=""#user-content-authentication"">Authentication</a></li>
<li><a href=""#user-content-retrieving-tweets-by-specific-accounts"">Retrieving tweets by specific accounts</a></li>
<li><a href=""#user-content-retrieve-user-profiles-in-bulk"">Retrieve user profiles in bulk</a></li>
<li><a href=""#user-content-retrieve-tweets-in-bulk"">Retrieve tweets in bulk</a></li>
<li><a href=""#user-content-retrieving-twitter-followers"">Retrieving Twitter followers</a></li>
<li><a href=""#user-content-retrieving-friends"">Retrieving friends</a></li>
<li><a href=""#user-content-retrieving-favorited-tweets"">Retrieving favorited tweets</a></li>
<li><a href=""#user-content-retrieving-twitter-lists"">Retrieving Twitter lists</a></li>
<li><a href=""#user-content-retrieving-twitter-list-memberships"">Retrieving Twitter list memberships</a></li>
<li><a href=""#user-content-retrieving-just-follower-and-friend-ids"">Retrieving just follower and friend IDs</a></li>
<li><a href=""#user-content-retrieving-tweets-from-your-home-timeline"">Retrieving tweets from your home timeline</a></li>
<li><a href=""#user-content-retrieving-your-mentions"">Retrieving your mentions</a></li>
<li><a href=""#user-content-providing-input-from-a-sql-query-with---sql-and---attach"">Providing input from a SQL query with --sql and --attach</a></li>
<li><a href=""#user-content-running-searches"">Running searches</a></li>
<li><a href=""#user-content-capturing-tweets-in-real-time-with-track-and-follow"">Capturing tweets in real-time with track and follow</a>
<ul dir=""auto"">
<li><a href=""#user-content-track"">track</a></li>
<li><a href=""#user-content-follow"">follow</a></li>
</ul>
</li>
<li><a href=""#user-content-importing-data-from-your-twitter-archive"">Importing data from your Twitter archive</a></li>
<li><a href=""#user-content-design-notes"">Design notes</a></li>
</ul>

<h2 dir=""auto""><a id=""user-content-how-to-install"" class=""anchor"" aria-hidden=""true"" href=""#user-content-how-to-install""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>How to install</h2>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ pip install twitter-to-sqlite""><pre><code>$ pip install twitter-to-sqlite
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-authentication"" class=""anchor"" aria-hidden=""true"" href=""#user-content-authentication""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Authentication</h2>
<p dir=""auto"">First, you will need to create a Twitter application at <a href=""https://developer.twitter.com/en/apps"" rel=""nofollow"">https://developer.twitter.com/en/apps</a>. You may need to apply for a Twitter developer account - if so, you may find this <a href=""https://raw.githubusercontent.com/dogsheep/twitter-to-sqlite/main/email.png"" rel=""nofollow"">example of an email application</a> useful that has been approved in the past.</p>
<p dir=""auto"">Once you have created your application, navigate to the ""Keys and tokens"" page and make note of the following:</p>
<ul dir=""auto"">
<li>Your API key</li>
<li>Your API secret key</li>
<li>Your access token</li>
<li>Your access token secret</li>
</ul>
<p dir=""auto"">You will need to save all four of these values to a JSON file in order to use this tool.</p>
<p dir=""auto"">You can create that JSON file by running the following command and pasting in the values at the prompts:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ twitter-to-sqlite auth
Create an app here: https://developer.twitter.com/en/apps
Then navigate to 'Keys and tokens' and paste in the following:

API key: xxx
API secret key: xxx
Access token: xxx
Access token secret: xxx""><pre><code>$ twitter-to-sqlite auth
Create an app here: https://developer.twitter.com/en/apps
Then navigate to 'Keys and tokens' and paste in the following:

API key: xxx
API secret key: xxx
Access token: xxx
Access token secret: xxx
</code></pre></div>
<p dir=""auto"">This will create a file called <code>auth.json</code> in your current directory containing the required values. To save the file at a different path or filename, use the <code>--auth=myauth.json</code> option.</p>
<h2 dir=""auto""><a id=""user-content-retrieving-tweets-by-specific-accounts"" class=""anchor"" aria-hidden=""true"" href=""#user-content-retrieving-tweets-by-specific-accounts""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Retrieving tweets by specific accounts</h2>
<p dir=""auto"">The <code>user-timeline</code> command retrieves all of the tweets posted by the specified user accounts. It defaults to the account belonging to the authenticated user:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ twitter-to-sqlite user-timeline twitter.db
Importing tweets  [#####-------------------------------]  2799/17780  00:01:39""><pre><code>$ twitter-to-sqlite user-timeline twitter.db
Importing tweets  [#####-------------------------------]  2799/17780  00:01:39
</code></pre></div>
<p dir=""auto"">All of these commands assume that there is an <code>auth.json</code> file in the current directory. You can provide the path to your <code>auth.json</code> file using <code>-a</code>:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ twitter-to-sqlite user-timeline twitter.db -a /path/to/auth.json""><pre><code>$ twitter-to-sqlite user-timeline twitter.db -a /path/to/auth.json
</code></pre></div>
<p dir=""auto"">To load tweets for other users, pass their screen names as arguments:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ twitter-to-sqlite user-timeline twitter.db cleopaws nichemuseums""><pre><code>$ twitter-to-sqlite user-timeline twitter.db cleopaws nichemuseums
</code></pre></div>
<p dir=""auto"">Twitter's API only returns up to around 3,200 tweets for most user accounts, but you may find that it returns all available tweets for your own user account.</p>
<p dir=""auto"">You can pass numeric Twitter user IDs instead of screen names using the <code>--ids</code> parameter.</p>
<p dir=""auto"">You can use <code>--since</code> to retrieve every tweet since the last time you imported for that user, or <code>--since_id=xxx</code> to retrieve every tweet since a specific tweet ID.</p>
<p dir=""auto"">This command also accepts <code>--sql</code> and <code>--attach</code> options, documented below.</p>
<h2 dir=""auto""><a id=""user-content-retrieve-user-profiles-in-bulk"" class=""anchor"" aria-hidden=""true"" href=""#user-content-retrieve-user-profiles-in-bulk""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Retrieve user profiles in bulk</h2>
<p dir=""auto"">If you have a list of Twitter screen names (or user IDs) you can bulk fetch their fully inflated Twitter profiles using the <code>users-lookup</code> command:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ twitter-to-sqlite users-lookup users.db simonw cleopaws""><pre><code>$ twitter-to-sqlite users-lookup users.db simonw cleopaws
</code></pre></div>
<p dir=""auto"">You can pass user IDs instead using the <code>--ids</code> option:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ twitter-to-sqlite users-lookup users.db 12497 3166449535 --ids""><pre><code>$ twitter-to-sqlite users-lookup users.db 12497 3166449535 --ids
</code></pre></div>
<p dir=""auto"">This command also accepts <code>--sql</code> and <code>--attach</code> options, documented below.</p>
<h2 dir=""auto""><a id=""user-content-retrieve-tweets-in-bulk"" class=""anchor"" aria-hidden=""true"" href=""#user-content-retrieve-tweets-in-bulk""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Retrieve tweets in bulk</h2>
<p dir=""auto"">If you have a list of tweet IDS you can bulk fetch them using the <code>statuses-lookup</code> command:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ twitter-to-sqlite statuses-lookup tweets.db 1122154819815239680 1122154178493575169""><pre><code>$ twitter-to-sqlite statuses-lookup tweets.db 1122154819815239680 1122154178493575169
</code></pre></div>
<p dir=""auto"">The <code>--sql</code> and <code>--attach</code> options are supported.</p>
<p dir=""auto"">Here's a recipe to retrieve any tweets that existing tweets are in-reply-to which have not yet been stored in your database:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ twitter-to-sqlite statuses-lookup tweets.db \
    --sql='
        select in_reply_to_status_id
        from tweets
        where in_reply_to_status_id is not null' \
    --skip-existing""><pre><code>$ twitter-to-sqlite statuses-lookup tweets.db \
    --sql='
        select in_reply_to_status_id
        from tweets
        where in_reply_to_status_id is not null' \
    --skip-existing
</code></pre></div>
<p dir=""auto"">The <code>--skip-existing</code> option means that tweets that have already been stored in the database will not be fetched again.</p>
<h2 dir=""auto""><a id=""user-content-retrieving-twitter-followers"" class=""anchor"" aria-hidden=""true"" href=""#user-content-retrieving-twitter-followers""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Retrieving Twitter followers</h2>
<p dir=""auto"">The <code>followers</code> command retrieves details of every follower of the specified accounts. You can use it to retrieve your own followers, or you can pass one or more screen names to pull the followers for other accounts.</p>
<p dir=""auto"">The following command pulls your followers and saves them in a SQLite database file called <code>twitter.db</code>:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ twitter-to-sqlite followers twitter.db""><pre><code>$ twitter-to-sqlite followers twitter.db
</code></pre></div>
<p dir=""auto"">This command is <strong>extremely slow</strong>, because Twitter impose a rate limit of no more than one request per minute to this endpoint! If you are running it against an account with thousands of followers you should expect this to take several hours.</p>
<p dir=""auto"">To retrieve followers for another account, use:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ twitter-to-sqlite followers twitter.db cleopaws""><pre><code>$ twitter-to-sqlite followers twitter.db cleopaws
</code></pre></div>
<p dir=""auto"">This command also accepts the <code>--ids</code>, <code>--sql</code> and <code>--attach</code> options.</p>
<p dir=""auto"">See <a href=""https://simonwillison.net/2018/Jan/28/analyzing-my-twitter-followers/"" rel=""nofollow"">Analyzing my Twitter followers with Datasette</a> for the original inspiration for this command.</p>
<h2 dir=""auto""><a id=""user-content-retrieving-friends"" class=""anchor"" aria-hidden=""true"" href=""#user-content-retrieving-friends""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Retrieving friends</h2>
<p dir=""auto"">The <code>friends</code> command works like the <code>followers</code> command, but retrieves the specified (or currently authenticated) user's friends - defined as accounts that the user is following.</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ twitter-to-sqlite friends twitter.db""><pre><code>$ twitter-to-sqlite friends twitter.db
</code></pre></div>
<p dir=""auto"">It takes the same options as the <code>followers</code> command.</p>
<h2 dir=""auto""><a id=""user-content-retrieving-favorited-tweets"" class=""anchor"" aria-hidden=""true"" href=""#user-content-retrieving-favorited-tweets""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Retrieving favorited tweets</h2>
<p dir=""auto"">The <code>favorites</code> command retrieves tweets that have been favorited by a specified user. Called without any extra arguments it retrieves tweets favorited by the currently authenticated user:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ twitter-to-sqlite favorites faves.db""><pre><code>$ twitter-to-sqlite favorites faves.db
</code></pre></div>
<p dir=""auto"">You can also use the <code>--screen_name</code> or <code>--user_id</code> arguments to retrieve favorite tweets for another user:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ twitter-to-sqlite favorites faves-obama.db --screen_name=BarackObama""><pre><code>$ twitter-to-sqlite favorites faves-obama.db --screen_name=BarackObama
</code></pre></div>
<p dir=""auto"">Use the <code>--stop_after=xxx</code> argument to retrieve only the most recent number of favorites, e.g. to get the authenticated user's 50 most recent favorites:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ twitter-to-sqlite favorites faves.db --stop_after=50""><pre><code>$ twitter-to-sqlite favorites faves.db --stop_after=50
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-retrieving-twitter-lists"" class=""anchor"" aria-hidden=""true"" href=""#user-content-retrieving-twitter-lists""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Retrieving Twitter lists</h2>
<p dir=""auto"">The <code>lists</code> command retrieves all of the lists belonging to one or more users.</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ twitter-to-sqlite lists lists.db simonw dogsheep""><pre><code>$ twitter-to-sqlite lists lists.db simonw dogsheep
</code></pre></div>
<p dir=""auto"">This command also accepts the <code>--sql</code> and <code>--attach</code> and <code>--ids</code> options.</p>
<p dir=""auto"">To additionally fetch the list of members for each list, use <code>--members</code>.</p>
<h2 dir=""auto""><a id=""user-content-retrieving-twitter-list-memberships"" class=""anchor"" aria-hidden=""true"" href=""#user-content-retrieving-twitter-list-memberships""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Retrieving Twitter list memberships</h2>
<p dir=""auto"">The <code>list-members</code> command can be used to retrieve details of one or more Twitter lists, including all of their members.</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ twitter-to-sqlite list-members members.db simonw/the-good-place""><pre><code>$ twitter-to-sqlite list-members members.db simonw/the-good-place
</code></pre></div>
<p dir=""auto"">You can pass multiple <code>screen_name/list_slug</code> identifiers.</p>
<p dir=""auto"">If you know the numeric IDs of the lists instead, you can use <code>--ids</code>:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ twitter-to-sqlite list-members members.db 927913322841653248 --ids""><pre><code>$ twitter-to-sqlite list-members members.db 927913322841653248 --ids
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-retrieving-just-follower-and-friend-ids"" class=""anchor"" aria-hidden=""true"" href=""#user-content-retrieving-just-follower-and-friend-ids""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Retrieving just follower and friend IDs</h2>
<p dir=""auto"">It's also possible to retrieve just the numeric Twitter IDs of the accounts that specific users are following (""friends"" in Twitter's API terminology) or followed-by:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ twitter-to-sqlite followers-ids members.db simonw cleopaws""><pre><code>$ twitter-to-sqlite followers-ids members.db simonw cleopaws
</code></pre></div>
<p dir=""auto"">This will populate the <code>following</code> table with <code>followed_id</code>/<code>follower_id</code> pairs for the two specified accounts, listing every account ID that is following either of those two accounts.</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ twitter-to-sqlite friends-ids members.db simonw cleopaws""><pre><code>$ twitter-to-sqlite friends-ids members.db simonw cleopaws
</code></pre></div>
<p dir=""auto"">This will do the same thing but pull the IDs that those accounts are following.</p>
<p dir=""auto"">Both of these commands also support <code>--sql</code> and <code>--attach</code> as an alternative to passing screen names as direct command-line arguments. You can use <code>--ids</code> to process the inputs as user IDs rather than screen names.</p>
<p dir=""auto"">The underlying Twitter APIs have a rate limit of 15 requests every 15 minutes - though they do return up to 5,000 IDs in each call. By default both of these subcommands will wait for 61 seconds between API calls in order to stay within the rate limit - you can adjust this behaviour down to just one second delay if you know you will not be making many calls using <code>--sleep=1</code>.</p>
<h2 dir=""auto""><a id=""user-content-retrieving-tweets-from-your-home-timeline"" class=""anchor"" aria-hidden=""true"" href=""#user-content-retrieving-tweets-from-your-home-timeline""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Retrieving tweets from your home timeline</h2>
<p dir=""auto"">The <code>home-timeline</code> command retrieves up to 800 tweets from the home timeline of the authenticated user - generally this means tweets from people you follow.</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ twitter-to-sqlite home-timeline twitter.db
Importing timeline  [#################--------]  591/800  00:01:14""><pre><code>$ twitter-to-sqlite home-timeline twitter.db
Importing timeline  [#################--------]  591/800  00:01:14
</code></pre></div>
<p dir=""auto"">The tweets are stored in the <code>tweets</code> table, and a record is added to the <code>timeline_tweets</code> table noting that this tweet came in due to being spotted in the timeline of your user.</p>
<p dir=""auto"">You can use <code>--since</code> to retrieve just tweets that have been posted since the last time this command was run, or <code>--since_id=xxx</code> to explicitly pass in a tweet ID to use as the last position.</p>
<p dir=""auto"">You can then view your timeline in Datasette using the following URL:</p>
<p dir=""auto""><code>/tweets/tweets?_where=id+in+(select+tweet+from+[timeline_tweets])&amp;_sort_desc=id&amp;_facet=user</code></p>
<p dir=""auto"">This will filter your tweets table to just tweets that appear in your timeline, ordered by most recent first and use faceting to show you which users are responsible for the most tweets.</p>
<h2 dir=""auto""><a id=""user-content-retrieving-your-mentions"" class=""anchor"" aria-hidden=""true"" href=""#user-content-retrieving-your-mentions""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Retrieving your mentions</h2>
<p dir=""auto"">The <code>mentions-timeline</code> command works like <code>home-timeline</code> except it retrieves tweets that mention the authenticated user's account. It records the user account that was mentioned in a <code>mentions_tweets</code> table.</p>
<p dir=""auto"">It supports <code>--since</code> and <code>--since_id</code> in the same was as <code>home-timeline</code> does.</p>
<h2 dir=""auto""><a id=""user-content-providing-input-from-a-sql-query-with---sql-and---attach"" class=""anchor"" aria-hidden=""true"" href=""#user-content-providing-input-from-a-sql-query-with---sql-and---attach""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Providing input from a SQL query with --sql and --attach</h2>
<p dir=""auto"">This option is available for some subcommands - run <code>twitter-to-sqlite command-name --help</code> to check.</p>
<p dir=""auto"">You can provide Twitter screen names (or user IDs or tweet IDs) directly as command-line arguments, or you can provide those screen names or IDs by executing a SQL query.</p>
<p dir=""auto"">For example: consider a SQLite database with an <code>attendees</code> table listing names and Twitter accounts - something like this:</p>
<table>
<thead>
<tr>
<th>First</th>
<th>Last</th>
<th>Twitter</th>
</tr>
</thead>
<tbody>
<tr>
<td>Simon</td>
<td>Willison</td>
<td>simonw</td>
</tr>
<tr>
<td>Avril</td>
<td>Lavigne</td>
<td>AvrilLavigne</td>
</tr>
</tbody>
</table>
<p dir=""auto"">You can run the <code>users-lookup</code> command to pull the Twitter profile of every user listed in that database by loading the screen names using a <code>--sql</code> query:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ twitter-to-sqlite users-lookup my.db --sql=&quot;select Twitter from attendees&quot;""><pre><code>$ twitter-to-sqlite users-lookup my.db --sql=""select Twitter from attendees""
</code></pre></div>
<p dir=""auto"">If your database table contains Twitter IDs, you can select those IDs and pass the <code>--ids</code> argument. For example, to fetch the profiles of users who have had their user IDs inserted into the <code>following</code> table using the <code>twitter-to-sqlite friends-ids</code> command:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ twitter-to-sqlite users-lookup my.db --sql=&quot;select follower_id from following&quot; --ids""><pre><code>$ twitter-to-sqlite users-lookup my.db --sql=""select follower_id from following"" --ids
</code></pre></div>
<p dir=""auto"">Or to avoid re-fetching users that have already been fetched:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ twitter-to-sqlite users-lookup my.db \
    --sql=&quot;select followed_id from following where followed_id not in (
        select id from users)&quot; --ids""><pre><code>$ twitter-to-sqlite users-lookup my.db \
    --sql=""select followed_id from following where followed_id not in (
        select id from users)"" --ids
</code></pre></div>
<p dir=""auto"">If your data lives in a separate database file you can attach it using <code>--attach</code>. For example, consider the attendees example above but the data lives in an <code>attendees.db</code> file, and you want to fetch the user profiles into a <code>tweets.db</code> file. You could do that like this:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ twitter-to-sqlite users-lookup tweets.db \
    --attach=attendees.db \
    --sql=&quot;select Twitter from attendees.attendees&quot;""><pre><code>$ twitter-to-sqlite users-lookup tweets.db \
    --attach=attendees.db \
    --sql=""select Twitter from attendees.attendees""
</code></pre></div>
<p dir=""auto"">The filename (without the extension) will be used as the database alias within SQLite. If you want a different alias for some reason you can specify that with a colon like this:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ twitter-to-sqlite users-lookup tweets.db \
    --attach=foo:attendees.db \
    --sql=&quot;select Twitter from foo.attendees&quot;""><pre><code>$ twitter-to-sqlite users-lookup tweets.db \
    --attach=foo:attendees.db \
    --sql=""select Twitter from foo.attendees""
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-running-searches"" class=""anchor"" aria-hidden=""true"" href=""#user-content-running-searches""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Running searches</h2>
<p dir=""auto"">The <code>search</code> command runs a search against the Twitter <a href=""https://developer.twitter.com/en/docs/tweets/search/api-reference/get-search-tweets"" rel=""nofollow"">standard search API</a>.</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ twitter-to-sqlite search tweets.db &quot;dogsheep&quot;""><pre><code>$ twitter-to-sqlite search tweets.db ""dogsheep""
</code></pre></div>
<p dir=""auto"">This will import up to around 320 tweets that match that search term into the <code>tweets</code> table. It will also create a record in the <code>search_runs</code> table recording that the search took place, and many-to-many records in the <code>search_runs_tweets</code> table recording which tweets were seen for that search at that time.</p>
<p dir=""auto"">You can use the <code>--since</code> parameter to check for previous search runs with the same arguments and only retrieve tweets that were posted since the last retrieved matching tweet.</p>
<p dir=""auto"">The following additional options for <code>search</code> are supported:</p>
<ul dir=""auto"">
<li><code>--geocode</code>: <code>latitude,longitude,radius</code> where radius is a number followed by mi or km</li>
<li><code>--lang</code>: ISO 639-1 language code e.g. <code>en</code> or <code>es</code></li>
<li><code>--locale</code>: Locale: only <code>ja</code> is currently effective</li>
<li><code>--result_type</code>: <code>mixed</code>, <code>recent</code> or <code>popular</code>. Defaults to <code>mixed</code></li>
<li><code>--count</code>: Number of results per page, defaults to the maximum of 100</li>
<li><code>--stop_after</code>: Stop after this many results</li>
<li><code>--since_id</code>: Pull tweets since this Tweet ID. You probably want to use <code>--since</code> instead of this.</li>
</ul>
<h2 dir=""auto""><a id=""user-content-capturing-tweets-in-real-time-with-track-and-follow"" class=""anchor"" aria-hidden=""true"" href=""#user-content-capturing-tweets-in-real-time-with-track-and-follow""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Capturing tweets in real-time with track and follow</h2>
<p dir=""auto"">This functionality is <strong>experimental</strong>. Please <a href=""https://github.com/dogsheep/twitter-to-sqlite/issues"">file bug reports</a> if you find any!</p>
<p dir=""auto"">Twitter provides a real-time API which can be used to subscribe to tweets as they happen. <code>twitter-to-sqlite</code> can use this API to continually update a SQLite database with tweets matching certain keywords, or referencing specific users.</p>
<h3 dir=""auto""><a id=""user-content-track"" class=""anchor"" aria-hidden=""true"" href=""#user-content-track""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>track</h3>
<p dir=""auto"">To track keywords, use the <code>track</code> command:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ twitter-to-sqlite track tweets.db kakapo""><pre><code>$ twitter-to-sqlite track tweets.db kakapo
</code></pre></div>
<p dir=""auto"">This command will continue to run until you hit Ctrl+C. It will capture any tweets mentioning the keyword <a href=""https://en.wikipedia.org/wiki/Kakapo"" rel=""nofollow"">kakapo</a> and store them in the <code>tweets.db</code> database file.</p>
<p dir=""auto"">You can pass multiple keywords as a space separated list. This will capture tweets matching either of those keywords:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ twitter-to-sqlite track tweets.db kakapo raccoon""><pre><code>$ twitter-to-sqlite track tweets.db kakapo raccoon
</code></pre></div>
<p dir=""auto"">You can enclose phrases in quotes to search for tweets matching both of those keywords:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ twitter-to-sqlite track tweets.db 'trash panda'""><pre><code>$ twitter-to-sqlite track tweets.db 'trash panda'
</code></pre></div>
<p dir=""auto"">See <a href=""https://developer.twitter.com/en/docs/tweets/filter-realtime/guides/basic-stream-parameters#track"" rel=""nofollow"">the Twitter track documentation</a> for advanced tips on using this command.</p>
<p dir=""auto"">Add the <code>--verbose</code> option to see matching tweets (in their verbose JSON form) displayed to the terminal as they are captured:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ twitter-to-sqlite track tweets.db raccoon --verbose""><pre><code>$ twitter-to-sqlite track tweets.db raccoon --verbose
</code></pre></div>
<h3 dir=""auto""><a id=""user-content-follow"" class=""anchor"" aria-hidden=""true"" href=""#user-content-follow""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>follow</h3>
<p dir=""auto"">The <code>follow</code> command will capture all tweets that are relevant to one or more specific Twitter users.</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ twitter-to-sqlite follow tweets.db nytimes""><pre><code>$ twitter-to-sqlite follow tweets.db nytimes
</code></pre></div>
<p dir=""auto"">This includes tweets by those users, tweets that reply to or quote those users and retweets by that user. See <a href=""https://developer.twitter.com/en/docs/tweets/filter-realtime/guides/basic-stream-parameters#follow"" rel=""nofollow"">the Twitter follow documentation</a> for full details.</p>
<p dir=""auto"">The command accepts one or more screen names.</p>
<p dir=""auto"">You can feed it numeric Twitter user IDs instead of screen names by using the <code>--ids</code> flag.</p>
<p dir=""auto"">The command also supports the <code>--sql</code> and <code>--attach</code> options, and the <code>--verbose</code> option for displaying tweets as they are captured.</p>
<p dir=""auto"">Here's how to start following tweets from every user ID currently represented as being followed in the <code>following</code> table (populated using the <code>friends-ids</code> command):</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ twitter-to-sqlite follow tweets.db \
    --sql=&quot;select distinct followed_id from following&quot; \
    --ids""><pre><code>$ twitter-to-sqlite follow tweets.db \
    --sql=""select distinct followed_id from following"" \
    --ids
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-importing-data-from-your-twitter-archive"" class=""anchor"" aria-hidden=""true"" href=""#user-content-importing-data-from-your-twitter-archive""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Importing data from your Twitter archive</h2>
<p dir=""auto"">You can request an archive of your Twitter data by <a href=""https://help.twitter.com/en/managing-your-account/how-to-download-your-twitter-archive"" rel=""nofollow"">following these instructions</a>.</p>
<p dir=""auto"">Twitter will send you a link to download a <code>.zip</code> file. You can import the contents of that file into a set of tables in a new database file called <code>archive.db</code> (each table beginning with the <code>archive_</code> prefix) using the <code>import</code> command:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ twitter-to-sqlite import archive.db ~/Downloads/twitter-2019-06-25-b31f2.zip""><pre><code>$ twitter-to-sqlite import archive.db ~/Downloads/twitter-2019-06-25-b31f2.zip
</code></pre></div>
<p dir=""auto"">This command does not populate any of the regular tables, since Twitter's export data does not exactly match the schema returned by the Twitter API.</p>
<p dir=""auto"">It will delete and recreate the corresponding <code>archive_*</code> tables every time you run it. If this is not what you want, run the command against a new SQLite database file name rather than running it against one that already exists.</p>
<p dir=""auto"">If you have already decompressed your archive, you can run this against the directory that you decompressed it to:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ twitter-to-sqlite import archive.db ~/Downloads/twitter-2019-06-25-b31f2/""><pre><code>$ twitter-to-sqlite import archive.db ~/Downloads/twitter-2019-06-25-b31f2/
</code></pre></div>
<p dir=""auto"">You can also run it against one or more specific files within that folder. For example, to import just the follower.js and following.js files:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ twitter-to-sqlite import archive.db \
    ~/Downloads/twitter-2019-06-25-b31f2/follower.js \
    ~/Downloads/twitter-2019-06-25-b31f2/following.js""><pre><code>$ twitter-to-sqlite import archive.db \
    ~/Downloads/twitter-2019-06-25-b31f2/follower.js \
    ~/Downloads/twitter-2019-06-25-b31f2/following.js
</code></pre></div>
<p dir=""auto"">You may want to use other commands to populate tables based on data from the archive. For example, to retrieve full API versions of each of the tweets you have favourited in your archive, you could run the following:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ twitter-to-sqlite statuses-lookup archive.db \
    --sql='select tweetId from archive_like' \
    --skip-existing""><pre><code>$ twitter-to-sqlite statuses-lookup archive.db \
    --sql='select tweetId from archive_like' \
    --skip-existing
</code></pre></div>
<p dir=""auto"">If you want these imported tweets to then be reflected in the <code>favorited_by</code> table, you can do so by applying the following SQL query:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ sqlite3 archive.db
SQLite version 3.22.0 2018-01-22 18:45:57
Enter &quot;.help&quot; for usage hints.
sqlite&gt; INSERT OR IGNORE INTO favorited_by (tweet, user)
   ...&gt;     SELECT tweetId, 'YOUR_TWITTER_ID' FROM archive_like;
&lt;Ctrl+D&gt;""><pre><code>$ sqlite3 archive.db
SQLite version 3.22.0 2018-01-22 18:45:57
Enter "".help"" for usage hints.
sqlite&gt; INSERT OR IGNORE INTO favorited_by (tweet, user)
   ...&gt;     SELECT tweetId, 'YOUR_TWITTER_ID' FROM archive_like;
&lt;Ctrl+D&gt;
</code></pre></div>
<p dir=""auto"">Replace YOUR_TWITTER_ID with your numeric Twitter ID. If you don't know that ID you can find it out by running the following:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ twitter-to-sqlite fetch \
    &quot;https://api.twitter.com/1.1/account/verify_credentials.json&quot; \
    | grep '&quot;id&quot;' | head -n 1""><pre><code>$ twitter-to-sqlite fetch \
    ""https://api.twitter.com/1.1/account/verify_credentials.json"" \
    | grep '""id""' | head -n 1
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-design-notes"" class=""anchor"" aria-hidden=""true"" href=""#user-content-design-notes""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Design notes</h2>
<ul dir=""auto"">
<li>Tweet IDs are stored as integers, to afford sorting by ID in a sensible way</li>
<li>While we configure foreign key relationships between tables, we do not ask SQLite to enforce them. This is used by the <code>following</code> table to allow the <code>followers-ids</code> and <code>friends-ids</code> commands to populate it with user IDs even if the user accounts themselves are not yet present in the <code>users</code> table.</li>
</ul>
</article></div>",1,public,0,,,
206202864,MDEwOlJlcG9zaXRvcnkyMDYyMDI4NjQ=,inaturalist-to-sqlite,dogsheep/inaturalist-to-sqlite,0,53015001,https://github.com/dogsheep/inaturalist-to-sqlite,Create a SQLite database containing your observation history from iNaturalist,0,2019-09-04T01:21:21Z,2020-12-19T05:18:38Z,2020-10-22T00:08:58Z,,17,2,2,Python,1,1,1,1,0,0,0,0,0,apache-2.0,"[""sqlite"", ""inaturalist"", ""datasette"", ""dogsheep"", ""datasette-io"", ""datasette-tool""]",0,0,2,master,"{""admin"": false, ""push"": false, ""pull"": false}",,53015001,0,1,"# inaturalist-to-sqlite

[![PyPI](https://img.shields.io/pypi/v/inaturalist-to-sqlite.svg)](https://pypi.org/project/inaturalist-to-sqlite/)
[![CircleCI](https://circleci.com/gh/dogsheep/inaturalist-to-sqlite.svg?style=svg)](https://circleci.com/gh/dogsheep/inaturalist-to-sqlite)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/dogsheep/inaturalist-to-sqlite/blob/master/LICENSE)

Create a SQLite database containing your observation history from [iNaturalist](https://www.inaturalist.org/).

## How to install

    $ pip install inaturalist-to-sqlite

## Usage

    $ inaturalist-to-sqlite inaturalist.db yourusername

(Or try `simonw` if you don't yet have an iNaturalist account)

This will import all of your iNaturalist observations into a SQLite database called `inaturalist.db`.","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-inaturalist-to-sqlite"" class=""anchor"" aria-hidden=""true"" href=""#user-content-inaturalist-to-sqlite""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>inaturalist-to-sqlite</h1>
<p><a href=""https://pypi.org/project/inaturalist-to-sqlite/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/0b4aee6bb6f3aeb706c5195fce3537b66445f052f38021aefeb48672cf06cf74/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f696e61747572616c6973742d746f2d73716c6974652e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/inaturalist-to-sqlite.svg"" style=""max-width:100%;""></a>
<a href=""https://circleci.com/gh/dogsheep/inaturalist-to-sqlite"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/96dcb5c0cfa03bf010686f57c71dfef278ab6aaa53eb39cb48d28bde427c55b7/68747470733a2f2f636972636c6563692e636f6d2f67682f646f6773686565702f696e61747572616c6973742d746f2d73716c6974652e7376673f7374796c653d737667"" alt=""CircleCI"" data-canonical-src=""https://circleci.com/gh/dogsheep/inaturalist-to-sqlite.svg?style=svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/dogsheep/inaturalist-to-sqlite/blob/master/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>Create a SQLite database containing your observation history from <a href=""https://www.inaturalist.org/"" rel=""nofollow"">iNaturalist</a>.</p>
<h2><a id=""user-content-how-to-install"" class=""anchor"" aria-hidden=""true"" href=""#user-content-how-to-install""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>How to install</h2>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ pip install inaturalist-to-sqlite
""><pre><code>$ pip install inaturalist-to-sqlite
</code></pre></div>
<h2><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ inaturalist-to-sqlite inaturalist.db yourusername
""><pre><code>$ inaturalist-to-sqlite inaturalist.db yourusername
</code></pre></div>
<p>(Or try <code>simonw</code> if you don't yet have an iNaturalist account)</p>
<p>This will import all of your iNaturalist observations into a SQLite database called <code>inaturalist.db</code>.</p>
</article></div>",,,,,,
206649770,MDEwOlJlcG9zaXRvcnkyMDY2NDk3NzA=,google-takeout-to-sqlite,dogsheep/google-takeout-to-sqlite,0,53015001,https://github.com/dogsheep/google-takeout-to-sqlite,Save data from Google Takeout to a SQLite database,0,2019-09-05T20:15:15Z,2021-06-08T15:31:47Z,2021-02-24T00:34:55Z,,14,51,51,Python,1,1,1,1,0,4,0,0,6,apache-2.0,"[""google"", ""sqlite"", ""datasette"", ""dogsheep"", ""datasette-io"", ""datasette-tool""]",4,6,51,master,"{""admin"": false, ""push"": false, ""pull"": false}",,53015001,4,3,"# google-takeout-to-sqlite

[![PyPI](https://img.shields.io/pypi/v/google-takeout-to-sqlite.svg)](https://pypi.org/project/google-takeout-to-sqlite/)
[![CircleCI](https://circleci.com/gh/dogsheep/google-takeout-to-sqlite.svg?style=svg)](https://circleci.com/gh/dogsheep/google-takeout-to-sqlite)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/dogsheep/google-takeout-to-sqlite/blob/master/LICENSE)

Save data from google-takeout to a SQLite database.

## How to install

    $ pip install google-takeout-to-sqlite

Request your Google data from https://takeout.google.com/ - wait for the email and download the zip file.

This tool only supports a subset of the available options. More will be added over time.

## My Activity

You can request the ""My Activity"" export and then import it with the following command:

    $ google-takeout-to-sqlite my-activity takeout.db ~/Downloads/takeout-20190530.zip

This will create a database file called `takeout.db` if one does not already exist.

## Location History

Your location history records latitude, longitude and timestame for where Google has tracked your location. You can import it using this command:

    $ google-takeout-to-sqlite location-history takeout.db ~/Downloads/takeout-20190530.zip

## Browsing your data with Datasette

Once you have imported Google data into a SQLite database file you can browse your data using [Datasette](https://github.com/simonw/datasette). Install Datasette like so:

    $ pip install datasette

Now browse your data by running this and then visiting `http://localhost:8001/`

    $ datasette takeout.db

Install the [datasette-cluster-map](https://github.com/simonw/datasette-cluster-map) plugin to see your location history on a map:

    $ pip install datasette-cluster-map
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-google-takeout-to-sqlite"" class=""anchor"" aria-hidden=""true"" href=""#user-content-google-takeout-to-sqlite""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>google-takeout-to-sqlite</h1>
<p><a href=""https://pypi.org/project/google-takeout-to-sqlite/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/fb9e0c4c7734b2904f2b1aaaa4ce05fa195167989240be80a7c8e56e2b2a000e/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f676f6f676c652d74616b656f75742d746f2d73716c6974652e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/google-takeout-to-sqlite.svg"" style=""max-width:100%;""></a>
<a href=""https://circleci.com/gh/dogsheep/google-takeout-to-sqlite"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/41bc34953e618d9a884ec9a8bad7d759bf580fc6b7c27982d3c4af4531121246/68747470733a2f2f636972636c6563692e636f6d2f67682f646f6773686565702f676f6f676c652d74616b656f75742d746f2d73716c6974652e7376673f7374796c653d737667"" alt=""CircleCI"" data-canonical-src=""https://circleci.com/gh/dogsheep/google-takeout-to-sqlite.svg?style=svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/dogsheep/google-takeout-to-sqlite/blob/master/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>Save data from google-takeout to a SQLite database.</p>
<h2><a id=""user-content-how-to-install"" class=""anchor"" aria-hidden=""true"" href=""#user-content-how-to-install""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>How to install</h2>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ pip install google-takeout-to-sqlite
""><pre><code>$ pip install google-takeout-to-sqlite
</code></pre></div>
<p>Request your Google data from <a href=""https://takeout.google.com/"" rel=""nofollow"">https://takeout.google.com/</a> - wait for the email and download the zip file.</p>
<p>This tool only supports a subset of the available options. More will be added over time.</p>
<h2><a id=""user-content-my-activity"" class=""anchor"" aria-hidden=""true"" href=""#user-content-my-activity""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>My Activity</h2>
<p>You can request the ""My Activity"" export and then import it with the following command:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ google-takeout-to-sqlite my-activity takeout.db ~/Downloads/takeout-20190530.zip
""><pre><code>$ google-takeout-to-sqlite my-activity takeout.db ~/Downloads/takeout-20190530.zip
</code></pre></div>
<p>This will create a database file called <code>takeout.db</code> if one does not already exist.</p>
<h2><a id=""user-content-location-history"" class=""anchor"" aria-hidden=""true"" href=""#user-content-location-history""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Location History</h2>
<p>Your location history records latitude, longitude and timestame for where Google has tracked your location. You can import it using this command:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ google-takeout-to-sqlite location-history takeout.db ~/Downloads/takeout-20190530.zip
""><pre><code>$ google-takeout-to-sqlite location-history takeout.db ~/Downloads/takeout-20190530.zip
</code></pre></div>
<h2><a id=""user-content-browsing-your-data-with-datasette"" class=""anchor"" aria-hidden=""true"" href=""#user-content-browsing-your-data-with-datasette""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Browsing your data with Datasette</h2>
<p>Once you have imported Google data into a SQLite database file you can browse your data using <a href=""https://github.com/simonw/datasette"">Datasette</a>. Install Datasette like so:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ pip install datasette
""><pre><code>$ pip install datasette
</code></pre></div>
<p>Now browse your data by running this and then visiting <code>http://localhost:8001/</code></p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ datasette takeout.db
""><pre><code>$ datasette takeout.db
</code></pre></div>
<p>Install the <a href=""https://github.com/simonw/datasette-cluster-map"">datasette-cluster-map</a> plugin to see your location history on a map:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ pip install datasette-cluster-map
""><pre><code>$ pip install datasette-cluster-map
</code></pre></div>
</article></div>",,,,,,
207052882,MDEwOlJlcG9zaXRvcnkyMDcwNTI4ODI=,github-to-sqlite,dogsheep/github-to-sqlite,0,53015001,https://github.com/dogsheep/github-to-sqlite,Save data from GitHub to a SQLite database,0,2019-09-08T02:50:28Z,2022-09-20T04:36:37Z,2022-09-28T21:07:54Z,https://github-to-sqlite.dogsheep.net/,143,235,235,Python,1,1,1,1,0,32,0,0,20,apache-2.0,"[""datasette"", ""datasette-io"", ""datasette-tool"", ""dogsheep"", ""github-api"", ""sqlite""]",32,20,235,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,53015001,32,6,"# github-to-sqlite

[![PyPI](https://img.shields.io/pypi/v/github-to-sqlite.svg)](https://pypi.org/project/github-to-sqlite/)
[![Changelog](https://img.shields.io/github/v/release/dogsheep/github-to-sqlite?include_prereleases&label=changelog)](https://github.com/dogsheep/github-to-sqlite/releases)
[![Tests](https://github.com/dogsheep/github-to-sqlite/workflows/Test/badge.svg)](https://github.com/dogsheep/github-to-sqlite/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/dogsheep/github-to-sqlite/blob/main/LICENSE)

Save data from GitHub to a SQLite database.

<!-- toc -->

- [Demo](#demo)
- [How to install](#how-to-install)
- [Authentication](#authentication)
- [Fetching issues for a repository](#fetching-issues-for-a-repository)
- [Fetching pull requests for a repository](#fetching-pull-requests-for-a-repository)
- [Fetching issue comments for a repository](#fetching-issue-comments-for-a-repository)
- [Fetching commits for a repository](#fetching-commits-for-a-repository)
- [Fetching releases for a repository](#fetching-releases-for-a-repository)
- [Fetching tags for a repository](#fetching-tags-for-a-repository)
- [Fetching contributors to a repository](#fetching-contributors-to-a-repository)
- [Fetching repos belonging to a user or organization](#fetching-repos-belonging-to-a-user-or-organization)
- [Fetching specific repositories](#fetching-specific-repositories)
- [Fetching repos that have been starred by a user](#fetching-repos-that-have-been-starred-by-a-user)
- [Fetching users that have starred specific repos](#fetching-users-that-have-starred-specific-repos)
- [Fetching GitHub Actions workflows](#fetching-github-actions-workflows)
- [Scraping dependents for a repository](#scraping-dependents-for-a-repository)
- [Fetching emojis](#fetching-emojis)
- [Making authenticated API calls](#making-authenticated-api-calls)

<!-- tocstop -->

## Demo

https://github-to-sqlite.dogsheep.net/ hosts a [Datasette](https://datasette.io/) demo of a database created by [running this tool](https://github.com/dogsheep/github-to-sqlite/blob/main/.github/workflows/deploy-demo.yml#L40-L60) against all of the repositories in the [Dogsheep GitHub organization](https://github.com/dogsheep), plus the [datasette](https://github.com/simonw/datasette) and [sqlite-utils](https://github.com/simonw/sqlite-utils) repositories.

## How to install

    $ pip install github-to-sqlite

## Authentication

Create a GitHub personal access token: https://github.com/settings/tokens

Run this command and paste in your new token:

    $ github-to-sqlite auth

This will create a file called `auth.json` in your current directory containing the required value. To save the file at a different path or filename, use the `--auth=myauth.json` option.

As an alternative to using an `auth.json` file you can add your access token to an environment variable called `GITHUB_TOKEN`.

## Fetching issues for a repository

The `issues` command retrieves all of the issues belonging to a specified repository.

    $ github-to-sqlite issues github.db simonw/datasette

If an `auth.json` file is present it will use the token from that file. It works without authentication for public repositories but you should be aware that GitHub have strict IP-based rate limits for unauthenticated requests.

You can point to a different location of `auth.json` using `-a`:

    $ github-to-sqlite issues github.db simonw/datasette -a /path/to/auth.json

You can use the `--issue` option one or more times to load specific issues:

    $ github-to-sqlite issues github.db simonw/datasette --issue=1

Example: [issues table](https://github-to-sqlite.dogsheep.net/github/issues)

## Fetching pull requests for a repository

While pull requests are a type of issue, you will get more information on pull requests by pulling them separately. For example, whether a pull request has been merged and when.

Following the API of issues, the `pull-requests` command retrieves all of the pull requests belonging to a specified repository.

    $ github-to-sqlite pull-requests github.db simonw/datasette

You can use the `--pull-request` option one or more times to load specific pull request:

    $ github-to-sqlite pull-requests github.db simonw/datasette --pull-request=81

Note that the `merged_by` column on the `pull_requests` table will only be populated for pull requests that are loaded using the `--pull-request` option - the GitHub API does not return this field for pull requests that are loaded in bulk.

Example: [pull_requests table](https://github-to-sqlite.dogsheep.net/github/pull_requests)

## Fetching issue comments for a repository

The `issue-comments` command retrieves all of the comments on all of the issues in a repository.

It is recommended you run `issues` first, so that each imported comment can have a foreign key poining to its issue.

    $ github-to-sqlite issues github.db simonw/datasette
    $ github-to-sqlite issue-comments github.db simonw/datasette

You can use the `--issue` option to only load comments for a specific issue within that repository, for example:

    $ github-to-sqlite issue-comments github.db simonw/datasette --issue=1

Example: [issue_comments table](https://github-to-sqlite.dogsheep.net/github/issue_comments)

## Fetching commits for a repository

The `commits` command retrieves details of all of the commits for one or more repositories. It currently fetches the sha, commit message and author and committer details - it does no retrieve the full commit body.

    $ github-to-sqlite commits github.db simonw/datasette simonw/sqlite-utils

The command accepts one or more repositories.

By default it will stop as soon as it sees a commit that has previously been retrieved. You can force it to retrieve all commits (including those that have been previously inserted) using `--all`.

Example: [commits table](https://github-to-sqlite.dogsheep.net/github/commits)

## Fetching releases for a repository

The `releases` command retrieves the releases for one or more repositories.

    $ github-to-sqlite releases github.db simonw/datasette simonw/sqlite-utils

The command accepts one or more repositories.

Example: [releases table](https://github-to-sqlite.dogsheep.net/github/releases)

## Fetching tags for a repository

The `tags` command retrieves all of the tags for one or more repositories.

    $ github-to-sqlite tags github.db simonw/datasette simonw/sqlite-utils

Example: [tags table](https://github-to-sqlite.dogsheep.net/github/tags)

## Fetching contributors to a repository

The `contributors` command retrieves details of all of the contributors for one or more repositories.

    $ github-to-sqlite contributors github.db simonw/datasette simonw/sqlite-utils

The command accepts one or more repositories. It populates a `contributors` table, with foreign keys to `repos` and `users` and a `contributions` table listing the number of commits to that repository for each contributor.

Example: [contributors table](https://github-to-sqlite.dogsheep.net/github/contributors)

## Fetching repos belonging to a user or organization

The `repos` command fetches repos belonging to a user or organization.

Without any other arguments, this command will fetch all repos that the currently authenticated user owns, collaborates on or can access via one of their organizations:

    $ github-to-sqlite repos github.db

To fetch repos belonging to a specific user or organization, provide their username as an argument:

    $ github-to-sqlite repos github.db dogsheep # organization
    $ github-to-sqlite repos github.db simonw # user

You can pass more than one username to fetch for multiple users or organizations at once:

    $ github-to-sqlite repos github.db simonw dogsheep

Add the `--readme` option to save the README for the repo in a column called `readme`. Add `--readme-html` to save the HTML rendered version of the README into a collumn called `readme_html`.

Example: [repos table](https://github-to-sqlite.dogsheep.net/github/repos)

## Fetching specific repositories

You can use `-r` with the `repos` command one or more times to fetch just specific repositories.

    $ github-to-sqlite repos github.db -r simonw/datasette -r dogsheep/github-to-sqlite

## Fetching repos that have been starred by a user

The `starred` command fetches the repos that have been starred by a user.

    $ github-to-sqlite starred github.db simonw

If you are using an `auth.json` file you can omit the username to retrieve the starred repos for the authenticated user.

Example: [stars table](https://github-to-sqlite.dogsheep.net/github/stars)

## Fetching users that have starred specific repos

The `stargazers` command fetches the users that have starred the specified repos.

    $ github-to-sqlite stargazers github.db simonw/datasette dogsheep/github-to-sqlite

You can specify one or more repository using `owner/repo` syntax.

Users fetched using this command will be inserted into the `users` table. Many-to-many records showing which repository they starred will be added to the `stars` table.

## Fetching GitHub Actions workflows

The `workflows` command fetches the YAML workflow configurations from each repository's `.github/workflows` directory and parses them to populate `workflows`, `jobs` and `steps` tables.

    $ github-to-sqlite workflows github.db simonw/datasette dogsheep/github-to-sqlite

You can specify one or more repository using `owner/repo` syntax.

Example: [workflows table](https://github-to-sqlite.dogsheep.net/github/workflows), [jobs table](https://github-to-sqlite.dogsheep.net/github/jobs), [steps table](https://github-to-sqlite.dogsheep.net/github/steps)

## Scraping dependents for a repository

The GitHub dependency graph can show other GitHub projects that depend on a specific repo, for example [simonw/datasette/network/dependents](https://github.com/simonw/datasette/network/dependents).

This data is not yet available through the GitHub API. The `scrape-dependents` command scrapes those pages and uses the GitHub API to load full versions of the dependent repositories.

    $ github-to-sqlite scrape-dependents github.db simonw/datasette

The command accepts one or more repositories.

Add `-v` for verbose output.

Example: [dependents table](https://github-to-sqlite.dogsheep.net/github/dependents?_sort_desc=first_seen_utc)

## Fetching emojis

You can fetch a list of every emoji supported by GitHub using the `emojis` command:

    $ github-to-sqlite emojis github.db

This will create a table callad `emojis` with a primary key `name` and a `url` column.

If you add the `--fetch` option the command will also fetch the binary content of the images and place them in an `image` column:

    $ github-to-sqlite emojis emojis.db -f
    [########----------------------------]  397/1799   22%  00:03:43

You can then use the [datasette-render-images](https://github.com/simonw/datasette-render-images) plugin to browse them visually.

Example: [emojis table](https://github-to-sqlite.dogsheep.net/github/emojis)

## Making authenticated API calls

The `github-to-sqlite get` command provides a convenient shortcut for making authenticated calls to the API. Once you have created your `auth.json` file (or set a `GITHUB_TOKEN` environment variable) you can use it like this:

    $ github-to-sqlite get https://api.github.com/gists

This will make an authenticated call to the URL you provide and pretty-print the resulting JSON to the console.

You can ommit the `https://api.github.com/` prefix, for example:

    $ github-to-sqlite get /gists

Many GitHub APIs are [paginated using the HTTP Link header](https://docs.github.com/en/rest/guides/traversing-with-pagination). You can follow this pagination and output a list of all of the resulting items using `--paginate`:

    $ github-to-sqlite get /users/simonw/repos --paginate

You can outline newline-delimited JSON for each item using `--nl`. This can be useful for streaming items into another tool.

    $ github-to-sqlite get /users/simonw/repos --nl
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-github-to-sqlite"" class=""anchor"" aria-hidden=""true"" href=""#user-content-github-to-sqlite""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>github-to-sqlite</h1>
<p dir=""auto""><a href=""https://pypi.org/project/github-to-sqlite/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/515e6efa4aef15e83b08072e5490c9040420223b2355d1d715cb628f66d60dff/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6769746875622d746f2d73716c6974652e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/github-to-sqlite.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/dogsheep/github-to-sqlite/releases""><img src=""https://camo.githubusercontent.com/ae3fbf680cae0fca1e9126549e1fd0e14e756f6f77046e878db7b1c9cbd78911/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f646f6773686565702f6769746875622d746f2d73716c6974653f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/dogsheep/github-to-sqlite?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/dogsheep/github-to-sqlite/actions?query=workflow%3ATest""><img src=""https://github.com/dogsheep/github-to-sqlite/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/dogsheep/github-to-sqlite/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Save data from GitHub to a SQLite database.</p>

<ul dir=""auto"">
<li><a href=""#user-content-demo"">Demo</a></li>
<li><a href=""#user-content-how-to-install"">How to install</a></li>
<li><a href=""#user-content-authentication"">Authentication</a></li>
<li><a href=""#user-content-fetching-issues-for-a-repository"">Fetching issues for a repository</a></li>
<li><a href=""#user-content-fetching-pull-requests-for-a-repository"">Fetching pull requests for a repository</a></li>
<li><a href=""#user-content-fetching-issue-comments-for-a-repository"">Fetching issue comments for a repository</a></li>
<li><a href=""#user-content-fetching-commits-for-a-repository"">Fetching commits for a repository</a></li>
<li><a href=""#user-content-fetching-releases-for-a-repository"">Fetching releases for a repository</a></li>
<li><a href=""#user-content-fetching-tags-for-a-repository"">Fetching tags for a repository</a></li>
<li><a href=""#user-content-fetching-contributors-to-a-repository"">Fetching contributors to a repository</a></li>
<li><a href=""#user-content-fetching-repos-belonging-to-a-user-or-organization"">Fetching repos belonging to a user or organization</a></li>
<li><a href=""#user-content-fetching-specific-repositories"">Fetching specific repositories</a></li>
<li><a href=""#user-content-fetching-repos-that-have-been-starred-by-a-user"">Fetching repos that have been starred by a user</a></li>
<li><a href=""#user-content-fetching-users-that-have-starred-specific-repos"">Fetching users that have starred specific repos</a></li>
<li><a href=""#user-content-fetching-github-actions-workflows"">Fetching GitHub Actions workflows</a></li>
<li><a href=""#user-content-scraping-dependents-for-a-repository"">Scraping dependents for a repository</a></li>
<li><a href=""#user-content-fetching-emojis"">Fetching emojis</a></li>
<li><a href=""#user-content-making-authenticated-api-calls"">Making authenticated API calls</a></li>
</ul>

<h2 dir=""auto""><a id=""user-content-demo"" class=""anchor"" aria-hidden=""true"" href=""#user-content-demo""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Demo</h2>
<p dir=""auto""><a href=""https://github-to-sqlite.dogsheep.net/"" rel=""nofollow"">https://github-to-sqlite.dogsheep.net/</a> hosts a <a href=""https://datasette.io/"" rel=""nofollow"">Datasette</a> demo of a database created by <a href=""https://github.com/dogsheep/github-to-sqlite/blob/main/.github/workflows/deploy-demo.yml#L40-L60"">running this tool</a> against all of the repositories in the <a href=""https://github.com/dogsheep"">Dogsheep GitHub organization</a>, plus the <a href=""https://github.com/simonw/datasette"">datasette</a> and <a href=""https://github.com/simonw/sqlite-utils"">sqlite-utils</a> repositories.</p>
<h2 dir=""auto""><a id=""user-content-how-to-install"" class=""anchor"" aria-hidden=""true"" href=""#user-content-how-to-install""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>How to install</h2>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ pip install github-to-sqlite""><pre class=""notranslate""><code>$ pip install github-to-sqlite
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-authentication"" class=""anchor"" aria-hidden=""true"" href=""#user-content-authentication""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Authentication</h2>
<p dir=""auto"">Create a GitHub personal access token: <a href=""https://github.com/settings/tokens"">https://github.com/settings/tokens</a></p>
<p dir=""auto"">Run this command and paste in your new token:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ github-to-sqlite auth""><pre class=""notranslate""><code>$ github-to-sqlite auth
</code></pre></div>
<p dir=""auto"">This will create a file called <code>auth.json</code> in your current directory containing the required value. To save the file at a different path or filename, use the <code>--auth=myauth.json</code> option.</p>
<p dir=""auto"">As an alternative to using an <code>auth.json</code> file you can add your access token to an environment variable called <code>GITHUB_TOKEN</code>.</p>
<h2 dir=""auto""><a id=""user-content-fetching-issues-for-a-repository"" class=""anchor"" aria-hidden=""true"" href=""#user-content-fetching-issues-for-a-repository""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Fetching issues for a repository</h2>
<p dir=""auto"">The <code>issues</code> command retrieves all of the issues belonging to a specified repository.</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ github-to-sqlite issues github.db simonw/datasette""><pre class=""notranslate""><code>$ github-to-sqlite issues github.db simonw/datasette
</code></pre></div>
<p dir=""auto"">If an <code>auth.json</code> file is present it will use the token from that file. It works without authentication for public repositories but you should be aware that GitHub have strict IP-based rate limits for unauthenticated requests.</p>
<p dir=""auto"">You can point to a different location of <code>auth.json</code> using <code>-a</code>:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ github-to-sqlite issues github.db simonw/datasette -a /path/to/auth.json""><pre class=""notranslate""><code>$ github-to-sqlite issues github.db simonw/datasette -a /path/to/auth.json
</code></pre></div>
<p dir=""auto"">You can use the <code>--issue</code> option one or more times to load specific issues:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ github-to-sqlite issues github.db simonw/datasette --issue=1""><pre class=""notranslate""><code>$ github-to-sqlite issues github.db simonw/datasette --issue=1
</code></pre></div>
<p dir=""auto"">Example: <a href=""https://github-to-sqlite.dogsheep.net/github/issues"" rel=""nofollow"">issues table</a></p>
<h2 dir=""auto""><a id=""user-content-fetching-pull-requests-for-a-repository"" class=""anchor"" aria-hidden=""true"" href=""#user-content-fetching-pull-requests-for-a-repository""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Fetching pull requests for a repository</h2>
<p dir=""auto"">While pull requests are a type of issue, you will get more information on pull requests by pulling them separately. For example, whether a pull request has been merged and when.</p>
<p dir=""auto"">Following the API of issues, the <code>pull-requests</code> command retrieves all of the pull requests belonging to a specified repository.</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ github-to-sqlite pull-requests github.db simonw/datasette""><pre class=""notranslate""><code>$ github-to-sqlite pull-requests github.db simonw/datasette
</code></pre></div>
<p dir=""auto"">You can use the <code>--pull-request</code> option one or more times to load specific pull request:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ github-to-sqlite pull-requests github.db simonw/datasette --pull-request=81""><pre class=""notranslate""><code>$ github-to-sqlite pull-requests github.db simonw/datasette --pull-request=81
</code></pre></div>
<p dir=""auto"">Note that the <code>merged_by</code> column on the <code>pull_requests</code> table will only be populated for pull requests that are loaded using the <code>--pull-request</code> option - the GitHub API does not return this field for pull requests that are loaded in bulk.</p>
<p dir=""auto"">Example: <a href=""https://github-to-sqlite.dogsheep.net/github/pull_requests"" rel=""nofollow"">pull_requests table</a></p>
<h2 dir=""auto""><a id=""user-content-fetching-issue-comments-for-a-repository"" class=""anchor"" aria-hidden=""true"" href=""#user-content-fetching-issue-comments-for-a-repository""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Fetching issue comments for a repository</h2>
<p dir=""auto"">The <code>issue-comments</code> command retrieves all of the comments on all of the issues in a repository.</p>
<p dir=""auto"">It is recommended you run <code>issues</code> first, so that each imported comment can have a foreign key poining to its issue.</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ github-to-sqlite issues github.db simonw/datasette
$ github-to-sqlite issue-comments github.db simonw/datasette""><pre class=""notranslate""><code>$ github-to-sqlite issues github.db simonw/datasette
$ github-to-sqlite issue-comments github.db simonw/datasette
</code></pre></div>
<p dir=""auto"">You can use the <code>--issue</code> option to only load comments for a specific issue within that repository, for example:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ github-to-sqlite issue-comments github.db simonw/datasette --issue=1""><pre class=""notranslate""><code>$ github-to-sqlite issue-comments github.db simonw/datasette --issue=1
</code></pre></div>
<p dir=""auto"">Example: <a href=""https://github-to-sqlite.dogsheep.net/github/issue_comments"" rel=""nofollow"">issue_comments table</a></p>
<h2 dir=""auto""><a id=""user-content-fetching-commits-for-a-repository"" class=""anchor"" aria-hidden=""true"" href=""#user-content-fetching-commits-for-a-repository""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Fetching commits for a repository</h2>
<p dir=""auto"">The <code>commits</code> command retrieves details of all of the commits for one or more repositories. It currently fetches the sha, commit message and author and committer details - it does no retrieve the full commit body.</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ github-to-sqlite commits github.db simonw/datasette simonw/sqlite-utils""><pre class=""notranslate""><code>$ github-to-sqlite commits github.db simonw/datasette simonw/sqlite-utils
</code></pre></div>
<p dir=""auto"">The command accepts one or more repositories.</p>
<p dir=""auto"">By default it will stop as soon as it sees a commit that has previously been retrieved. You can force it to retrieve all commits (including those that have been previously inserted) using <code>--all</code>.</p>
<p dir=""auto"">Example: <a href=""https://github-to-sqlite.dogsheep.net/github/commits"" rel=""nofollow"">commits table</a></p>
<h2 dir=""auto""><a id=""user-content-fetching-releases-for-a-repository"" class=""anchor"" aria-hidden=""true"" href=""#user-content-fetching-releases-for-a-repository""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Fetching releases for a repository</h2>
<p dir=""auto"">The <code>releases</code> command retrieves the releases for one or more repositories.</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ github-to-sqlite releases github.db simonw/datasette simonw/sqlite-utils""><pre class=""notranslate""><code>$ github-to-sqlite releases github.db simonw/datasette simonw/sqlite-utils
</code></pre></div>
<p dir=""auto"">The command accepts one or more repositories.</p>
<p dir=""auto"">Example: <a href=""https://github-to-sqlite.dogsheep.net/github/releases"" rel=""nofollow"">releases table</a></p>
<h2 dir=""auto""><a id=""user-content-fetching-tags-for-a-repository"" class=""anchor"" aria-hidden=""true"" href=""#user-content-fetching-tags-for-a-repository""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Fetching tags for a repository</h2>
<p dir=""auto"">The <code>tags</code> command retrieves all of the tags for one or more repositories.</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ github-to-sqlite tags github.db simonw/datasette simonw/sqlite-utils""><pre class=""notranslate""><code>$ github-to-sqlite tags github.db simonw/datasette simonw/sqlite-utils
</code></pre></div>
<p dir=""auto"">Example: <a href=""https://github-to-sqlite.dogsheep.net/github/tags"" rel=""nofollow"">tags table</a></p>
<h2 dir=""auto""><a id=""user-content-fetching-contributors-to-a-repository"" class=""anchor"" aria-hidden=""true"" href=""#user-content-fetching-contributors-to-a-repository""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Fetching contributors to a repository</h2>
<p dir=""auto"">The <code>contributors</code> command retrieves details of all of the contributors for one or more repositories.</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ github-to-sqlite contributors github.db simonw/datasette simonw/sqlite-utils""><pre class=""notranslate""><code>$ github-to-sqlite contributors github.db simonw/datasette simonw/sqlite-utils
</code></pre></div>
<p dir=""auto"">The command accepts one or more repositories. It populates a <code>contributors</code> table, with foreign keys to <code>repos</code> and <code>users</code> and a <code>contributions</code> table listing the number of commits to that repository for each contributor.</p>
<p dir=""auto"">Example: <a href=""https://github-to-sqlite.dogsheep.net/github/contributors"" rel=""nofollow"">contributors table</a></p>
<h2 dir=""auto""><a id=""user-content-fetching-repos-belonging-to-a-user-or-organization"" class=""anchor"" aria-hidden=""true"" href=""#user-content-fetching-repos-belonging-to-a-user-or-organization""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Fetching repos belonging to a user or organization</h2>
<p dir=""auto"">The <code>repos</code> command fetches repos belonging to a user or organization.</p>
<p dir=""auto"">Without any other arguments, this command will fetch all repos that the currently authenticated user owns, collaborates on or can access via one of their organizations:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ github-to-sqlite repos github.db""><pre class=""notranslate""><code>$ github-to-sqlite repos github.db
</code></pre></div>
<p dir=""auto"">To fetch repos belonging to a specific user or organization, provide their username as an argument:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ github-to-sqlite repos github.db dogsheep # organization
$ github-to-sqlite repos github.db simonw # user""><pre class=""notranslate""><code>$ github-to-sqlite repos github.db dogsheep # organization
$ github-to-sqlite repos github.db simonw # user
</code></pre></div>
<p dir=""auto"">You can pass more than one username to fetch for multiple users or organizations at once:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ github-to-sqlite repos github.db simonw dogsheep""><pre class=""notranslate""><code>$ github-to-sqlite repos github.db simonw dogsheep
</code></pre></div>
<p dir=""auto"">Add the <code>--readme</code> option to save the README for the repo in a column called <code>readme</code>. Add <code>--readme-html</code> to save the HTML rendered version of the README into a collumn called <code>readme_html</code>.</p>
<p dir=""auto"">Example: <a href=""https://github-to-sqlite.dogsheep.net/github/repos"" rel=""nofollow"">repos table</a></p>
<h2 dir=""auto""><a id=""user-content-fetching-specific-repositories"" class=""anchor"" aria-hidden=""true"" href=""#user-content-fetching-specific-repositories""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Fetching specific repositories</h2>
<p dir=""auto"">You can use <code>-r</code> with the <code>repos</code> command one or more times to fetch just specific repositories.</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ github-to-sqlite repos github.db -r simonw/datasette -r dogsheep/github-to-sqlite""><pre class=""notranslate""><code>$ github-to-sqlite repos github.db -r simonw/datasette -r dogsheep/github-to-sqlite
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-fetching-repos-that-have-been-starred-by-a-user"" class=""anchor"" aria-hidden=""true"" href=""#user-content-fetching-repos-that-have-been-starred-by-a-user""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Fetching repos that have been starred by a user</h2>
<p dir=""auto"">The <code>starred</code> command fetches the repos that have been starred by a user.</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ github-to-sqlite starred github.db simonw""><pre class=""notranslate""><code>$ github-to-sqlite starred github.db simonw
</code></pre></div>
<p dir=""auto"">If you are using an <code>auth.json</code> file you can omit the username to retrieve the starred repos for the authenticated user.</p>
<p dir=""auto"">Example: <a href=""https://github-to-sqlite.dogsheep.net/github/stars"" rel=""nofollow"">stars table</a></p>
<h2 dir=""auto""><a id=""user-content-fetching-users-that-have-starred-specific-repos"" class=""anchor"" aria-hidden=""true"" href=""#user-content-fetching-users-that-have-starred-specific-repos""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Fetching users that have starred specific repos</h2>
<p dir=""auto"">The <code>stargazers</code> command fetches the users that have starred the specified repos.</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ github-to-sqlite stargazers github.db simonw/datasette dogsheep/github-to-sqlite""><pre class=""notranslate""><code>$ github-to-sqlite stargazers github.db simonw/datasette dogsheep/github-to-sqlite
</code></pre></div>
<p dir=""auto"">You can specify one or more repository using <code>owner/repo</code> syntax.</p>
<p dir=""auto"">Users fetched using this command will be inserted into the <code>users</code> table. Many-to-many records showing which repository they starred will be added to the <code>stars</code> table.</p>
<h2 dir=""auto""><a id=""user-content-fetching-github-actions-workflows"" class=""anchor"" aria-hidden=""true"" href=""#user-content-fetching-github-actions-workflows""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Fetching GitHub Actions workflows</h2>
<p dir=""auto"">The <code>workflows</code> command fetches the YAML workflow configurations from each repository's <code>.github/workflows</code> directory and parses them to populate <code>workflows</code>, <code>jobs</code> and <code>steps</code> tables.</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ github-to-sqlite workflows github.db simonw/datasette dogsheep/github-to-sqlite""><pre class=""notranslate""><code>$ github-to-sqlite workflows github.db simonw/datasette dogsheep/github-to-sqlite
</code></pre></div>
<p dir=""auto"">You can specify one or more repository using <code>owner/repo</code> syntax.</p>
<p dir=""auto"">Example: <a href=""https://github-to-sqlite.dogsheep.net/github/workflows"" rel=""nofollow"">workflows table</a>, <a href=""https://github-to-sqlite.dogsheep.net/github/jobs"" rel=""nofollow"">jobs table</a>, <a href=""https://github-to-sqlite.dogsheep.net/github/steps"" rel=""nofollow"">steps table</a></p>
<h2 dir=""auto""><a id=""user-content-scraping-dependents-for-a-repository"" class=""anchor"" aria-hidden=""true"" href=""#user-content-scraping-dependents-for-a-repository""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Scraping dependents for a repository</h2>
<p dir=""auto"">The GitHub dependency graph can show other GitHub projects that depend on a specific repo, for example <a href=""https://github.com/simonw/datasette/network/dependents"">simonw/datasette/network/dependents</a>.</p>
<p dir=""auto"">This data is not yet available through the GitHub API. The <code>scrape-dependents</code> command scrapes those pages and uses the GitHub API to load full versions of the dependent repositories.</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ github-to-sqlite scrape-dependents github.db simonw/datasette""><pre class=""notranslate""><code>$ github-to-sqlite scrape-dependents github.db simonw/datasette
</code></pre></div>
<p dir=""auto"">The command accepts one or more repositories.</p>
<p dir=""auto"">Add <code>-v</code> for verbose output.</p>
<p dir=""auto"">Example: <a href=""https://github-to-sqlite.dogsheep.net/github/dependents?_sort_desc=first_seen_utc"" rel=""nofollow"">dependents table</a></p>
<h2 dir=""auto""><a id=""user-content-fetching-emojis"" class=""anchor"" aria-hidden=""true"" href=""#user-content-fetching-emojis""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Fetching emojis</h2>
<p dir=""auto"">You can fetch a list of every emoji supported by GitHub using the <code>emojis</code> command:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ github-to-sqlite emojis github.db""><pre class=""notranslate""><code>$ github-to-sqlite emojis github.db
</code></pre></div>
<p dir=""auto"">This will create a table callad <code>emojis</code> with a primary key <code>name</code> and a <code>url</code> column.</p>
<p dir=""auto"">If you add the <code>--fetch</code> option the command will also fetch the binary content of the images and place them in an <code>image</code> column:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ github-to-sqlite emojis emojis.db -f
[########----------------------------]  397/1799   22%  00:03:43""><pre class=""notranslate""><code>$ github-to-sqlite emojis emojis.db -f
[########----------------------------]  397/1799   22%  00:03:43
</code></pre></div>
<p dir=""auto"">You can then use the <a href=""https://github.com/simonw/datasette-render-images"">datasette-render-images</a> plugin to browse them visually.</p>
<p dir=""auto"">Example: <a href=""https://github-to-sqlite.dogsheep.net/github/emojis"" rel=""nofollow"">emojis table</a></p>
<h2 dir=""auto""><a id=""user-content-making-authenticated-api-calls"" class=""anchor"" aria-hidden=""true"" href=""#user-content-making-authenticated-api-calls""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Making authenticated API calls</h2>
<p dir=""auto"">The <code>github-to-sqlite get</code> command provides a convenient shortcut for making authenticated calls to the API. Once you have created your <code>auth.json</code> file (or set a <code>GITHUB_TOKEN</code> environment variable) you can use it like this:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ github-to-sqlite get https://api.github.com/gists""><pre class=""notranslate""><code>$ github-to-sqlite get https://api.github.com/gists
</code></pre></div>
<p dir=""auto"">This will make an authenticated call to the URL you provide and pretty-print the resulting JSON to the console.</p>
<p dir=""auto"">You can ommit the <code>https://api.github.com/</code> prefix, for example:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ github-to-sqlite get /gists""><pre class=""notranslate""><code>$ github-to-sqlite get /gists
</code></pre></div>
<p dir=""auto"">Many GitHub APIs are <a href=""https://docs.github.com/en/rest/guides/traversing-with-pagination"">paginated using the HTTP Link header</a>. You can follow this pagination and output a list of all of the resulting items using <code>--paginate</code>:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ github-to-sqlite get /users/simonw/repos --paginate""><pre class=""notranslate""><code>$ github-to-sqlite get /users/simonw/repos --paginate
</code></pre></div>
<p dir=""auto"">You can outline newline-delimited JSON for each item using <code>--nl</code>. This can be useful for streaming items into another tool.</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ github-to-sqlite get /users/simonw/repos --nl""><pre class=""notranslate""><code>$ github-to-sqlite get /users/simonw/repos --nl
</code></pre></div>
</article></div>",1,public,0,,0,
207630174,MDEwOlJlcG9zaXRvcnkyMDc2MzAxNzQ=,datasette-rure,simonw/datasette-rure,0,9599,https://github.com/simonw/datasette-rure,Datasette plugin that adds a custom SQL function for executing matches using the Rust regular expression engine,0,2019-09-10T18:09:33Z,2020-12-04T04:26:53Z,2019-09-11T22:59:38Z,,19,4,4,Python,1,1,1,1,0,0,0,0,0,apache-2.0,"[""sqlite"", ""regular-expressions"", ""datasette"", ""datasette-plugin"", ""datasette-io""]",0,0,4,master,"{""admin"": false, ""push"": false, ""pull"": false}",,,0,1,"# datasette-rure

[![PyPI](https://img.shields.io/pypi/v/datasette-rure.svg)](https://pypi.org/project/datasette-rure/)
[![CircleCI](https://circleci.com/gh/simonw/datasette-rure.svg?style=svg)](https://circleci.com/gh/simonw/datasette-rure)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-rure/blob/master/LICENSE)

Datasette plugin that adds a custom SQL function for executing matches using the Rust regular expression engine

Install this plugin in the same environment as Datasette to enable the `regexp()` SQL function.

    $ pip install datasette-rure

The plugin is built on top of the [rure-python](https://github.com/davidblewett/rure-python) library by David Blewett.

## regexp() to test regular expressions

You can test if a value matches a regular expression like this:

    select regexp('hi.*there', 'hi there')
    -- returns 1
    select regexp('not.*there', 'hi there')
    -- returns 0

You can also use SQLite's custom syntax to run matches:

    select 'hi there' REGEXP 'hi.*there'
    -- returns 1

This means you can select rows based on regular expression matches - for example, to select every article where the title begins with an E or an F:

    select * from articles where title REGEXP '^[EF]'

Try this out: [REGEXP interactive demo](https://datasette-rure-demo.datasette.io/24ways?sql=select+*+from+articles+where+title+REGEXP+%27%5E%5BEF%5D%27)

## regexp_match() to extract groups

You can extract captured subsets of a pattern using `regexp_match()`.

    select regexp_match('.*( and .*)', title) as n from articles where n is not null
    -- Returns the ' and X' component of any matching titles, e.g.
    --     and Recognition
    --     and Transitions Their Place
    -- etc

This will return the first parenthesis match when called with two arguments. You can call it with three arguments to indicate which match you would like to extract:

    select regexp_match('.*(and)(.*)', title, 2) as n from articles where n is not null

The function will return `null` for invalid inputs e.g. a pattern without capture groups.

Try this out: [regexp_match() interactive demo](https://datasette-rure-demo.datasette.io/24ways?sql=select+%27WHY+%27+%7C%7C+regexp_match%28%27Why+%28.*%29%27%2C+title%29+as+t+from+articles+where+t+is+not+null)

## regexp_matches() to extract multiple matches at once

The `regexp_matches()` function can be used to extract multiple patterns from a single string. The result is returned as a JSON array, which can then be further processed using SQLite's [JSON functions](https://www.sqlite.org/json1.html).

The first argument is a regular expression with named capture groups. The second argument is the string to be matched.

    select regexp_matches(
        'hello (?P<name>\w+) the (?P<species>\w+)',
        'hello bob the dog, hello maggie the cat, hello tarquin the otter'
    )

This will return a list of JSON objects, each one representing the named captures from the original regular expression:

    [
        {""name"": ""bob"", ""species"": ""dog""},
        {""name"": ""maggie"", ""species"": ""cat""},
        {""name"": ""tarquin"", ""species"": ""otter""}
    ]

Try this out: [regexp_matches() interactive demo](https://datasette-rure-demo.datasette.io/24ways?sql=select+regexp_matches%28%0D%0A++++%27hello+%28%3FP%3Cname%3E%5Cw%2B%29+the+%28%3FP%3Cspecies%3E%5Cw%2B%29%27%2C%0D%0A++++%27hello+bob+the+dog%2C+hello+maggie+the+cat%2C+hello+tarquin+the+otter%27%0D%0A%29)
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-datasette-rure"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-rure""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-rure</h1>
<p><a href=""https://pypi.org/project/datasette-rure/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/71217f9bd1b366cef0b2c8147af82aa94cac531801f86b277bb0bf3b1d13c1c1/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d727572652e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-rure.svg"" style=""max-width:100%;""></a>
<a href=""https://circleci.com/gh/simonw/datasette-rure"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/bc58494132e2732aabfe5f64284366a785e96850c5b5b86962a9d94d06f61489/68747470733a2f2f636972636c6563692e636f6d2f67682f73696d6f6e772f6461746173657474652d727572652e7376673f7374796c653d737667"" alt=""CircleCI"" data-canonical-src=""https://circleci.com/gh/simonw/datasette-rure.svg?style=svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-rure/blob/master/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>Datasette plugin that adds a custom SQL function for executing matches using the Rust regular expression engine</p>
<p>Install this plugin in the same environment as Datasette to enable the <code>regexp()</code> SQL function.</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ pip install datasette-rure
""><pre><code>$ pip install datasette-rure
</code></pre></div>
<p>The plugin is built on top of the <a href=""https://github.com/davidblewett/rure-python"">rure-python</a> library by David Blewett.</p>
<h2><a id=""user-content-regexp-to-test-regular-expressions"" class=""anchor"" aria-hidden=""true"" href=""#user-content-regexp-to-test-regular-expressions""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>regexp() to test regular expressions</h2>
<p>You can test if a value matches a regular expression like this:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""select regexp('hi.*there', 'hi there')
-- returns 1
select regexp('not.*there', 'hi there')
-- returns 0
""><pre><code>select regexp('hi.*there', 'hi there')
-- returns 1
select regexp('not.*there', 'hi there')
-- returns 0
</code></pre></div>
<p>You can also use SQLite's custom syntax to run matches:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""select 'hi there' REGEXP 'hi.*there'
-- returns 1
""><pre><code>select 'hi there' REGEXP 'hi.*there'
-- returns 1
</code></pre></div>
<p>This means you can select rows based on regular expression matches - for example, to select every article where the title begins with an E or an F:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""select * from articles where title REGEXP '^[EF]'
""><pre><code>select * from articles where title REGEXP '^[EF]'
</code></pre></div>
<p>Try this out: <a href=""https://datasette-rure-demo.datasette.io/24ways?sql=select+*+from+articles+where+title+REGEXP+%27%5E%5BEF%5D%27"" rel=""nofollow"">REGEXP interactive demo</a></p>
<h2><a id=""user-content-regexp_match-to-extract-groups"" class=""anchor"" aria-hidden=""true"" href=""#user-content-regexp_match-to-extract-groups""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>regexp_match() to extract groups</h2>
<p>You can extract captured subsets of a pattern using <code>regexp_match()</code>.</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""select regexp_match('.*( and .*)', title) as n from articles where n is not null
-- Returns the ' and X' component of any matching titles, e.g.
--     and Recognition
--     and Transitions Their Place
-- etc
""><pre><code>select regexp_match('.*( and .*)', title) as n from articles where n is not null
-- Returns the ' and X' component of any matching titles, e.g.
--     and Recognition
--     and Transitions Their Place
-- etc
</code></pre></div>
<p>This will return the first parenthesis match when called with two arguments. You can call it with three arguments to indicate which match you would like to extract:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""select regexp_match('.*(and)(.*)', title, 2) as n from articles where n is not null
""><pre><code>select regexp_match('.*(and)(.*)', title, 2) as n from articles where n is not null
</code></pre></div>
<p>The function will return <code>null</code> for invalid inputs e.g. a pattern without capture groups.</p>
<p>Try this out: <a href=""https://datasette-rure-demo.datasette.io/24ways?sql=select+%27WHY+%27+%7C%7C+regexp_match%28%27Why+%28.*%29%27%2C+title%29+as+t+from+articles+where+t+is+not+null"" rel=""nofollow"">regexp_match() interactive demo</a></p>
<h2><a id=""user-content-regexp_matches-to-extract-multiple-matches-at-once"" class=""anchor"" aria-hidden=""true"" href=""#user-content-regexp_matches-to-extract-multiple-matches-at-once""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>regexp_matches() to extract multiple matches at once</h2>
<p>The <code>regexp_matches()</code> function can be used to extract multiple patterns from a single string. The result is returned as a JSON array, which can then be further processed using SQLite's <a href=""https://www.sqlite.org/json1.html"" rel=""nofollow"">JSON functions</a>.</p>
<p>The first argument is a regular expression with named capture groups. The second argument is the string to be matched.</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""select regexp_matches(
    'hello (?P&lt;name&gt;\w+) the (?P&lt;species&gt;\w+)',
    'hello bob the dog, hello maggie the cat, hello tarquin the otter'
)
""><pre><code>select regexp_matches(
    'hello (?P&lt;name&gt;\w+) the (?P&lt;species&gt;\w+)',
    'hello bob the dog, hello maggie the cat, hello tarquin the otter'
)
</code></pre></div>
<p>This will return a list of JSON objects, each one representing the named captures from the original regular expression:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""[
    {&quot;name&quot;: &quot;bob&quot;, &quot;species&quot;: &quot;dog&quot;},
    {&quot;name&quot;: &quot;maggie&quot;, &quot;species&quot;: &quot;cat&quot;},
    {&quot;name&quot;: &quot;tarquin&quot;, &quot;species&quot;: &quot;otter&quot;}
]
""><pre><code>[
    {""name"": ""bob"", ""species"": ""dog""},
    {""name"": ""maggie"", ""species"": ""cat""},
    {""name"": ""tarquin"", ""species"": ""otter""}
]
</code></pre></div>
<p>Try this out: <a href=""https://datasette-rure-demo.datasette.io/24ways?sql=select+regexp_matches%28%0D%0A++++%27hello+%28%3FP%3Cname%3E%5Cw%2B%29+the+%28%3FP%3Cspecies%3E%5Cw%2B%29%27%2C%0D%0A++++%27hello+bob+the+dog%2C+hello+maggie+the+cat%2C+hello+tarquin+the+otter%27%0D%0A%29"" rel=""nofollow"">regexp_matches() interactive demo</a></p>
</article></div>",,,,,,
209091256,MDEwOlJlcG9zaXRvcnkyMDkwOTEyNTY=,datasette-atom,simonw/datasette-atom,0,9599,https://github.com/simonw/datasette-atom,Datasette plugin that adds a .atom output format,0,2019-09-17T15:31:01Z,2021-03-26T02:06:51Z,2021-01-24T23:59:36Z,,47,10,10,Python,1,1,1,1,0,0,0,0,0,apache-2.0,"[""datasette"", ""datasette-plugin"", ""datasette-io""]",0,0,10,main,"{""admin"": false, ""push"": false, ""pull"": false}",,,0,2,"# datasette-atom

[![PyPI](https://img.shields.io/pypi/v/datasette-atom.svg)](https://pypi.org/project/datasette-atom/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-atom?include_prereleases&label=changelog)](https://github.com/simonw/datasette-atom/releases)
[![Tests](https://github.com/simonw/datasette-atom/workflows/Test/badge.svg)](https://github.com/simonw/datasette-atom/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-atom/blob/main/LICENSE)

Datasette plugin that adds support for generating [Atom feeds](https://validator.w3.org/feed/docs/atom.html) with the results of a SQL query.

## Installation

Install this plugin in the same environment as Datasette to enable the `.atom` output extension.

    $ pip install datasette-atom

## Usage

To create an Atom feed you need to define a custom SQL query that returns a required set of columns:

* `atom_id` - a unique ID for each row. [This article](https://web.archive.org/web/20080211143232/http://diveintomark.org/archives/2004/05/28/howto-atom-id) has suggestions about ways to create these IDs.
* `atom_title` - a title for that row.
* `atom_updated` - an [RFC 3339](http://www.faqs.org/rfcs/rfc3339.html) timestamp representing the last time the entry was modified in a significant way. This can usually be the time that the row was created.

The following columns are optional:

* `atom_content` - content that should be shown in the feed. This will be treated as a regular string, so any embedded HTML tags will be escaped when they are displayed.
* `atom_content_html` - content that should be shown in the feed. This will be treated as an HTML string, and will be sanitized using [Bleach](https://github.com/mozilla/bleach) to ensure it does not have any malicious code in it before being returned as part of a `<content type=""html"">` Atom element. If both are provided, this will be used in place of `atom_content`.
* `atom_link` - a URL that should be used as the link that the feed entry points to.
* `atom_author_name` - the name of the author of the entry. If you provide this you can also provide `atom_author_uri` and `atom_author_email` with a URL and e-mail address for that author.

A query that returns these columns can then be returned as an Atom feed by adding the `.atom` extension.

## Example

Here is an example SQL query which generates an Atom feed for new entries on [www.niche-museums.com](https://www.niche-museums.com/):

```sql
select
  'tag:niche-museums.com,' || substr(created, 0, 11) || ':' || id as atom_id,
  name as atom_title,
  created as atom_updated,
  'https://www.niche-museums.com/browse/museums/' || id as atom_link,
  coalesce(
    '<img src=""' || photo_url || '?w=800&amp;h=400&amp;fit=crop&amp;auto=compress"">',
    ''
  ) || '<p>' || description || '</p>' as atom_content_html
from
  museums
order by
  created desc
limit
  15
```

You can try this query by [pasting it in here](https://www.niche-museums.com/browse) - then click the `.atom` link to see it as an Atom feed.

## Using a canned query

Datasette's [canned query mechanism](https://docs.datasette.io/en/stable/sql_queries.html#canned-queries) is a useful way to configure feeds. If a canned query definition has a `title` that will be used as the title of the Atom feed.

Here's an example, defined using a `metadata.yaml` file:

```yaml
databases:
  browse:
    queries:
      feed:
        title: Niche Museums
        sql: |-
          select
            'tag:niche-museums.com,' || substr(created, 0, 11) || ':' || id as atom_id,
            name as atom_title,
            created as atom_updated,
            'https://www.niche-museums.com/browse/museums/' || id as atom_link,
            coalesce(
              '<img src=""' || photo_url || '?w=800&amp;h=400&amp;fit=crop&amp;auto=compress"">',
              ''
            ) || '<p>' || description || '</p>' as atom_content_html
          from
            museums
          order by
            created desc
          limit
            15
```
## Disabling HTML filtering

The HTML allow-list used by Bleach for the `atom_content_html` column can be found in the `clean(html)` function at the bottom of [datasette_atom/__init__.py](https://github.com/simonw/datasette-atom/blob/main/datasette_atom/__init__.py).

You can disable Bleach entirely for Atom feeds generated using a canned query. You should only do this if you are certain that no user-provided HTML could be included in that value.

Here's how to do that in `metadata.json`:

```json
{
  ""plugins"": {
    ""datasette-atom"": {
      ""allow_unsafe_html_in_canned_queries"": true
    }
  }
}
```
Setting this to `true` will disable Bleach filtering for all canned queries across all databases.

You can disable Bleach filtering just for a specific list of canned queries like so:

```json
{
  ""plugins"": {
    ""datasette-atom"": {
      ""allow_unsafe_html_in_canned_queries"": {
        ""museums"": [""latest"", ""moderation""]
      }
    }
  }
}
```
This will disable Bleach just for the canned queries called `latest` and `moderation` in the `museums.db` database.
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-datasette-atom"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-atom""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-atom</h1>
<p><a href=""https://pypi.org/project/datasette-atom/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/7957f60f0a749eb9a6313a3c35574dad2554ed586ef3bd67c4f0b768edd28fb2/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d61746f6d2e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-atom.svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-atom/releases""><img src=""https://camo.githubusercontent.com/42a02d342f39e3dc05195df216b9f89fa84de19d94547b4c68a02ec12f0eaf2d/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d61746f6d3f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-atom?include_prereleases&amp;label=changelog"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-atom/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-atom/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-atom/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>Datasette plugin that adds support for generating <a href=""https://validator.w3.org/feed/docs/atom.html"" rel=""nofollow"">Atom feeds</a> with the results of a SQL query.</p>
<h2><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p>Install this plugin in the same environment as Datasette to enable the <code>.atom</code> output extension.</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ pip install datasette-atom
""><pre><code>$ pip install datasette-atom
</code></pre></div>
<h2><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p>To create an Atom feed you need to define a custom SQL query that returns a required set of columns:</p>
<ul>
<li><code>atom_id</code> - a unique ID for each row. <a href=""https://web.archive.org/web/20080211143232/http://diveintomark.org/archives/2004/05/28/howto-atom-id"" rel=""nofollow"">This article</a> has suggestions about ways to create these IDs.</li>
<li><code>atom_title</code> - a title for that row.</li>
<li><code>atom_updated</code> - an <a href=""http://www.faqs.org/rfcs/rfc3339.html"" rel=""nofollow"">RFC 3339</a> timestamp representing the last time the entry was modified in a significant way. This can usually be the time that the row was created.</li>
</ul>
<p>The following columns are optional:</p>
<ul>
<li><code>atom_content</code> - content that should be shown in the feed. This will be treated as a regular string, so any embedded HTML tags will be escaped when they are displayed.</li>
<li><code>atom_content_html</code> - content that should be shown in the feed. This will be treated as an HTML string, and will be sanitized using <a href=""https://github.com/mozilla/bleach"">Bleach</a> to ensure it does not have any malicious code in it before being returned as part of a <code>&lt;content type=""html""&gt;</code> Atom element. If both are provided, this will be used in place of <code>atom_content</code>.</li>
<li><code>atom_link</code> - a URL that should be used as the link that the feed entry points to.</li>
<li><code>atom_author_name</code> - the name of the author of the entry. If you provide this you can also provide <code>atom_author_uri</code> and <code>atom_author_email</code> with a URL and e-mail address for that author.</li>
</ul>
<p>A query that returns these columns can then be returned as an Atom feed by adding the <code>.atom</code> extension.</p>
<h2><a id=""user-content-example"" class=""anchor"" aria-hidden=""true"" href=""#user-content-example""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Example</h2>
<p>Here is an example SQL query which generates an Atom feed for new entries on <a href=""https://www.niche-museums.com/"" rel=""nofollow"">www.niche-museums.com</a>:</p>
<div class=""highlight highlight-source-sql position-relative"" data-snippet-clipboard-copy-content=""select
  'tag:niche-museums.com,' || substr(created, 0, 11) || ':' || id as atom_id,
  name as atom_title,
  created as atom_updated,
  'https://www.niche-museums.com/browse/museums/' || id as atom_link,
  coalesce(
    '&lt;img src=&quot;' || photo_url || '?w=800&amp;amp;h=400&amp;amp;fit=crop&amp;amp;auto=compress&quot;&gt;',
    ''
  ) || '&lt;p&gt;' || description || '&lt;/p&gt;' as atom_content_html
from
  museums
order by
  created desc
limit
  15
""><pre><span class=""pl-k"">select</span>
  <span class=""pl-s""><span class=""pl-pds"">'</span>tag:niche-museums.com,<span class=""pl-pds"">'</span></span> <span class=""pl-k"">||</span> substr(created, <span class=""pl-c1"">0</span>, <span class=""pl-c1"">11</span>) <span class=""pl-k"">||</span> <span class=""pl-s""><span class=""pl-pds"">'</span>:<span class=""pl-pds"">'</span></span> <span class=""pl-k"">||</span> id <span class=""pl-k"">as</span> atom_id,
  name <span class=""pl-k"">as</span> atom_title,
  created <span class=""pl-k"">as</span> atom_updated,
  <span class=""pl-s""><span class=""pl-pds"">'</span>https://www.niche-museums.com/browse/museums/<span class=""pl-pds"">'</span></span> <span class=""pl-k"">||</span> id <span class=""pl-k"">as</span> atom_link,
  coalesce(
    <span class=""pl-s""><span class=""pl-pds"">'</span>&lt;img src=""<span class=""pl-pds"">'</span></span> <span class=""pl-k"">||</span> photo_url <span class=""pl-k"">||</span> <span class=""pl-s""><span class=""pl-pds"">'</span>?w=800&amp;amp;h=400&amp;amp;fit=crop&amp;amp;auto=compress""&gt;<span class=""pl-pds"">'</span></span>,
    <span class=""pl-s""><span class=""pl-pds"">'</span><span class=""pl-pds"">'</span></span>
  ) <span class=""pl-k"">||</span> <span class=""pl-s""><span class=""pl-pds"">'</span>&lt;p&gt;<span class=""pl-pds"">'</span></span> <span class=""pl-k"">||</span> description <span class=""pl-k"">||</span> <span class=""pl-s""><span class=""pl-pds"">'</span>&lt;/p&gt;<span class=""pl-pds"">'</span></span> <span class=""pl-k"">as</span> atom_content_html
<span class=""pl-k"">from</span>
  museums
<span class=""pl-k"">order by</span>
  created <span class=""pl-k"">desc</span>
<span class=""pl-k"">limit</span>
  <span class=""pl-c1"">15</span></pre></div>
<p>You can try this query by <a href=""https://www.niche-museums.com/browse"" rel=""nofollow"">pasting it in here</a> - then click the <code>.atom</code> link to see it as an Atom feed.</p>
<h2><a id=""user-content-using-a-canned-query"" class=""anchor"" aria-hidden=""true"" href=""#user-content-using-a-canned-query""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Using a canned query</h2>
<p>Datasette's <a href=""https://docs.datasette.io/en/stable/sql_queries.html#canned-queries"" rel=""nofollow"">canned query mechanism</a> is a useful way to configure feeds. If a canned query definition has a <code>title</code> that will be used as the title of the Atom feed.</p>
<p>Here's an example, defined using a <code>metadata.yaml</code> file:</p>
<div class=""highlight highlight-source-yaml position-relative"" data-snippet-clipboard-copy-content=""databases:
  browse:
    queries:
      feed:
        title: Niche Museums
        sql: |-
          select
            'tag:niche-museums.com,' || substr(created, 0, 11) || ':' || id as atom_id,
            name as atom_title,
            created as atom_updated,
            'https://www.niche-museums.com/browse/museums/' || id as atom_link,
            coalesce(
              '&lt;img src=&quot;' || photo_url || '?w=800&amp;amp;h=400&amp;amp;fit=crop&amp;amp;auto=compress&quot;&gt;',
              ''
            ) || '&lt;p&gt;' || description || '&lt;/p&gt;' as atom_content_html
          from
            museums
          order by
            created desc
          limit
            15
""><pre><span class=""pl-ent"">databases</span>:
  <span class=""pl-ent"">browse</span>:
    <span class=""pl-ent"">queries</span>:
      <span class=""pl-ent"">feed</span>:
        <span class=""pl-ent"">title</span>: <span class=""pl-s"">Niche Museums</span>
        <span class=""pl-ent"">sql</span>: <span class=""pl-s"">|-</span>
<span class=""pl-s"">          select</span>
<span class=""pl-s"">            'tag:niche-museums.com,' || substr(created, 0, 11) || ':' || id as atom_id,</span>
<span class=""pl-s"">            name as atom_title,</span>
<span class=""pl-s"">            created as atom_updated,</span>
<span class=""pl-s"">            'https://www.niche-museums.com/browse/museums/' || id as atom_link,</span>
<span class=""pl-s"">            coalesce(</span>
<span class=""pl-s"">              '&lt;img src=""' || photo_url || '?w=800&amp;amp;h=400&amp;amp;fit=crop&amp;amp;auto=compress""&gt;',</span>
<span class=""pl-s"">              ''</span>
<span class=""pl-s"">            ) || '&lt;p&gt;' || description || '&lt;/p&gt;' as atom_content_html</span>
<span class=""pl-s"">          from</span>
<span class=""pl-s"">            museums</span>
<span class=""pl-s"">          order by</span>
<span class=""pl-s"">            created desc</span>
<span class=""pl-s"">          limit</span>
<span class=""pl-s"">            15</span></pre></div>
<h2><a id=""user-content-disabling-html-filtering"" class=""anchor"" aria-hidden=""true"" href=""#user-content-disabling-html-filtering""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Disabling HTML filtering</h2>
<p>The HTML allow-list used by Bleach for the <code>atom_content_html</code> column can be found in the <code>clean(html)</code> function at the bottom of <a href=""https://github.com/simonw/datasette-atom/blob/main/datasette_atom/__init__.py"">datasette_atom/<strong>init</strong>.py</a>.</p>
<p>You can disable Bleach entirely for Atom feeds generated using a canned query. You should only do this if you are certain that no user-provided HTML could be included in that value.</p>
<p>Here's how to do that in <code>metadata.json</code>:</p>
<div class=""highlight highlight-source-json position-relative"" data-snippet-clipboard-copy-content=""{
  &quot;plugins&quot;: {
    &quot;datasette-atom&quot;: {
      &quot;allow_unsafe_html_in_canned_queries&quot;: true
    }
  }
}
""><pre>{
  <span class=""pl-s""><span class=""pl-pds"">""</span>plugins<span class=""pl-pds"">""</span></span>: {
    <span class=""pl-s""><span class=""pl-pds"">""</span>datasette-atom<span class=""pl-pds"">""</span></span>: {
      <span class=""pl-s""><span class=""pl-pds"">""</span>allow_unsafe_html_in_canned_queries<span class=""pl-pds"">""</span></span>: <span class=""pl-c1"">true</span>
    }
  }
}</pre></div>
<p>Setting this to <code>true</code> will disable Bleach filtering for all canned queries across all databases.</p>
<p>You can disable Bleach filtering just for a specific list of canned queries like so:</p>
<div class=""highlight highlight-source-json position-relative"" data-snippet-clipboard-copy-content=""{
  &quot;plugins&quot;: {
    &quot;datasette-atom&quot;: {
      &quot;allow_unsafe_html_in_canned_queries&quot;: {
        &quot;museums&quot;: [&quot;latest&quot;, &quot;moderation&quot;]
      }
    }
  }
}
""><pre>{
  <span class=""pl-s""><span class=""pl-pds"">""</span>plugins<span class=""pl-pds"">""</span></span>: {
    <span class=""pl-s""><span class=""pl-pds"">""</span>datasette-atom<span class=""pl-pds"">""</span></span>: {
      <span class=""pl-s""><span class=""pl-pds"">""</span>allow_unsafe_html_in_canned_queries<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>museums<span class=""pl-pds"">""</span></span>: [<span class=""pl-s""><span class=""pl-pds"">""</span>latest<span class=""pl-pds"">""</span></span>, <span class=""pl-s""><span class=""pl-pds"">""</span>moderation<span class=""pl-pds"">""</span></span>]
      }
    }
  }
}</pre></div>
<p>This will disable Bleach just for the canned queries called <code>latest</code> and <code>moderation</code> in the <code>museums.db</code> database.</p>
</article></div>",,,,,,
209590345,MDEwOlJlcG9zaXRvcnkyMDk1OTAzNDU=,genome-to-sqlite,dogsheep/genome-to-sqlite,0,53015001,https://github.com/dogsheep/genome-to-sqlite,Import your genome into a SQLite database,0,2019-09-19T15:38:39Z,2021-01-18T19:39:48Z,2019-09-19T15:41:17Z,,9,13,13,Python,1,1,1,1,0,0,0,0,2,apache-2.0,"[""genetics"", ""sqlite"", ""23andme"", ""personal-analytics"", ""datasette"", ""dogsheep"", ""datasette-io"", ""datasette-tool""]",0,2,13,master,"{""admin"": false, ""push"": false, ""pull"": false}",,53015001,0,2,"# genome-to-sqlite

[![PyPI](https://img.shields.io/pypi/v/genome-to-sqlite.svg)](https://pypi.org/project/genome-to-sqlite/)
[![CircleCI](https://circleci.com/gh/dogsheep/genome-to-sqlite.svg?style=svg)](https://circleci.com/gh/dogsheep/genome-to-sqlite)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/dogsheep/genome-to-sqlite/blob/master/LICENSE)

Import your genome into a SQLite database.

## How to install

    $ pip install genome-to-sqlite

## How to use

First, export your genome. This tool has only been tested against 23andMe so far. You can request an export of your genome from https://you.23andme.com/tools/data/download/

Now you can convert the resulting `export.zip` file to SQLite like so:

    $ genome-to-sqlite export.zip genome.db

A progress bar will be displayed. You can disable this using `--silent`.

```
Importing genome  [#----------------]    5%  00:01:33
```

You can explore the resulting data using [Datasette](https://datasette.readthedocs.io/) like this:

    $ datasette genome.db --config facet_time_limit_ms:1000

Bumping up the facet time limit is useful in order to enable faceting by chromosome:

http://127.0.0.1:8001/genome/genome?_facet=chromosome&_sort=position
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-genome-to-sqlite"" class=""anchor"" aria-hidden=""true"" href=""#user-content-genome-to-sqlite""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>genome-to-sqlite</h1>
<p><a href=""https://pypi.org/project/genome-to-sqlite/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/a91d14ecfe9aa2d73da4818a7a898033e301380d4c8477d06a17dc20bd871c84/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f67656e6f6d652d746f2d73716c6974652e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/genome-to-sqlite.svg"" style=""max-width:100%;""></a>
<a href=""https://circleci.com/gh/dogsheep/genome-to-sqlite"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/5f7f5be2791ff007c49d3d153ef38baa81770a11c0326e5f6b8ffd7ee939d00f/68747470733a2f2f636972636c6563692e636f6d2f67682f646f6773686565702f67656e6f6d652d746f2d73716c6974652e7376673f7374796c653d737667"" alt=""CircleCI"" data-canonical-src=""https://circleci.com/gh/dogsheep/genome-to-sqlite.svg?style=svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/dogsheep/genome-to-sqlite/blob/master/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>Import your genome into a SQLite database.</p>
<h2><a id=""user-content-how-to-install"" class=""anchor"" aria-hidden=""true"" href=""#user-content-how-to-install""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>How to install</h2>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ pip install genome-to-sqlite
""><pre><code>$ pip install genome-to-sqlite
</code></pre></div>
<h2><a id=""user-content-how-to-use"" class=""anchor"" aria-hidden=""true"" href=""#user-content-how-to-use""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>How to use</h2>
<p>First, export your genome. This tool has only been tested against 23andMe so far. You can request an export of your genome from <a href=""https://you.23andme.com/tools/data/download/"" rel=""nofollow"">https://you.23andme.com/tools/data/download/</a></p>
<p>Now you can convert the resulting <code>export.zip</code> file to SQLite like so:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ genome-to-sqlite export.zip genome.db
""><pre><code>$ genome-to-sqlite export.zip genome.db
</code></pre></div>
<p>A progress bar will be displayed. You can disable this using <code>--silent</code>.</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""Importing genome  [#----------------]    5%  00:01:33
""><pre><code>Importing genome  [#----------------]    5%  00:01:33
</code></pre></div>
<p>You can explore the resulting data using <a href=""https://datasette.readthedocs.io/"" rel=""nofollow"">Datasette</a> like this:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ datasette genome.db --config facet_time_limit_ms:1000
""><pre><code>$ datasette genome.db --config facet_time_limit_ms:1000
</code></pre></div>
<p>Bumping up the facet time limit is useful in order to enable faceting by chromosome:</p>
<p><a href=""http://127.0.0.1:8001/genome/genome?_facet=chromosome&amp;_sort=position"" rel=""nofollow"">http://127.0.0.1:8001/genome/genome?_facet=chromosome&amp;_sort=position</a></p>
</article></div>",,,,,,
213286752,MDEwOlJlcG9zaXRvcnkyMTMyODY3NTI=,pocket-to-sqlite,dogsheep/pocket-to-sqlite,0,53015001,https://github.com/dogsheep/pocket-to-sqlite,Create a SQLite database containing data from your Pocket account,0,2019-10-07T03:24:14Z,2022-08-21T21:11:59Z,2022-08-22T16:21:34Z,,20,63,63,Python,1,1,1,1,0,3,0,0,5,apache-2.0,"[""datasette"", ""datasette-io"", ""datasette-tool"", ""dogsheep"", ""pocket"", ""pocket-api"", ""sqlite""]",3,5,63,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,53015001,3,4,"# pocket-to-sqlite

[![PyPI](https://img.shields.io/pypi/v/pocket-to-sqlite.svg)](https://pypi.org/project/pocket-to-sqlite/)
[![Changelog](https://img.shields.io/github/v/release/dogsheep/pocket-to-sqlite?include_prereleases&label=changelog)](https://github.com/dogsheep/pocket-to-sqlite/releases)
[![Tests](https://github.com/dogsheep/pocket-to-sqlite/workflows/Test/badge.svg)](https://github.com/dogsheep/pocket-to-sqlite/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/dogsheep/pocket-to-sqlite/blob/main/LICENSE)

Create a SQLite database containing data from your [Pocket](https://getpocket.com/) account.

## How to install

    $ pip install pocket-to-sqlite

## Usage

You will need to first obtain a valid OAuth token for your Pocket account. You can do this by running the `auth` command and following the prompts:

    $ pocket-to-sqlite auth
    Visit this page and sign in with your Pocket account:

    https://getpocket.com/auth/author...

    Once you have signed in there, hit <enter> to continue
    Authentication tokens written to auth.json

Now you can fetch all of your items from Pocket like this:

    $ pocket-to-sqlite fetch pocket.db

The first time you run this command it will fetch all of your items, and display a progress bar while it does it.

On subsequent runs it will only fetch new items.

You can force it to fetch everything from the beginning again using `--all`. Use `--silent` to disable the progress bar.

## Using with Datasette

The SQLite database produced by this tool is designed to be browsed using [Datasette](https://datasette.readthedocs.io/). Use the [datasette-render-timestamps](https://github.com/simonw/datasette-render-timestamps) plugin to improve the display of the timestamp values.
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-pocket-to-sqlite"" class=""anchor"" aria-hidden=""true"" href=""#user-content-pocket-to-sqlite""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>pocket-to-sqlite</h1>
<p dir=""auto""><a href=""https://pypi.org/project/pocket-to-sqlite/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/98050bc158f5e93f2c3576b03fa6a25b4a8091bb6d44774e07d95ae2842e4a3e/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f706f636b65742d746f2d73716c6974652e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/pocket-to-sqlite.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/dogsheep/pocket-to-sqlite/releases""><img src=""https://camo.githubusercontent.com/b37c773b526e69ffcc72b9c02cbcd977d1b6e6a9bcf317c3accfc04b975e960c/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f646f6773686565702f706f636b65742d746f2d73716c6974653f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/dogsheep/pocket-to-sqlite?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/dogsheep/pocket-to-sqlite/actions?query=workflow%3ATest""><img src=""https://github.com/dogsheep/pocket-to-sqlite/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/dogsheep/pocket-to-sqlite/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Create a SQLite database containing data from your <a href=""https://getpocket.com/"" rel=""nofollow"">Pocket</a> account.</p>
<h2 dir=""auto""><a id=""user-content-how-to-install"" class=""anchor"" aria-hidden=""true"" href=""#user-content-how-to-install""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>How to install</h2>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ pip install pocket-to-sqlite""><pre class=""notranslate""><code>$ pip install pocket-to-sqlite
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p dir=""auto"">You will need to first obtain a valid OAuth token for your Pocket account. You can do this by running the <code>auth</code> command and following the prompts:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ pocket-to-sqlite auth
Visit this page and sign in with your Pocket account:

https://getpocket.com/auth/author...

Once you have signed in there, hit &lt;enter&gt; to continue
Authentication tokens written to auth.json""><pre class=""notranslate""><code>$ pocket-to-sqlite auth
Visit this page and sign in with your Pocket account:

https://getpocket.com/auth/author...

Once you have signed in there, hit &lt;enter&gt; to continue
Authentication tokens written to auth.json
</code></pre></div>
<p dir=""auto"">Now you can fetch all of your items from Pocket like this:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ pocket-to-sqlite fetch pocket.db""><pre class=""notranslate""><code>$ pocket-to-sqlite fetch pocket.db
</code></pre></div>
<p dir=""auto"">The first time you run this command it will fetch all of your items, and display a progress bar while it does it.</p>
<p dir=""auto"">On subsequent runs it will only fetch new items.</p>
<p dir=""auto"">You can force it to fetch everything from the beginning again using <code>--all</code>. Use <code>--silent</code> to disable the progress bar.</p>
<h2 dir=""auto""><a id=""user-content-using-with-datasette"" class=""anchor"" aria-hidden=""true"" href=""#user-content-using-with-datasette""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Using with Datasette</h2>
<p dir=""auto"">The SQLite database produced by this tool is designed to be browsed using <a href=""https://datasette.readthedocs.io/"" rel=""nofollow"">Datasette</a>. Use the <a href=""https://github.com/simonw/datasette-render-timestamps"">datasette-render-timestamps</a> plugin to improve the display of the timestamp values.</p>
</article></div>",1,public,0,,0,
214299267,MDEwOlJlcG9zaXRvcnkyMTQyOTkyNjc=,datasette-render-timestamps,simonw/datasette-render-timestamps,0,9599,https://github.com/simonw/datasette-render-timestamps,Datasette plugin for rendering timestamps,0,2019-10-10T22:50:50Z,2020-10-17T11:09:42Z,2020-03-22T17:57:17Z,,17,4,4,Python,1,1,1,1,0,1,0,0,0,apache-2.0,"[""datasette"", ""datasette-plugin"", ""datasette-io""]",1,0,4,master,"{""admin"": false, ""push"": false, ""pull"": false}",,,1,2,"# datasette-render-timestamps

[![PyPI](https://img.shields.io/pypi/v/datasette-render-timestamps.svg)](https://pypi.org/project/datasette-render-timestamps/)
[![CircleCI](https://circleci.com/gh/simonw/datasette-render-timestamps.svg?style=svg)](https://circleci.com/gh/simonw/datasette-render-timestamps)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-render-timestamps/blob/master/LICENSE)

Datasette plugin for rendering timestamps.

## Installation

Install this plugin in the same environment as Datasette to enable this new functionality:

    pip install datasette-render-timestamps

The plugin will then look out for integer numbers that are likely to be timestamps - anything that would be a number of seconds from 5 years ago to 5 years in the future.

These will then be rendered in a more readable format.

## Configuration

You can disable automatic column detection in favour of explicitly listing the columns that you would like to render using [plugin configuration](https://datasette.readthedocs.io/en/stable/plugins.html#plugin-configuration) in a `metadata.json` file.

Add a `""datasette-render-timestamps""` configuration block and use a `""columns""` key to list the columns you would like to treat as timestamp values:

```json
{
    ""plugins"": {
        ""datasette-render-timestamps"": {
            ""columns"": [""created"", ""updated""]
        }
    }
}
```
This will cause any `created` or `updated` columns in any table to be treated as timestamps and rendered.

Save this to `metadata.json` and run datasette with the `--metadata` flag to load this configuration:

    datasette serve mydata.db --metadata metadata.json

To disable automatic timestamp detection entirely, you can use `""columnns"": []`.

This configuration block can be used at the top level, or it can be applied just to specific databases or tables. Here's how to apply it to just the `entries` table in the `news.db` database:

```json
{
    ""databases"": {
        ""news"": {
            ""tables"": {
                ""entries"": {
                    ""plugins"": {
                        ""datasette-render-timestamps"": {
                            ""columns"": [""created"", ""updated""]
                        }
                    }
                }
            }
        }
    }
}
```

And here's how to apply it to every `created` column in every table in the `news.db` database:

```json
{
    ""databases"": {
        ""news"": {
            ""plugins"": {
                ""datasette-render-timestamps"": {
                    ""columns"": [""created"", ""updated""]
                }
            }
        }
    }
}
```

### Customizing the date format

The default format is `%B %d, %Y - %H:%M:%S UTC` which renders for example: `October 10, 2019 - 07:18:29 UTC`. If you want another format, the date format can be customized using plugin configuration. Any format string supported by [strftime](http://strftime.org/) may be used. For example:

```json
{
    ""plugins"": {
        ""datasette-render-timestamps"": {
            ""format"": ""%Y-%m-%d-%H:%M:%S""
        }
    }
}
```
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-datasette-render-timestamps"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-render-timestamps""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-render-timestamps</h1>
<p><a href=""https://pypi.org/project/datasette-render-timestamps/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/6c9b69c5f6aca7ab8a2fbe0dd7b4ce1cca3a75f837750f3e7e7a55f9bd03b96f/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d72656e6465722d74696d657374616d70732e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-render-timestamps.svg"" style=""max-width:100%;""></a>
<a href=""https://circleci.com/gh/simonw/datasette-render-timestamps"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/8bf2320ca702efe91ed80467a992e0bd23b979894eaa4854bbc8affacbfaf8bb/68747470733a2f2f636972636c6563692e636f6d2f67682f73696d6f6e772f6461746173657474652d72656e6465722d74696d657374616d70732e7376673f7374796c653d737667"" alt=""CircleCI"" data-canonical-src=""https://circleci.com/gh/simonw/datasette-render-timestamps.svg?style=svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-render-timestamps/blob/master/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>Datasette plugin for rendering timestamps.</p>
<h2><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p>Install this plugin in the same environment as Datasette to enable this new functionality:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pip install datasette-render-timestamps
""><pre><code>pip install datasette-render-timestamps
</code></pre></div>
<p>The plugin will then look out for integer numbers that are likely to be timestamps - anything that would be a number of seconds from 5 years ago to 5 years in the future.</p>
<p>These will then be rendered in a more readable format.</p>
<h2><a id=""user-content-configuration"" class=""anchor"" aria-hidden=""true"" href=""#user-content-configuration""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Configuration</h2>
<p>You can disable automatic column detection in favour of explicitly listing the columns that you would like to render using <a href=""https://datasette.readthedocs.io/en/stable/plugins.html#plugin-configuration"" rel=""nofollow"">plugin configuration</a> in a <code>metadata.json</code> file.</p>
<p>Add a <code>""datasette-render-timestamps""</code> configuration block and use a <code>""columns""</code> key to list the columns you would like to treat as timestamp values:</p>
<div class=""highlight highlight-source-json position-relative"" data-snippet-clipboard-copy-content=""{
    &quot;plugins&quot;: {
        &quot;datasette-render-timestamps&quot;: {
            &quot;columns&quot;: [&quot;created&quot;, &quot;updated&quot;]
        }
    }
}
""><pre>{
    <span class=""pl-s""><span class=""pl-pds"">""</span>plugins<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>datasette-render-timestamps<span class=""pl-pds"">""</span></span>: {
            <span class=""pl-s""><span class=""pl-pds"">""</span>columns<span class=""pl-pds"">""</span></span>: [<span class=""pl-s""><span class=""pl-pds"">""</span>created<span class=""pl-pds"">""</span></span>, <span class=""pl-s""><span class=""pl-pds"">""</span>updated<span class=""pl-pds"">""</span></span>]
        }
    }
}</pre></div>
<p>This will cause any <code>created</code> or <code>updated</code> columns in any table to be treated as timestamps and rendered.</p>
<p>Save this to <code>metadata.json</code> and run datasette with the <code>--metadata</code> flag to load this configuration:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""datasette serve mydata.db --metadata metadata.json
""><pre><code>datasette serve mydata.db --metadata metadata.json
</code></pre></div>
<p>To disable automatic timestamp detection entirely, you can use <code>""columnns"": []</code>.</p>
<p>This configuration block can be used at the top level, or it can be applied just to specific databases or tables. Here's how to apply it to just the <code>entries</code> table in the <code>news.db</code> database:</p>
<div class=""highlight highlight-source-json position-relative"" data-snippet-clipboard-copy-content=""{
    &quot;databases&quot;: {
        &quot;news&quot;: {
            &quot;tables&quot;: {
                &quot;entries&quot;: {
                    &quot;plugins&quot;: {
                        &quot;datasette-render-timestamps&quot;: {
                            &quot;columns&quot;: [&quot;created&quot;, &quot;updated&quot;]
                        }
                    }
                }
            }
        }
    }
}
""><pre>{
    <span class=""pl-s""><span class=""pl-pds"">""</span>databases<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>news<span class=""pl-pds"">""</span></span>: {
            <span class=""pl-s""><span class=""pl-pds"">""</span>tables<span class=""pl-pds"">""</span></span>: {
                <span class=""pl-s""><span class=""pl-pds"">""</span>entries<span class=""pl-pds"">""</span></span>: {
                    <span class=""pl-s""><span class=""pl-pds"">""</span>plugins<span class=""pl-pds"">""</span></span>: {
                        <span class=""pl-s""><span class=""pl-pds"">""</span>datasette-render-timestamps<span class=""pl-pds"">""</span></span>: {
                            <span class=""pl-s""><span class=""pl-pds"">""</span>columns<span class=""pl-pds"">""</span></span>: [<span class=""pl-s""><span class=""pl-pds"">""</span>created<span class=""pl-pds"">""</span></span>, <span class=""pl-s""><span class=""pl-pds"">""</span>updated<span class=""pl-pds"">""</span></span>]
                        }
                    }
                }
            }
        }
    }
}</pre></div>
<p>And here's how to apply it to every <code>created</code> column in every table in the <code>news.db</code> database:</p>
<div class=""highlight highlight-source-json position-relative"" data-snippet-clipboard-copy-content=""{
    &quot;databases&quot;: {
        &quot;news&quot;: {
            &quot;plugins&quot;: {
                &quot;datasette-render-timestamps&quot;: {
                    &quot;columns&quot;: [&quot;created&quot;, &quot;updated&quot;]
                }
            }
        }
    }
}
""><pre>{
    <span class=""pl-s""><span class=""pl-pds"">""</span>databases<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>news<span class=""pl-pds"">""</span></span>: {
            <span class=""pl-s""><span class=""pl-pds"">""</span>plugins<span class=""pl-pds"">""</span></span>: {
                <span class=""pl-s""><span class=""pl-pds"">""</span>datasette-render-timestamps<span class=""pl-pds"">""</span></span>: {
                    <span class=""pl-s""><span class=""pl-pds"">""</span>columns<span class=""pl-pds"">""</span></span>: [<span class=""pl-s""><span class=""pl-pds"">""</span>created<span class=""pl-pds"">""</span></span>, <span class=""pl-s""><span class=""pl-pds"">""</span>updated<span class=""pl-pds"">""</span></span>]
                }
            }
        }
    }
}</pre></div>
<h3><a id=""user-content-customizing-the-date-format"" class=""anchor"" aria-hidden=""true"" href=""#user-content-customizing-the-date-format""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Customizing the date format</h3>
<p>The default format is <code>%B %d, %Y - %H:%M:%S UTC</code> which renders for example: <code>October 10, 2019 - 07:18:29 UTC</code>. If you want another format, the date format can be customized using plugin configuration. Any format string supported by <a href=""http://strftime.org/"" rel=""nofollow"">strftime</a> may be used. For example:</p>
<div class=""highlight highlight-source-json position-relative"" data-snippet-clipboard-copy-content=""{
    &quot;plugins&quot;: {
        &quot;datasette-render-timestamps&quot;: {
            &quot;format&quot;: &quot;%Y-%m-%d-%H:%M:%S&quot;
        }
    }
}
""><pre>{
    <span class=""pl-s""><span class=""pl-pds"">""</span>plugins<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>datasette-render-timestamps<span class=""pl-pds"">""</span></span>: {
            <span class=""pl-s""><span class=""pl-pds"">""</span>format<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>%Y-%m-%d-%H:%M:%S<span class=""pl-pds"">""</span></span>
        }
    }
}</pre></div>
</article></div>",,,,,,
217216787,MDEwOlJlcG9zaXRvcnkyMTcyMTY3ODc=,datasette-haversine,simonw/datasette-haversine,0,9599,https://github.com/simonw/datasette-haversine,Datasette plugin that adds a custom SQL function for haversine distances,0,2019-10-24T05:16:14Z,2021-07-28T20:13:38Z,2021-07-28T20:14:24Z,,8,1,1,Python,1,1,1,1,0,0,0,0,0,apache-2.0,"[""datasette"", ""datasette-plugin"", ""datasette-io""]",0,0,1,main,"{""admin"": false, ""push"": false, ""pull"": false}",,,0,1,"# datasette-haversine

[![PyPI](https://img.shields.io/pypi/v/datasette-haversine.svg)](https://pypi.org/project/datasette-haversine/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-haversine?include_prereleases&label=changelog)](https://github.com/simonw/datasette-haversine/releases)
[![Tests](https://github.com/simonw/datasette-haversine/workflows/Test/badge.svg)](https://github.com/simonw/datasette-haversine/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-haversine/blob/main/LICENSE)

Datasette plugin that adds a custom SQL function for haversine distances

Install this plugin in the same environment as Datasette to enable the `haversine()` SQL function.

    $ pip install datasette-haversine

The plugin is built on top of the [haversine](https://github.com/mapado/haversine) library.

## haversine() to calculate distances

```sql
select haversine(lat1, lon1, lat2, lon2);
```

This will return the distance in kilometers between the point defined by `(lat1, lon1)` and the point defined by `(lat2, lon2)`.

## Custom units

By default `haversine()` returns results in km. You can pass an optional third argument to get results in a different unit:

- `ft` for feet
- `m` for meters
- `in` for inches
- `mi` for miles
- `nmi` for nautical miles
- `km` for kilometers (the default)

```sql
select haversine(lat1, lon1, lat2, lon2, 'mi');
```
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-datasette-haversine"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-haversine""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-haversine</h1>
<p><a href=""https://pypi.org/project/datasette-haversine/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/c15ca12319585eb769d51678b9d275b82c3dd407362c2922244fb4b2e93a9f1e/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d686176657273696e652e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-haversine.svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-haversine/releases""><img src=""https://camo.githubusercontent.com/2a194d8f840f0491df7941d978f40f1fda33b2edbb9b9896269ff963f72d3bfb/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d686176657273696e653f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-haversine?include_prereleases&amp;label=changelog"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-haversine/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-haversine/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-haversine/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>Datasette plugin that adds a custom SQL function for haversine distances</p>
<p>Install this plugin in the same environment as Datasette to enable the <code>haversine()</code> SQL function.</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ pip install datasette-haversine
""><pre><code>$ pip install datasette-haversine
</code></pre></div>
<p>The plugin is built on top of the <a href=""https://github.com/mapado/haversine"">haversine</a> library.</p>
<h2><a id=""user-content-haversine-to-calculate-distances"" class=""anchor"" aria-hidden=""true"" href=""#user-content-haversine-to-calculate-distances""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>haversine() to calculate distances</h2>
<div class=""highlight highlight-source-sql position-relative"" data-snippet-clipboard-copy-content=""select haversine(lat1, lon1, lat2, lon2);
""><pre><span class=""pl-k"">select</span> haversine(lat1, lon1, lat2, lon2);</pre></div>
<p>This will return the distance in kilometers between the point defined by <code>(lat1, lon1)</code> and the point defined by <code>(lat2, lon2)</code>.</p>
<h2><a id=""user-content-custom-units"" class=""anchor"" aria-hidden=""true"" href=""#user-content-custom-units""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Custom units</h2>
<p>By default <code>haversine()</code> returns results in km. You can pass an optional third argument to get results in a different unit:</p>
<ul>
<li><code>ft</code> for feet</li>
<li><code>m</code> for meters</li>
<li><code>in</code> for inches</li>
<li><code>mi</code> for miles</li>
<li><code>nmi</code> for nautical miles</li>
<li><code>km</code> for kilometers (the default)</li>
</ul>
<div class=""highlight highlight-source-sql position-relative"" data-snippet-clipboard-copy-content=""select haversine(lat1, lon1, lat2, lon2, 'mi');
""><pre><span class=""pl-k"">select</span> haversine(lat1, lon1, lat2, lon2, <span class=""pl-s""><span class=""pl-pds"">'</span>mi<span class=""pl-pds"">'</span></span>);</pre></div>
</article></div>",,,,,,
219372133,MDEwOlJlcG9zaXRvcnkyMTkzNzIxMzM=,sqlite-transform,simonw/sqlite-transform,0,9599,https://github.com/simonw/sqlite-transform,Tool for running transformations on columns in a SQLite database,0,2019-11-03T22:07:53Z,2021-08-02T22:06:23Z,2021-08-02T22:07:57Z,,64,29,29,Python,1,1,1,1,0,1,0,0,0,apache-2.0,"[""sqlite"", ""datasette-io"", ""datasette-tool""]",1,0,29,main,"{""admin"": false, ""push"": false, ""pull"": false}",,,1,1,"# sqlite-transform

![No longer maintained](https://img.shields.io/badge/no%20longer-maintained-red)
[![PyPI](https://img.shields.io/pypi/v/sqlite-transform.svg)](https://pypi.org/project/sqlite-transform/)
[![Changelog](https://img.shields.io/github/v/release/simonw/sqlite-transform?include_prereleases&label=changelog)](https://github.com/simonw/sqlite-transform/releases)
[![Tests](https://github.com/simonw/sqlite-transform/workflows/Test/badge.svg)](https://github.com/simonw/sqlite-transform/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/dogsheep/sqlite-transform/blob/main/LICENSE)

Tool for running transformations on columns in a SQLite database.

> **:warning: This tool is no longer maintained**
>
> I added a new tool to [sqlite-utils](https://sqlite-utils.datasette.io/) called [sqlite-utils convert](https://sqlite-utils.datasette.io/en/stable/cli.html#converting-data-in-columns) which provides a super-set of the functionality originally provided here. `sqlite-transform` is no longer maintained, and I recommend switching to using `sqlite-utils convert` instead.

## How to install

    pip install sqlite-transform

## parsedate and parsedatetime

These subcommands will run all values in the specified column through `dateutils.parser.parse()` and replace them with the result, formatted as an ISO timestamp or ISO date.

For example, if a row in the database has an `opened` column which contains `10/10/2019 08:10:00 PM`, running the following command:

    sqlite-transform parsedatetime my.db mytable opened

Will result in that value being replaced by `2019-10-10T20:10:00`.

Using the `parsedate` subcommand here would result in `2019-10-10` instead.

In the case of ambiguous dates such as `03/04/05` these commands both default to assuming American-style `mm/dd/yy` format. You can pass `--dayfirst` to specify that the day should be assumed to be first, or `--yearfirst` for the year.

## jsonsplit

The `jsonsplit` subcommand takes columns that contain a comma-separated list, for example a `tags` column containing records like `""trees,park,dogs""` and converts it into a JSON array `[""trees"", ""park"", ""dogs""]`.

This is useful for taking advantage of Datasette's [Facet by JSON array](https://docs.datasette.io/en/stable/facets.html#facet-by-json-array) feature.

    sqlite-transform jsonsplit my.db mytable tags

It defaults to splitting on commas, but you can specify a different delimiter character using the `--delimiter` option, for example:

    sqlite-transform jsonsplit \
        my.db mytable tags --delimiter ';'

Values within the array will be treated as strings, so a column containing `123,552,775` will be converted into the JSON array `[""123"", ""552"", ""775""]`.

You can specify a different type for these values using `--type int` or `--type float`, for example:

    sqlite-transform jsonsplit \
        my.db mytable tags --type int

This will result in that column being converted into `[123, 552, 775]`.

## lambda for executing your own code

The `lambda` subcommand lets you specify Python code which will be executed against the column.

Here's how to convert a column to uppercase:

    sqlite-transform lambda my.db mytable mycolumn --code='str(value).upper()'

The code you provide will be compiled into a function that takes `value` as a single argument. You can break your function body into multiple lines, provided the last line is a `return` statement:

    sqlite-transform lambda my.db mytable mycolumn --code='value = str(value)
    return value.upper()'

You can also specify Python modules that should be imported and made available to your code using one or more `--import` options:

    sqlite-transform lambda my.db mytable mycolumn \
        --code='""\n"".join(textwrap.wrap(value, 10))' \
        --import=textwrap

The `--dry-run` option will output a preview of the transformation against the first ten rows, without modifying the database.

## Saving the result to a separate column

Each of these commands accepts optional `--output` and `--output-type` options. These can be used to save the result of the transformation to a separate column, which will be created if the column does not already exist.

To save the result of `jsonsplit` to a new column called `json_tags`, use the following:

    sqlite-transform jsonsplit my.db mytable tags \
      --output json_tags

The type of the created column defaults to `text`, but a different column type can be specified using `--output-type`. This example will create a new floating point column called `float_id` with a copy of each item's ID increased by 0.5:

    sqlite-transform lambda my.db mytable id \
      --code 'float(value) + 0.5' \
      --output float_id \
      --output-type float

You can drop the original column at the end of the operation by adding `--drop`.

## Splitting a column into multiple columns

Sometimes you may wish to convert a single column into multiple derived columns. For example, you may have a `location` column containing `latitude,longitude` values which you wish to split out into separate `latitude` and `longitude` columns.

You can achieve this using the `--multi` option to `sqlite-transform lambda`. This option expects your `--code` function to return a Python dictionary: new columns well be created and populated for each of the keys in that dictionary.

For the `latitude,longitude` example you would use the following:

    sqlite-transform lambda demo.db places location \
      --code 'return {
        ""latitude"": float(value.split("","")[0]),
        ""longitude"": float(value.split("","")[1]),
      }' --multi

The type of the returned values will be taken into account when creating the new columns. In this example, the resulting database schema will look like this:

```sql
CREATE TABLE [places] (
    [location] TEXT,
    [latitude] FLOAT,
    [longitude] FLOAT
);
```
The code function can also return `None`, in which case its output will be ignored.

You can drop the original column at the end of the operation by adding `--drop`.

## Disabling the progress bar

By default each command will show a progress bar. Pass `-s` or `--silent` to hide that progress bar.
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-sqlite-transform"" class=""anchor"" aria-hidden=""true"" href=""#user-content-sqlite-transform""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>sqlite-transform</h1>
<p><a target=""_blank"" rel=""noopener noreferrer"" href=""https://camo.githubusercontent.com/818df98789ea0f246ed427c6efefc9450fdab68f50d69b83ecbbda8dda1d82b8/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6e6f2532306c6f6e6765722d6d61696e7461696e65642d726564""><img src=""https://camo.githubusercontent.com/818df98789ea0f246ed427c6efefc9450fdab68f50d69b83ecbbda8dda1d82b8/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6e6f2532306c6f6e6765722d6d61696e7461696e65642d726564"" alt=""No longer maintained"" data-canonical-src=""https://img.shields.io/badge/no%20longer-maintained-red"" style=""max-width:100%;""></a>
<a href=""https://pypi.org/project/sqlite-transform/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/607faf62b18abed6c31fc21c85d2e93f7800da5d89495d6df6355b2bb0b11a38/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f73716c6974652d7472616e73666f726d2e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/sqlite-transform.svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/sqlite-transform/releases""><img src=""https://camo.githubusercontent.com/351675bc73115e1f71304704ec82e7402dd85f44a866cd2580a9e1ce4194384f/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f73716c6974652d7472616e73666f726d3f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/sqlite-transform?include_prereleases&amp;label=changelog"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/sqlite-transform/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/sqlite-transform/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width:100%;""></a>
<a href=""https://github.com/dogsheep/sqlite-transform/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>Tool for running transformations on columns in a SQLite database.</p>
<blockquote>
<p><strong><g-emoji class=""g-emoji"" alias=""warning"" fallback-src=""https://github.githubassets.com/images/icons/emoji/unicode/26a0.png"">⚠️</g-emoji> This tool is no longer maintained</strong></p>
<p>I added a new tool to <a href=""https://sqlite-utils.datasette.io/"" rel=""nofollow"">sqlite-utils</a> called <a href=""https://sqlite-utils.datasette.io/en/stable/cli.html#converting-data-in-columns"" rel=""nofollow"">sqlite-utils convert</a> which provides a super-set of the functionality originally provided here. <code>sqlite-transform</code> is no longer maintained, and I recommend switching to using <code>sqlite-utils convert</code> instead.</p>
</blockquote>
<h2><a id=""user-content-how-to-install"" class=""anchor"" aria-hidden=""true"" href=""#user-content-how-to-install""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>How to install</h2>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pip install sqlite-transform
""><pre><code>pip install sqlite-transform
</code></pre></div>
<h2><a id=""user-content-parsedate-and-parsedatetime"" class=""anchor"" aria-hidden=""true"" href=""#user-content-parsedate-and-parsedatetime""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>parsedate and parsedatetime</h2>
<p>These subcommands will run all values in the specified column through <code>dateutils.parser.parse()</code> and replace them with the result, formatted as an ISO timestamp or ISO date.</p>
<p>For example, if a row in the database has an <code>opened</code> column which contains <code>10/10/2019 08:10:00 PM</code>, running the following command:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""sqlite-transform parsedatetime my.db mytable opened
""><pre><code>sqlite-transform parsedatetime my.db mytable opened
</code></pre></div>
<p>Will result in that value being replaced by <code>2019-10-10T20:10:00</code>.</p>
<p>Using the <code>parsedate</code> subcommand here would result in <code>2019-10-10</code> instead.</p>
<p>In the case of ambiguous dates such as <code>03/04/05</code> these commands both default to assuming American-style <code>mm/dd/yy</code> format. You can pass <code>--dayfirst</code> to specify that the day should be assumed to be first, or <code>--yearfirst</code> for the year.</p>
<h2><a id=""user-content-jsonsplit"" class=""anchor"" aria-hidden=""true"" href=""#user-content-jsonsplit""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>jsonsplit</h2>
<p>The <code>jsonsplit</code> subcommand takes columns that contain a comma-separated list, for example a <code>tags</code> column containing records like <code>""trees,park,dogs""</code> and converts it into a JSON array <code>[""trees"", ""park"", ""dogs""]</code>.</p>
<p>This is useful for taking advantage of Datasette's <a href=""https://docs.datasette.io/en/stable/facets.html#facet-by-json-array"" rel=""nofollow"">Facet by JSON array</a> feature.</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""sqlite-transform jsonsplit my.db mytable tags
""><pre><code>sqlite-transform jsonsplit my.db mytable tags
</code></pre></div>
<p>It defaults to splitting on commas, but you can specify a different delimiter character using the <code>--delimiter</code> option, for example:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""sqlite-transform jsonsplit \
    my.db mytable tags --delimiter ';'
""><pre><code>sqlite-transform jsonsplit \
    my.db mytable tags --delimiter ';'
</code></pre></div>
<p>Values within the array will be treated as strings, so a column containing <code>123,552,775</code> will be converted into the JSON array <code>[""123"", ""552"", ""775""]</code>.</p>
<p>You can specify a different type for these values using <code>--type int</code> or <code>--type float</code>, for example:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""sqlite-transform jsonsplit \
    my.db mytable tags --type int
""><pre><code>sqlite-transform jsonsplit \
    my.db mytable tags --type int
</code></pre></div>
<p>This will result in that column being converted into <code>[123, 552, 775]</code>.</p>
<h2><a id=""user-content-lambda-for-executing-your-own-code"" class=""anchor"" aria-hidden=""true"" href=""#user-content-lambda-for-executing-your-own-code""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>lambda for executing your own code</h2>
<p>The <code>lambda</code> subcommand lets you specify Python code which will be executed against the column.</p>
<p>Here's how to convert a column to uppercase:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""sqlite-transform lambda my.db mytable mycolumn --code='str(value).upper()'
""><pre><code>sqlite-transform lambda my.db mytable mycolumn --code='str(value).upper()'
</code></pre></div>
<p>The code you provide will be compiled into a function that takes <code>value</code> as a single argument. You can break your function body into multiple lines, provided the last line is a <code>return</code> statement:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""sqlite-transform lambda my.db mytable mycolumn --code='value = str(value)
return value.upper()'
""><pre><code>sqlite-transform lambda my.db mytable mycolumn --code='value = str(value)
return value.upper()'
</code></pre></div>
<p>You can also specify Python modules that should be imported and made available to your code using one or more <code>--import</code> options:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""sqlite-transform lambda my.db mytable mycolumn \
    --code='&quot;\n&quot;.join(textwrap.wrap(value, 10))' \
    --import=textwrap
""><pre><code>sqlite-transform lambda my.db mytable mycolumn \
    --code='""\n"".join(textwrap.wrap(value, 10))' \
    --import=textwrap
</code></pre></div>
<p>The <code>--dry-run</code> option will output a preview of the transformation against the first ten rows, without modifying the database.</p>
<h2><a id=""user-content-saving-the-result-to-a-separate-column"" class=""anchor"" aria-hidden=""true"" href=""#user-content-saving-the-result-to-a-separate-column""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Saving the result to a separate column</h2>
<p>Each of these commands accepts optional <code>--output</code> and <code>--output-type</code> options. These can be used to save the result of the transformation to a separate column, which will be created if the column does not already exist.</p>
<p>To save the result of <code>jsonsplit</code> to a new column called <code>json_tags</code>, use the following:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""sqlite-transform jsonsplit my.db mytable tags \
  --output json_tags
""><pre><code>sqlite-transform jsonsplit my.db mytable tags \
  --output json_tags
</code></pre></div>
<p>The type of the created column defaults to <code>text</code>, but a different column type can be specified using <code>--output-type</code>. This example will create a new floating point column called <code>float_id</code> with a copy of each item's ID increased by 0.5:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""sqlite-transform lambda my.db mytable id \
  --code 'float(value) + 0.5' \
  --output float_id \
  --output-type float
""><pre><code>sqlite-transform lambda my.db mytable id \
  --code 'float(value) + 0.5' \
  --output float_id \
  --output-type float
</code></pre></div>
<p>You can drop the original column at the end of the operation by adding <code>--drop</code>.</p>
<h2><a id=""user-content-splitting-a-column-into-multiple-columns"" class=""anchor"" aria-hidden=""true"" href=""#user-content-splitting-a-column-into-multiple-columns""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Splitting a column into multiple columns</h2>
<p>Sometimes you may wish to convert a single column into multiple derived columns. For example, you may have a <code>location</code> column containing <code>latitude,longitude</code> values which you wish to split out into separate <code>latitude</code> and <code>longitude</code> columns.</p>
<p>You can achieve this using the <code>--multi</code> option to <code>sqlite-transform lambda</code>. This option expects your <code>--code</code> function to return a Python dictionary: new columns well be created and populated for each of the keys in that dictionary.</p>
<p>For the <code>latitude,longitude</code> example you would use the following:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""sqlite-transform lambda demo.db places location \
  --code 'return {
    &quot;latitude&quot;: float(value.split(&quot;,&quot;)[0]),
    &quot;longitude&quot;: float(value.split(&quot;,&quot;)[1]),
  }' --multi
""><pre><code>sqlite-transform lambda demo.db places location \
  --code 'return {
    ""latitude"": float(value.split("","")[0]),
    ""longitude"": float(value.split("","")[1]),
  }' --multi
</code></pre></div>
<p>The type of the returned values will be taken into account when creating the new columns. In this example, the resulting database schema will look like this:</p>
<div class=""highlight highlight-source-sql position-relative"" data-snippet-clipboard-copy-content=""CREATE TABLE [places] (
    [location] TEXT,
    [latitude] FLOAT,
    [longitude] FLOAT
);
""><pre>CREATE TABLE [places] (
    [location] <span class=""pl-k"">TEXT</span>,
    [latitude] FLOAT,
    [longitude] FLOAT
);</pre></div>
<p>The code function can also return <code>None</code>, in which case its output will be ignored.</p>
<p>You can drop the original column at the end of the operation by adding <code>--drop</code>.</p>
<h2><a id=""user-content-disabling-the-progress-bar"" class=""anchor"" aria-hidden=""true"" href=""#user-content-disabling-the-progress-bar""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Disabling the progress bar</h2>
<p>By default each command will show a progress bar. Pass <code>-s</code> or <code>--silent</code> to hide that progress bar.</p>
</article></div>",,,,,,
220716822,MDEwOlJlcG9zaXRvcnkyMjA3MTY4MjI=,datasette-render-markdown,simonw/datasette-render-markdown,0,9599,https://github.com/simonw/datasette-render-markdown,Datasette plugin for rendering Markdown,0,2019-11-09T23:28:31Z,2022-05-26T04:58:56Z,2022-07-18T19:35:10Z,,57,11,11,Python,1,1,1,1,0,0,0,0,1,apache-2.0,"[""datasette"", ""datasette-io"", ""datasette-plugin"", ""markdown""]",0,1,11,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,0,2,"# datasette-render-markdown

[![PyPI](https://img.shields.io/pypi/v/datasette-render-markdown.svg)](https://pypi.org/project/datasette-render-markdown/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-render-markdown?include_prereleases&label=changelog)](https://github.com/simonw/datasette-render-markdown/releases)
[![Tests](https://github.com/simonw/datasette-render-markdown/workflows/Test/badge.svg)](https://github.com/simonw/datasette-render-markdown/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-render-markdown/blob/main/LICENSE)

Datasette plugin for rendering Markdown.

## Installation

Install this plugin in the same environment as Datasette to enable this new functionality:

    $ pip install datasette-render-markdown

## Usage

You can explicitly list the columns you would like to treat as Markdown using [plugin configuration](https://datasette.readthedocs.io/en/stable/plugins.html#plugin-configuration) in a `metadata.json` file.

Add a `""datasette-render-markdown""` configuration block and use a `""columns""` key to list the columns you would like to treat as Markdown values:

```json
{
    ""plugins"": {
        ""datasette-render-markdown"": {
            ""columns"": [""body""]
        }
    }
}
```

This will cause any `body` column in any table to be treated as markdown and safely rendered using [Python-Markdown](https://python-markdown.github.io/). The resulting HTML is then run through [Bleach](https://bleach.readthedocs.io/) to avoid the risk of XSS security problems.

Save this to `metadata.json` and run Datasette with the `--metadata` flag to load this configuration:

    $ datasette serve mydata.db --metadata metadata.json

The configuration block can be used at the top level, or it can be applied just to specific databases or tables. Here's how to apply it to just the `entries` table in the `news.db` database:

```json
{
    ""databases"": {
        ""news"": {
            ""tables"": {
                ""entries"": {
                    ""plugins"": {
                        ""datasette-render-markdown"": {
                            ""columns"": [""body""]
                        }
                    }
                }
            }
        }
    }
}
```

And here's how to apply it to every `body` column in every table in the `news.db` database:

```json
{
    ""databases"": {
        ""news"": {
            ""plugins"": {
                ""datasette-render-markdown"": {
                    ""columns"": [""body""]
                }
            }
        }
    }
}
```

## Columns that match a naming convention

This plugin can also render markdown in any columns that match a specific naming convention.

By default, columns that have a name ending in `_markdown` will be rendered.

You can try this out using the following query:

```sql
select '# Hello there

* This is a list
* of items

[And a link](https://github.com/simonw/datasette-render-markdown).'
as demo_markdown
```

You can configure a different list of wildcard patterns using the `""patterns""` configuration key. Here's how to render columns that end in either `_markdown` or `_md`:

```json
{
    ""plugins"": {
        ""datasette-render-markdown"": {
            ""patterns"": [""*_markdown"", ""*_md""]
        }
    }
}
```

To disable wildcard column matching entirely, set `""patterns"": []` in your plugin metadata configuration.

## Markdown extensions

The [Python-Markdown library](https://python-markdown.github.io/) that powers this plugin supports extensions, both [bundled](https://python-markdown.github.io/extensions/) and [third-party](https://github.com/Python-Markdown/markdown/wiki/Third-Party-Extensions). These can be used to enable additional Markdown features such as [table support](https://python-markdown.github.io/extensions/tables/).

You can configure support for extensions using the `""extensions""` key in your plugin metadata configuration.

Since extensions may introduce new HTML tags, you will also need to add those tags to the list of tags that are allowed by the [Bleach](https://bleach.readthedocs.io/) sanitizer. You can do that using the `""extra_tags""` key, and you can whitelist additional HTML attributes using `""extra_attrs""`. See [the Bleach documentation](https://bleach.readthedocs.io/en/latest/clean.html#allowed-tags-tags) for more information on this.

Here's how to enable support for [Markdown tables](https://python-markdown.github.io/extensions/tables/):

```json
{
    ""plugins"": {
        ""datasette-render-markdown"": {
            ""extensions"": [""tables""],
            ""extra_tags"": [""table"", ""thead"", ""tr"", ""th"", ""td"", ""tbody""]
        }
    }
}
```

### GitHub-Flavored Markdown

Enabling [GitHub-Flavored Markdown](https://help.github.com/en/github/writing-on-github) (useful for if you are working with data imported from GitHub using [github-to-sqlite](https://github.com/dogsheep/github-to-sqlite)) is a little more complicated.

First, you will need to install the [py-gfm](https://py-gfm.readthedocs.io) package:

    $ pip install py-gfm

Note that `py-gfm` has [a bug](https://github.com/Zopieux/py-gfm/issues/13) that causes it to pin to `Markdown<3.0` - so if you are using it you should install it _before_ installing `datasette-render-markdown` to ensure you get a compatibly version of that dependency.

Now you can configure it like this. Note that the extension name is `mdx_gfm:GithubFlavoredMarkdownExtension` and you need to whitelist several extra HTML tags and attributes:

```json
{
    ""plugins"": {
        ""datasette-render-markdown"": {
            ""extra_tags"": [
                ""hr"",
                ""br"",
                ""details"",
                ""summary"",
                ""input""
            ],
            ""extra_attrs"": {
                ""input"": [
                    ""type"",
                    ""disabled"",
                    ""checked""
                ],
            },
            ""extensions"": [
                ""mdx_gfm:GithubFlavoredMarkdownExtension""
            ]
        }
    }
}
```

The `<input type="""" checked disabled>` attributes are needed to support rendering checkboxes in issue descriptions.

## Markdown in templates

The plugin also adds a new template function: `render_markdown(value)`. You can use this in your templates like so:

```html+jinja
{{ render_markdown(""""""
# This is markdown

* One
* Two
* Three
"""""") }}
```

You can load additional extensions and whitelist tags by passing extra arguments to the function like this:

```html+jinja
{{ render_markdown(""""""
## Markdown table

First Header  | Second Header
------------- | -------------
Content Cell  | Content Cell
Content Cell  | Content Cell
"""""", extensions=[""tables""],
    extra_tags=[""table"", ""thead"", ""tr"", ""th"", ""td"", ""tbody""])) }}
```
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-datasette-render-markdown"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-render-markdown""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-render-markdown</h1>
<p dir=""auto""><a href=""https://pypi.org/project/datasette-render-markdown/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/624f87a070fbf68882456039073cd2f330c4b412199efd1bd7308c97c3b30c40/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d72656e6465722d6d61726b646f776e2e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-render-markdown.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-render-markdown/releases""><img src=""https://camo.githubusercontent.com/5b3b13cdd601c70ee249e42c3fb43fce5b707eff875d49f19b1460a319cc42d6/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d72656e6465722d6d61726b646f776e3f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-render-markdown?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-render-markdown/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-render-markdown/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-render-markdown/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Datasette plugin for rendering Markdown.</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p dir=""auto"">Install this plugin in the same environment as Datasette to enable this new functionality:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ pip install datasette-render-markdown""><pre class=""notranslate""><code>$ pip install datasette-render-markdown
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p dir=""auto"">You can explicitly list the columns you would like to treat as Markdown using <a href=""https://datasette.readthedocs.io/en/stable/plugins.html#plugin-configuration"" rel=""nofollow"">plugin configuration</a> in a <code>metadata.json</code> file.</p>
<p dir=""auto"">Add a <code>""datasette-render-markdown""</code> configuration block and use a <code>""columns""</code> key to list the columns you would like to treat as Markdown values:</p>
<div class=""highlight highlight-source-json notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{
    &quot;plugins&quot;: {
        &quot;datasette-render-markdown&quot;: {
            &quot;columns&quot;: [&quot;body&quot;]
        }
    }
}""><pre>{
    <span class=""pl-ent"">""plugins""</span>: {
        <span class=""pl-ent"">""datasette-render-markdown""</span>: {
            <span class=""pl-ent"">""columns""</span>: [<span class=""pl-s""><span class=""pl-pds"">""</span>body<span class=""pl-pds"">""</span></span>]
        }
    }
}</pre></div>
<p dir=""auto"">This will cause any <code>body</code> column in any table to be treated as markdown and safely rendered using <a href=""https://python-markdown.github.io/"" rel=""nofollow"">Python-Markdown</a>. The resulting HTML is then run through <a href=""https://bleach.readthedocs.io/"" rel=""nofollow"">Bleach</a> to avoid the risk of XSS security problems.</p>
<p dir=""auto"">Save this to <code>metadata.json</code> and run Datasette with the <code>--metadata</code> flag to load this configuration:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ datasette serve mydata.db --metadata metadata.json""><pre class=""notranslate""><code>$ datasette serve mydata.db --metadata metadata.json
</code></pre></div>
<p dir=""auto"">The configuration block can be used at the top level, or it can be applied just to specific databases or tables. Here's how to apply it to just the <code>entries</code> table in the <code>news.db</code> database:</p>
<div class=""highlight highlight-source-json notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{
    &quot;databases&quot;: {
        &quot;news&quot;: {
            &quot;tables&quot;: {
                &quot;entries&quot;: {
                    &quot;plugins&quot;: {
                        &quot;datasette-render-markdown&quot;: {
                            &quot;columns&quot;: [&quot;body&quot;]
                        }
                    }
                }
            }
        }
    }
}""><pre>{
    <span class=""pl-ent"">""databases""</span>: {
        <span class=""pl-ent"">""news""</span>: {
            <span class=""pl-ent"">""tables""</span>: {
                <span class=""pl-ent"">""entries""</span>: {
                    <span class=""pl-ent"">""plugins""</span>: {
                        <span class=""pl-ent"">""datasette-render-markdown""</span>: {
                            <span class=""pl-ent"">""columns""</span>: [<span class=""pl-s""><span class=""pl-pds"">""</span>body<span class=""pl-pds"">""</span></span>]
                        }
                    }
                }
            }
        }
    }
}</pre></div>
<p dir=""auto"">And here's how to apply it to every <code>body</code> column in every table in the <code>news.db</code> database:</p>
<div class=""highlight highlight-source-json notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{
    &quot;databases&quot;: {
        &quot;news&quot;: {
            &quot;plugins&quot;: {
                &quot;datasette-render-markdown&quot;: {
                    &quot;columns&quot;: [&quot;body&quot;]
                }
            }
        }
    }
}""><pre>{
    <span class=""pl-ent"">""databases""</span>: {
        <span class=""pl-ent"">""news""</span>: {
            <span class=""pl-ent"">""plugins""</span>: {
                <span class=""pl-ent"">""datasette-render-markdown""</span>: {
                    <span class=""pl-ent"">""columns""</span>: [<span class=""pl-s""><span class=""pl-pds"">""</span>body<span class=""pl-pds"">""</span></span>]
                }
            }
        }
    }
}</pre></div>
<h2 dir=""auto""><a id=""user-content-columns-that-match-a-naming-convention"" class=""anchor"" aria-hidden=""true"" href=""#user-content-columns-that-match-a-naming-convention""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Columns that match a naming convention</h2>
<p dir=""auto"">This plugin can also render markdown in any columns that match a specific naming convention.</p>
<p dir=""auto"">By default, columns that have a name ending in <code>_markdown</code> will be rendered.</p>
<p dir=""auto"">You can try this out using the following query:</p>
<div class=""highlight highlight-source-sql notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""select '# Hello there

* This is a list
* of items

[And a link](https://github.com/simonw/datasette-render-markdown).'
as demo_markdown""><pre><span class=""pl-k"">select</span> <span class=""pl-s""><span class=""pl-pds"">'</span># Hello there</span>
<span class=""pl-s""></span>
<span class=""pl-s"">* This is a list</span>
<span class=""pl-s"">* of items</span>
<span class=""pl-s""></span>
<span class=""pl-s"">[And a link](https://github.com/simonw/datasette-render-markdown).<span class=""pl-pds"">'</span></span>
<span class=""pl-k"">as</span> demo_markdown</pre></div>
<p dir=""auto"">You can configure a different list of wildcard patterns using the <code>""patterns""</code> configuration key. Here's how to render columns that end in either <code>_markdown</code> or <code>_md</code>:</p>
<div class=""highlight highlight-source-json notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{
    &quot;plugins&quot;: {
        &quot;datasette-render-markdown&quot;: {
            &quot;patterns&quot;: [&quot;*_markdown&quot;, &quot;*_md&quot;]
        }
    }
}""><pre>{
    <span class=""pl-ent"">""plugins""</span>: {
        <span class=""pl-ent"">""datasette-render-markdown""</span>: {
            <span class=""pl-ent"">""patterns""</span>: [<span class=""pl-s""><span class=""pl-pds"">""</span>*_markdown<span class=""pl-pds"">""</span></span>, <span class=""pl-s""><span class=""pl-pds"">""</span>*_md<span class=""pl-pds"">""</span></span>]
        }
    }
}</pre></div>
<p dir=""auto"">To disable wildcard column matching entirely, set <code>""patterns"": []</code> in your plugin metadata configuration.</p>
<h2 dir=""auto""><a id=""user-content-markdown-extensions"" class=""anchor"" aria-hidden=""true"" href=""#user-content-markdown-extensions""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Markdown extensions</h2>
<p dir=""auto"">The <a href=""https://python-markdown.github.io/"" rel=""nofollow"">Python-Markdown library</a> that powers this plugin supports extensions, both <a href=""https://python-markdown.github.io/extensions/"" rel=""nofollow"">bundled</a> and <a href=""https://github.com/Python-Markdown/markdown/wiki/Third-Party-Extensions"">third-party</a>. These can be used to enable additional Markdown features such as <a href=""https://python-markdown.github.io/extensions/tables/"" rel=""nofollow"">table support</a>.</p>
<p dir=""auto"">You can configure support for extensions using the <code>""extensions""</code> key in your plugin metadata configuration.</p>
<p dir=""auto"">Since extensions may introduce new HTML tags, you will also need to add those tags to the list of tags that are allowed by the <a href=""https://bleach.readthedocs.io/"" rel=""nofollow"">Bleach</a> sanitizer. You can do that using the <code>""extra_tags""</code> key, and you can whitelist additional HTML attributes using <code>""extra_attrs""</code>. See <a href=""https://bleach.readthedocs.io/en/latest/clean.html#allowed-tags-tags"" rel=""nofollow"">the Bleach documentation</a> for more information on this.</p>
<p dir=""auto"">Here's how to enable support for <a href=""https://python-markdown.github.io/extensions/tables/"" rel=""nofollow"">Markdown tables</a>:</p>
<div class=""highlight highlight-source-json notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{
    &quot;plugins&quot;: {
        &quot;datasette-render-markdown&quot;: {
            &quot;extensions&quot;: [&quot;tables&quot;],
            &quot;extra_tags&quot;: [&quot;table&quot;, &quot;thead&quot;, &quot;tr&quot;, &quot;th&quot;, &quot;td&quot;, &quot;tbody&quot;]
        }
    }
}""><pre>{
    <span class=""pl-ent"">""plugins""</span>: {
        <span class=""pl-ent"">""datasette-render-markdown""</span>: {
            <span class=""pl-ent"">""extensions""</span>: [<span class=""pl-s""><span class=""pl-pds"">""</span>tables<span class=""pl-pds"">""</span></span>],
            <span class=""pl-ent"">""extra_tags""</span>: [<span class=""pl-s""><span class=""pl-pds"">""</span>table<span class=""pl-pds"">""</span></span>, <span class=""pl-s""><span class=""pl-pds"">""</span>thead<span class=""pl-pds"">""</span></span>, <span class=""pl-s""><span class=""pl-pds"">""</span>tr<span class=""pl-pds"">""</span></span>, <span class=""pl-s""><span class=""pl-pds"">""</span>th<span class=""pl-pds"">""</span></span>, <span class=""pl-s""><span class=""pl-pds"">""</span>td<span class=""pl-pds"">""</span></span>, <span class=""pl-s""><span class=""pl-pds"">""</span>tbody<span class=""pl-pds"">""</span></span>]
        }
    }
}</pre></div>
<h3 dir=""auto""><a id=""user-content-github-flavored-markdown"" class=""anchor"" aria-hidden=""true"" href=""#user-content-github-flavored-markdown""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>GitHub-Flavored Markdown</h3>
<p dir=""auto"">Enabling <a href=""https://help.github.com/en/github/writing-on-github"">GitHub-Flavored Markdown</a> (useful for if you are working with data imported from GitHub using <a href=""https://github.com/dogsheep/github-to-sqlite"">github-to-sqlite</a>) is a little more complicated.</p>
<p dir=""auto"">First, you will need to install the <a href=""https://py-gfm.readthedocs.io"" rel=""nofollow"">py-gfm</a> package:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ pip install py-gfm""><pre class=""notranslate""><code>$ pip install py-gfm
</code></pre></div>
<p dir=""auto"">Note that <code>py-gfm</code> has <a href=""https://github.com/Zopieux/py-gfm/issues/13"" data-hovercard-type=""issue"" data-hovercard-url=""/zopieux/py-gfm/issues/13/hovercard"">a bug</a> that causes it to pin to <code>Markdown&lt;3.0</code> - so if you are using it you should install it <em>before</em> installing <code>datasette-render-markdown</code> to ensure you get a compatibly version of that dependency.</p>
<p dir=""auto"">Now you can configure it like this. Note that the extension name is <code>mdx_gfm:GithubFlavoredMarkdownExtension</code> and you need to whitelist several extra HTML tags and attributes:</p>
<div class=""highlight highlight-source-json notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{
    &quot;plugins&quot;: {
        &quot;datasette-render-markdown&quot;: {
            &quot;extra_tags&quot;: [
                &quot;hr&quot;,
                &quot;br&quot;,
                &quot;details&quot;,
                &quot;summary&quot;,
                &quot;input&quot;
            ],
            &quot;extra_attrs&quot;: {
                &quot;input&quot;: [
                    &quot;type&quot;,
                    &quot;disabled&quot;,
                    &quot;checked&quot;
                ],
            },
            &quot;extensions&quot;: [
                &quot;mdx_gfm:GithubFlavoredMarkdownExtension&quot;
            ]
        }
    }
}""><pre>{
    <span class=""pl-ent"">""plugins""</span>: {
        <span class=""pl-ent"">""datasette-render-markdown""</span>: {
            <span class=""pl-ent"">""extra_tags""</span>: [
                <span class=""pl-s""><span class=""pl-pds"">""</span>hr<span class=""pl-pds"">""</span></span>,
                <span class=""pl-s""><span class=""pl-pds"">""</span>br<span class=""pl-pds"">""</span></span>,
                <span class=""pl-s""><span class=""pl-pds"">""</span>details<span class=""pl-pds"">""</span></span>,
                <span class=""pl-s""><span class=""pl-pds"">""</span>summary<span class=""pl-pds"">""</span></span>,
                <span class=""pl-s""><span class=""pl-pds"">""</span>input<span class=""pl-pds"">""</span></span>
            ],
            <span class=""pl-ent"">""extra_attrs""</span>: {
                <span class=""pl-ent"">""input""</span>: [
                    <span class=""pl-s""><span class=""pl-pds"">""</span>type<span class=""pl-pds"">""</span></span>,
                    <span class=""pl-s""><span class=""pl-pds"">""</span>disabled<span class=""pl-pds"">""</span></span>,
                    <span class=""pl-s""><span class=""pl-pds"">""</span>checked<span class=""pl-pds"">""</span></span>
                ],
            },
            <span class=""pl-ent"">""extensions""</span>: [
                <span class=""pl-s""><span class=""pl-pds"">""</span>mdx_gfm:GithubFlavoredMarkdownExtension<span class=""pl-pds"">""</span></span>
            ]
        }
    }
}</pre></div>
<p dir=""auto"">The <code>&lt;input type="""" checked disabled&gt;</code> attributes are needed to support rendering checkboxes in issue descriptions.</p>
<h2 dir=""auto""><a id=""user-content-markdown-in-templates"" class=""anchor"" aria-hidden=""true"" href=""#user-content-markdown-in-templates""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Markdown in templates</h2>
<p dir=""auto"">The plugin also adds a new template function: <code>render_markdown(value)</code>. You can use this in your templates like so:</p>
<div class=""highlight highlight-text-html-django notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{{ render_markdown(&quot;&quot;&quot;
# This is markdown

* One
* Two
* Three
&quot;&quot;&quot;) }}""><pre>{{ render_markdown(""""""
# This is markdown

* One
* Two
* Three
"""""") }}</pre></div>
<p dir=""auto"">You can load additional extensions and whitelist tags by passing extra arguments to the function like this:</p>
<div class=""highlight highlight-text-html-django notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{{ render_markdown(&quot;&quot;&quot;
## Markdown table

First Header  | Second Header
------------- | -------------
Content Cell  | Content Cell
Content Cell  | Content Cell
&quot;&quot;&quot;, extensions=[&quot;tables&quot;],
    extra_tags=[&quot;table&quot;, &quot;thead&quot;, &quot;tr&quot;, &quot;th&quot;, &quot;td&quot;, &quot;tbody&quot;])) }}""><pre>{{ render_markdown(""""""
## Markdown table

First Header  | Second Header
------------- | -------------
Content Cell  | Content Cell
Content Cell  | Content Cell
"""""", extensions=[""tables<span class=""pl-smi"">""]</span>,
    extra_tags=[""table"", ""thead"", ""tr"", ""th"", ""td"", ""tbody<span class=""pl-smi"">""]</span>)) }}</pre></div>
</article></div>",1,public,0,,0,
221802296,MDEwOlJlcG9zaXRvcnkyMjE4MDIyOTY=,datasette-template-sql,simonw/datasette-template-sql,0,9599,https://github.com/simonw/datasette-template-sql,Datasette plugin for executing SQL queries from templates,0,2019-11-14T23:05:34Z,2021-05-18T17:58:47Z,2021-05-18T17:58:44Z,https://datasette.io/plugins/datasette-template-sql,23,6,6,Python,1,1,1,1,0,0,0,0,1,apache-2.0,"[""datasette"", ""datasette-plugin"", ""datasette-io""]",0,1,6,main,"{""admin"": false, ""push"": false, ""pull"": false}",,,0,1,"# datasette-template-sql

[![PyPI](https://img.shields.io/pypi/v/datasette-template-sql.svg)](https://pypi.org/project/datasette-template-sql/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-template-sql?include_prereleases&label=changelog)](https://github.com/simonw/datasette-template-sql/releases)
[![Tests](https://github.com/simonw/datasette-template-sql/workflows/Test/badge.svg)](https://github.com/simonw/datasette-template-sql/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-template-sql/blob/main/LICENSE)

Datasette plugin for executing SQL queries from templates.

## Examples

[datasette.io](https://datasette.io/) uses this plugin extensively with [custom page templates](https://docs.datasette.io/en/stable/custom_templates.html#custom-pages), check out [simonw/datasette.io](https://github.com/simonw/datasette.io) to see how it works.

[www.niche-museums.com](https://www.niche-museums.com/) uses this plugin to run a custom themed website on top of Datasette. The full source code for the site [is here](https://github.com/simonw/museums) - see also [niche-museums.com, powered by Datasette](https://simonwillison.net/2019/Nov/25/niche-museums/).

[simonw/til](https://github.com/simonw/til) is another simple example, described in [Using a self-rewriting README powered by GitHub Actions to track TILs](https://simonwillison.net/2020/Apr/20/self-rewriting-readme/).

## Installation

Run this command to install the plugin in the same environment as Datasette:

    $ pip install datasette-template-sql

## Usage

This plugin makes a new function, `sql(sql_query)`, available to your Datasette templates.

You can use it like this:

```html+jinja
{% for row in sql(""select 1 + 1 as two, 2 * 4 as eight"") %}
    {% for key in row.keys() %}
        {{ key }}: {{ row[key] }}<br>
    {% endfor %}
{% endfor %}
```

The plugin will execute SQL against the current database for the page in  `database.html`, `table.html` and `row.html` templates. If a template does not have a current database (`index.html` for example) the query will execute against the first attached database.

### Queries with arguments

You can construct a SQL query using `?` or `:name` parameter syntax by passing a list or dictionary as a second argument:

```html+jinja
{% for row in sql(""select distinct topic from til order by topic"") %}
    <h2>{{ row.topic }}</h2>
    <ul>
        {% for til in sql(""select * from til where topic = ?"", [row.topic]) %}
            <li><a href=""{{ til.url }}"">{{ til.title }}</a> - {{ til.created[:10] }}</li>
        {% endfor %}
    </ul>
{% endfor %}
```

Here's the same example using the `:topic` style of parameters:

```html+jinja
{% for row in sql(""select distinct topic from til order by topic"") %}
    <h2>{{ row.topic }}</h2>
    <ul>
        {% for til in sql(""select * from til where topic = :topic"", {""topic"": row.topic}) %}
            <li><a href=""{{ til.url }}"">{{ til.title }}</a> - {{ til.created[:10] }}</li>
        {% endfor %}
    </ul>
{% endfor %}
```

### Querying a different database

You can pass an optional `database=` argument to specify a named database to use for the query. For example, if you have attached a `news.db` database you could use this:

```html+jinja
{% for article in sql(
    ""select headline, date, summary from articles order by date desc limit 5"",
    database=""news""
) %}
    <h3>{{ article.headline }}</h2>
    <p class=""date"">{{ article.date }}</p>
    <p>{{ article.summary }}</p>
{% endfor %}
```
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-datasette-template-sql"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-template-sql""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-template-sql</h1>
<p><a href=""https://pypi.org/project/datasette-template-sql/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/4de02f94922198f24958d43afbaa62802bf36bc44849bdf7bfb630d8bd7d5dde/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d74656d706c6174652d73716c2e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-template-sql.svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-template-sql/releases""><img src=""https://camo.githubusercontent.com/564e7d6e3e049c80e9c446272b50dd5ce9c6d4b573ad51a7f1e9e51756affbcb/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d74656d706c6174652d73716c3f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-template-sql?include_prereleases&amp;label=changelog"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-template-sql/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-template-sql/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-template-sql/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>Datasette plugin for executing SQL queries from templates.</p>
<h2><a id=""user-content-examples"" class=""anchor"" aria-hidden=""true"" href=""#user-content-examples""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Examples</h2>
<p><a href=""https://datasette.io/"" rel=""nofollow"">datasette.io</a> uses this plugin extensively with <a href=""https://docs.datasette.io/en/stable/custom_templates.html#custom-pages"" rel=""nofollow"">custom page templates</a>, check out <a href=""https://github.com/simonw/datasette.io"">simonw/datasette.io</a> to see how it works.</p>
<p><a href=""https://www.niche-museums.com/"" rel=""nofollow"">www.niche-museums.com</a> uses this plugin to run a custom themed website on top of Datasette. The full source code for the site <a href=""https://github.com/simonw/museums"">is here</a> - see also <a href=""https://simonwillison.net/2019/Nov/25/niche-museums/"" rel=""nofollow"">niche-museums.com, powered by Datasette</a>.</p>
<p><a href=""https://github.com/simonw/til"">simonw/til</a> is another simple example, described in <a href=""https://simonwillison.net/2020/Apr/20/self-rewriting-readme/"" rel=""nofollow"">Using a self-rewriting README powered by GitHub Actions to track TILs</a>.</p>
<h2><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p>Run this command to install the plugin in the same environment as Datasette:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ pip install datasette-template-sql
""><pre><code>$ pip install datasette-template-sql
</code></pre></div>
<h2><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p>This plugin makes a new function, <code>sql(sql_query)</code>, available to your Datasette templates.</p>
<p>You can use it like this:</p>
<div class=""highlight highlight-text-html-django position-relative"" data-snippet-clipboard-copy-content=""{% for row in sql(&quot;select 1 + 1 as two, 2 * 4 as eight&quot;) %}
    {% for key in row.keys() %}
        {{ key }}: {{ row[key] }}&lt;br&gt;
    {% endfor %}
{% endfor %}
""><pre><span class=""pl-e"">{%</span> <span class=""pl-k"">for</span> <span class=""pl-s"">row</span> <span class=""pl-k"">in</span> <span class=""pl-s"">sql</span>(<span class=""pl-s"">""select 1 + 1 as two, 2 * 4 as eight""</span>) <span class=""pl-e"">%}</span>
    <span class=""pl-e"">{%</span> <span class=""pl-k"">for</span> <span class=""pl-s"">key</span> <span class=""pl-k"">in</span> <span class=""pl-s"">row</span>.<span class=""pl-s"">keys</span>() <span class=""pl-e"">%}</span>
        {{ key }}: {{ row[ke<span class=""pl-smi"">y]</span> }}&lt;<span class=""pl-ent"">br</span>&gt;
    <span class=""pl-e"">{%</span> <span class=""pl-k"">endfor</span> <span class=""pl-e"">%}</span>
<span class=""pl-e"">{%</span> <span class=""pl-k"">endfor</span> <span class=""pl-e"">%}</span></pre></div>
<p>The plugin will execute SQL against the current database for the page in  <code>database.html</code>, <code>table.html</code> and <code>row.html</code> templates. If a template does not have a current database (<code>index.html</code> for example) the query will execute against the first attached database.</p>
<h3><a id=""user-content-queries-with-arguments"" class=""anchor"" aria-hidden=""true"" href=""#user-content-queries-with-arguments""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Queries with arguments</h3>
<p>You can construct a SQL query using <code>?</code> or <code>:name</code> parameter syntax by passing a list or dictionary as a second argument:</p>
<div class=""highlight highlight-text-html-django position-relative"" data-snippet-clipboard-copy-content=""{% for row in sql(&quot;select distinct topic from til order by topic&quot;) %}
    &lt;h2&gt;{{ row.topic }}&lt;/h2&gt;
    &lt;ul&gt;
        {% for til in sql(&quot;select * from til where topic = ?&quot;, [row.topic]) %}
            &lt;li&gt;&lt;a href=&quot;{{ til.url }}&quot;&gt;{{ til.title }}&lt;/a&gt; - {{ til.created[:10] }}&lt;/li&gt;
        {% endfor %}
    &lt;/ul&gt;
{% endfor %}
""><pre><span class=""pl-e"">{%</span> <span class=""pl-k"">for</span> <span class=""pl-s"">row</span> <span class=""pl-k"">in</span> <span class=""pl-s"">sql</span>(<span class=""pl-s"">""select distinct topic from til order by topic""</span>) <span class=""pl-e"">%}</span>
    &lt;<span class=""pl-ent"">h2</span>&gt;{{ row.topic }}&lt;/<span class=""pl-ent"">h2</span>&gt;
    &lt;<span class=""pl-ent"">ul</span>&gt;
        <span class=""pl-e"">{%</span> <span class=""pl-k"">for</span> <span class=""pl-s"">til</span> <span class=""pl-k"">in</span> <span class=""pl-s"">sql</span>(<span class=""pl-s"">""select * from til where topic = ?""</span>, [<span class=""pl-s"">row</span>.<span class=""pl-s"">topic</span>]) <span class=""pl-e"">%}</span>
            &lt;<span class=""pl-ent"">li</span>&gt;&lt;<span class=""pl-ent"">a</span> <span class=""pl-e"">href</span>=<span class=""pl-s""><span class=""pl-pds"">""</span>{{ til.url }}<span class=""pl-pds"">""</span></span>&gt;{{ til.title }}&lt;/<span class=""pl-ent"">a</span>&gt; - {{ til.created[:1<span class=""pl-smi"">0]</span> }}&lt;/<span class=""pl-ent"">li</span>&gt;
        <span class=""pl-e"">{%</span> <span class=""pl-k"">endfor</span> <span class=""pl-e"">%}</span>
    &lt;/<span class=""pl-ent"">ul</span>&gt;
<span class=""pl-e"">{%</span> <span class=""pl-k"">endfor</span> <span class=""pl-e"">%}</span></pre></div>
<p>Here's the same example using the <code>:topic</code> style of parameters:</p>
<div class=""highlight highlight-text-html-django position-relative"" data-snippet-clipboard-copy-content=""{% for row in sql(&quot;select distinct topic from til order by topic&quot;) %}
    &lt;h2&gt;{{ row.topic }}&lt;/h2&gt;
    &lt;ul&gt;
        {% for til in sql(&quot;select * from til where topic = :topic&quot;, {&quot;topic&quot;: row.topic}) %}
            &lt;li&gt;&lt;a href=&quot;{{ til.url }}&quot;&gt;{{ til.title }}&lt;/a&gt; - {{ til.created[:10] }}&lt;/li&gt;
        {% endfor %}
    &lt;/ul&gt;
{% endfor %}
""><pre><span class=""pl-e"">{%</span> <span class=""pl-k"">for</span> <span class=""pl-s"">row</span> <span class=""pl-k"">in</span> <span class=""pl-s"">sql</span>(<span class=""pl-s"">""select distinct topic from til order by topic""</span>) <span class=""pl-e"">%}</span>
    &lt;<span class=""pl-ent"">h2</span>&gt;{{ row.topic }}&lt;/<span class=""pl-ent"">h2</span>&gt;
    &lt;<span class=""pl-ent"">ul</span>&gt;
        <span class=""pl-e"">{%</span> <span class=""pl-k"">for</span> <span class=""pl-s"">til</span> <span class=""pl-k"">in</span> <span class=""pl-s"">sql</span>(<span class=""pl-s"">""select * from til where topic = :topic""</span>, {<span class=""pl-s"">""topic""</span>: <span class=""pl-s"">row</span>.<span class=""pl-s"">topic</span>}) <span class=""pl-e"">%}</span>
            &lt;<span class=""pl-ent"">li</span>&gt;&lt;<span class=""pl-ent"">a</span> <span class=""pl-e"">href</span>=<span class=""pl-s""><span class=""pl-pds"">""</span>{{ til.url }}<span class=""pl-pds"">""</span></span>&gt;{{ til.title }}&lt;/<span class=""pl-ent"">a</span>&gt; - {{ til.created[:1<span class=""pl-smi"">0]</span> }}&lt;/<span class=""pl-ent"">li</span>&gt;
        <span class=""pl-e"">{%</span> <span class=""pl-k"">endfor</span> <span class=""pl-e"">%}</span>
    &lt;/<span class=""pl-ent"">ul</span>&gt;
<span class=""pl-e"">{%</span> <span class=""pl-k"">endfor</span> <span class=""pl-e"">%}</span></pre></div>
<h3><a id=""user-content-querying-a-different-database"" class=""anchor"" aria-hidden=""true"" href=""#user-content-querying-a-different-database""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Querying a different database</h3>
<p>You can pass an optional <code>database=</code> argument to specify a named database to use for the query. For example, if you have attached a <code>news.db</code> database you could use this:</p>
<div class=""highlight highlight-text-html-django position-relative"" data-snippet-clipboard-copy-content=""{% for article in sql(
    &quot;select headline, date, summary from articles order by date desc limit 5&quot;,
    database=&quot;news&quot;
) %}
    &lt;h3&gt;{{ article.headline }}&lt;/h2&gt;
    &lt;p class=&quot;date&quot;&gt;{{ article.date }}&lt;/p&gt;
    &lt;p&gt;{{ article.summary }}&lt;/p&gt;
{% endfor %}
""><pre><span class=""pl-e"">{%</span> <span class=""pl-k"">for</span> <span class=""pl-s"">article</span> <span class=""pl-k"">in</span> <span class=""pl-s"">sql</span>(
    <span class=""pl-s"">""select headline, date, summary from articles order by date desc limit 5""</span>,
    <span class=""pl-s"">database</span>=<span class=""pl-s"">""news""</span>
) <span class=""pl-e"">%}</span>
    &lt;<span class=""pl-ent"">h3</span>&gt;{{ article.headline }}&lt;/<span class=""pl-ent"">h2</span>&gt;
    &lt;<span class=""pl-ent"">p</span> <span class=""pl-e"">class</span>=<span class=""pl-s""><span class=""pl-pds"">""</span>date<span class=""pl-pds"">""</span></span>&gt;{{ article.date }}&lt;/<span class=""pl-ent"">p</span>&gt;
    &lt;<span class=""pl-ent"">p</span>&gt;{{ article.summary }}&lt;/<span class=""pl-ent"">p</span>&gt;
<span class=""pl-e"">{%</span> <span class=""pl-k"">endfor</span> <span class=""pl-e"">%}</span></pre></div>
</article></div>",,,,,,
228485806,MDEwOlJlcG9zaXRvcnkyMjg0ODU4MDY=,datasette-configure-asgi,simonw/datasette-configure-asgi,0,9599,https://github.com/simonw/datasette-configure-asgi,Datasette plugin for configuring arbitrary ASGI middleware,0,2019-12-16T22:17:10Z,2020-08-25T15:54:32Z,2019-12-16T22:19:49Z,,6,1,1,Python,1,1,1,1,0,0,0,0,0,apache-2.0,"[""asgi"", ""datasette"", ""datasette-plugin"", ""datasette-io""]",0,0,1,master,"{""admin"": false, ""push"": false, ""pull"": false}",,,0,1,"# datasette-configure-asgi

[![PyPI](https://img.shields.io/pypi/v/datasette-configure-asgi.svg)](https://pypi.org/project/datasette-configure-asgi/)
[![CircleCI](https://circleci.com/gh/simonw/datasette-configure-asgi.svg?style=svg)](https://circleci.com/gh/simonw/datasette-configure-asgi)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-configure-asgi/blob/master/LICENSE)

Datasette plugin for configuring arbitrary ASGI middleware

## Installation

    pip install datasette-configure-asgi

## Usage

This plugin only takes effect if your `metadata.json` file contains relevant top-level plugin configuration in a `""datasette-configure-asgi""` configuration key.

For example, to wrap your Datasette instance in the `asgi-log-to-sqlite` middleware configured to write logs to `/tmp/log.db` you would use the following:

```json
{
    ""plugins"": {
        ""datasette-configure-asgi"": [
            {
                ""class"": ""asgi_log_to_sqlite.AsgiLogToSqlite"",
                ""args"": {
                    ""file"": ""/tmp/log.db""
                }
            }
        ]
    }
}
```

The `""datasette-configure-asgi""` key should be a list of JSON objects. Each object should have a `""class""` key indicating the class to be used, and an optional `""args""` key providing any necessary arguments to be passed to that class constructor.

## Plugin structure

This plugin can be used to wrap your Datasette instance in any ASGI middleware that conforms to the following structure:

```python
class SomeAsgiMiddleware:
    def __init__(self, app, arg1, arg2):
        self.app = app
        self.arg1 = arg1
        self.arg2 = arg2

    async def __call__(self, scope, receive, send):
        start = time.time()
        await self.app(scope, receive, send)
        end = time.time()
        print(""Time taken: {}"".format(end - start))
```

So the middleware is a class with a constructor which takes the wrapped application as a first argument, `app`, followed by further named arguments to configure the middleware. It provides an `async def __call__(self, scope, receive, send)` method to implement the middleware's behavior.

","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-datasette-configure-asgi"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-configure-asgi""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-configure-asgi</h1>
<p><a href=""https://pypi.org/project/datasette-configure-asgi/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/f416823edcf74e9efabbee4be66b7db51784d1eeb1b7fd75286cfcbf16a87815/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d636f6e6669677572652d617367692e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-configure-asgi.svg"" style=""max-width:100%;""></a>
<a href=""https://circleci.com/gh/simonw/datasette-configure-asgi"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/d9cd8121c1d308429b1b5c4f9a87aee4091dfe95aadeeb9e39336aa91fb737e4/68747470733a2f2f636972636c6563692e636f6d2f67682f73696d6f6e772f6461746173657474652d636f6e6669677572652d617367692e7376673f7374796c653d737667"" alt=""CircleCI"" data-canonical-src=""https://circleci.com/gh/simonw/datasette-configure-asgi.svg?style=svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-configure-asgi/blob/master/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>Datasette plugin for configuring arbitrary ASGI middleware</p>
<h2><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pip install datasette-configure-asgi
""><pre><code>pip install datasette-configure-asgi
</code></pre></div>
<h2><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p>This plugin only takes effect if your <code>metadata.json</code> file contains relevant top-level plugin configuration in a <code>""datasette-configure-asgi""</code> configuration key.</p>
<p>For example, to wrap your Datasette instance in the <code>asgi-log-to-sqlite</code> middleware configured to write logs to <code>/tmp/log.db</code> you would use the following:</p>
<div class=""highlight highlight-source-json position-relative"" data-snippet-clipboard-copy-content=""{
    &quot;plugins&quot;: {
        &quot;datasette-configure-asgi&quot;: [
            {
                &quot;class&quot;: &quot;asgi_log_to_sqlite.AsgiLogToSqlite&quot;,
                &quot;args&quot;: {
                    &quot;file&quot;: &quot;/tmp/log.db&quot;
                }
            }
        ]
    }
}
""><pre>{
    <span class=""pl-s""><span class=""pl-pds"">""</span>plugins<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>datasette-configure-asgi<span class=""pl-pds"">""</span></span>: [
            {
                <span class=""pl-s""><span class=""pl-pds"">""</span>class<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>asgi_log_to_sqlite.AsgiLogToSqlite<span class=""pl-pds"">""</span></span>,
                <span class=""pl-s""><span class=""pl-pds"">""</span>args<span class=""pl-pds"">""</span></span>: {
                    <span class=""pl-s""><span class=""pl-pds"">""</span>file<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>/tmp/log.db<span class=""pl-pds"">""</span></span>
                }
            }
        ]
    }
}</pre></div>
<p>The <code>""datasette-configure-asgi""</code> key should be a list of JSON objects. Each object should have a <code>""class""</code> key indicating the class to be used, and an optional <code>""args""</code> key providing any necessary arguments to be passed to that class constructor.</p>
<h2><a id=""user-content-plugin-structure"" class=""anchor"" aria-hidden=""true"" href=""#user-content-plugin-structure""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Plugin structure</h2>
<p>This plugin can be used to wrap your Datasette instance in any ASGI middleware that conforms to the following structure:</p>
<div class=""highlight highlight-source-python position-relative"" data-snippet-clipboard-copy-content=""class SomeAsgiMiddleware:
    def __init__(self, app, arg1, arg2):
        self.app = app
        self.arg1 = arg1
        self.arg2 = arg2

    async def __call__(self, scope, receive, send):
        start = time.time()
        await self.app(scope, receive, send)
        end = time.time()
        print(&quot;Time taken: {}&quot;.format(end - start))
""><pre><span class=""pl-k"">class</span> <span class=""pl-v"">SomeAsgiMiddleware</span>:
    <span class=""pl-k"">def</span> <span class=""pl-en"">__init__</span>(<span class=""pl-s1"">self</span>, <span class=""pl-s1"">app</span>, <span class=""pl-s1"">arg1</span>, <span class=""pl-s1"">arg2</span>):
        <span class=""pl-s1"">self</span>.<span class=""pl-s1"">app</span> <span class=""pl-c1"">=</span> <span class=""pl-s1"">app</span>
        <span class=""pl-s1"">self</span>.<span class=""pl-s1"">arg1</span> <span class=""pl-c1"">=</span> <span class=""pl-s1"">arg1</span>
        <span class=""pl-s1"">self</span>.<span class=""pl-s1"">arg2</span> <span class=""pl-c1"">=</span> <span class=""pl-s1"">arg2</span>

    <span class=""pl-k"">async</span> <span class=""pl-k"">def</span> <span class=""pl-en"">__call__</span>(<span class=""pl-s1"">self</span>, <span class=""pl-s1"">scope</span>, <span class=""pl-s1"">receive</span>, <span class=""pl-s1"">send</span>):
        <span class=""pl-s1"">start</span> <span class=""pl-c1"">=</span> <span class=""pl-s1"">time</span>.<span class=""pl-en"">time</span>()
        <span class=""pl-k"">await</span> <span class=""pl-s1"">self</span>.<span class=""pl-en"">app</span>(<span class=""pl-s1"">scope</span>, <span class=""pl-s1"">receive</span>, <span class=""pl-s1"">send</span>)
        <span class=""pl-s1"">end</span> <span class=""pl-c1"">=</span> <span class=""pl-s1"">time</span>.<span class=""pl-en"">time</span>()
        <span class=""pl-en"">print</span>(<span class=""pl-s"">""Time taken: {}""</span>.<span class=""pl-en"">format</span>(<span class=""pl-s1"">end</span> <span class=""pl-c1"">-</span> <span class=""pl-s1"">start</span>))</pre></div>
<p>So the middleware is a class with a constructor which takes the wrapped application as a first argument, <code>app</code>, followed by further named arguments to configure the middleware. It provides an <code>async def __call__(self, scope, receive, send)</code> method to implement the middleware's behavior.</p>
</article></div>",,,,,,
234825790,MDEwOlJlcG9zaXRvcnkyMzQ4MjU3OTA=,datasette-upload-csvs,simonw/datasette-upload-csvs,0,9599,https://github.com/simonw/datasette-upload-csvs,Datasette plugin for uploading CSV files and converting them to database tables,0,2020-01-19T02:07:05Z,2022-07-03T20:58:20Z,2022-09-09T16:23:59Z,https://datasette.io/plugins/datasette-upload-csvs,58,9,9,Python,1,1,1,1,0,1,0,0,4,apache-2.0,"[""csvs"", ""datasette"", ""datasette-io"", ""datasette-plugin""]",1,4,9,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,1,2,"# datasette-upload-csvs

[![PyPI](https://img.shields.io/pypi/v/datasette-upload-csvs.svg)](https://pypi.org/project/datasette-upload-csvs/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-upload-csvs?include_prereleases&label=changelog)](https://github.com/simonw/datasette-upload-csvs/releases)
[![Tests](https://github.com/simonw/datasette-upload-csvs/workflows/Test/badge.svg)](https://github.com/simonw/datasette-upload-csvs/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-upload-csvs/blob/main/LICENSE)

Datasette plugin for uploading CSV files and converting them to database tables

## Installation

    datasette install datasette-upload-csvs

## Usage

The plugin adds an interface at `/-/upload-csvs` for uploading a CSV file and using it to create a new database table.

By default only [the root actor](https://datasette.readthedocs.io/en/stable/authentication.html#using-the-root-actor) can access the page - so you'll need to run Datasette with the `--root` option and click on the link shown in the terminal to sign in and access the page.

The `upload-csvs` permission governs access. You can use permission plugins such as [datasette-permissions-sql](https://github.com/simonw/datasette-permissions-sql) to grant additional access to the write interface.
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-datasette-upload-csvs"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-upload-csvs""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-upload-csvs</h1>
<p dir=""auto""><a href=""https://pypi.org/project/datasette-upload-csvs/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/0579577329c092ea76dd2672cfa6077ecd82c918c9d3caa9c8ff26be02104f78/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d75706c6f61642d637376732e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-upload-csvs.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-upload-csvs/releases""><img src=""https://camo.githubusercontent.com/34f7f9b455f75c80e8b706ff050dc59f70a926b93af51457f14f0a1d290f47bc/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d75706c6f61642d637376733f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-upload-csvs?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-upload-csvs/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-upload-csvs/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-upload-csvs/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Datasette plugin for uploading CSV files and converting them to database tables</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""datasette install datasette-upload-csvs""><pre class=""notranslate""><code>datasette install datasette-upload-csvs
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p dir=""auto"">The plugin adds an interface at <code>/-/upload-csvs</code> for uploading a CSV file and using it to create a new database table.</p>
<p dir=""auto"">By default only <a href=""https://datasette.readthedocs.io/en/stable/authentication.html#using-the-root-actor"" rel=""nofollow"">the root actor</a> can access the page - so you'll need to run Datasette with the <code>--root</code> option and click on the link shown in the terminal to sign in and access the page.</p>
<p dir=""auto"">The <code>upload-csvs</code> permission governs access. You can use permission plugins such as <a href=""https://github.com/simonw/datasette-permissions-sql"">datasette-permissions-sql</a> to grant additional access to the write interface.</p>
</article></div>",1,public,0,,0,
236110759,MDEwOlJlcG9zaXRvcnkyMzYxMTA3NTk=,datasette-auth-existing-cookies,simonw/datasette-auth-existing-cookies,0,9599,https://github.com/simonw/datasette-auth-existing-cookies,Datasette plugin that authenticates users based on existing domain cookies,0,2020-01-25T01:20:31Z,2022-05-28T01:50:15Z,2022-05-30T17:10:11Z,,54,3,3,Python,1,1,1,1,0,1,0,0,0,apache-2.0,"[""datasette"", ""datasette-io"", ""datasette-plugin""]",1,0,3,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,1,3,"# datasette-auth-existing-cookies

[![PyPI](https://img.shields.io/pypi/v/datasette-auth-existing-cookies.svg)](https://pypi.org/project/datasette-auth-existing-cookies/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-auth-existing-cookies?include_prereleases&label=changelog)](https://github.com/simonw/datasette-auth-existing-cookies/releases)
[![Tests](https://github.com/simonw/datasette-auth-existing-cookies/workflows/Test/badge.svg)](https://github.com/simonw/datasette-auth-existing-cookies/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-auth-existing-cookies/blob/master/LICENSE)

Datasette plugin that authenticates users based on existing domain cookies.

## When to use this

This plugin allows you to build custom authentication for Datasette when you are hosting a Datasette instance on the same domain as another, authenticated website.

Consider a website on `www.example.com` which supports user authentication.

You could run Datasette on `data.example.com` in a way that lets it see cookies that were set for the `.example.com` domain.

Using this plugin, you could build an API endpoint at `www.example.com/user-for-cookies` which returns a JSON object representing the currently signed-in user, based on their cookies.

The plugin running on `data.example.com` will then make the `actor` available to the rest of Datasette based on the response from that API.

Read about [Datasette's authentication and permissions system](https://docs.datasette.io/en/stable/authentication.html) for more on how actors and permissions work.

## Configuration

This plugin requires some configuration in the Datasette [metadata.json file](https://datasette.readthedocs.io/en/stable/plugins.html#plugin-configuration).

The following configuration options are supported:

- `api_url`: this is the API endpoint that Datasette should call with the user's cookies in order to identify the logged in user.
- `cookies`: optional. A list of cookie names that should be passed through to the API endpoint - if left blank, the default is to send all cookies.
- `ttl`: optional. By default Datasette will make a request to the API endpoint for every HTTP request recieved by Datasette itself. A `ttl` value of 5 will cause Datasette to cache the actor associated with the user's cookies for 5 seconds, reducing that API traffic.
- `headers`: an optional list of other headers to forward to the API endpoint as query string parameters.

Here is an example that uses all four of these settings:

```json
{
    ""plugins"": {
        ""datasette-auth-existing-cookies"": {
            ""api_url"": ""http://www.example.com/user-from-cookies"",
            ""cookies"": [""sessionid""],
            ""headers"": [""host""],
            ""ttl"": 10
        }
    }
}
```
With this configuration any hit to a Datasette hosted at `data.example.com` will result in the following request being made to the `http://www.example.com/user-from-cookies` API endpoint:
```
GET http://www.example.com/user-from-cookies?host=data.example.com
Cookie: sessionid=abc123
```
That API is expected to return a JSON object representing the current user:

```json
{
    ""id"": 1,
    ""name"": ""Barry""
}
```
Since `ttl` is set to 10 that actor will be cached for ten seconds against that exact combination of cookies and headers. When that cache expires another hit will be made to the API.

When deciding on a TTL value, take into account that users who lose access to the core site - maybe because their session expires, or their account is disabled - will still be able to access the Datasette instance until that cache expires.
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-datasette-auth-existing-cookies"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-auth-existing-cookies""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-auth-existing-cookies</h1>
<p dir=""auto""><a href=""https://pypi.org/project/datasette-auth-existing-cookies/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/590f1fc881f5547ddc2df20c3e78f26fde45e0e616863c3c2111d37bf22f6a61/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d617574682d6578697374696e672d636f6f6b6965732e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-auth-existing-cookies.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-auth-existing-cookies/releases""><img src=""https://camo.githubusercontent.com/ca9233dbd3b295b882d4e93a38bf3d9d64a6904111ad04a787a4d7201b3cd58f/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d617574682d6578697374696e672d636f6f6b6965733f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-auth-existing-cookies?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-auth-existing-cookies/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-auth-existing-cookies/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-auth-existing-cookies/blob/master/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Datasette plugin that authenticates users based on existing domain cookies.</p>
<h2 dir=""auto""><a id=""user-content-when-to-use-this"" class=""anchor"" aria-hidden=""true"" href=""#user-content-when-to-use-this""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>When to use this</h2>
<p dir=""auto"">This plugin allows you to build custom authentication for Datasette when you are hosting a Datasette instance on the same domain as another, authenticated website.</p>
<p dir=""auto"">Consider a website on <code>www.example.com</code> which supports user authentication.</p>
<p dir=""auto"">You could run Datasette on <code>data.example.com</code> in a way that lets it see cookies that were set for the <code>.example.com</code> domain.</p>
<p dir=""auto"">Using this plugin, you could build an API endpoint at <code>www.example.com/user-for-cookies</code> which returns a JSON object representing the currently signed-in user, based on their cookies.</p>
<p dir=""auto"">The plugin running on <code>data.example.com</code> will then make the <code>actor</code> available to the rest of Datasette based on the response from that API.</p>
<p dir=""auto"">Read about <a href=""https://docs.datasette.io/en/stable/authentication.html"" rel=""nofollow"">Datasette's authentication and permissions system</a> for more on how actors and permissions work.</p>
<h2 dir=""auto""><a id=""user-content-configuration"" class=""anchor"" aria-hidden=""true"" href=""#user-content-configuration""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Configuration</h2>
<p dir=""auto"">This plugin requires some configuration in the Datasette <a href=""https://datasette.readthedocs.io/en/stable/plugins.html#plugin-configuration"" rel=""nofollow"">metadata.json file</a>.</p>
<p dir=""auto"">The following configuration options are supported:</p>
<ul dir=""auto"">
<li><code>api_url</code>: this is the API endpoint that Datasette should call with the user's cookies in order to identify the logged in user.</li>
<li><code>cookies</code>: optional. A list of cookie names that should be passed through to the API endpoint - if left blank, the default is to send all cookies.</li>
<li><code>ttl</code>: optional. By default Datasette will make a request to the API endpoint for every HTTP request recieved by Datasette itself. A <code>ttl</code> value of 5 will cause Datasette to cache the actor associated with the user's cookies for 5 seconds, reducing that API traffic.</li>
<li><code>headers</code>: an optional list of other headers to forward to the API endpoint as query string parameters.</li>
</ul>
<p dir=""auto"">Here is an example that uses all four of these settings:</p>
<div class=""highlight highlight-source-json notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{
    &quot;plugins&quot;: {
        &quot;datasette-auth-existing-cookies&quot;: {
            &quot;api_url&quot;: &quot;http://www.example.com/user-from-cookies&quot;,
            &quot;cookies&quot;: [&quot;sessionid&quot;],
            &quot;headers&quot;: [&quot;host&quot;],
            &quot;ttl&quot;: 10
        }
    }
}""><pre>{
    <span class=""pl-ent"">""plugins""</span>: {
        <span class=""pl-ent"">""datasette-auth-existing-cookies""</span>: {
            <span class=""pl-ent"">""api_url""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>http://www.example.com/user-from-cookies<span class=""pl-pds"">""</span></span>,
            <span class=""pl-ent"">""cookies""</span>: [<span class=""pl-s""><span class=""pl-pds"">""</span>sessionid<span class=""pl-pds"">""</span></span>],
            <span class=""pl-ent"">""headers""</span>: [<span class=""pl-s""><span class=""pl-pds"">""</span>host<span class=""pl-pds"">""</span></span>],
            <span class=""pl-ent"">""ttl""</span>: <span class=""pl-c1"">10</span>
        }
    }
}</pre></div>
<p dir=""auto"">With this configuration any hit to a Datasette hosted at <code>data.example.com</code> will result in the following request being made to the <code>http://www.example.com/user-from-cookies</code> API endpoint:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""GET http://www.example.com/user-from-cookies?host=data.example.com
Cookie: sessionid=abc123""><pre class=""notranslate""><code>GET http://www.example.com/user-from-cookies?host=data.example.com
Cookie: sessionid=abc123
</code></pre></div>
<p dir=""auto"">That API is expected to return a JSON object representing the current user:</p>
<div class=""highlight highlight-source-json notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{
    &quot;id&quot;: 1,
    &quot;name&quot;: &quot;Barry&quot;
}""><pre>{
    <span class=""pl-ent"">""id""</span>: <span class=""pl-c1"">1</span>,
    <span class=""pl-ent"">""name""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>Barry<span class=""pl-pds"">""</span></span>
}</pre></div>
<p dir=""auto"">Since <code>ttl</code> is set to 10 that actor will be cached for ten seconds against that exact combination of cookies and headers. When that cache expires another hit will be made to the API.</p>
<p dir=""auto"">When deciding on a TTL value, take into account that users who lose access to the core site - maybe because their session expires, or their account is disabled - will still be able to access the Datasette instance until that cache expires.</p>
</article></div>",1,public,0,,,
236867027,MDEwOlJlcG9zaXRvcnkyMzY4NjcwMjc=,datasette-sentry,simonw/datasette-sentry,0,9599,https://github.com/simonw/datasette-sentry,Datasette plugin for configuring Sentry,0,2020-01-28T23:41:27Z,2022-07-18T20:28:25Z,2022-10-06T22:31:29Z,,26,6,6,Python,1,1,1,1,0,0,0,0,0,apache-2.0,"[""datasette"", ""datasette-io"", ""datasette-plugin"", ""sentry""]",0,0,6,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,0,2,"# datasette-sentry

[![PyPI](https://img.shields.io/pypi/v/datasette-sentry.svg)](https://pypi.org/project/datasette-sentry/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-sentry?include_prereleases&label=changelog)](https://github.com/simonw/datasette-sentry/releases)
[![Tests](https://github.com/simonw/datasette-sentry/workflows/Test/badge.svg)](https://github.com/simonw/datasette-sentry/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-sentry/blob/main/LICENSE)

Datasette plugin for configuring Sentry for error reporting
 
## Installation

    pip install datasette-sentry

## Usage

This plugin only takes effect if your `metadata.json` file contains relevant top-level plugin configuration in a `""datasette-sentry""` configuration key.

You will need a Sentry DSN - see their [Getting Started instructions](https://docs.sentry.io/error-reporting/quickstart/?platform=python).

Add it to `metadata.json` like this:

```json
{
    ""plugins"": {
        ""datasette-sentry"": {
            ""dsn"": ""https://KEY@sentry.io/PROJECTID""
        }
    }
}
```
Settings in `metadata.json` are visible to anyone who visits the `/-/metadata` URL so this is a good place to take advantage of Datasette's [secret configuration values](https://datasette.readthedocs.io/en/stable/plugins.html#secret-configuration-values), in which case your configuration will look more like this:
```json
{
    ""plugins"": {
        ""datasette-sentry"": {
            ""dsn"": {
                ""$env"": ""SENTRY_DSN""
            }
        }
    }
}
```
Then make a `SENTRY_DSN` environment variable available to Datasette.

## Configuration

In addition to the `dsn` setting, you can also configure the Sentry [sample rate](https://docs.sentry.io/platforms/python/configuration/sampling/) by setting  `sample_rate` to a floating point number between 0 and 1.

For example, to capture 25% of errors you would do this:

```json
{
    ""plugins"": {
        ""datasette-sentry"": {
            ""dsn"": {
                ""$env"": ""SENTRY_DSN""
            },
            ""sample_rate"": 0.25
        }
    }
}
```
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-datasette-sentry"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-sentry""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-sentry</h1>
<p dir=""auto""><a href=""https://pypi.org/project/datasette-sentry/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/235a5563b51d4001ae7207d970a8ea72137fd5cdb3443c66b5eb42ef35703683/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d73656e7472792e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-sentry.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-sentry/releases""><img src=""https://camo.githubusercontent.com/0a9f8c726d4c19b1bde12b99d824cf9269559d1290186a88815f72ee50a2a618/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d73656e7472793f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-sentry?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-sentry/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-sentry/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-sentry/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Datasette plugin for configuring Sentry for error reporting</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install datasette-sentry""><pre class=""notranslate""><code>pip install datasette-sentry
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p dir=""auto"">This plugin only takes effect if your <code>metadata.json</code> file contains relevant top-level plugin configuration in a <code>""datasette-sentry""</code> configuration key.</p>
<p dir=""auto"">You will need a Sentry DSN - see their <a href=""https://docs.sentry.io/error-reporting/quickstart/?platform=python"" rel=""nofollow"">Getting Started instructions</a>.</p>
<p dir=""auto"">Add it to <code>metadata.json</code> like this:</p>
<div class=""highlight highlight-source-json notranslate position-relative overflow-auto"" dir=""auto"" data-snippet-clipboard-copy-content=""{
    &quot;plugins&quot;: {
        &quot;datasette-sentry&quot;: {
            &quot;dsn&quot;: &quot;https://KEY@sentry.io/PROJECTID&quot;
        }
    }
}""><pre>{
    <span class=""pl-ent"">""plugins""</span>: {
        <span class=""pl-ent"">""datasette-sentry""</span>: {
            <span class=""pl-ent"">""dsn""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>https://KEY@sentry.io/PROJECTID<span class=""pl-pds"">""</span></span>
        }
    }
}</pre></div>
<p dir=""auto"">Settings in <code>metadata.json</code> are visible to anyone who visits the <code>/-/metadata</code> URL so this is a good place to take advantage of Datasette's <a href=""https://datasette.readthedocs.io/en/stable/plugins.html#secret-configuration-values"" rel=""nofollow"">secret configuration values</a>, in which case your configuration will look more like this:</p>
<div class=""highlight highlight-source-json notranslate position-relative overflow-auto"" dir=""auto"" data-snippet-clipboard-copy-content=""{
    &quot;plugins&quot;: {
        &quot;datasette-sentry&quot;: {
            &quot;dsn&quot;: {
                &quot;$env&quot;: &quot;SENTRY_DSN&quot;
            }
        }
    }
}""><pre>{
    <span class=""pl-ent"">""plugins""</span>: {
        <span class=""pl-ent"">""datasette-sentry""</span>: {
            <span class=""pl-ent"">""dsn""</span>: {
                <span class=""pl-ent"">""$env""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>SENTRY_DSN<span class=""pl-pds"">""</span></span>
            }
        }
    }
}</pre></div>
<p dir=""auto"">Then make a <code>SENTRY_DSN</code> environment variable available to Datasette.</p>
<h2 dir=""auto""><a id=""user-content-configuration"" class=""anchor"" aria-hidden=""true"" href=""#user-content-configuration""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Configuration</h2>
<p dir=""auto"">In addition to the <code>dsn</code> setting, you can also configure the Sentry <a href=""https://docs.sentry.io/platforms/python/configuration/sampling/"" rel=""nofollow"">sample rate</a> by setting  <code>sample_rate</code> to a floating point number between 0 and 1.</p>
<p dir=""auto"">For example, to capture 25% of errors you would do this:</p>
<div class=""highlight highlight-source-json notranslate position-relative overflow-auto"" dir=""auto"" data-snippet-clipboard-copy-content=""{
    &quot;plugins&quot;: {
        &quot;datasette-sentry&quot;: {
            &quot;dsn&quot;: {
                &quot;$env&quot;: &quot;SENTRY_DSN&quot;
            },
            &quot;sample_rate&quot;: 0.25
        }
    }
}""><pre>{
    <span class=""pl-ent"">""plugins""</span>: {
        <span class=""pl-ent"">""datasette-sentry""</span>: {
            <span class=""pl-ent"">""dsn""</span>: {
                <span class=""pl-ent"">""$env""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>SENTRY_DSN<span class=""pl-pds"">""</span></span>
            },
            <span class=""pl-ent"">""sample_rate""</span>: <span class=""pl-c1"">0.25</span>
        }
    }
}</pre></div>
</article></div>",1,public,0,,0,
237321267,MDEwOlJlcG9zaXRvcnkyMzczMjEyNjc=,geojson-to-sqlite,simonw/geojson-to-sqlite,0,9599,https://github.com/simonw/geojson-to-sqlite,CLI tool for converting GeoJSON files to SQLite (with SpatiaLite),0,2020-01-30T22:51:05Z,2022-03-05T00:40:56Z,2022-04-13T23:39:25Z,,117,34,34,Python,1,1,1,1,0,3,0,0,4,apache-2.0,"[""datasette-io"", ""datasette-tool"", ""geojson"", ""gis"", ""sqlite""]",3,4,34,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,3,3,"# geojson-to-sqlite

[![PyPI](https://img.shields.io/pypi/v/geojson-to-sqlite.svg)](https://pypi.org/project/geojson-to-sqlite/)
[![Changelog](https://img.shields.io/github/v/release/simonw/geojson-to-sqlite?include_prereleases&label=changelog)](https://github.com/simonw/geojson-to-sqlite/releases)
[![Tests](https://github.com/simonw/geojson-to-sqlite/workflows/Test/badge.svg)](https://github.com/simonw/geojson-to-sqlite/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/geojson-to-sqlite/blob/main/LICENSE)

CLI tool for converting GeoJSON to SQLite (optionally with SpatiaLite)

[RFC 7946: The GeoJSON Format](https://tools.ietf.org/html/rfc7946)

## How to install

    $ pip install geojson-to-sqlite

## How to use

You can run this tool against a GeoJSON file like so:

    $ geojson-to-sqlite my.db features features.geojson

This will load all of the features from the `features.geojson` file into a table called `features`.

Each row will have a `geometry` column containing the feature geometry, and columns for each of the keys found in any `properties` attached to those features. (To bundle all properties into a single JSON object, use the `--properties` flag.)

The table will be created the first time you run the command.

On subsequent runs you can use the `--alter` option to add any new columns that are missing from the table.

You can pass more than one GeoJSON file, in which case the contents of all of the files will be inserted into the same table.

If your features have an `""id""` property it will be used as the primary key for the table. You can also use `--pk=PROPERTY` with the name of a different property to use that as the primary key instead. If you don't want to use the `""id""` as the primary key (maybe it contains duplicate values) you can use `--pk ''` to specify no primary key.

Specifying a primary key also will allow you to upsert data into the rows instead of insert data into new rows.

If no primary key is specified, a SQLite `rowid` column will be used.

You can use `-` as the filename to import from standard input. For example:

    $ curl https://eric.clst.org/assets/wiki/uploads/Stuff/gz_2010_us_040_00_20m.json \
        | geojson-to-sqlite my.db states - --pk GEO_ID

## Using with SpatiaLite

By default, the `geometry` column will contain JSON.

If you have installed the [SpatiaLite](https://www.gaia-gis.it/fossil/libspatialite/index) module for SQLite you can instead import the geometry into a geospatially indexed column.

You can do this using the `--spatialite` option, like so:

    $ geojson-to-sqlite my.db features features.geojson --spatialite

The tool will search for the SpatiaLite module in the following locations:

- `/usr/lib/x86_64-linux-gnu/mod_spatialite.so`
- `/usr/local/lib/mod_spatialite.dylib`

If you have installed the module in another location, you can use the `--spatialite_mod=xxx` option to specify where:

    $ geojson-to-sqlite my.db features features.geojson \
        --spatialite_mod=/usr/lib/mod_spatialite.dylib

You can create a SpatiaLite spatial index on the `geometry` column using the `--spatial-index` option:

    $ geojson-to-sqlite my.db features features.geojson --spatial-index

Using this option implies `--spatialite` so you do not need to add that.

## Streaming large datasets

For large datasets, consider using newline-delimited JSON to stream features into the database without loading the entire feature collection into memory.

For example, to load a day of earthquake reports from USGS:

    $ geojson-to-sqlite quakes.db quakes tests/quakes.ndjson \
      --nl --pk=id --spatialite

When using newline-delimited JSON, tables will also be created from the first feature, instead of guessing types based on the first 100 features.

If you want to use a larger subset of your data to guess column types (for example, if some fields are inconsistent) you can use [fiona](https://fiona.readthedocs.io/en/latest/cli.html) to collect features into a single collection.

    $ head tests/quakes.ndjson | fio collect | \
      geojson-to-sqlite quakes.db quakes - --spatialite

This will take the first 10 lines from `tests/quakes.ndjson`, pass them to `fio collect`, which turns them into a single feature collection, and pass that, in turn, to `geojson-to-sqlite`.

## Using this with Datasette

Databases created using this tool can be explored and published using [Datasette](https://datasette.readthedocs.io/).

The Datasette documentation includes a section on [how to use it to browse SpatiaLite databases](https://datasette.readthedocs.io/en/stable/spatialite.html).

The [datasette-leaflet-geojson](https://datasette.io/plugins/datasette-leaflet-geojson) plugin can be used to visualize columns containing GeoJSON geometries on a [Leaflet](https://leafletjs.com/) map.

If you are using SpatiaLite you will need to output the geometry as GeoJSON in order for that plugin to work. You can do that using the SpaitaLite `AsGeoJSON()` function - something like this:

```sql
select rowid, AsGeoJSON(geometry) from mytable limit 10
```

The [datasette-geojson-map](https://datasette.io/plugins/datasette-geojson-map) is an alternative plugin which will automatically render SpatiaLite geometries as a Leaflet map on the corresponding table page, without needing you to call `AsGeoJSON(geometry)`.
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-geojson-to-sqlite"" class=""anchor"" aria-hidden=""true"" href=""#user-content-geojson-to-sqlite""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>geojson-to-sqlite</h1>
<p dir=""auto""><a href=""https://pypi.org/project/geojson-to-sqlite/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/b0c77834f0d6adf37e62573e121ccadd92711433ad2c5b4dd506610919762dab/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f67656f6a736f6e2d746f2d73716c6974652e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/geojson-to-sqlite.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/geojson-to-sqlite/releases""><img src=""https://camo.githubusercontent.com/e88a12c13acc77ac4d0338f065dfeb664076b3414a7ee2560e8d465e828c1320/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f67656f6a736f6e2d746f2d73716c6974653f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/geojson-to-sqlite?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/geojson-to-sqlite/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/geojson-to-sqlite/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/geojson-to-sqlite/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">CLI tool for converting GeoJSON to SQLite (optionally with SpatiaLite)</p>
<p dir=""auto""><a href=""https://tools.ietf.org/html/rfc7946"" rel=""nofollow"">RFC 7946: The GeoJSON Format</a></p>
<h2 dir=""auto""><a id=""user-content-how-to-install"" class=""anchor"" aria-hidden=""true"" href=""#user-content-how-to-install""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>How to install</h2>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ pip install geojson-to-sqlite""><pre><code>$ pip install geojson-to-sqlite
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-how-to-use"" class=""anchor"" aria-hidden=""true"" href=""#user-content-how-to-use""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>How to use</h2>
<p dir=""auto"">You can run this tool against a GeoJSON file like so:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ geojson-to-sqlite my.db features features.geojson""><pre><code>$ geojson-to-sqlite my.db features features.geojson
</code></pre></div>
<p dir=""auto"">This will load all of the features from the <code>features.geojson</code> file into a table called <code>features</code>.</p>
<p dir=""auto"">Each row will have a <code>geometry</code> column containing the feature geometry, and columns for each of the keys found in any <code>properties</code> attached to those features. (To bundle all properties into a single JSON object, use the <code>--properties</code> flag.)</p>
<p dir=""auto"">The table will be created the first time you run the command.</p>
<p dir=""auto"">On subsequent runs you can use the <code>--alter</code> option to add any new columns that are missing from the table.</p>
<p dir=""auto"">You can pass more than one GeoJSON file, in which case the contents of all of the files will be inserted into the same table.</p>
<p dir=""auto"">If your features have an <code>""id""</code> property it will be used as the primary key for the table. You can also use <code>--pk=PROPERTY</code> with the name of a different property to use that as the primary key instead. If you don't want to use the <code>""id""</code> as the primary key (maybe it contains duplicate values) you can use <code>--pk ''</code> to specify no primary key.</p>
<p dir=""auto"">Specifying a primary key also will allow you to upsert data into the rows instead of insert data into new rows.</p>
<p dir=""auto"">If no primary key is specified, a SQLite <code>rowid</code> column will be used.</p>
<p dir=""auto"">You can use <code>-</code> as the filename to import from standard input. For example:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ curl https://eric.clst.org/assets/wiki/uploads/Stuff/gz_2010_us_040_00_20m.json \
    | geojson-to-sqlite my.db states - --pk GEO_ID""><pre><code>$ curl https://eric.clst.org/assets/wiki/uploads/Stuff/gz_2010_us_040_00_20m.json \
    | geojson-to-sqlite my.db states - --pk GEO_ID
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-using-with-spatialite"" class=""anchor"" aria-hidden=""true"" href=""#user-content-using-with-spatialite""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Using with SpatiaLite</h2>
<p dir=""auto"">By default, the <code>geometry</code> column will contain JSON.</p>
<p dir=""auto"">If you have installed the <a href=""https://www.gaia-gis.it/fossil/libspatialite/index"" rel=""nofollow"">SpatiaLite</a> module for SQLite you can instead import the geometry into a geospatially indexed column.</p>
<p dir=""auto"">You can do this using the <code>--spatialite</code> option, like so:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ geojson-to-sqlite my.db features features.geojson --spatialite""><pre><code>$ geojson-to-sqlite my.db features features.geojson --spatialite
</code></pre></div>
<p dir=""auto"">The tool will search for the SpatiaLite module in the following locations:</p>
<ul dir=""auto"">
<li><code>/usr/lib/x86_64-linux-gnu/mod_spatialite.so</code></li>
<li><code>/usr/local/lib/mod_spatialite.dylib</code></li>
</ul>
<p dir=""auto"">If you have installed the module in another location, you can use the <code>--spatialite_mod=xxx</code> option to specify where:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ geojson-to-sqlite my.db features features.geojson \
    --spatialite_mod=/usr/lib/mod_spatialite.dylib""><pre><code>$ geojson-to-sqlite my.db features features.geojson \
    --spatialite_mod=/usr/lib/mod_spatialite.dylib
</code></pre></div>
<p dir=""auto"">You can create a SpatiaLite spatial index on the <code>geometry</code> column using the <code>--spatial-index</code> option:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ geojson-to-sqlite my.db features features.geojson --spatial-index""><pre><code>$ geojson-to-sqlite my.db features features.geojson --spatial-index
</code></pre></div>
<p dir=""auto"">Using this option implies <code>--spatialite</code> so you do not need to add that.</p>
<h2 dir=""auto""><a id=""user-content-streaming-large-datasets"" class=""anchor"" aria-hidden=""true"" href=""#user-content-streaming-large-datasets""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Streaming large datasets</h2>
<p dir=""auto"">For large datasets, consider using newline-delimited JSON to stream features into the database without loading the entire feature collection into memory.</p>
<p dir=""auto"">For example, to load a day of earthquake reports from USGS:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ geojson-to-sqlite quakes.db quakes tests/quakes.ndjson \
  --nl --pk=id --spatialite""><pre><code>$ geojson-to-sqlite quakes.db quakes tests/quakes.ndjson \
  --nl --pk=id --spatialite
</code></pre></div>
<p dir=""auto"">When using newline-delimited JSON, tables will also be created from the first feature, instead of guessing types based on the first 100 features.</p>
<p dir=""auto"">If you want to use a larger subset of your data to guess column types (for example, if some fields are inconsistent) you can use <a href=""https://fiona.readthedocs.io/en/latest/cli.html"" rel=""nofollow"">fiona</a> to collect features into a single collection.</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ head tests/quakes.ndjson | fio collect | \
  geojson-to-sqlite quakes.db quakes - --spatialite""><pre><code>$ head tests/quakes.ndjson | fio collect | \
  geojson-to-sqlite quakes.db quakes - --spatialite
</code></pre></div>
<p dir=""auto"">This will take the first 10 lines from <code>tests/quakes.ndjson</code>, pass them to <code>fio collect</code>, which turns them into a single feature collection, and pass that, in turn, to <code>geojson-to-sqlite</code>.</p>
<h2 dir=""auto""><a id=""user-content-using-this-with-datasette"" class=""anchor"" aria-hidden=""true"" href=""#user-content-using-this-with-datasette""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Using this with Datasette</h2>
<p dir=""auto"">Databases created using this tool can be explored and published using <a href=""https://datasette.readthedocs.io/"" rel=""nofollow"">Datasette</a>.</p>
<p dir=""auto"">The Datasette documentation includes a section on <a href=""https://datasette.readthedocs.io/en/stable/spatialite.html"" rel=""nofollow"">how to use it to browse SpatiaLite databases</a>.</p>
<p dir=""auto"">The <a href=""https://datasette.io/plugins/datasette-leaflet-geojson"" rel=""nofollow"">datasette-leaflet-geojson</a> plugin can be used to visualize columns containing GeoJSON geometries on a <a href=""https://leafletjs.com/"" rel=""nofollow"">Leaflet</a> map.</p>
<p dir=""auto"">If you are using SpatiaLite you will need to output the geometry as GeoJSON in order for that plugin to work. You can do that using the SpaitaLite <code>AsGeoJSON()</code> function - something like this:</p>
<div class=""highlight highlight-source-sql position-relative overflow-auto"" data-snippet-clipboard-copy-content=""select rowid, AsGeoJSON(geometry) from mytable limit 10""><pre><span class=""pl-k"">select</span> rowid, AsGeoJSON(geometry) <span class=""pl-k"">from</span> mytable <span class=""pl-k"">limit</span> <span class=""pl-c1"">10</span></pre></div>
<p dir=""auto"">The <a href=""https://datasette.io/plugins/datasette-geojson-map"" rel=""nofollow"">datasette-geojson-map</a> is an alternative plugin which will automatically render SpatiaLite geometries as a Leaflet map on the corresponding table page, without needing you to call <code>AsGeoJSON(geometry)</code>.</p>
</article></div>",1,public,0,,,
238339412,MDEwOlJlcG9zaXRvcnkyMzgzMzk0MTI=,datasette-debug-asgi,simonw/datasette-debug-asgi,0,9599,https://github.com/simonw/datasette-debug-asgi,Datasette plugin for dumping out the ASGI scope,0,2020-02-05T00:57:09Z,2021-08-17T23:40:02Z,2021-08-17T23:41:03Z,https://datasette.io/plugins/datasette-debug-asgi,16,1,1,Python,1,1,1,1,0,0,0,0,0,apache-2.0,"[""asgi"", ""datasette-io"", ""datasette-plugin""]",0,0,1,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,0,1,"# datasette-debug-asgi

[![PyPI](https://img.shields.io/pypi/v/datasette-debug-asgi.svg)](https://pypi.org/project/datasette-debug-asgi/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-debug-asgi?include_prereleases&label=changelog)](https://github.com/simonw/datasette-debug-asgi/releases)
[![Tests](https://github.com/simonw/datasette-debug-asgi/workflows/Test/badge.svg)](https://github.com/simonw/datasette-debug-asgi/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-debug-asgi/blob/main/LICENSE)

Datasette plugin for dumping out the ASGI scope.

Adds a new URL at `/-/asgi-scope` which shows the current ASGI scope. Demo here: https://datasette.io/-/asgi-scope

## Installation

    pip install datasette-debug-asgi

## Usage

Visit `/-/asgi-scope` to see debug output showing the ASGI scope.

You can add query string parameters such as `/-/asgi-scope?q=hello`.

You can also add extra path components such as `/-/asgi-scope/more/path/here`.
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-datasette-debug-asgi"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-debug-asgi""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-debug-asgi</h1>
<p><a href=""https://pypi.org/project/datasette-debug-asgi/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/59ce0e036bd317f141a4ce0589ec0969f8fd2cfdef49c6b3c7c4dafc4bd9e332/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d64656275672d617367692e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-debug-asgi.svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-debug-asgi/releases""><img src=""https://camo.githubusercontent.com/d3b776c6e5f4674675855b8268bbc26631391e18e23a50a4cc92204b89a1b701/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d64656275672d617367693f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-debug-asgi?include_prereleases&amp;label=changelog"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-debug-asgi/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-debug-asgi/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-debug-asgi/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>Datasette plugin for dumping out the ASGI scope.</p>
<p>Adds a new URL at <code>/-/asgi-scope</code> which shows the current ASGI scope. Demo here: <a href=""https://datasette.io/-/asgi-scope"" rel=""nofollow"">https://datasette.io/-/asgi-scope</a></p>
<h2><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pip install datasette-debug-asgi
""><pre><code>pip install datasette-debug-asgi
</code></pre></div>
<h2><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p>Visit <code>/-/asgi-scope</code> to see debug output showing the ASGI scope.</p>
<p>You can add query string parameters such as <code>/-/asgi-scope?q=hello</code>.</p>
<p>You can also add extra path components such as <code>/-/asgi-scope/more/path/here</code>.</p>
</article></div>",,,,,,
240815938,MDEwOlJlcG9zaXRvcnkyNDA4MTU5Mzg=,shapefile-to-sqlite,simonw/shapefile-to-sqlite,0,9599,https://github.com/simonw/shapefile-to-sqlite,Load shapefiles into a SQLite (optionally SpatiaLite) database,0,2020-02-16T01:55:29Z,2021-03-26T08:39:43Z,2020-08-23T06:00:41Z,,54,15,15,Python,1,1,1,1,0,0,0,0,3,apache-2.0,"[""sqlite"", ""gis"", ""spatialite"", ""shapefiles"", ""datasette"", ""datasette-io"", ""datasette-tool""]",0,3,15,main,"{""admin"": false, ""push"": false, ""pull"": false}",,,0,1,"# shapefile-to-sqlite

[![PyPI](https://img.shields.io/pypi/v/shapefile-to-sqlite.svg)](https://pypi.org/project/shapefile-to-sqlite/)
[![CircleCI](https://circleci.com/gh/simonw/shapefile-to-sqlite.svg?style=svg)](https://circleci.com/gh/simonw/shapefile-to-sqlite)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/shapefile-to-sqlite/blob/main/LICENSE)

Load shapefiles into a SQLite (optionally SpatiaLite) database.

Project background: [Things I learned about shapefiles building shapefile-to-sqlite](https://simonwillison.net/2020/Feb/19/shapefile-to-sqlite/)

## How to install

    $ pip install shapefile-to-sqlite

## How to use

You can run this tool against a shapefile file like so:

    $ shapefile-to-sqlite my.db features.shp

This will load the geometries as GeoJSON in a text column.

## Using with SpatiaLite

If you have [SpatiaLite](https://www.gaia-gis.it/fossil/libspatialite/index) available you can load them as SpatiaLite geometries like this:

    $ shapefile-to-sqlite my.db features.shp --spatialite

The data will be loaded into a table called `features` - based on the name of the shapefile. You can specify an alternative table name using `--table`:

    $ shapefile-to-sqlite my.db features.shp --table=places --spatialite

The tool will search for the SpatiaLite module in the following locations:

- `/usr/lib/x86_64-linux-gnu/mod_spatialite.so`
- `/usr/local/lib/mod_spatialite.dylib`

If you have installed the module in another location, you can use the `--spatialite_mod=xxx` option to specify where:

    $ shapefile-to-sqlite my.db features.shp \
        --spatialite_mod=/usr/lib/mod_spatialite.dylib

You can use the `--spatial-index` option to create a spatial index on the `geometry` column:

    $ shapefile-to-sqlite my.db features.shp --spatial-index

You can omit `--spatialite` if you use either `--spatialite-mod` or `--spatial-index`.

## Projections

By default, this tool will attempt to convert geometries in the shapefile to the WGS 84 projection, for best conformance with the [GeoJSON specification](https://tools.ietf.org/html/rfc7946).

If you want it to leave the data in whatever projection was used by the shapefile, use the `--crs=keep` option.

You can convert the data to another output projection by passing it to the `--crs` option. For example, to convert to [EPSG:2227](https://epsg.io/2227) (California zone 3) use `--crs=espg:2227`.

The full list of formats accepted by the `--crs` option is [documented here](https://pyproj4.github.io/pyproj/stable/api/crs.html#pyproj.crs.CRS.__init__).

## Extracting columns

If your data contains columns with a small number of heavily duplicated values - the names of specific agencies responsible for parcels of land for example - you can extract those columns into separate lookup tables referenced by foreign keys using the `-c` option:

    $ shapefile-to-sqlite my.db features.shp -c agency

This will create a `agency` table with `id` and `name` columns, and will create the `agency` column in your main table as an integer foreign key reference to that table.

The `-c` option can be used multiple times.

[CPAD_2020a_Units](https://calands.datasettes.com/calands/CPAD_2020a_Units) is an example of a table created using the `-c` option.
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-shapefile-to-sqlite"" class=""anchor"" aria-hidden=""true"" href=""#user-content-shapefile-to-sqlite""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>shapefile-to-sqlite</h1>
<p><a href=""https://pypi.org/project/shapefile-to-sqlite/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/491f86f2d61c3cf6ce08c6c019a47686ccbe35187c465aeb784876d3adcc236b/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f736861706566696c652d746f2d73716c6974652e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/shapefile-to-sqlite.svg"" style=""max-width:100%;""></a>
<a href=""https://circleci.com/gh/simonw/shapefile-to-sqlite"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/665b44f51d2c08eeb25388a24485a1591ebe793b914807ae371fe96fb242c0aa/68747470733a2f2f636972636c6563692e636f6d2f67682f73696d6f6e772f736861706566696c652d746f2d73716c6974652e7376673f7374796c653d737667"" alt=""CircleCI"" data-canonical-src=""https://circleci.com/gh/simonw/shapefile-to-sqlite.svg?style=svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/shapefile-to-sqlite/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>Load shapefiles into a SQLite (optionally SpatiaLite) database.</p>
<p>Project background: <a href=""https://simonwillison.net/2020/Feb/19/shapefile-to-sqlite/"" rel=""nofollow"">Things I learned about shapefiles building shapefile-to-sqlite</a></p>
<h2><a id=""user-content-how-to-install"" class=""anchor"" aria-hidden=""true"" href=""#user-content-how-to-install""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>How to install</h2>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ pip install shapefile-to-sqlite
""><pre><code>$ pip install shapefile-to-sqlite
</code></pre></div>
<h2><a id=""user-content-how-to-use"" class=""anchor"" aria-hidden=""true"" href=""#user-content-how-to-use""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>How to use</h2>
<p>You can run this tool against a shapefile file like so:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ shapefile-to-sqlite my.db features.shp
""><pre><code>$ shapefile-to-sqlite my.db features.shp
</code></pre></div>
<p>This will load the geometries as GeoJSON in a text column.</p>
<h2><a id=""user-content-using-with-spatialite"" class=""anchor"" aria-hidden=""true"" href=""#user-content-using-with-spatialite""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Using with SpatiaLite</h2>
<p>If you have <a href=""https://www.gaia-gis.it/fossil/libspatialite/index"" rel=""nofollow"">SpatiaLite</a> available you can load them as SpatiaLite geometries like this:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ shapefile-to-sqlite my.db features.shp --spatialite
""><pre><code>$ shapefile-to-sqlite my.db features.shp --spatialite
</code></pre></div>
<p>The data will be loaded into a table called <code>features</code> - based on the name of the shapefile. You can specify an alternative table name using <code>--table</code>:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ shapefile-to-sqlite my.db features.shp --table=places --spatialite
""><pre><code>$ shapefile-to-sqlite my.db features.shp --table=places --spatialite
</code></pre></div>
<p>The tool will search for the SpatiaLite module in the following locations:</p>
<ul>
<li><code>/usr/lib/x86_64-linux-gnu/mod_spatialite.so</code></li>
<li><code>/usr/local/lib/mod_spatialite.dylib</code></li>
</ul>
<p>If you have installed the module in another location, you can use the <code>--spatialite_mod=xxx</code> option to specify where:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ shapefile-to-sqlite my.db features.shp \
    --spatialite_mod=/usr/lib/mod_spatialite.dylib
""><pre><code>$ shapefile-to-sqlite my.db features.shp \
    --spatialite_mod=/usr/lib/mod_spatialite.dylib
</code></pre></div>
<p>You can use the <code>--spatial-index</code> option to create a spatial index on the <code>geometry</code> column:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ shapefile-to-sqlite my.db features.shp --spatial-index
""><pre><code>$ shapefile-to-sqlite my.db features.shp --spatial-index
</code></pre></div>
<p>You can omit <code>--spatialite</code> if you use either <code>--spatialite-mod</code> or <code>--spatial-index</code>.</p>
<h2><a id=""user-content-projections"" class=""anchor"" aria-hidden=""true"" href=""#user-content-projections""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Projections</h2>
<p>By default, this tool will attempt to convert geometries in the shapefile to the WGS 84 projection, for best conformance with the <a href=""https://tools.ietf.org/html/rfc7946"" rel=""nofollow"">GeoJSON specification</a>.</p>
<p>If you want it to leave the data in whatever projection was used by the shapefile, use the <code>--crs=keep</code> option.</p>
<p>You can convert the data to another output projection by passing it to the <code>--crs</code> option. For example, to convert to <a href=""https://epsg.io/2227"" rel=""nofollow"">EPSG:2227</a> (California zone 3) use <code>--crs=espg:2227</code>.</p>
<p>The full list of formats accepted by the <code>--crs</code> option is <a href=""https://pyproj4.github.io/pyproj/stable/api/crs.html#pyproj.crs.CRS.__init__"" rel=""nofollow"">documented here</a>.</p>
<h2><a id=""user-content-extracting-columns"" class=""anchor"" aria-hidden=""true"" href=""#user-content-extracting-columns""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Extracting columns</h2>
<p>If your data contains columns with a small number of heavily duplicated values - the names of specific agencies responsible for parcels of land for example - you can extract those columns into separate lookup tables referenced by foreign keys using the <code>-c</code> option:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ shapefile-to-sqlite my.db features.shp -c agency
""><pre><code>$ shapefile-to-sqlite my.db features.shp -c agency
</code></pre></div>
<p>This will create a <code>agency</code> table with <code>id</code> and <code>name</code> columns, and will create the <code>agency</code> column in your main table as an integer foreign key reference to that table.</p>
<p>The <code>-c</code> option can be used multiple times.</p>
<p><a href=""https://calands.datasettes.com/calands/CPAD_2020a_Units"" rel=""nofollow"">CPAD_2020a_Units</a> is an example of a table created using the <code>-c</code> option.</p>
</article></div>",,,,,,
242260583,MDEwOlJlcG9zaXRvcnkyNDIyNjA1ODM=,datasette-mask-columns,simonw/datasette-mask-columns,0,9599,https://github.com/simonw/datasette-mask-columns,Datasette plugin that masks specified database columns,0,2020-02-22T01:29:16Z,2021-06-10T19:50:37Z,2021-06-10T19:51:02Z,https://datasette.io/plugins/datasette-mask-columns,15,2,2,Python,1,1,1,1,0,0,0,0,0,apache-2.0,"[""datasette"", ""datasette-plugin"", ""datasette-io""]",0,0,2,main,"{""admin"": false, ""push"": false, ""pull"": false}",,,0,1,"# datasette-mask-columns

[![PyPI](https://img.shields.io/pypi/v/datasette-mask-columns.svg)](https://pypi.org/project/datasette-mask-columns/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-mask-columns?include_prereleases&label=changelog)](https://github.com/simonw/datasette-mask-columns/releases)
[![Tests](https://github.com/simonw/datasette-mask-columns/workflows/Test/badge.svg)](https://github.com/simonw/datasette-mask-columns/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-mask-columns/blob/main/LICENSE)

Datasette plugin that masks specified database columns

## Installation

    pip install datasette-mask-columns

This depends on plugin hook changes in a not-yet released branch of Datasette. See [issue #678](https://github.com/simonw/datasette/issues/678) for details.

## Usage

In your `metadata.json` file add a section like this describing the database and table in which you wish to mask columns:

```json
{
    ""databases"": {
        ""my-database"": {
            ""plugins"": {
                ""datasette-mask-columns"": {
                    ""users"": [""password""]
                }
            }
        }
    }
}
```
All SQL queries against the `users` table in `my-database.db` will now return `null` for the `password` column, no matter what value that column actually holds.

The table page for `users` will display the text `REDACTED` in the masked column. This visual hint will only be available on the table page; it will not display his text for arbitrary queries against the table.
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-datasette-mask-columns"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-mask-columns""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-mask-columns</h1>
<p><a href=""https://pypi.org/project/datasette-mask-columns/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/09772a46d884b67ea26bd94868ec94be4bd514fa0a4a8ea3fac1b707801e4d3d/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d6d61736b2d636f6c756d6e732e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-mask-columns.svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-mask-columns/releases""><img src=""https://camo.githubusercontent.com/f3fb962acca8cb0fd0c0caeb303007c6c199287a95f5b2f9bfb2128bec3b1dc3/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d6d61736b2d636f6c756d6e733f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-mask-columns?include_prereleases&amp;label=changelog"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-mask-columns/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-mask-columns/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-mask-columns/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>Datasette plugin that masks specified database columns</p>
<h2><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pip install datasette-mask-columns
""><pre><code>pip install datasette-mask-columns
</code></pre></div>
<p>This depends on plugin hook changes in a not-yet released branch of Datasette. See <a href=""https://github.com/simonw/datasette/issues/678"">issue #678</a> for details.</p>
<h2><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p>In your <code>metadata.json</code> file add a section like this describing the database and table in which you wish to mask columns:</p>
<div class=""highlight highlight-source-json position-relative"" data-snippet-clipboard-copy-content=""{
    &quot;databases&quot;: {
        &quot;my-database&quot;: {
            &quot;plugins&quot;: {
                &quot;datasette-mask-columns&quot;: {
                    &quot;users&quot;: [&quot;password&quot;]
                }
            }
        }
    }
}
""><pre>{
    <span class=""pl-s""><span class=""pl-pds"">""</span>databases<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>my-database<span class=""pl-pds"">""</span></span>: {
            <span class=""pl-s""><span class=""pl-pds"">""</span>plugins<span class=""pl-pds"">""</span></span>: {
                <span class=""pl-s""><span class=""pl-pds"">""</span>datasette-mask-columns<span class=""pl-pds"">""</span></span>: {
                    <span class=""pl-s""><span class=""pl-pds"">""</span>users<span class=""pl-pds"">""</span></span>: [<span class=""pl-s""><span class=""pl-pds"">""</span>password<span class=""pl-pds"">""</span></span>]
                }
            }
        }
    }
}</pre></div>
<p>All SQL queries against the <code>users</code> table in <code>my-database.db</code> will now return <code>null</code> for the <code>password</code> column, no matter what value that column actually holds.</p>
<p>The table page for <code>users</code> will display the text <code>REDACTED</code> in the masked column. This visual hint will only be available on the table page; it will not display his text for arbitrary queries against the table.</p>
</article></div>",,,,,,
243710733,MDEwOlJlcG9zaXRvcnkyNDM3MTA3MzM=,datasette-ics,simonw/datasette-ics,0,9599,https://github.com/simonw/datasette-ics,Datasette plugin for outputting iCalendar files,0,2020-02-28T08:11:01Z,2022-07-07T14:11:49Z,2022-07-12T02:08:10Z,https://datasette.io/plugins/datasette-ics,34,13,13,Python,1,1,1,1,0,0,0,0,0,apache-2.0,"[""datasette"", ""datasette-io"", ""datasette-plugin"", ""icalendar"", ""ics""]",0,0,13,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,0,2,"# datasette-ics

[![PyPI](https://img.shields.io/pypi/v/datasette-ics.svg)](https://pypi.org/project/datasette-ics/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-ics?include_prereleases&label=changelog)](https://github.com/simonw/datasette-ics/releases)
[![Tests](https://github.com/simonw/datasette-ics/workflows/Test/badge.svg)](https://github.com/simonw/datasette-ics/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-ics/blob/main/LICENSE)

Datasette plugin that adds support for generating [iCalendar .ics files](https://tools.ietf.org/html/rfc5545) with the results of a SQL query.

## Installation

Install this plugin in the same environment as Datasette to enable the `.ics` output extension.

    $ pip install datasette-ics

## Usage

To create an iCalendar file you need to define a custom SQL query that returns a required set of columns:

* `event_name` - the short name for the event
* `event_dtstart` - when the event starts

The following columns are optional:

* `event_dtend` - when the event ends
* `event_duration` - the duration of the event (use instead of `dtend`)
* `event_description` - a longer description of the event
* `event_uid` - a globally unique identifier for this event
* `event_tzid` - the timezone for the event, e.g. `America/Chicago`

A query that returns these columns can then be returned as an ics feed by adding the `.ics` extension.

## Demo

[This SQL query]([https://www.rockybeaches.com/data?sql=with+inner+as+(%0D%0A++select%0D%0A++++datetime%2C%0D%0A++++substr(datetime%2C+0%2C+11)+as+date%2C%0D%0A++++mllw_feet%2C%0D%0A++++lag(mllw_feet)+over+win+as+previous_mllw_feet%2C%0D%0A++++lead(mllw_feet)+over+win+as+next_mllw_feet%0D%0A++from%0D%0A++++tide_predictions%0D%0A++where%0D%0A++++station_id+%3D+%3Astation_id%0D%0A++++and+datetime+%3E%3D+date()%0D%0A++++window+win+as+(%0D%0A++++++order+by%0D%0A++++++++datetime%0D%0A++++)%0D%0A++order+by%0D%0A++++datetime%0D%0A)%2C%0D%0Alowest_tide_per_day+as+(%0D%0A++select%0D%0A++++date%2C%0D%0A++++datetime%2C%0D%0A++++mllw_feet%0D%0A++from%0D%0A++++inner%0D%0A++where%0D%0A++++mllw_feet+%3C%3D+previous_mllw_feet%0D%0A++++and+mllw_feet+%3C%3D+next_mllw_feet%0D%0A)%0D%0Aselect%0D%0A++min(datetime)+as+event_dtstart%2C%0D%0A++%27Low+tide%3A+%27+||+mllw_feet+||+%27+feet%27+as+event_name%2C%0D%0A++%27America%2FLos_Angeles%27+as+event_tzid%0D%0Afrom%0D%0A++lowest_tide_per_day%0D%0Agroup+by%0D%0A++date%0D%0Aorder+by%0D%0A++date&station_id=9414131) calculates the lowest tide per day at Pillar Point in Half Moon Bay, California.

Since the query returns `event_name`, `event_dtstart` and `event_tzid` columns it produces [this ICS feed](https://www.rockybeaches.com/data.ics?sql=with+inner+as+(%0D%0A++select%0D%0A++++datetime%2C%0D%0A++++substr(datetime%2C+0%2C+11)+as+date%2C%0D%0A++++mllw_feet%2C%0D%0A++++lag(mllw_feet)+over+win+as+previous_mllw_feet%2C%0D%0A++++lead(mllw_feet)+over+win+as+next_mllw_feet%0D%0A++from%0D%0A++++tide_predictions%0D%0A++where%0D%0A++++station_id+%3D+%3Astation_id%0D%0A++++and+datetime+%3E%3D+date()%0D%0A++++window+win+as+(%0D%0A++++++order+by%0D%0A++++++++datetime%0D%0A++++)%0D%0A++order+by%0D%0A++++datetime%0D%0A)%2C%0D%0Alowest_tide_per_day+as+(%0D%0A++select%0D%0A++++date%2C%0D%0A++++datetime%2C%0D%0A++++mllw_feet%0D%0A++from%0D%0A++++inner%0D%0A++where%0D%0A++++mllw_feet+%3C%3D+previous_mllw_feet%0D%0A++++and+mllw_feet+%3C%3D+next_mllw_feet%0D%0A)%0D%0Aselect%0D%0A++min(datetime)+as+event_dtstart%2C%0D%0A++%27Low+tide%3A+%27+||+mllw_feet+||+%27+feet%27+as+event_name%2C%0D%0A++%27America%2FLos_Angeles%27+as+event_tzid%0D%0Afrom%0D%0A++lowest_tide_per_day%0D%0Agroup+by%0D%0A++date%0D%0Aorder+by%0D%0A++date&station_id=9414131). If you subscribe to that in a calendar application such as Apple Calendar you get something that looks like this:

![Apple Calendar showing low tides at Pillar Point during a week](https://user-images.githubusercontent.com/9599/173158984-e5ec6bd0-33fc-4fc0-ba9d-17ae674f310a.jpg)

## Using a canned query

Datasette's [canned query mechanism](https://datasette.readthedocs.io/en/stable/sql_queries.html#canned-queries) can be used to configure calendars. If a canned query definition has a `title` that will be used as the title of the calendar.

Here's an example, defined using a `metadata.yaml` file:

```yaml
databases:
  mydatabase:
    queries:
      calendar:
        title: My Calendar
        sql: |-
          select
            title as event_name,
            start as event_dtstart,
            description as event_description
          from
            events
          order by
            start
          limit
            100
```
This will result in a calendar feed at `http://localhost:8001/mydatabase/calendar.ics`
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-datasette-ics"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-ics""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-ics</h1>
<p dir=""auto""><a href=""https://pypi.org/project/datasette-ics/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/ded2e32b426b79e4545931cf7a056c8c1db91042979ee4e1fa286bd89222af70/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d6963732e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-ics.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-ics/releases""><img src=""https://camo.githubusercontent.com/ac8ba594bbdf3837c5b4a7c54786be6517128f0b6fe17fb569b189024fd0fcfb/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d6963733f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-ics?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-ics/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-ics/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-ics/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Datasette plugin that adds support for generating <a href=""https://tools.ietf.org/html/rfc5545"" rel=""nofollow"">iCalendar .ics files</a> with the results of a SQL query.</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p dir=""auto"">Install this plugin in the same environment as Datasette to enable the <code>.ics</code> output extension.</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ pip install datasette-ics""><pre class=""notranslate""><code>$ pip install datasette-ics
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p dir=""auto"">To create an iCalendar file you need to define a custom SQL query that returns a required set of columns:</p>
<ul dir=""auto"">
<li><code>event_name</code> - the short name for the event</li>
<li><code>event_dtstart</code> - when the event starts</li>
</ul>
<p dir=""auto"">The following columns are optional:</p>
<ul dir=""auto"">
<li><code>event_dtend</code> - when the event ends</li>
<li><code>event_duration</code> - the duration of the event (use instead of <code>dtend</code>)</li>
<li><code>event_description</code> - a longer description of the event</li>
<li><code>event_uid</code> - a globally unique identifier for this event</li>
<li><code>event_tzid</code> - the timezone for the event, e.g. <code>America/Chicago</code></li>
</ul>
<p dir=""auto"">A query that returns these columns can then be returned as an ics feed by adding the <code>.ics</code> extension.</p>
<h2 dir=""auto""><a id=""user-content-demo"" class=""anchor"" aria-hidden=""true"" href=""#user-content-demo""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Demo</h2>
<p dir=""auto"">This SQL query calculates the lowest tide per day at Pillar Point in Half Moon Bay, California.</p>
<p dir=""auto"">Since the query returns <code>event_name</code>, <code>event_dtstart</code> and <code>event_tzid</code> columns it produces <a href=""https://www.rockybeaches.com/data.ics?sql=with+inner+as+(%0D%0A++select%0D%0A++++datetime%2C%0D%0A++++substr(datetime%2C+0%2C+11)+as+date%2C%0D%0A++++mllw_feet%2C%0D%0A++++lag(mllw_feet)+over+win+as+previous_mllw_feet%2C%0D%0A++++lead(mllw_feet)+over+win+as+next_mllw_feet%0D%0A++from%0D%0A++++tide_predictions%0D%0A++where%0D%0A++++station_id+%3D+%3Astation_id%0D%0A++++and+datetime+%3E%3D+date()%0D%0A++++window+win+as+(%0D%0A++++++order+by%0D%0A++++++++datetime%0D%0A++++)%0D%0A++order+by%0D%0A++++datetime%0D%0A)%2C%0D%0Alowest_tide_per_day+as+(%0D%0A++select%0D%0A++++date%2C%0D%0A++++datetime%2C%0D%0A++++mllw_feet%0D%0A++from%0D%0A++++inner%0D%0A++where%0D%0A++++mllw_feet+%3C%3D+previous_mllw_feet%0D%0A++++and+mllw_feet+%3C%3D+next_mllw_feet%0D%0A)%0D%0Aselect%0D%0A++min(datetime)+as+event_dtstart%2C%0D%0A++%27Low+tide%3A+%27+%7C%7C+mllw_feet+%7C%7C+%27+feet%27+as+event_name%2C%0D%0A++%27America%2FLos_Angeles%27+as+event_tzid%0D%0Afrom%0D%0A++lowest_tide_per_day%0D%0Agroup+by%0D%0A++date%0D%0Aorder+by%0D%0A++date&amp;station_id=9414131"" rel=""nofollow"">this ICS feed</a>. If you subscribe to that in a calendar application such as Apple Calendar you get something that looks like this:</p>
<p dir=""auto""><a target=""_blank"" rel=""noopener noreferrer"" href=""https://user-images.githubusercontent.com/9599/173158984-e5ec6bd0-33fc-4fc0-ba9d-17ae674f310a.jpg""><img src=""https://user-images.githubusercontent.com/9599/173158984-e5ec6bd0-33fc-4fc0-ba9d-17ae674f310a.jpg"" alt=""Apple Calendar showing low tides at Pillar Point during a week"" style=""max-width: 100%;""></a></p>
<h2 dir=""auto""><a id=""user-content-using-a-canned-query"" class=""anchor"" aria-hidden=""true"" href=""#user-content-using-a-canned-query""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Using a canned query</h2>
<p dir=""auto"">Datasette's <a href=""https://datasette.readthedocs.io/en/stable/sql_queries.html#canned-queries"" rel=""nofollow"">canned query mechanism</a> can be used to configure calendars. If a canned query definition has a <code>title</code> that will be used as the title of the calendar.</p>
<p dir=""auto"">Here's an example, defined using a <code>metadata.yaml</code> file:</p>
<div class=""highlight highlight-source-yaml notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""databases:
  mydatabase:
    queries:
      calendar:
        title: My Calendar
        sql: |-
          select
            title as event_name,
            start as event_dtstart,
            description as event_description
          from
            events
          order by
            start
          limit
            100""><pre><span class=""pl-ent"">databases</span>:
  <span class=""pl-ent"">mydatabase</span>:
    <span class=""pl-ent"">queries</span>:
      <span class=""pl-ent"">calendar</span>:
        <span class=""pl-ent"">title</span>: <span class=""pl-s"">My Calendar</span>
        <span class=""pl-ent"">sql</span>: <span class=""pl-s"">|-</span>
<span class=""pl-s"">          select</span>
<span class=""pl-s"">            title as event_name,</span>
<span class=""pl-s"">            start as event_dtstart,</span>
<span class=""pl-s"">            description as event_description</span>
<span class=""pl-s"">          from</span>
<span class=""pl-s"">            events</span>
<span class=""pl-s"">          order by</span>
<span class=""pl-s"">            start</span>
<span class=""pl-s"">          limit</span>
<span class=""pl-s"">            100</span></pre></div>
<p dir=""auto"">This will result in a calendar feed at <code>http://localhost:8001/mydatabase/calendar.ics</code></p>
</article></div>",1,public,0,,0,
243887036,MDEwOlJlcG9zaXRvcnkyNDM4ODcwMzY=,datasette-configure-fts,simonw/datasette-configure-fts,0,9599,https://github.com/simonw/datasette-configure-fts,Datasette plugin for enabling full-text search against selected table columns,0,2020-02-29T01:50:57Z,2020-11-01T02:59:12Z,2020-11-01T02:59:10Z,,42,2,2,Python,1,1,1,1,0,0,0,0,2,apache-2.0,"[""datasette"", ""datasette-plugin"", ""datasette-io""]",0,2,2,main,"{""admin"": false, ""push"": false, ""pull"": false}",,,0,1,"# datasette-configure-fts

[![PyPI](https://img.shields.io/pypi/v/datasette-configure-fts.svg)](https://pypi.org/project/datasette-configure-fts/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-configure-fts?include_prereleases&label=changelog)](https://github.com/simonw/datasette-configure-fts/releases)
[![Tests](https://github.com/simonw/datasette-configure-fts/workflows/Test/badge.svg)](https://github.com/simonw/datasette-configure-fts/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-configure-fts/blob/main/LICENSE)

Datasette plugin for enabling full-text search against selected table columns

## Installation

Install this plugin in the same environment as Datasette.

    $ datasette install datasette-configure-fts

## Usage

Having installed the plugin, visit `/-/configure-fts` on your Datasette instance to configure FTS for tables on attached writable databases.

Any time you have permission to configure FTS for a table a menu item will appear in the table actions menu on the table page.

By default only [the root actor](https://datasette.readthedocs.io/en/stable/authentication.html#using-the-root-actor) can access the page - so you'll need to run Datasette with the `--root` option and click on the link shown in the terminal to sign in and access the page.

The `configure-fts` permission governs access. You can use permission plugins such as [datasette-permissions-sql](https://github.com/simonw/datasette-permissions-sql) to grant additional access to the write interface.
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-datasette-configure-fts"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-configure-fts""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-configure-fts</h1>
<p><a href=""https://pypi.org/project/datasette-configure-fts/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/73d7c1152d58fc5ae85794eaf3165fd023270f7f118bde3acaa24522371419b5/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d636f6e6669677572652d6674732e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-configure-fts.svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-configure-fts/releases""><img src=""https://camo.githubusercontent.com/cd5d1f0fe6b01840d00ffa3e5696a7827c30095262f3ea8c4447f0547e16dc17/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d636f6e6669677572652d6674733f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-configure-fts?include_prereleases&amp;label=changelog"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-configure-fts/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-configure-fts/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-configure-fts/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>Datasette plugin for enabling full-text search against selected table columns</p>
<h2><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p>Install this plugin in the same environment as Datasette.</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ datasette install datasette-configure-fts
""><pre><code>$ datasette install datasette-configure-fts
</code></pre></div>
<h2><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p>Having installed the plugin, visit <code>/-/configure-fts</code> on your Datasette instance to configure FTS for tables on attached writable databases.</p>
<p>Any time you have permission to configure FTS for a table a menu item will appear in the table actions menu on the table page.</p>
<p>By default only <a href=""https://datasette.readthedocs.io/en/stable/authentication.html#using-the-root-actor"" rel=""nofollow"">the root actor</a> can access the page - so you'll need to run Datasette with the <code>--root</code> option and click on the link shown in the terminal to sign in and access the page.</p>
<p>The <code>configure-fts</code> permission governs access. You can use permission plugins such as <a href=""https://github.com/simonw/datasette-permissions-sql"">datasette-permissions-sql</a> to grant additional access to the write interface.</p>
</article></div>",,,,,,
245670670,MDEwOlJlcG9zaXRvcnkyNDU2NzA2NzA=,fec-to-sqlite,simonw/fec-to-sqlite,0,9599,https://github.com/simonw/fec-to-sqlite,Save FEC campaign finance data to a SQLite database,0,2020-03-07T16:52:49Z,2020-12-19T05:09:05Z,2020-03-07T18:21:48Z,,16,8,8,Python,1,1,1,1,0,0,0,0,1,apache-2.0,"[""sqlite"", ""fec"", ""datasette"", ""datasette-io"", ""datasette-tool""]",0,1,8,master,"{""admin"": false, ""push"": false, ""pull"": false}",,,0,2,"# fec-to-sqlite

[![PyPI](https://img.shields.io/pypi/v/fec-to-sqlite.svg)](https://pypi.org/project/fec-to-sqlite/)
[![CircleCI](https://circleci.com/gh/simonw/fec-to-sqlite.svg?style=svg)](https://circleci.com/gh/simonw/fec-to-sqlite)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/fec-to-sqlite/blob/master/LICENSE)

Create a SQLite database using FEC campaign contributions data.

This tool builds on [fecfile](https://github.com/esonderegger/) by Evan Sonderegger.

## How to install

    $ pip install fec-to-sqlite

## Usage

    $ fec-to-sqlite filings filings.db 1146148

This fetches the filing with ID `1146148` and stores it in tables in a SQLite database called `filings.db`. It will create any tables it needs.

You can pass more than one filing ID, separated by spaces.
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-fec-to-sqlite"" class=""anchor"" aria-hidden=""true"" href=""#user-content-fec-to-sqlite""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>fec-to-sqlite</h1>
<p><a href=""https://pypi.org/project/fec-to-sqlite/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/93308ffe0f5302a01fe685a905be46fc42abddee239c695af80456aabcd72e94/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6665632d746f2d73716c6974652e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/fec-to-sqlite.svg"" style=""max-width:100%;""></a>
<a href=""https://circleci.com/gh/simonw/fec-to-sqlite"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/46d17ba2dacfcf6081f8db0a40f9dfd848e9bb2af54c6652e8358a01a88ba2cd/68747470733a2f2f636972636c6563692e636f6d2f67682f73696d6f6e772f6665632d746f2d73716c6974652e7376673f7374796c653d737667"" alt=""CircleCI"" data-canonical-src=""https://circleci.com/gh/simonw/fec-to-sqlite.svg?style=svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/fec-to-sqlite/blob/master/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>Create a SQLite database using FEC campaign contributions data.</p>
<p>This tool builds on <a href=""https://github.com/esonderegger/"">fecfile</a> by Evan Sonderegger.</p>
<h2><a id=""user-content-how-to-install"" class=""anchor"" aria-hidden=""true"" href=""#user-content-how-to-install""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>How to install</h2>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ pip install fec-to-sqlite
""><pre><code>$ pip install fec-to-sqlite
</code></pre></div>
<h2><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ fec-to-sqlite filings filings.db 1146148
""><pre><code>$ fec-to-sqlite filings filings.db 1146148
</code></pre></div>
<p>This fetches the filing with ID <code>1146148</code> and stores it in tables in a SQLite database called <code>filings.db</code>. It will create any tables it needs.</p>
<p>You can pass more than one filing ID, separated by spaces.</p>
</article></div>",,,,,,
245856731,MDEwOlJlcG9zaXRvcnkyNDU4NTY3MzE=,datasette-search-all,simonw/datasette-search-all,0,9599,https://github.com/simonw/datasette-search-all,Datasette plugin for searching all searchable tables at once,0,2020-03-08T17:21:54Z,2021-12-19T04:06:49Z,2022-10-05T01:53:33Z,,186,6,6,Python,1,1,1,1,0,2,0,0,0,apache-2.0,"[""datasette"", ""datasette-io"", ""datasette-plugin"", ""search""]",2,0,6,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,2,2,"# datasette-search-all

[![PyPI](https://img.shields.io/pypi/v/datasette-search-all.svg)](https://pypi.org/project/datasette-search-all/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-search-all?include_prereleases&label=changelog)](https://github.com/simonw/datasette-search-all/releases)
[![Tests](https://github.com/simonw/datasette-search-all/workflows/Test/badge.svg)](https://github.com/simonw/datasette-search-all/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-search-all/blob/main/LICENSE)

Datasette plugin for searching all searchable tables at once.

## Installation

Install the plugin in the same Python environment as Datasette:

    pip install datasette-search-all

## Background

See [datasette-search-all: a new plugin for searching multiple Datasette tables at once](https://simonwillison.net/2020/Mar/9/datasette-search-all/) for background on this project. You can try the plugin out at https://fara.datasettes.com/

## Usage

This plugin only works if at least one of the tables connected to your Datasette instance has been configured for SQLite's full-text search.

The [Datasette search documentation](https://docs.datasette.io/en/stable/full_text_search.html) includes details on how to enable full-text search for a table.

You can also use the following tools:

* [sqlite-utils](https://sqlite-utils.datasette.io/en/stable/cli.html#configuring-full-text-search) includes a command-line tool for enabling full-text search.
* [datasette-enable-fts](https://github.com/simonw/datasette-enable-fts) is a Datasette plugin that adds a web interface for enabling search for specific columns.

If the plugin detects at least one searchable table it will add a search form to the homepage.

You can also navigate to `/-/search` on your Datasette instance to use the search interface directly.

## Screenshot

![Animated screenshot showing the plugin in action](https://raw.githubusercontent.com/simonw/datasette-search-all/main/animated-screenshot.gif)
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-datasette-search-all"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-search-all""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-search-all</h1>
<p dir=""auto""><a href=""https://pypi.org/project/datasette-search-all/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/2fe2b05de97f33998d29929414fb3878c21e05f5d6820e69a8c1b9aa1a6d7593/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d7365617263682d616c6c2e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-search-all.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-search-all/releases""><img src=""https://camo.githubusercontent.com/e686df469f1732c1e8112cf75ac7f2388e793d4eaf381b965cf54876e5a6a75e/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d7365617263682d616c6c3f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-search-all?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-search-all/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-search-all/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-search-all/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Datasette plugin for searching all searchable tables at once.</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p dir=""auto"">Install the plugin in the same Python environment as Datasette:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install datasette-search-all""><pre class=""notranslate""><code>pip install datasette-search-all
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-background"" class=""anchor"" aria-hidden=""true"" href=""#user-content-background""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Background</h2>
<p dir=""auto"">See <a href=""https://simonwillison.net/2020/Mar/9/datasette-search-all/"" rel=""nofollow"">datasette-search-all: a new plugin for searching multiple Datasette tables at once</a> for background on this project. You can try the plugin out at <a href=""https://fara.datasettes.com/"" rel=""nofollow"">https://fara.datasettes.com/</a></p>
<h2 dir=""auto""><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p dir=""auto"">This plugin only works if at least one of the tables connected to your Datasette instance has been configured for SQLite's full-text search.</p>
<p dir=""auto"">The <a href=""https://docs.datasette.io/en/stable/full_text_search.html"" rel=""nofollow"">Datasette search documentation</a> includes details on how to enable full-text search for a table.</p>
<p dir=""auto"">You can also use the following tools:</p>
<ul dir=""auto"">
<li><a href=""https://sqlite-utils.datasette.io/en/stable/cli.html#configuring-full-text-search"" rel=""nofollow"">sqlite-utils</a> includes a command-line tool for enabling full-text search.</li>
<li><a href=""https://github.com/simonw/datasette-enable-fts"">datasette-enable-fts</a> is a Datasette plugin that adds a web interface for enabling search for specific columns.</li>
</ul>
<p dir=""auto"">If the plugin detects at least one searchable table it will add a search form to the homepage.</p>
<p dir=""auto"">You can also navigate to <code>/-/search</code> on your Datasette instance to use the search interface directly.</p>
<h2 dir=""auto""><a id=""user-content-screenshot"" class=""anchor"" aria-hidden=""true"" href=""#user-content-screenshot""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Screenshot</h2>
<p dir=""auto""><a target=""_blank"" rel=""noopener noreferrer nofollow"" href=""https://raw.githubusercontent.com/simonw/datasette-search-all/main/animated-screenshot.gif""><img src=""https://raw.githubusercontent.com/simonw/datasette-search-all/main/animated-screenshot.gif"" alt=""Animated screenshot showing the plugin in action"" data-animated-image="""" style=""max-width: 100%;""></a></p>
</article></div>",1,public,0,,0,
246108561,MDEwOlJlcG9zaXRvcnkyNDYxMDg1NjE=,datasette-column-inspect,simonw/datasette-column-inspect,0,9599,https://github.com/simonw/datasette-column-inspect,Experimental plugin that adds a column inspector,0,2020-03-09T18:11:00Z,2020-12-09T21:46:10Z,2020-12-09T21:47:38Z,,15,1,1,HTML,1,1,1,1,0,0,0,0,3,apache-2.0,"[""datasette"", ""datasette-plugin"", ""datasette-io""]",0,3,1,main,"{""admin"": false, ""push"": false, ""pull"": false}",,,0,1,"# datasette-column-inspect

[![PyPI](https://img.shields.io/pypi/v/datasette-column-inspect.svg)](https://pypi.org/project/datasette-column-inspect/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-column-inspect?include_prereleases&label=changelog)](https://github.com/simonw/datasette-column-inspect/releases)
[![Tests](https://github.com/simonw/datasette-column-inspect/workflows/Test/badge.svg)](https://github.com/simonw/datasette-column-inspect/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-column-inspect/blob/main/LICENSE)

Highly experimental Datasette plugin for inspecting columns.

## Installation

Install this plugin in the same environment as Datasette.

    $ pip install datasette-column-inspect

## Usage

This plugin adds an icon to each column on the table page which opens an inspection side panel.
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-datasette-column-inspect"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-column-inspect""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-column-inspect</h1>
<p><a href=""https://pypi.org/project/datasette-column-inspect/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/e5f8f5c2f4a6206e386bea29ca46270bc4bb3d12cc3a0126dc181f1cfde0e7ab/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d636f6c756d6e2d696e73706563742e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-column-inspect.svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-column-inspect/releases""><img src=""https://camo.githubusercontent.com/cd438ac87792c14aa871310ac3c1e61a61d71e09601e83e7bc5fabeeb5418242/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d636f6c756d6e2d696e73706563743f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-column-inspect?include_prereleases&amp;label=changelog"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-column-inspect/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-column-inspect/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-column-inspect/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>Highly experimental Datasette plugin for inspecting columns.</p>
<h2><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p>Install this plugin in the same environment as Datasette.</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ pip install datasette-column-inspect
""><pre><code>$ pip install datasette-column-inspect
</code></pre></div>
<h2><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p>This plugin adds an icon to each column on the table page which opens an inspection side panel.</p>
</article></div>",,,,,,
247527438,MDEwOlJlcG9zaXRvcnkyNDc1Mjc0Mzg=,datasette-edit-schema,simonw/datasette-edit-schema,0,9599,https://github.com/simonw/datasette-edit-schema,Datasette plugin for modifying table schemas,0,2020-03-15T18:34:06Z,2022-07-01T22:20:25Z,2022-08-22T22:45:58Z,,133,6,6,JavaScript,1,1,1,1,0,0,0,0,10,apache-2.0,"[""datasette"", ""datasette-io"", ""datasette-plugin""]",0,10,6,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,0,1,"# datasette-edit-schema

[![PyPI](https://img.shields.io/pypi/v/datasette-edit-schema.svg)](https://pypi.org/project/datasette-edit-schema/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-edit-schema?include_prereleases&label=changelog)](https://github.com/simonw/datasette-edit-schema/releases)
[![Tests](https://github.com/simonw/datasette-edit-schema/workflows/Test/badge.svg)](https://github.com/simonw/datasette-edit-schema/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-edit-schema/blob/master/LICENSE)

Datasette plugin for modifying table schemas

## Features

* Add new columns to a table
* Rename columns in a table
* Modify the type of columns in a table
* Re-order the columns in a table
* Rename a table
* Delete a table

## Installation

Install this plugin in the same environment as Datasette.

    $ pip install datasette-edit-schema

## Usage

Navigate to `/-/edit-schema/dbname/tablename` on your Datasette instance to edit a specific table.

Use `/-/edit-schema/dbname` to create a new table in a specific database.

By default only [the root actor](https://datasette.readthedocs.io/en/stable/authentication.html#using-the-root-actor) can access the page - so you'll need to run Datasette with the `--root` option and click on the link shown in the terminal to sign in and access the page.

## Permissions

The `edit-schema` permission governs access. You can use permission plugins such as [datasette-permissions-sql](https://github.com/simonw/datasette-permissions-sql) to grant additional access to the write interface.

These permission checks will call the `permission_allowed()` plugin hook with three arguments:

- `action` will be the string `""edit-schema""`
- `actor` will be the currently authenticated actor - usually a dictionary
- `resource` will be the string name of the database

## Screenshot

![datasette-edit-schema interface](https://raw.githubusercontent.com/simonw/datasette-edit-schema/main/datasette-edit-schema.png)

## Development

To set up this plugin locally, first checkout the code. Then create a new virtual environment:

    cd datasette-edit-schema
    python3 -mvenv venv
    source venv/bin/activate

Or if you are using `pipenv`:

    pipenv shell

Now install the dependencies and tests:

    pip install -e '.[test]'

To run the tests:

    pytest
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-datasette-edit-schema"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-edit-schema""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-edit-schema</h1>
<p dir=""auto""><a href=""https://pypi.org/project/datasette-edit-schema/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/c7772b0d1f8c2377409f509e097f39ac2540e5215589dc08e73e1d497cb7e41d/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d656469742d736368656d612e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-edit-schema.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-edit-schema/releases""><img src=""https://camo.githubusercontent.com/e6d232795a6c369f9bf6a52f5656bbbc227f6a5d0e6ed99d8a02d703e74b07dc/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d656469742d736368656d613f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-edit-schema?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-edit-schema/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-edit-schema/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-edit-schema/blob/master/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Datasette plugin for modifying table schemas</p>
<h2 dir=""auto""><a id=""user-content-features"" class=""anchor"" aria-hidden=""true"" href=""#user-content-features""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Features</h2>
<ul dir=""auto"">
<li>Add new columns to a table</li>
<li>Rename columns in a table</li>
<li>Modify the type of columns in a table</li>
<li>Re-order the columns in a table</li>
<li>Rename a table</li>
<li>Delete a table</li>
</ul>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p dir=""auto"">Install this plugin in the same environment as Datasette.</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ pip install datasette-edit-schema""><pre class=""notranslate""><code>$ pip install datasette-edit-schema
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p dir=""auto"">Navigate to <code>/-/edit-schema/dbname/tablename</code> on your Datasette instance to edit a specific table.</p>
<p dir=""auto"">Use <code>/-/edit-schema/dbname</code> to create a new table in a specific database.</p>
<p dir=""auto"">By default only <a href=""https://datasette.readthedocs.io/en/stable/authentication.html#using-the-root-actor"" rel=""nofollow"">the root actor</a> can access the page - so you'll need to run Datasette with the <code>--root</code> option and click on the link shown in the terminal to sign in and access the page.</p>
<h2 dir=""auto""><a id=""user-content-permissions"" class=""anchor"" aria-hidden=""true"" href=""#user-content-permissions""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Permissions</h2>
<p dir=""auto"">The <code>edit-schema</code> permission governs access. You can use permission plugins such as <a href=""https://github.com/simonw/datasette-permissions-sql"">datasette-permissions-sql</a> to grant additional access to the write interface.</p>
<p dir=""auto"">These permission checks will call the <code>permission_allowed()</code> plugin hook with three arguments:</p>
<ul dir=""auto"">
<li><code>action</code> will be the string <code>""edit-schema""</code></li>
<li><code>actor</code> will be the currently authenticated actor - usually a dictionary</li>
<li><code>resource</code> will be the string name of the database</li>
</ul>
<h2 dir=""auto""><a id=""user-content-screenshot"" class=""anchor"" aria-hidden=""true"" href=""#user-content-screenshot""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Screenshot</h2>
<p dir=""auto""><a target=""_blank"" rel=""noopener noreferrer"" href=""https://raw.githubusercontent.com/simonw/datasette-edit-schema/main/datasette-edit-schema.png""><img src=""https://raw.githubusercontent.com/simonw/datasette-edit-schema/main/datasette-edit-schema.png"" alt=""datasette-edit-schema interface"" style=""max-width: 100%;""></a></p>
<h2 dir=""auto""><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p dir=""auto"">To set up this plugin locally, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""cd datasette-edit-schema
python3 -mvenv venv
source venv/bin/activate""><pre class=""notranslate""><code>cd datasette-edit-schema
python3 -mvenv venv
source venv/bin/activate
</code></pre></div>
<p dir=""auto"">Or if you are using <code>pipenv</code>:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pipenv shell""><pre class=""notranslate""><code>pipenv shell
</code></pre></div>
<p dir=""auto"">Now install the dependencies and tests:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'""><pre class=""notranslate""><code>pip install -e '.[test]'
</code></pre></div>
<p dir=""auto"">To run the tests:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pytest""><pre class=""notranslate""><code>pytest
</code></pre></div>
</article></div>",1,public,0,,0,
248385299,MDEwOlJlcG9zaXRvcnkyNDgzODUyOTk=,datasette-publish-fly,simonw/datasette-publish-fly,0,9599,https://github.com/simonw/datasette-publish-fly,Datasette plugin for publishing data using Fly,0,2020-03-19T01:47:01Z,2022-09-29T22:28:45Z,2022-09-29T17:25:15Z,,50,10,10,Python,1,1,1,1,0,3,0,0,4,apache-2.0,"[""datasette"", ""datasette-io"", ""datasette-plugin"", ""fly""]",3,4,10,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,3,3,"# datasette-publish-fly

[![PyPI](https://img.shields.io/pypi/v/datasette-publish-fly.svg)](https://pypi.org/project/datasette-publish-fly/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-publish-fly?include_prereleases&label=changelog)](https://github.com/simonw/datasette-publish-fly/releases)
[![Tests](https://github.com/simonw/datasette-publish-fly/workflows/Test/badge.svg)](https://github.com/simonw/datasette-publish-fly/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-publish-fly/blob/main/LICENSE)

[Datasette](https://datasette.io/) plugin for deploying Datasette instances to [Fly.io](https://fly.io/).

Project background: [Using SQLite and Datasette with Fly Volumes](https://simonwillison.net/2022/Feb/15/fly-volumes/)

## Installation

Install this plugin in the same environment as Datasette.

    $ datasette install datasette-publish-fly

## Deploying read-only data

First, install the `flyctl` command-line tool by [following their instructions](https://fly.io/docs/getting-started/installing-flyctl/).

Run `flyctl auth signup` to create an account there, or `flyctl auth login` if you already have one.

You can now use `datasette publish fly` to publish one or more SQLite database files:

    datasette publish fly my-database.db --app=""my-data-app""

The argument you pass to `--app` will be used for the URL of your application: `my-data-app.fly.dev`.

To update an application, run the publish command passing the same application name to the `--app` option.

Fly have [a free tier](https://fly.io/docs/about/pricing/#free-allowances), beyond which they will charge you monthly for each application you have live.  Details of their pricing can be [found on their site](https://fly.io/docs/pricing/).

Your application will be deployed at `https://your-app-name.fly.io/` - be aware that it may take several minutes to start working the first time you deploy it.

## Using Fly volumes for writable databases

Fly [Volumes](https://fly.io/docs/reference/volumes/) provide persistant disk storage for Fly applications. Volumes can be 1GB or more in size and the Fly free tier includes 3GB of volume space.

Datasette plugins such as [datasette-uploads-csvs](https://datasette.io/plugins/datasette-upload-csvs) and [datasette-tiddlywiki](https://datasette.io/plugins/datasette-tiddlywiki) can be deployed to Fly and store their mutable data in a volume.

> :warning: **You should only run a single instance of your application** if your database accepts writes. Fly has excellent support for running multiple instances in different geographical regions, but `datasette-publish-fly` with volumes is not yet compatible with that model. You should probably [use Fly PostgreSQL instead](https://fly.io/blog/globally-distributed-postgres/).

Here's how to deploy `datasette-tiddlywiki` with authentication provided by `datasette-auth-passwords`.

First, you'll need to create a root password hash to use to sign into the instance.

You can do that by installing the plugin and running the `datasette hash-password` command, or by using [this hosted tool](https://datasette-auth-passwords-demo.datasette.io/-/password-tool).

The hash should look like `pbkdf2_sha256$...` - you'll need this for the next step.

In this example we're also deploying a read-only database called `content.db`.

Pick a name for your new application, then run the following:

    datasette publish fly \
    content.db \
    --app your-application-name \
    --create-volume 1 \
    --create-db tiddlywiki \
    --install datasette-auth-passwords \
    --install datasette-tiddlywiki \
    --plugin-secret datasette-auth-passwords root_password_hash 'pbkdf2_sha256$...'

This will create the new application, deploy the `content.db` read-only database, create a 1GB volume for that application, create a new database in that volume called `tiddlywiki.db`, then install the two plugins and configure the password you specified.

### Updating applications that use a volume

Once you have deployed an application using a volume, you can update that application without needing the `--create-volume` or `--create-db` options. To add the [datasette-graphq](https://datasette.io/plugins/datasette-graphql) plugin to your deployed application you would run the following:

    datasette publish fly \
    content.db \
    --app your-application-name \
    --install datasette-auth-passwords \
    --install datasette-tiddlywiki \
    --install datasette-graphql \
    --plugin-secret datasette-auth-passwords root_password_hash 'pbkdf2_sha256$...' \

Since the application name is the same you don't need the `--create-volume` or `--create-db` options - these are persisted automatically between deploys.

You do need to specify the full list of plugins that you want to have installed, and any plugin secrets.

You also need to include any read-only database files that are part of the instance - `content.db` in this example - otherwise the new deployment will not include them.

### Advanced volume usage

`datasette publish fly` will add a volume called `datasette` to your Fly application. You can customize the name using the `--volume name custom_name` option.

Fly can be used to scale applications to run multiple instances in multiple regions around the world. This works well with read-only Datasette but is not currently recommended using Datasette with volumes, since each Fly replica would need its own volume and data stored in one instance would not be visible in others.

If you want to use multiple instances with volumes you will need to switch to using the `flyctl` command directly. The `--generate-dir` option, described below, can help with this.

## Generating without deploying

Use the `--generate-dir` option to generate a directory that can be deployed to Fly rather than deploying directly:

    datasette publish fly my-database.db \
      --app=""my-generated-app"" \
      --generate-dir /tmp/deploy-this

You can then manually deploy your generated application using the following:

    cd /tmp/deploy-this
    flyctl apps create my-generated-app
    flyctl deploy

## datasette publish fly --help

<!-- [[[cog
import cog
from datasette import cli
from click.testing import CliRunner
runner = CliRunner()
result = runner.invoke(cli.cli, [""publish"", ""fly"", ""--help""])
help = result.output.replace(""Usage: cli"", ""Usage: datasette"")
cog.out(
    ""```\n{}```"".format(help)
)
]]] -->
```
Usage: datasette publish fly [OPTIONS] [FILES]...

  Deploy an application to Fly that runs Datasette against the provided database
  files.

  Usage example:

      datasette publish fly my-database.db --app=""my-data-app""

  Full documentation: https://datasette.io/plugins/datasette-publish-fly

Options:
  -m, --metadata FILENAME         Path to JSON/YAML file containing metadata to
                                  publish
  --extra-options TEXT            Extra options to pass to datasette serve
  --branch TEXT                   Install datasette from a GitHub branch e.g.
                                  main
  --template-dir DIRECTORY        Path to directory containing custom templates
  --plugins-dir DIRECTORY         Path to directory containing custom plugins
  --static MOUNT:DIRECTORY        Serve static files from this directory at
                                  /MOUNT/...
  --install TEXT                  Additional packages (e.g. plugins) to install
  --plugin-secret <TEXT TEXT TEXT>...
                                  Secrets to pass to plugins, e.g. --plugin-
                                  secret datasette-auth-github client_id xxx
  --version-note TEXT             Additional note to show on /-/versions
  --secret TEXT                   Secret used for signing secure values, such as
                                  signed cookies
  --title TEXT                    Title for metadata
  --license TEXT                  License label for metadata
  --license_url TEXT              License URL for metadata
  --source TEXT                   Source label for metadata
  --source_url TEXT               Source URL for metadata
  --about TEXT                    About label for metadata
  --about_url TEXT                About URL for metadata
  --spatialite                    Enable SpatialLite extension
  --region TEXT                   Fly region to deploy to, e.g sjc - see
                                  https://fly.io/docs/reference/regions/
  --create-volume INTEGER RANGE   Create and attach volume of this size in GB
                                  [x>=1]
  --create-db TEXT                Names of read-write database files to create
  --volume-name TEXT              Volume name to use
  -a, --app TEXT                  Name of Fly app to deploy  [required]
  -o, --org TEXT                  Name of Fly org to deploy to
  --generate-dir DIRECTORY        Output generated application files and stop
                                  without deploying
  --show-files                    Output the generated Dockerfile, metadata.json
                                  and fly.toml
  --help                          Show this message and exit.
```
<!-- [[[end]]] -->

## Development

To contribute to this tool, first checkout the code. Then create a new virtual environment:

    cd datasette-publish-fly
    python -m venv venv
    source venv/bin/activate

Or if you are using `pipenv`:

    pipenv shell

Now install the dependencies and test dependencies:

    pip install -e '.[test]'

To run the tests:

    pytest

### Integration tests

The tests in `tests/test_integration.py` make actual calls to Fly to deploy a test application.

These tests are skipped by default. If you have `flyctl` installed and configured, you can run the integration tests like this:

    pytest --integration -s

The `-s` option here ensures that output from the deploys will be visible to you - otherwise it can look like the tests have hung.

The tests will create applications on Fly that start with the prefix `publish-fly-temp-` and then delete them at the end of the run.
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-datasette-publish-fly"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-publish-fly""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-publish-fly</h1>
<p dir=""auto""><a href=""https://pypi.org/project/datasette-publish-fly/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/e734388aed6f472c37bdfa14e6ddace47ad6e959c87da3b6399a5313f3558110/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d7075626c6973682d666c792e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-publish-fly.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-publish-fly/releases""><img src=""https://camo.githubusercontent.com/35efe08178a809f72d27ab8954426873c2c94d698c5ebd81d7eb284b421aa967/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d7075626c6973682d666c793f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-publish-fly?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-publish-fly/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-publish-fly/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-publish-fly/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto""><a href=""https://datasette.io/"" rel=""nofollow"">Datasette</a> plugin for deploying Datasette instances to <a href=""https://fly.io/"" rel=""nofollow"">Fly.io</a>.</p>
<p dir=""auto"">Project background: <a href=""https://simonwillison.net/2022/Feb/15/fly-volumes/"" rel=""nofollow"">Using SQLite and Datasette with Fly Volumes</a></p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p dir=""auto"">Install this plugin in the same environment as Datasette.</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ datasette install datasette-publish-fly""><pre class=""notranslate""><code>$ datasette install datasette-publish-fly
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-deploying-read-only-data"" class=""anchor"" aria-hidden=""true"" href=""#user-content-deploying-read-only-data""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Deploying read-only data</h2>
<p dir=""auto"">First, install the <code>flyctl</code> command-line tool by <a href=""https://fly.io/docs/getting-started/installing-flyctl/"" rel=""nofollow"">following their instructions</a>.</p>
<p dir=""auto"">Run <code>flyctl auth signup</code> to create an account there, or <code>flyctl auth login</code> if you already have one.</p>
<p dir=""auto"">You can now use <code>datasette publish fly</code> to publish one or more SQLite database files:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""datasette publish fly my-database.db --app=&quot;my-data-app&quot;""><pre class=""notranslate""><code>datasette publish fly my-database.db --app=""my-data-app""
</code></pre></div>
<p dir=""auto"">The argument you pass to <code>--app</code> will be used for the URL of your application: <code>my-data-app.fly.dev</code>.</p>
<p dir=""auto"">To update an application, run the publish command passing the same application name to the <code>--app</code> option.</p>
<p dir=""auto"">Fly have <a href=""https://fly.io/docs/about/pricing/#free-allowances"" rel=""nofollow"">a free tier</a>, beyond which they will charge you monthly for each application you have live.  Details of their pricing can be <a href=""https://fly.io/docs/pricing/"" rel=""nofollow"">found on their site</a>.</p>
<p dir=""auto"">Your application will be deployed at <code>https://your-app-name.fly.io/</code> - be aware that it may take several minutes to start working the first time you deploy it.</p>
<h2 dir=""auto""><a id=""user-content-using-fly-volumes-for-writable-databases"" class=""anchor"" aria-hidden=""true"" href=""#user-content-using-fly-volumes-for-writable-databases""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Using Fly volumes for writable databases</h2>
<p dir=""auto"">Fly <a href=""https://fly.io/docs/reference/volumes/"" rel=""nofollow"">Volumes</a> provide persistant disk storage for Fly applications. Volumes can be 1GB or more in size and the Fly free tier includes 3GB of volume space.</p>
<p dir=""auto"">Datasette plugins such as <a href=""https://datasette.io/plugins/datasette-upload-csvs"" rel=""nofollow"">datasette-uploads-csvs</a> and <a href=""https://datasette.io/plugins/datasette-tiddlywiki"" rel=""nofollow"">datasette-tiddlywiki</a> can be deployed to Fly and store their mutable data in a volume.</p>
<blockquote>
<p dir=""auto""><g-emoji class=""g-emoji"" alias=""warning"" fallback-src=""https://github.githubassets.com/images/icons/emoji/unicode/26a0.png"">⚠️</g-emoji> <strong>You should only run a single instance of your application</strong> if your database accepts writes. Fly has excellent support for running multiple instances in different geographical regions, but <code>datasette-publish-fly</code> with volumes is not yet compatible with that model. You should probably <a href=""https://fly.io/blog/globally-distributed-postgres/"" rel=""nofollow"">use Fly PostgreSQL instead</a>.</p>
</blockquote>
<p dir=""auto"">Here's how to deploy <code>datasette-tiddlywiki</code> with authentication provided by <code>datasette-auth-passwords</code>.</p>
<p dir=""auto"">First, you'll need to create a root password hash to use to sign into the instance.</p>
<p dir=""auto"">You can do that by installing the plugin and running the <code>datasette hash-password</code> command, or by using <a href=""https://datasette-auth-passwords-demo.datasette.io/-/password-tool"" rel=""nofollow"">this hosted tool</a>.</p>
<p dir=""auto"">The hash should look like <code>pbkdf2_sha256$...</code> - you'll need this for the next step.</p>
<p dir=""auto"">In this example we're also deploying a read-only database called <code>content.db</code>.</p>
<p dir=""auto"">Pick a name for your new application, then run the following:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""datasette publish fly \
content.db \
--app your-application-name \
--create-volume 1 \
--create-db tiddlywiki \
--install datasette-auth-passwords \
--install datasette-tiddlywiki \
--plugin-secret datasette-auth-passwords root_password_hash 'pbkdf2_sha256$...'""><pre class=""notranslate""><code>datasette publish fly \
content.db \
--app your-application-name \
--create-volume 1 \
--create-db tiddlywiki \
--install datasette-auth-passwords \
--install datasette-tiddlywiki \
--plugin-secret datasette-auth-passwords root_password_hash 'pbkdf2_sha256$...'
</code></pre></div>
<p dir=""auto"">This will create the new application, deploy the <code>content.db</code> read-only database, create a 1GB volume for that application, create a new database in that volume called <code>tiddlywiki.db</code>, then install the two plugins and configure the password you specified.</p>
<h3 dir=""auto""><a id=""user-content-updating-applications-that-use-a-volume"" class=""anchor"" aria-hidden=""true"" href=""#user-content-updating-applications-that-use-a-volume""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Updating applications that use a volume</h3>
<p dir=""auto"">Once you have deployed an application using a volume, you can update that application without needing the <code>--create-volume</code> or <code>--create-db</code> options. To add the <a href=""https://datasette.io/plugins/datasette-graphql"" rel=""nofollow"">datasette-graphq</a> plugin to your deployed application you would run the following:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""datasette publish fly \
content.db \
--app your-application-name \
--install datasette-auth-passwords \
--install datasette-tiddlywiki \
--install datasette-graphql \
--plugin-secret datasette-auth-passwords root_password_hash 'pbkdf2_sha256$...' \""><pre class=""notranslate""><code>datasette publish fly \
content.db \
--app your-application-name \
--install datasette-auth-passwords \
--install datasette-tiddlywiki \
--install datasette-graphql \
--plugin-secret datasette-auth-passwords root_password_hash 'pbkdf2_sha256$...' \
</code></pre></div>
<p dir=""auto"">Since the application name is the same you don't need the <code>--create-volume</code> or <code>--create-db</code> options - these are persisted automatically between deploys.</p>
<p dir=""auto"">You do need to specify the full list of plugins that you want to have installed, and any plugin secrets.</p>
<p dir=""auto"">You also need to include any read-only database files that are part of the instance - <code>content.db</code> in this example - otherwise the new deployment will not include them.</p>
<h3 dir=""auto""><a id=""user-content-advanced-volume-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-advanced-volume-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Advanced volume usage</h3>
<p dir=""auto""><code>datasette publish fly</code> will add a volume called <code>datasette</code> to your Fly application. You can customize the name using the <code>--volume name custom_name</code> option.</p>
<p dir=""auto"">Fly can be used to scale applications to run multiple instances in multiple regions around the world. This works well with read-only Datasette but is not currently recommended using Datasette with volumes, since each Fly replica would need its own volume and data stored in one instance would not be visible in others.</p>
<p dir=""auto"">If you want to use multiple instances with volumes you will need to switch to using the <code>flyctl</code> command directly. The <code>--generate-dir</code> option, described below, can help with this.</p>
<h2 dir=""auto""><a id=""user-content-generating-without-deploying"" class=""anchor"" aria-hidden=""true"" href=""#user-content-generating-without-deploying""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Generating without deploying</h2>
<p dir=""auto"">Use the <code>--generate-dir</code> option to generate a directory that can be deployed to Fly rather than deploying directly:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""datasette publish fly my-database.db \
  --app=&quot;my-generated-app&quot; \
  --generate-dir /tmp/deploy-this""><pre class=""notranslate""><code>datasette publish fly my-database.db \
  --app=""my-generated-app"" \
  --generate-dir /tmp/deploy-this
</code></pre></div>
<p dir=""auto"">You can then manually deploy your generated application using the following:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""cd /tmp/deploy-this
flyctl apps create my-generated-app
flyctl deploy""><pre class=""notranslate""><code>cd /tmp/deploy-this
flyctl apps create my-generated-app
flyctl deploy
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-datasette-publish-fly---help"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-publish-fly---help""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette publish fly --help</h2>

<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""Usage: datasette publish fly [OPTIONS] [FILES]...

  Deploy an application to Fly that runs Datasette against the provided database
  files.

  Usage example:

      datasette publish fly my-database.db --app=&quot;my-data-app&quot;

  Full documentation: https://datasette.io/plugins/datasette-publish-fly

Options:
  -m, --metadata FILENAME         Path to JSON/YAML file containing metadata to
                                  publish
  --extra-options TEXT            Extra options to pass to datasette serve
  --branch TEXT                   Install datasette from a GitHub branch e.g.
                                  main
  --template-dir DIRECTORY        Path to directory containing custom templates
  --plugins-dir DIRECTORY         Path to directory containing custom plugins
  --static MOUNT:DIRECTORY        Serve static files from this directory at
                                  /MOUNT/...
  --install TEXT                  Additional packages (e.g. plugins) to install
  --plugin-secret &lt;TEXT TEXT TEXT&gt;...
                                  Secrets to pass to plugins, e.g. --plugin-
                                  secret datasette-auth-github client_id xxx
  --version-note TEXT             Additional note to show on /-/versions
  --secret TEXT                   Secret used for signing secure values, such as
                                  signed cookies
  --title TEXT                    Title for metadata
  --license TEXT                  License label for metadata
  --license_url TEXT              License URL for metadata
  --source TEXT                   Source label for metadata
  --source_url TEXT               Source URL for metadata
  --about TEXT                    About label for metadata
  --about_url TEXT                About URL for metadata
  --spatialite                    Enable SpatialLite extension
  --region TEXT                   Fly region to deploy to, e.g sjc - see
                                  https://fly.io/docs/reference/regions/
  --create-volume INTEGER RANGE   Create and attach volume of this size in GB
                                  [x&gt;=1]
  --create-db TEXT                Names of read-write database files to create
  --volume-name TEXT              Volume name to use
  -a, --app TEXT                  Name of Fly app to deploy  [required]
  -o, --org TEXT                  Name of Fly org to deploy to
  --generate-dir DIRECTORY        Output generated application files and stop
                                  without deploying
  --show-files                    Output the generated Dockerfile, metadata.json
                                  and fly.toml
  --help                          Show this message and exit.""><pre class=""notranslate""><code>Usage: datasette publish fly [OPTIONS] [FILES]...

  Deploy an application to Fly that runs Datasette against the provided database
  files.

  Usage example:

      datasette publish fly my-database.db --app=""my-data-app""

  Full documentation: https://datasette.io/plugins/datasette-publish-fly

Options:
  -m, --metadata FILENAME         Path to JSON/YAML file containing metadata to
                                  publish
  --extra-options TEXT            Extra options to pass to datasette serve
  --branch TEXT                   Install datasette from a GitHub branch e.g.
                                  main
  --template-dir DIRECTORY        Path to directory containing custom templates
  --plugins-dir DIRECTORY         Path to directory containing custom plugins
  --static MOUNT:DIRECTORY        Serve static files from this directory at
                                  /MOUNT/...
  --install TEXT                  Additional packages (e.g. plugins) to install
  --plugin-secret &lt;TEXT TEXT TEXT&gt;...
                                  Secrets to pass to plugins, e.g. --plugin-
                                  secret datasette-auth-github client_id xxx
  --version-note TEXT             Additional note to show on /-/versions
  --secret TEXT                   Secret used for signing secure values, such as
                                  signed cookies
  --title TEXT                    Title for metadata
  --license TEXT                  License label for metadata
  --license_url TEXT              License URL for metadata
  --source TEXT                   Source label for metadata
  --source_url TEXT               Source URL for metadata
  --about TEXT                    About label for metadata
  --about_url TEXT                About URL for metadata
  --spatialite                    Enable SpatialLite extension
  --region TEXT                   Fly region to deploy to, e.g sjc - see
                                  https://fly.io/docs/reference/regions/
  --create-volume INTEGER RANGE   Create and attach volume of this size in GB
                                  [x&gt;=1]
  --create-db TEXT                Names of read-write database files to create
  --volume-name TEXT              Volume name to use
  -a, --app TEXT                  Name of Fly app to deploy  [required]
  -o, --org TEXT                  Name of Fly org to deploy to
  --generate-dir DIRECTORY        Output generated application files and stop
                                  without deploying
  --show-files                    Output the generated Dockerfile, metadata.json
                                  and fly.toml
  --help                          Show this message and exit.
</code></pre></div>

<h2 dir=""auto""><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p dir=""auto"">To contribute to this tool, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""cd datasette-publish-fly
python -m venv venv
source venv/bin/activate""><pre class=""notranslate""><code>cd datasette-publish-fly
python -m venv venv
source venv/bin/activate
</code></pre></div>
<p dir=""auto"">Or if you are using <code>pipenv</code>:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pipenv shell""><pre class=""notranslate""><code>pipenv shell
</code></pre></div>
<p dir=""auto"">Now install the dependencies and test dependencies:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'""><pre class=""notranslate""><code>pip install -e '.[test]'
</code></pre></div>
<p dir=""auto"">To run the tests:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pytest""><pre class=""notranslate""><code>pytest
</code></pre></div>
<h3 dir=""auto""><a id=""user-content-integration-tests"" class=""anchor"" aria-hidden=""true"" href=""#user-content-integration-tests""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Integration tests</h3>
<p dir=""auto"">The tests in <code>tests/test_integration.py</code> make actual calls to Fly to deploy a test application.</p>
<p dir=""auto"">These tests are skipped by default. If you have <code>flyctl</code> installed and configured, you can run the integration tests like this:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pytest --integration -s""><pre class=""notranslate""><code>pytest --integration -s
</code></pre></div>
<p dir=""auto"">The <code>-s</code> option here ensures that output from the deploys will be visible to you - otherwise it can look like the tests have hung.</p>
<p dir=""auto"">The tests will create applications on Fly that start with the prefix <code>publish-fly-temp-</code> and then delete them at the end of the run.</p>
</article></div>",1,public,0,,0,
248903544,MDEwOlJlcG9zaXRvcnkyNDg5MDM1NDQ=,hacker-news-to-sqlite,dogsheep/hacker-news-to-sqlite,0,53015001,https://github.com/dogsheep/hacker-news-to-sqlite,Create a SQLite database containing data pulled from Hacker News,0,2020-03-21T04:02:05Z,2021-06-06T22:42:00Z,2021-03-13T19:15:06Z,,19,25,25,Python,1,1,1,1,0,2,0,0,0,apache-2.0,"[""hacker-news"", ""datasette"", ""dogsheep"", ""datasette-io"", ""datasette-tool""]",2,0,25,main,"{""admin"": false, ""push"": false, ""pull"": false}",,53015001,2,1,"# hacker-news-to-sqlite

[![PyPI](https://img.shields.io/pypi/v/hacker-news-to-sqlite.svg)](https://pypi.org/project/hacker-news-to-sqlite/)
[![Changelog](https://img.shields.io/github/v/release/dogsheep/hacker-news-to-sqlite?include_prereleases&label=changelog)](https://github.com/dogsheep/hacker-news-to-sqlite/releases)
[![Tests](https://github.com/dogsheep/hacker-news-to-sqlite/workflows/Test/badge.svg)](https://github.com/dogsheep/hacker-news-to-sqlite/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/hacker-news-to-sqlite/blob/main/LICENSE)

Create a SQLite database containing data fetched from [Hacker News](https://news.ycombinator.com/).

## How to install

    $ pip install hacker-news-to-sqlite

## Usage

    $ hacker-news-to-sqlite user hacker-news.db your-username
    Importing items:  37%|███████████                        | 845/2297 [05:09<11:02,  2.19it/s]

Imports all of your Hacker News submissions and comments into a SQLite database called `hacker-news.db`.

    $ hacker-news-to-sqlite trees hacker-news.db 22640038 22643218

Fetches the entire comments tree in which any of those content IDs appears.

## Browsing your data with Datasette

You can use [Datasette](https://datasette.readthedocs.org/) to browse your data. Install Datasette like this:

    $ pip install datasette

Now run it against your `hacker-news.db` file like so:

    $ datasette hacker-news.db

Visit `https://localhost:8001/` to search and explore your data.

You can improve the display of your data usinng the [datasette-render-timestamps](https://github.com/simonw/datasette-render-timestamps) and [datasette-render-html](https://github.com/simonw/datasette-render-html) plugins. Install them like this:

    $ pip install datasette-render-timestamps datasette-render-html

Now save the following configuration in a file called `metadata.json`:

```json
{
    ""databases"": {
        ""hacker-news"": {
            ""tables"": {
                ""items"": {
                    ""plugins"": {
                        ""datasette-render-html"": {
                            ""columns"": [
                                ""text""
                            ]
                        },
                        ""datasette-render-timestamps"": {
                            ""columns"": [
                                ""time""
                            ]
                        }
                    }
                },
                ""users"": {
                    ""plugins"": {
                        ""datasette-render-timestamps"": {
                            ""columns"": [
                                ""created""
                            ]
                        }
                    }
                }
            }
        }
    }
}
```
Run Datasette like this:

    $ datasette -m metadata.json hacker-news.db

The timestamp columns will now be rendered as human-readable dates, and any HTML in your posts will be displayed as rendered HTML.
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-hacker-news-to-sqlite"" class=""anchor"" aria-hidden=""true"" href=""#user-content-hacker-news-to-sqlite""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>hacker-news-to-sqlite</h1>
<p><a href=""https://pypi.org/project/hacker-news-to-sqlite/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/bf6d88d26ea4d8f1f396c1b1fc88ae74380de6a30f5a792c9f29664f21a219dd/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6861636b65722d6e6577732d746f2d73716c6974652e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/hacker-news-to-sqlite.svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/dogsheep/hacker-news-to-sqlite/releases""><img src=""https://camo.githubusercontent.com/090ef507ee9402a786e40313d173f516f1dbdedf3df703402296594a4ebcad73/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f646f6773686565702f6861636b65722d6e6577732d746f2d73716c6974653f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/dogsheep/hacker-news-to-sqlite?include_prereleases&amp;label=changelog"" style=""max-width:100%;""></a>
<a href=""https://github.com/dogsheep/hacker-news-to-sqlite/actions?query=workflow%3ATest""><img src=""https://github.com/dogsheep/hacker-news-to-sqlite/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/hacker-news-to-sqlite/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>Create a SQLite database containing data fetched from <a href=""https://news.ycombinator.com/"" rel=""nofollow"">Hacker News</a>.</p>
<h2><a id=""user-content-how-to-install"" class=""anchor"" aria-hidden=""true"" href=""#user-content-how-to-install""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>How to install</h2>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ pip install hacker-news-to-sqlite
""><pre><code>$ pip install hacker-news-to-sqlite
</code></pre></div>
<h2><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ hacker-news-to-sqlite user hacker-news.db your-username
Importing items:  37%|███████████                        | 845/2297 [05:09&lt;11:02,  2.19it/s]
""><pre><code>$ hacker-news-to-sqlite user hacker-news.db your-username
Importing items:  37%|███████████                        | 845/2297 [05:09&lt;11:02,  2.19it/s]
</code></pre></div>
<p>Imports all of your Hacker News submissions and comments into a SQLite database called <code>hacker-news.db</code>.</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ hacker-news-to-sqlite trees hacker-news.db 22640038 22643218
""><pre><code>$ hacker-news-to-sqlite trees hacker-news.db 22640038 22643218
</code></pre></div>
<p>Fetches the entire comments tree in which any of those content IDs appears.</p>
<h2><a id=""user-content-browsing-your-data-with-datasette"" class=""anchor"" aria-hidden=""true"" href=""#user-content-browsing-your-data-with-datasette""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Browsing your data with Datasette</h2>
<p>You can use <a href=""https://datasette.readthedocs.org/"" rel=""nofollow"">Datasette</a> to browse your data. Install Datasette like this:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ pip install datasette
""><pre><code>$ pip install datasette
</code></pre></div>
<p>Now run it against your <code>hacker-news.db</code> file like so:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ datasette hacker-news.db
""><pre><code>$ datasette hacker-news.db
</code></pre></div>
<p>Visit <code>https://localhost:8001/</code> to search and explore your data.</p>
<p>You can improve the display of your data usinng the <a href=""https://github.com/simonw/datasette-render-timestamps"">datasette-render-timestamps</a> and <a href=""https://github.com/simonw/datasette-render-html"">datasette-render-html</a> plugins. Install them like this:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ pip install datasette-render-timestamps datasette-render-html
""><pre><code>$ pip install datasette-render-timestamps datasette-render-html
</code></pre></div>
<p>Now save the following configuration in a file called <code>metadata.json</code>:</p>
<div class=""highlight highlight-source-json position-relative"" data-snippet-clipboard-copy-content=""{
    &quot;databases&quot;: {
        &quot;hacker-news&quot;: {
            &quot;tables&quot;: {
                &quot;items&quot;: {
                    &quot;plugins&quot;: {
                        &quot;datasette-render-html&quot;: {
                            &quot;columns&quot;: [
                                &quot;text&quot;
                            ]
                        },
                        &quot;datasette-render-timestamps&quot;: {
                            &quot;columns&quot;: [
                                &quot;time&quot;
                            ]
                        }
                    }
                },
                &quot;users&quot;: {
                    &quot;plugins&quot;: {
                        &quot;datasette-render-timestamps&quot;: {
                            &quot;columns&quot;: [
                                &quot;created&quot;
                            ]
                        }
                    }
                }
            }
        }
    }
}
""><pre>{
    <span class=""pl-s""><span class=""pl-pds"">""</span>databases<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>hacker-news<span class=""pl-pds"">""</span></span>: {
            <span class=""pl-s""><span class=""pl-pds"">""</span>tables<span class=""pl-pds"">""</span></span>: {
                <span class=""pl-s""><span class=""pl-pds"">""</span>items<span class=""pl-pds"">""</span></span>: {
                    <span class=""pl-s""><span class=""pl-pds"">""</span>plugins<span class=""pl-pds"">""</span></span>: {
                        <span class=""pl-s""><span class=""pl-pds"">""</span>datasette-render-html<span class=""pl-pds"">""</span></span>: {
                            <span class=""pl-s""><span class=""pl-pds"">""</span>columns<span class=""pl-pds"">""</span></span>: [
                                <span class=""pl-s""><span class=""pl-pds"">""</span>text<span class=""pl-pds"">""</span></span>
                            ]
                        },
                        <span class=""pl-s""><span class=""pl-pds"">""</span>datasette-render-timestamps<span class=""pl-pds"">""</span></span>: {
                            <span class=""pl-s""><span class=""pl-pds"">""</span>columns<span class=""pl-pds"">""</span></span>: [
                                <span class=""pl-s""><span class=""pl-pds"">""</span>time<span class=""pl-pds"">""</span></span>
                            ]
                        }
                    }
                },
                <span class=""pl-s""><span class=""pl-pds"">""</span>users<span class=""pl-pds"">""</span></span>: {
                    <span class=""pl-s""><span class=""pl-pds"">""</span>plugins<span class=""pl-pds"">""</span></span>: {
                        <span class=""pl-s""><span class=""pl-pds"">""</span>datasette-render-timestamps<span class=""pl-pds"">""</span></span>: {
                            <span class=""pl-s""><span class=""pl-pds"">""</span>columns<span class=""pl-pds"">""</span></span>: [
                                <span class=""pl-s""><span class=""pl-pds"">""</span>created<span class=""pl-pds"">""</span></span>
                            ]
                        }
                    }
                }
            }
        }
    }
}</pre></div>
<p>Run Datasette like this:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ datasette -m metadata.json hacker-news.db
""><pre><code>$ datasette -m metadata.json hacker-news.db
</code></pre></div>
<p>The timestamp columns will now be rendered as human-readable dates, and any HTML in your posts will be displayed as rendered HTML.</p>
</article></div>",,,,,,
248999994,MDEwOlJlcG9zaXRvcnkyNDg5OTk5OTQ=,datasette-show-errors,simonw/datasette-show-errors,0,9599,https://github.com/simonw/datasette-show-errors,Datasette plugin for displaying error tracebacks,0,2020-03-21T15:06:04Z,2020-09-24T00:17:29Z,2020-09-01T00:32:23Z,,7,1,1,Python,1,1,1,1,0,0,0,0,1,apache-2.0,"[""asgi"", ""datasette"", ""starlette"", ""datasette-plugin"", ""datasette-io""]",0,1,1,master,"{""admin"": false, ""push"": false, ""pull"": false}",,,0,0,"# datasette-show-errors

[![PyPI](https://img.shields.io/pypi/v/datasette-show-errors.svg)](https://pypi.org/project/datasette-show-errors/)
[![CircleCI](https://circleci.com/gh/simonw/datasette-show-errors.svg?style=svg)](https://circleci.com/gh/simonw/datasette-show-errors)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-show-errors/blob/master/LICENSE)

Datasette plugin for displaying error tracebacks.

**This plugin does not work with current versions of Datasette.** See [issue #2](https://github.com/simonw/datasette-show-errors/issues/2).

## Installation

    pip install datasette-show-errors

## Usage

Installing the plugin will cause any internal error to be displayed with a full traceback, rather than just a generic 500 page.

Be careful not to use this in a context that might expose sensitive information.
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-datasette-show-errors"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-show-errors""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-show-errors</h1>
<p><a href=""https://pypi.org/project/datasette-show-errors/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/b3e198ab4e1253d5eca382cb810057bf1be9b3a78db6f60f57c2a5a269d49304/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d73686f772d6572726f72732e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-show-errors.svg"" style=""max-width:100%;""></a>
<a href=""https://circleci.com/gh/simonw/datasette-show-errors"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/183abdd282beee4022d9443bd5c050fa0a2d8fd024156414206fa188872a7a8e/68747470733a2f2f636972636c6563692e636f6d2f67682f73696d6f6e772f6461746173657474652d73686f772d6572726f72732e7376673f7374796c653d737667"" alt=""CircleCI"" data-canonical-src=""https://circleci.com/gh/simonw/datasette-show-errors.svg?style=svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-show-errors/blob/master/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>Datasette plugin for displaying error tracebacks.</p>
<p><strong>This plugin does not work with current versions of Datasette.</strong> See <a href=""https://github.com/simonw/datasette-show-errors/issues/2"">issue #2</a>.</p>
<h2><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pip install datasette-show-errors
""><pre><code>pip install datasette-show-errors
</code></pre></div>
<h2><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p>Installing the plugin will cause any internal error to be displayed with a full traceback, rather than just a generic 500 page.</p>
<p>Be careful not to use this in a context that might expose sensitive information.</p>
</article></div>",,,,,,
253632948,MDEwOlJlcG9zaXRvcnkyNTM2MzI5NDg=,datasette-publish-vercel,simonw/datasette-publish-vercel,0,9599,https://github.com/simonw/datasette-publish-vercel,Datasette plugin for publishing data using Vercel,0,2020-04-06T22:47:13Z,2022-07-29T17:09:47Z,2022-08-24T17:43:41Z,,55,27,27,Python,1,1,1,1,0,5,0,0,17,apache-2.0,"[""datasette"", ""datasette-io"", ""datasette-plugin"", ""vercel"", ""zeit-now""]",5,17,27,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,5,4,"# datasette-publish-vercel

[![PyPI](https://img.shields.io/pypi/v/datasette-publish-vercel.svg)](https://pypi.org/project/datasette-publish-vercel/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-publish-vercel?include_prereleases&label=changelog)](https://github.com/simonw/datasette-publish-vercel/releases)
[![Tests](https://github.com/simonw/datasette-publish-vercel/workflows/Test/badge.svg)](https://github.com/simonw/datasette-publish-vercel/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-publish-vercel/blob/main/LICENSE)

Datasette plugin for publishing data using [Vercel](https://vercel.com/).

## Installation

Install this plugin in the same environment as Datasette.

    $ datasette install datasette-publish-vercel

## Usage

First, install the Vercel CLI tool by [following their instructions](https://vercel.com/download).

Run `vercel login` to login to (or create) an account.

Now you can use `datasette publish vercel` to publish your data:

    datasette publish vercel my-database.db --project=my-database

The `--project` argument is required - it specifies the project name that should be used for your deployment. This will be used as part of the deployment's URL.

### Other options

* `--no-prod` deploys to the project without updating the ""production"" URL alias to point to that new deployment. Without that option all deploys go directly to production.
* `--debug` enables the Vercel CLI debug output.
* `--token` allows you to pass a Now authentication token, rather than needing to first run `now login` to configure the tool. Tokens can be created in the Vercel web dashboard under Account Settings -> Tokens.
* `--public` runs `vercel --public` to publish the application source code at `/_src` e.g. https://datasette-public.now.sh/_src and make recent logs visible at `/_logs` e.g. https://datasette-public.now.sh/_logs
* `--generate-dir` - by default this tool generates a new Vercel app in a temporary directory, deploys it and then deletes the directory. Use `--generate-dir=my-app` to output the generated application files to a new directory of your choice instead. You can then deploy it by running `vercel` in that directory.
* `--setting default_page_size 10` - use this to set Datasette settings, as described in [the documentation](https://docs.datasette.io/en/stable/settings.html). This is a replacement for the unsupported `--extra-options` option.

### Full help

**Warning:** Some of these options are not yet implemented by this plugin. In particular, the following do not yet work:

* `--extra-options` - use `--setting` described above instead.
* `--plugin-secret`
* `--version-note`

```
$ datasette publish vercel --help

Usage: datasette publish vercel [OPTIONS] [FILES]...

  Publish to https://vercel.com/

Options:
  -m, --metadata FILENAME         Path to JSON/YAML file containing metadata to publish
  --extra-options TEXT            Extra options to pass to datasette serve
  --branch TEXT                   Install datasette from a GitHub branch e.g. main
  --template-dir DIRECTORY        Path to directory containing custom templates
  --plugins-dir DIRECTORY         Path to directory containing custom plugins
  --static MOUNT:DIRECTORY        Serve static files from this directory at /MOUNT/...
  --install TEXT                  Additional packages (e.g. plugins) to install
  --plugin-secret <TEXT TEXT TEXT>...
                                  Secrets to pass to plugins, e.g. --plugin-secret
                                  datasette-auth-github client_id xxx
  --version-note TEXT             Additional note to show on /-/versions
  --secret TEXT                   Secret used for signing secure values, such as signed
                                  cookies
  --title TEXT                    Title for metadata
  --license TEXT                  License label for metadata
  --license_url TEXT              License URL for metadata
  --source TEXT                   Source label for metadata
  --source_url TEXT               Source URL for metadata
  --about TEXT                    About label for metadata
  --about_url TEXT                About URL for metadata
  --token TEXT                    Auth token to use for deploy
  --project PROJECT               Vercel project name to use  [required]
  --scope TEXT                    Optional Vercel scope (e.g. a team name)
  --no-prod                       Don't deploy directly to production
  --debug                         Enable Vercel CLI debug output
  --public                        Publish source with Vercel CLI --public
  --generate-dir DIRECTORY        Output generated application files and stop without
                                  deploying
  --generate-vercel-json          Output generated vercel.json file and stop without
                                  deploying
  --vercel-json FILENAME          Custom vercel.json file to use instead of generating
                                  one
  --setting SETTING...            Setting, see docs.datasette.io/en/stable/settings.html
  --crossdb                       Enable cross-database SQL queries
  --help                          Show this message and exit.
```
## Using a custom `vercel.json` file

If you want to add additional redirects or similar to your Vercel configuration you may want to provide a custom `vercel.json` file.

To do this, first generate a configuration file (without running a deploy) using the `--generate-vercel-json` option:

    datasette publish vercel my-database.db \
      --project=my-database \
      --generate-vercel-json > vercel.json

You can now edit the `vercel.json` file that this creates to add your custom options.

Then run the deploy using:

    datasette publish vercel my-database.db \
      --project=my-database \
      --vercel-json=vercel.json

## Setting a `DATASETTE_SECRET`

Datasette uses [a secret string](https://docs.datasette.io/en/stable/settings.html#configuring-the-secret) for purposes such as signing authentication cookies. This secret is reset when the server restarts, which will sign out any users who are authenticated using a signed cookie.

You can avoid this by generating a `DATASETTE_SECRET` secret string and setting that as a [Vercel environment variable](https://vercel.com/docs/concepts/projects/environment-variables). If you do this the secret will stay consistent and your users will not be signed out.

## Using this with GitHub Actions

This plugin can be used together with [GitHub Actions](https://github.com/features/actions) to deploy Datasette instances automatically on new pushes to a repo, or on a schedule.

The GitHub Actions runners already have the Vercel deployment tool installed. You'll need to create an API token for your account at [vercel.com/account/tokens](https://vercel.com/account/tokens), and store that as a secret in your GitHub repository called `VERCEL_TOKEN`.

Make sure your workflow has installed `datasette` and `datasette-publish-vercel` using `pip`, then add the following step to your GitHub Actions workflow:
```
    - name: Deploy Datasette using Vercel
      env:
        VERCEL_TOKEN: ${{ secrets.VERCEL_TOKEN }}
      run: |-
        datasette publish vercel mydb.db \
          --token $VERCEL_TOKEN \
          --project my-vercel-project
```
You can see a full example of a workflow that uses Vercel in this way [in the simonw/til repository](https://github.com/simonw/til/blob/12b3f0d3679320cbeafa5df164bbc08ba703625d/.github/workflows/build.yml).
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-datasette-publish-vercel"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-publish-vercel""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-publish-vercel</h1>
<p dir=""auto""><a href=""https://pypi.org/project/datasette-publish-vercel/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/54812f865fef02d66cf1d1155ea07750ad19a0a88dad2d4f1a86f4fbd3fba8a0/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d7075626c6973682d76657263656c2e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-publish-vercel.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-publish-vercel/releases""><img src=""https://camo.githubusercontent.com/5ba538bc3912064a11f09b105ecd92083d1223f2f8a38034d17734d09a26f321/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d7075626c6973682d76657263656c3f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-publish-vercel?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-publish-vercel/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-publish-vercel/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-publish-vercel/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Datasette plugin for publishing data using <a href=""https://vercel.com/"" rel=""nofollow"">Vercel</a>.</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p dir=""auto"">Install this plugin in the same environment as Datasette.</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ datasette install datasette-publish-vercel""><pre class=""notranslate""><code>$ datasette install datasette-publish-vercel
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p dir=""auto"">First, install the Vercel CLI tool by <a href=""https://vercel.com/download"" rel=""nofollow"">following their instructions</a>.</p>
<p dir=""auto"">Run <code>vercel login</code> to login to (or create) an account.</p>
<p dir=""auto"">Now you can use <code>datasette publish vercel</code> to publish your data:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""datasette publish vercel my-database.db --project=my-database""><pre class=""notranslate""><code>datasette publish vercel my-database.db --project=my-database
</code></pre></div>
<p dir=""auto"">The <code>--project</code> argument is required - it specifies the project name that should be used for your deployment. This will be used as part of the deployment's URL.</p>
<h3 dir=""auto""><a id=""user-content-other-options"" class=""anchor"" aria-hidden=""true"" href=""#user-content-other-options""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Other options</h3>
<ul dir=""auto"">
<li><code>--no-prod</code> deploys to the project without updating the ""production"" URL alias to point to that new deployment. Without that option all deploys go directly to production.</li>
<li><code>--debug</code> enables the Vercel CLI debug output.</li>
<li><code>--token</code> allows you to pass a Now authentication token, rather than needing to first run <code>now login</code> to configure the tool. Tokens can be created in the Vercel web dashboard under Account Settings -&gt; Tokens.</li>
<li><code>--public</code> runs <code>vercel --public</code> to publish the application source code at <code>/_src</code> e.g. <a href=""https://datasette-public.now.sh/_src"" rel=""nofollow"">https://datasette-public.now.sh/_src</a> and make recent logs visible at <code>/_logs</code> e.g. <a href=""https://datasette-public.now.sh/_logs"" rel=""nofollow"">https://datasette-public.now.sh/_logs</a></li>
<li><code>--generate-dir</code> - by default this tool generates a new Vercel app in a temporary directory, deploys it and then deletes the directory. Use <code>--generate-dir=my-app</code> to output the generated application files to a new directory of your choice instead. You can then deploy it by running <code>vercel</code> in that directory.</li>
<li><code>--setting default_page_size 10</code> - use this to set Datasette settings, as described in <a href=""https://docs.datasette.io/en/stable/settings.html"" rel=""nofollow"">the documentation</a>. This is a replacement for the unsupported <code>--extra-options</code> option.</li>
</ul>
<h3 dir=""auto""><a id=""user-content-full-help"" class=""anchor"" aria-hidden=""true"" href=""#user-content-full-help""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Full help</h3>
<p dir=""auto""><strong>Warning:</strong> Some of these options are not yet implemented by this plugin. In particular, the following do not yet work:</p>
<ul dir=""auto"">
<li><code>--extra-options</code> - use <code>--setting</code> described above instead.</li>
<li><code>--plugin-secret</code></li>
<li><code>--version-note</code></li>
</ul>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ datasette publish vercel --help

Usage: datasette publish vercel [OPTIONS] [FILES]...

  Publish to https://vercel.com/

Options:
  -m, --metadata FILENAME         Path to JSON/YAML file containing metadata to publish
  --extra-options TEXT            Extra options to pass to datasette serve
  --branch TEXT                   Install datasette from a GitHub branch e.g. main
  --template-dir DIRECTORY        Path to directory containing custom templates
  --plugins-dir DIRECTORY         Path to directory containing custom plugins
  --static MOUNT:DIRECTORY        Serve static files from this directory at /MOUNT/...
  --install TEXT                  Additional packages (e.g. plugins) to install
  --plugin-secret &lt;TEXT TEXT TEXT&gt;...
                                  Secrets to pass to plugins, e.g. --plugin-secret
                                  datasette-auth-github client_id xxx
  --version-note TEXT             Additional note to show on /-/versions
  --secret TEXT                   Secret used for signing secure values, such as signed
                                  cookies
  --title TEXT                    Title for metadata
  --license TEXT                  License label for metadata
  --license_url TEXT              License URL for metadata
  --source TEXT                   Source label for metadata
  --source_url TEXT               Source URL for metadata
  --about TEXT                    About label for metadata
  --about_url TEXT                About URL for metadata
  --token TEXT                    Auth token to use for deploy
  --project PROJECT               Vercel project name to use  [required]
  --scope TEXT                    Optional Vercel scope (e.g. a team name)
  --no-prod                       Don't deploy directly to production
  --debug                         Enable Vercel CLI debug output
  --public                        Publish source with Vercel CLI --public
  --generate-dir DIRECTORY        Output generated application files and stop without
                                  deploying
  --generate-vercel-json          Output generated vercel.json file and stop without
                                  deploying
  --vercel-json FILENAME          Custom vercel.json file to use instead of generating
                                  one
  --setting SETTING...            Setting, see docs.datasette.io/en/stable/settings.html
  --crossdb                       Enable cross-database SQL queries
  --help                          Show this message and exit.""><pre class=""notranslate""><code>$ datasette publish vercel --help

Usage: datasette publish vercel [OPTIONS] [FILES]...

  Publish to https://vercel.com/

Options:
  -m, --metadata FILENAME         Path to JSON/YAML file containing metadata to publish
  --extra-options TEXT            Extra options to pass to datasette serve
  --branch TEXT                   Install datasette from a GitHub branch e.g. main
  --template-dir DIRECTORY        Path to directory containing custom templates
  --plugins-dir DIRECTORY         Path to directory containing custom plugins
  --static MOUNT:DIRECTORY        Serve static files from this directory at /MOUNT/...
  --install TEXT                  Additional packages (e.g. plugins) to install
  --plugin-secret &lt;TEXT TEXT TEXT&gt;...
                                  Secrets to pass to plugins, e.g. --plugin-secret
                                  datasette-auth-github client_id xxx
  --version-note TEXT             Additional note to show on /-/versions
  --secret TEXT                   Secret used for signing secure values, such as signed
                                  cookies
  --title TEXT                    Title for metadata
  --license TEXT                  License label for metadata
  --license_url TEXT              License URL for metadata
  --source TEXT                   Source label for metadata
  --source_url TEXT               Source URL for metadata
  --about TEXT                    About label for metadata
  --about_url TEXT                About URL for metadata
  --token TEXT                    Auth token to use for deploy
  --project PROJECT               Vercel project name to use  [required]
  --scope TEXT                    Optional Vercel scope (e.g. a team name)
  --no-prod                       Don't deploy directly to production
  --debug                         Enable Vercel CLI debug output
  --public                        Publish source with Vercel CLI --public
  --generate-dir DIRECTORY        Output generated application files and stop without
                                  deploying
  --generate-vercel-json          Output generated vercel.json file and stop without
                                  deploying
  --vercel-json FILENAME          Custom vercel.json file to use instead of generating
                                  one
  --setting SETTING...            Setting, see docs.datasette.io/en/stable/settings.html
  --crossdb                       Enable cross-database SQL queries
  --help                          Show this message and exit.
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-using-a-custom-verceljson-file"" class=""anchor"" aria-hidden=""true"" href=""#user-content-using-a-custom-verceljson-file""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Using a custom <code>vercel.json</code> file</h2>
<p dir=""auto"">If you want to add additional redirects or similar to your Vercel configuration you may want to provide a custom <code>vercel.json</code> file.</p>
<p dir=""auto"">To do this, first generate a configuration file (without running a deploy) using the <code>--generate-vercel-json</code> option:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""datasette publish vercel my-database.db \
  --project=my-database \
  --generate-vercel-json &gt; vercel.json""><pre class=""notranslate""><code>datasette publish vercel my-database.db \
  --project=my-database \
  --generate-vercel-json &gt; vercel.json
</code></pre></div>
<p dir=""auto"">You can now edit the <code>vercel.json</code> file that this creates to add your custom options.</p>
<p dir=""auto"">Then run the deploy using:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""datasette publish vercel my-database.db \
  --project=my-database \
  --vercel-json=vercel.json""><pre class=""notranslate""><code>datasette publish vercel my-database.db \
  --project=my-database \
  --vercel-json=vercel.json
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-setting-a-datasette_secret"" class=""anchor"" aria-hidden=""true"" href=""#user-content-setting-a-datasette_secret""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Setting a <code>DATASETTE_SECRET</code></h2>
<p dir=""auto"">Datasette uses <a href=""https://docs.datasette.io/en/stable/settings.html#configuring-the-secret"" rel=""nofollow"">a secret string</a> for purposes such as signing authentication cookies. This secret is reset when the server restarts, which will sign out any users who are authenticated using a signed cookie.</p>
<p dir=""auto"">You can avoid this by generating a <code>DATASETTE_SECRET</code> secret string and setting that as a <a href=""https://vercel.com/docs/concepts/projects/environment-variables"" rel=""nofollow"">Vercel environment variable</a>. If you do this the secret will stay consistent and your users will not be signed out.</p>
<h2 dir=""auto""><a id=""user-content-using-this-with-github-actions"" class=""anchor"" aria-hidden=""true"" href=""#user-content-using-this-with-github-actions""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Using this with GitHub Actions</h2>
<p dir=""auto"">This plugin can be used together with <a href=""https://github.com/features/actions"">GitHub Actions</a> to deploy Datasette instances automatically on new pushes to a repo, or on a schedule.</p>
<p dir=""auto"">The GitHub Actions runners already have the Vercel deployment tool installed. You'll need to create an API token for your account at <a href=""https://vercel.com/account/tokens"" rel=""nofollow"">vercel.com/account/tokens</a>, and store that as a secret in your GitHub repository called <code>VERCEL_TOKEN</code>.</p>
<p dir=""auto"">Make sure your workflow has installed <code>datasette</code> and <code>datasette-publish-vercel</code> using <code>pip</code>, then add the following step to your GitHub Actions workflow:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""    - name: Deploy Datasette using Vercel
      env:
        VERCEL_TOKEN: ${{ secrets.VERCEL_TOKEN }}
      run: |-
        datasette publish vercel mydb.db \
          --token $VERCEL_TOKEN \
          --project my-vercel-project""><pre class=""notranslate""><code>    - name: Deploy Datasette using Vercel
      env:
        VERCEL_TOKEN: ${{ secrets.VERCEL_TOKEN }}
      run: |-
        datasette publish vercel mydb.db \
          --token $VERCEL_TOKEN \
          --project my-vercel-project
</code></pre></div>
<p dir=""auto"">You can see a full example of a workflow that uses Vercel in this way <a href=""https://github.com/simonw/til/blob/12b3f0d3679320cbeafa5df164bbc08ba703625d/.github/workflows/build.yml"">in the simonw/til repository</a>.</p>
</article></div>",1,public,0,,0,
255460347,MDEwOlJlcG9zaXRvcnkyNTU0NjAzNDc=,datasette-clone,simonw/datasette-clone,0,9599,https://github.com/simonw/datasette-clone,Create a local copy of database files from a Datasette instance,0,2020-04-13T23:05:41Z,2021-06-08T15:33:21Z,2021-02-22T19:32:36Z,,20,2,2,Python,1,1,1,1,0,0,0,0,0,apache-2.0,"[""datasette"", ""datasette-io"", ""datasette-tool""]",0,0,2,main,"{""admin"": false, ""push"": false, ""pull"": false}",,,0,1,"# datasette-clone

[![PyPI](https://img.shields.io/pypi/v/datasette-clone.svg)](https://pypi.org/project/datasette-clone/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-clone?include_prereleases&label=changelog)](https://github.com/simonw/datasette-clone/releases)
[![Tests](https://github.com/simonw/datasette-clone/workflows/Test/badge.svg)](https://github.com/simonw/datasette-clone/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-clone/blob/main/LICENSE)

Create a local copy of database files from a Datasette instance.

See [datasette-clone](https://simonwillison.net/2020/Apr/14/datasette-clone/) on my blog for background on this project.

## How to install

    $ pip install datasette-clone

## Usage

This only works against Datasette instances running immutable databases (with the `-i` option). Databases published using the `datasette publish` command should be compatible with this tool.

To download copies of all `.db` files from an instance, run:

    datasette-clone https://latest.datasette.io

You can provide an optional second argument to specify a directory:

    datasette-clone https://latest.datasette.io /tmp/here-please

The command stores its own copy of a `databases.json` manifest and uses it to only download databases that have changed the next time you run the command.

It also stores a copy of the instance's `metadata.json` to ensure you have a copy of any source and licensing information for the downloaded databases.

If your instance is protected by an API token, you can use `--token` to provide it:

    datasette-clone https://latest.datasette.io --token=xyz

For verbose output showing what the tool is doing, use `-v`.
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-datasette-clone"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-clone""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-clone</h1>
<p><a href=""https://pypi.org/project/datasette-clone/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/17cc348c1197fd918ac911e525efd416a5d4ca5d3f00729cbf85c930ac5dbde6/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d636c6f6e652e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-clone.svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-clone/releases""><img src=""https://camo.githubusercontent.com/9f8615567ed0f9b4d38d5f69dafcc8ae2fa97aa13895779172645064f16fcc9f/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d636c6f6e653f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-clone?include_prereleases&amp;label=changelog"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-clone/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-clone/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-clone/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>Create a local copy of database files from a Datasette instance.</p>
<p>See <a href=""https://simonwillison.net/2020/Apr/14/datasette-clone/"" rel=""nofollow"">datasette-clone</a> on my blog for background on this project.</p>
<h2><a id=""user-content-how-to-install"" class=""anchor"" aria-hidden=""true"" href=""#user-content-how-to-install""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>How to install</h2>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ pip install datasette-clone
""><pre><code>$ pip install datasette-clone
</code></pre></div>
<h2><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p>This only works against Datasette instances running immutable databases (with the <code>-i</code> option). Databases published using the <code>datasette publish</code> command should be compatible with this tool.</p>
<p>To download copies of all <code>.db</code> files from an instance, run:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""datasette-clone https://latest.datasette.io
""><pre><code>datasette-clone https://latest.datasette.io
</code></pre></div>
<p>You can provide an optional second argument to specify a directory:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""datasette-clone https://latest.datasette.io /tmp/here-please
""><pre><code>datasette-clone https://latest.datasette.io /tmp/here-please
</code></pre></div>
<p>The command stores its own copy of a <code>databases.json</code> manifest and uses it to only download databases that have changed the next time you run the command.</p>
<p>It also stores a copy of the instance's <code>metadata.json</code> to ensure you have a copy of any source and licensing information for the downloaded databases.</p>
<p>If your instance is protected by an API token, you can use <code>--token</code> to provide it:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""datasette-clone https://latest.datasette.io --token=xyz
""><pre><code>datasette-clone https://latest.datasette.io --token=xyz
</code></pre></div>
<p>For verbose output showing what the tool is doing, use <code>-v</code>.</p>
</article></div>",,,,,,
256834907,MDEwOlJlcG9zaXRvcnkyNTY4MzQ5MDc=,dogsheep-photos,dogsheep/dogsheep-photos,0,53015001,https://github.com/dogsheep/dogsheep-photos,Upload your photos to S3 and import metadata about them into a SQLite database,0,2020-04-18T19:22:13Z,2021-11-04T20:45:03Z,2021-11-04T20:45:00Z,,68,124,124,Python,1,1,1,1,0,7,0,0,19,apache-2.0,"[""datasette"", ""datasette-io"", ""datasette-tool"", ""dogsheep"", ""sqlite""]",7,19,124,master,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,53015001,7,10,"# dogsheep-photos

[![PyPI](https://img.shields.io/pypi/v/dogsheep-photos.svg)](https://pypi.org/project/dogsheep-photos/)
[![Changelog](https://img.shields.io/github/v/release/dogsheep/dogsheep-photos?include_prereleases&label=changelog)](https://github.com/dogsheep/dogsheep-photos/releases)
[![CircleCI](https://circleci.com/gh/dogsheep/dogsheep-photos.svg?style=svg)](https://circleci.com/gh/dogsheep/dogsheep-photos)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/dogsheep/dogsheep-photos/blob/master/LICENSE)

Save details of your photos to a SQLite database and upload them to S3.

See [Using SQL to find my best photo of a pelican according to Apple Photos](https://simonwillison.net/2020/May/21/apple-photos-sqlite/) for background information on this project.

## What these tools do

These tools are a work-in-progress mechanism for taking full ownership of your photos. The core idea is to help implement the following:

* Every photo you have taken lives in a single, private Amazon S3 bucket
* You have a single SQLite database file which stores metadata about those photos - potentially pulled from multiple different places. This may include EXIF data, Apple Photos, the results of running machine learning APIs against photos and much more besides.
* You can then use [Datasette](https://github.com/simonw/datasette) to explore your own photos.

I'm a heavy user of Apple Photos so the initial releases of this tool will have a bias towards that, but ideally I would like a subset of these tools to be useful to people no matter which core photo solution they are using.

## Installation

    $ pip install dogsheep-photos

## Authentication (if using S3)

If you want to use S3 to store your photos, you will need to first create S3 credentials for a new, dedicated bucket.

You may find the [s3-credentials tool](https://github.com/simonw/s3-credentials) useful for this.

Run this command and paste in your credentials. You will need three values: the name of your S3 bucket, your Access key ID and your Secret access key.

    $ dogsheep-photos s3-auth

This will create a file called `auth.json` in your current directory containing the required values. To save the file at a different path or filename, use the `--auth=myauth.json` option.

## Uploading photos

Run this command to upload every photo in a specific directory to your S3 bucket:

    $ dogsheep-photos upload photos.db \
        ~/Pictures/Photos\ Library.photoslibrary/original

The command will only upload photos that have not yet been uploaded, based on their sha256 hash.

`photos.db` will be created with an `uploads` table containing details of which files were uploaded.

To see what the command would do without uploading any files, use the `--dry-run` option.

The sha256 hash of the photo contents will be used as the name of the file in the bucket, with an extension matching the type of file. This is an implementation of the [Content addressable storage](https://en.wikipedia.org/wiki/Content-addressable_storage) pattern.

## Importing Apple Photos metadata

The `apple-photos` command imports metadata from your Apple Photos library.

    $ photo-to-sqlite apple-photos photos.db

Imported metadata includes places, people, albums, quality scores and machine learning labels for the photo contents.

## Creating a subset database

You can create a new, subset database of photos using the `create-subset` command.

This is useful for creating a shareable SQLite database that only contains metadata for a selected set of photos.

Since photo metadata contains latitude and longitude you may not want to share a database that includes photos taken at your home address.

`create-subset` takes three arguments: an existing database file created using the `apple-photos` command, the name of the new, shareable database file you would like to create and a SQL query that returns the `sha256` hash values of the photos you would like to include in that database.

For example, here's how to create a shareable database of just the photos that have been added to albums containing the word ""Public"":

    $ dogsheep-photos create-subset \
        photos.db \
        public.db \
        ""select sha256 from apple_photos where albums like '%Public%'""

## Serving photos locally with datasette-media

If you don't want to upload your photos to S3 but you still want to browse them using Datasette you can do so using the [datasette-media](https://github.com/simonw/datasette-media) plugin. This plugin adds the ability to serve images and other static files directly from disk, configured using a SQL query.

To use it, first install Datasette and the plugin:

    $ pip install datasette datasette-media

If any of your photos are `.HEIC` images taken by an iPhone you should also install the optional `pyheif` dependency:

    $ pip install pyheif

Now create a `metadata.yaml` file configuring the plugin:

```yaml
plugins:
  datasette-media:
    thumbnail:
      sql: |-
        select path as filepath, 200 as resize_height from apple_photos where uuid = :key
    large:
      sql: |-
        select path as filepath, 1024 as resize_height from apple_photos where uuid = :key
```
This will configure two URL endpoints - one for 200 pixel high thumbnails and one for 1024 pixel high larger images.

Create your `photos.db` database using the `apple-photos` command, then run Datasette like this:

    $ datasette -m metadata.yaml

Your photos will be served on URLs that look like this:

    http://127.0.0.1:8001/-/media/thumbnail/F4469918-13F3-43D8-9EC1-734C0E6B60AD
    http://127.0.0.1:8001/-/media/large/F4469918-13F3-43D8-9EC1-734C0E6B60AD

You can find the UUIDs for use in these URLs by running `select uuid from photos_with_apple_metadata`.

### Displaying images using datasette-json-html

If you are using `datasette-media` to serve photos you can include images directly in Datasette query results using the [datasette-json-html](https://github.com/simonw/datasette-json-html) plugin.

Run `pip install datasette-json-html` to install the plugin, then use queries like this to view your images:

```sql
select
    json_object(
        'img_src',
        '/-/media/thumbnail/' || uuid
    ) as photo,
    uuid,
    date
from
    apple_photos
order by
    date desc
limit 10;
```
The `photo` column returned by this query should render as image tags that display the correct images.

### Displaying images using custom template pages

Datasette's [custom pages](https://datasette.readthedocs.io/en/stable/custom_templates.html#custom-pages) feature lets you create custom pages for a Datasette instance by dropping HTML templates into a `templates/pages` directory and then running Datasette using `datasette --template-dir=templates/`.

You can combine that ability with the [datasette-template-sql](https://github.com/simonw/datasette-template-sql) plugin to create custom template pages that directly display photos served by `datasette-media`.

Install the plugin using `pip install datasette-template-sql`.

Create a `templates/pages` folder and add the following files:

`recent-photos.html`
```html+jinja
<h1>Recent photos</h1>

<div>
{% for photo in sql(""select * from apple_photos order by date desc limit 20"") %}
    <img src=""/-/media/photo/{{ photo['uuid'] }}"">
{% endfor %}
</div>
```
`random-photos.html`
```html+jinja
<h1>Random photos</h1>

<div>
{% for photo in sql(""with foo as (select * from apple_photos order by date desc limit 5000) select * from foo order by random() limit 20"") %}
    <img src=""/-/media/photo/{{ photo['uuid'] }}"">
{% endfor %}
</div>
```
Now run Datasette like this:

    $ datasette photos.db -m metadata.yaml --template-dir=templates/

Visiting `http://localhost:8001/recent-photos` will display 20 recent photos. Visiting `http://localhost:8001/random-photos` will display 20 photos randomly selected from your 5,000 most recent.

","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-dogsheep-photos"" class=""anchor"" aria-hidden=""true"" href=""#user-content-dogsheep-photos""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>dogsheep-photos</h1>
<p><a href=""https://pypi.org/project/dogsheep-photos/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/c21d10e4250454707420755a0d4b90c97709771d795ffa55b495c98956a3938d/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f646f6773686565702d70686f746f732e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/dogsheep-photos.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/dogsheep/dogsheep-photos/releases""><img src=""https://camo.githubusercontent.com/c7bead2f1c989034f62914278a736eecdfbab79ce4d9b3ab804d1692e230d79d/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f646f6773686565702f646f6773686565702d70686f746f733f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/dogsheep/dogsheep-photos?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://circleci.com/gh/dogsheep/dogsheep-photos"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/3aefe86d4a3560c1ca080cf5b7e4dcc5788b239df8b7a3fed40e622b834c22e8/68747470733a2f2f636972636c6563692e636f6d2f67682f646f6773686565702f646f6773686565702d70686f746f732e7376673f7374796c653d737667"" alt=""CircleCI"" data-canonical-src=""https://circleci.com/gh/dogsheep/dogsheep-photos.svg?style=svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/dogsheep/dogsheep-photos/blob/master/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p>Save details of your photos to a SQLite database and upload them to S3.</p>
<p>See <a href=""https://simonwillison.net/2020/May/21/apple-photos-sqlite/"" rel=""nofollow"">Using SQL to find my best photo of a pelican according to Apple Photos</a> for background information on this project.</p>
<h2><a id=""user-content-what-these-tools-do"" class=""anchor"" aria-hidden=""true"" href=""#user-content-what-these-tools-do""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>What these tools do</h2>
<p>These tools are a work-in-progress mechanism for taking full ownership of your photos. The core idea is to help implement the following:</p>
<ul>
<li>Every photo you have taken lives in a single, private Amazon S3 bucket</li>
<li>You have a single SQLite database file which stores metadata about those photos - potentially pulled from multiple different places. This may include EXIF data, Apple Photos, the results of running machine learning APIs against photos and much more besides.</li>
<li>You can then use <a href=""https://github.com/simonw/datasette"">Datasette</a> to explore your own photos.</li>
</ul>
<p>I'm a heavy user of Apple Photos so the initial releases of this tool will have a bias towards that, but ideally I would like a subset of these tools to be useful to people no matter which core photo solution they are using.</p>
<h2><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ pip install dogsheep-photos
""><pre><code>$ pip install dogsheep-photos
</code></pre></div>
<h2><a id=""user-content-authentication-if-using-s3"" class=""anchor"" aria-hidden=""true"" href=""#user-content-authentication-if-using-s3""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Authentication (if using S3)</h2>
<p>If you want to use S3 to store your photos, you will need to first create S3 credentials for a new, dedicated bucket.</p>
<p>You may find the <a href=""https://github.com/simonw/s3-credentials"">s3-credentials tool</a> useful for this.</p>
<p>Run this command and paste in your credentials. You will need three values: the name of your S3 bucket, your Access key ID and your Secret access key.</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ dogsheep-photos s3-auth
""><pre><code>$ dogsheep-photos s3-auth
</code></pre></div>
<p>This will create a file called <code>auth.json</code> in your current directory containing the required values. To save the file at a different path or filename, use the <code>--auth=myauth.json</code> option.</p>
<h2><a id=""user-content-uploading-photos"" class=""anchor"" aria-hidden=""true"" href=""#user-content-uploading-photos""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Uploading photos</h2>
<p>Run this command to upload every photo in a specific directory to your S3 bucket:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ dogsheep-photos upload photos.db \
    ~/Pictures/Photos\ Library.photoslibrary/original
""><pre><code>$ dogsheep-photos upload photos.db \
    ~/Pictures/Photos\ Library.photoslibrary/original
</code></pre></div>
<p>The command will only upload photos that have not yet been uploaded, based on their sha256 hash.</p>
<p><code>photos.db</code> will be created with an <code>uploads</code> table containing details of which files were uploaded.</p>
<p>To see what the command would do without uploading any files, use the <code>--dry-run</code> option.</p>
<p>The sha256 hash of the photo contents will be used as the name of the file in the bucket, with an extension matching the type of file. This is an implementation of the <a href=""https://en.wikipedia.org/wiki/Content-addressable_storage"" rel=""nofollow"">Content addressable storage</a> pattern.</p>
<h2><a id=""user-content-importing-apple-photos-metadata"" class=""anchor"" aria-hidden=""true"" href=""#user-content-importing-apple-photos-metadata""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Importing Apple Photos metadata</h2>
<p>The <code>apple-photos</code> command imports metadata from your Apple Photos library.</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ photo-to-sqlite apple-photos photos.db
""><pre><code>$ photo-to-sqlite apple-photos photos.db
</code></pre></div>
<p>Imported metadata includes places, people, albums, quality scores and machine learning labels for the photo contents.</p>
<h2><a id=""user-content-creating-a-subset-database"" class=""anchor"" aria-hidden=""true"" href=""#user-content-creating-a-subset-database""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Creating a subset database</h2>
<p>You can create a new, subset database of photos using the <code>create-subset</code> command.</p>
<p>This is useful for creating a shareable SQLite database that only contains metadata for a selected set of photos.</p>
<p>Since photo metadata contains latitude and longitude you may not want to share a database that includes photos taken at your home address.</p>
<p><code>create-subset</code> takes three arguments: an existing database file created using the <code>apple-photos</code> command, the name of the new, shareable database file you would like to create and a SQL query that returns the <code>sha256</code> hash values of the photos you would like to include in that database.</p>
<p>For example, here's how to create a shareable database of just the photos that have been added to albums containing the word ""Public"":</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ dogsheep-photos create-subset \
    photos.db \
    public.db \
    &quot;select sha256 from apple_photos where albums like '%Public%'&quot;
""><pre><code>$ dogsheep-photos create-subset \
    photos.db \
    public.db \
    ""select sha256 from apple_photos where albums like '%Public%'""
</code></pre></div>
<h2><a id=""user-content-serving-photos-locally-with-datasette-media"" class=""anchor"" aria-hidden=""true"" href=""#user-content-serving-photos-locally-with-datasette-media""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Serving photos locally with datasette-media</h2>
<p>If you don't want to upload your photos to S3 but you still want to browse them using Datasette you can do so using the <a href=""https://github.com/simonw/datasette-media"">datasette-media</a> plugin. This plugin adds the ability to serve images and other static files directly from disk, configured using a SQL query.</p>
<p>To use it, first install Datasette and the plugin:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ pip install datasette datasette-media
""><pre><code>$ pip install datasette datasette-media
</code></pre></div>
<p>If any of your photos are <code>.HEIC</code> images taken by an iPhone you should also install the optional <code>pyheif</code> dependency:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ pip install pyheif
""><pre><code>$ pip install pyheif
</code></pre></div>
<p>Now create a <code>metadata.yaml</code> file configuring the plugin:</p>
<div class=""highlight highlight-source-yaml position-relative overflow-auto"" data-snippet-clipboard-copy-content=""plugins:
  datasette-media:
    thumbnail:
      sql: |-
        select path as filepath, 200 as resize_height from apple_photos where uuid = :key
    large:
      sql: |-
        select path as filepath, 1024 as resize_height from apple_photos where uuid = :key
""><pre><span class=""pl-ent"">plugins</span>:
  <span class=""pl-ent"">datasette-media</span>:
    <span class=""pl-ent"">thumbnail</span>:
      <span class=""pl-ent"">sql</span>: <span class=""pl-s"">|-</span>
<span class=""pl-s"">        select path as filepath, 200 as resize_height from apple_photos where uuid = :key</span>
<span class=""pl-s""></span>    <span class=""pl-ent"">large</span>:
      <span class=""pl-ent"">sql</span>: <span class=""pl-s"">|-</span>
<span class=""pl-s"">        select path as filepath, 1024 as resize_height from apple_photos where uuid = :key</span></pre></div>
<p>This will configure two URL endpoints - one for 200 pixel high thumbnails and one for 1024 pixel high larger images.</p>
<p>Create your <code>photos.db</code> database using the <code>apple-photos</code> command, then run Datasette like this:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ datasette -m metadata.yaml
""><pre><code>$ datasette -m metadata.yaml
</code></pre></div>
<p>Your photos will be served on URLs that look like this:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""http://127.0.0.1:8001/-/media/thumbnail/F4469918-13F3-43D8-9EC1-734C0E6B60AD
http://127.0.0.1:8001/-/media/large/F4469918-13F3-43D8-9EC1-734C0E6B60AD
""><pre><code>http://127.0.0.1:8001/-/media/thumbnail/F4469918-13F3-43D8-9EC1-734C0E6B60AD
http://127.0.0.1:8001/-/media/large/F4469918-13F3-43D8-9EC1-734C0E6B60AD
</code></pre></div>
<p>You can find the UUIDs for use in these URLs by running <code>select uuid from photos_with_apple_metadata</code>.</p>
<h3><a id=""user-content-displaying-images-using-datasette-json-html"" class=""anchor"" aria-hidden=""true"" href=""#user-content-displaying-images-using-datasette-json-html""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Displaying images using datasette-json-html</h3>
<p>If you are using <code>datasette-media</code> to serve photos you can include images directly in Datasette query results using the <a href=""https://github.com/simonw/datasette-json-html"">datasette-json-html</a> plugin.</p>
<p>Run <code>pip install datasette-json-html</code> to install the plugin, then use queries like this to view your images:</p>
<div class=""highlight highlight-source-sql position-relative overflow-auto"" data-snippet-clipboard-copy-content=""select
    json_object(
        'img_src',
        '/-/media/thumbnail/' || uuid
    ) as photo,
    uuid,
    date
from
    apple_photos
order by
    date desc
limit 10;
""><pre><span class=""pl-k"">select</span>
    json_object(
        <span class=""pl-s""><span class=""pl-pds"">'</span>img_src<span class=""pl-pds"">'</span></span>,
        <span class=""pl-s""><span class=""pl-pds"">'</span>/-/media/thumbnail/<span class=""pl-pds"">'</span></span> <span class=""pl-k"">||</span> uuid
    ) <span class=""pl-k"">as</span> photo,
    uuid,
    <span class=""pl-k"">date</span>
<span class=""pl-k"">from</span>
    apple_photos
<span class=""pl-k"">order by</span>
    <span class=""pl-k"">date</span> <span class=""pl-k"">desc</span>
<span class=""pl-k"">limit</span> <span class=""pl-c1"">10</span>;</pre></div>
<p>The <code>photo</code> column returned by this query should render as image tags that display the correct images.</p>
<h3><a id=""user-content-displaying-images-using-custom-template-pages"" class=""anchor"" aria-hidden=""true"" href=""#user-content-displaying-images-using-custom-template-pages""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Displaying images using custom template pages</h3>
<p>Datasette's <a href=""https://datasette.readthedocs.io/en/stable/custom_templates.html#custom-pages"" rel=""nofollow"">custom pages</a> feature lets you create custom pages for a Datasette instance by dropping HTML templates into a <code>templates/pages</code> directory and then running Datasette using <code>datasette --template-dir=templates/</code>.</p>
<p>You can combine that ability with the <a href=""https://github.com/simonw/datasette-template-sql"">datasette-template-sql</a> plugin to create custom template pages that directly display photos served by <code>datasette-media</code>.</p>
<p>Install the plugin using <code>pip install datasette-template-sql</code>.</p>
<p>Create a <code>templates/pages</code> folder and add the following files:</p>
<p><code>recent-photos.html</code></p>
<div class=""highlight highlight-text-html-django position-relative overflow-auto"" data-snippet-clipboard-copy-content=""&lt;h1&gt;Recent photos&lt;/h1&gt;

&lt;div&gt;
{% for photo in sql(&quot;select * from apple_photos order by date desc limit 20&quot;) %}
    &lt;img src=&quot;/-/media/photo/{{ photo['uuid'] }}&quot;&gt;
{% endfor %}
&lt;/div&gt;
""><pre>&lt;<span class=""pl-ent"">h1</span>&gt;Recent photos&lt;/<span class=""pl-ent"">h1</span>&gt;

&lt;<span class=""pl-ent"">div</span>&gt;
<span class=""pl-e"">{%</span> <span class=""pl-k"">for</span> <span class=""pl-s"">photo</span> <span class=""pl-k"">in</span> <span class=""pl-s"">sql</span>(<span class=""pl-s"">""select * from apple_photos order by date desc limit 20""</span>) <span class=""pl-e"">%}</span>
    &lt;<span class=""pl-ent"">img</span> <span class=""pl-e"">src</span>=<span class=""pl-s""><span class=""pl-pds"">""</span>/-/media/photo/{{ photo['uuid'] }}<span class=""pl-pds"">""</span></span>&gt;
<span class=""pl-e"">{%</span> <span class=""pl-k"">endfor</span> <span class=""pl-e"">%}</span>
&lt;/<span class=""pl-ent"">div</span>&gt;</pre></div>
<p><code>random-photos.html</code></p>
<div class=""highlight highlight-text-html-django position-relative overflow-auto"" data-snippet-clipboard-copy-content=""&lt;h1&gt;Random photos&lt;/h1&gt;

&lt;div&gt;
{% for photo in sql(&quot;with foo as (select * from apple_photos order by date desc limit 5000) select * from foo order by random() limit 20&quot;) %}
    &lt;img src=&quot;/-/media/photo/{{ photo['uuid'] }}&quot;&gt;
{% endfor %}
&lt;/div&gt;
""><pre>&lt;<span class=""pl-ent"">h1</span>&gt;Random photos&lt;/<span class=""pl-ent"">h1</span>&gt;

&lt;<span class=""pl-ent"">div</span>&gt;
<span class=""pl-e"">{%</span> <span class=""pl-k"">for</span> <span class=""pl-s"">photo</span> <span class=""pl-k"">in</span> <span class=""pl-s"">sql</span>(<span class=""pl-s"">""with foo as (select * from apple_photos order by date desc limit 5000) select * from foo order by random() limit 20""</span>) <span class=""pl-e"">%}</span>
    &lt;<span class=""pl-ent"">img</span> <span class=""pl-e"">src</span>=<span class=""pl-s""><span class=""pl-pds"">""</span>/-/media/photo/{{ photo['uuid'] }}<span class=""pl-pds"">""</span></span>&gt;
<span class=""pl-e"">{%</span> <span class=""pl-k"">endfor</span> <span class=""pl-e"">%}</span>
&lt;/<span class=""pl-ent"">div</span>&gt;</pre></div>
<p>Now run Datasette like this:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ datasette photos.db -m metadata.yaml --template-dir=templates/
""><pre><code>$ datasette photos.db -m metadata.yaml --template-dir=templates/
</code></pre></div>
<p>Visiting <code>http://localhost:8001/recent-photos</code> will display 20 recent photos. Visiting <code>http://localhost:8001/random-photos</code> will display 20 photos randomly selected from your 5,000 most recent.</p>
</article></div>",1,public,0,,,
261634807,MDEwOlJlcG9zaXRvcnkyNjE2MzQ4MDc=,datasette-media,simonw/datasette-media,0,9599,https://github.com/simonw/datasette-media,Datasette plugin for serving media based on a SQL query,0,2020-05-06T02:42:57Z,2021-05-03T05:04:39Z,2020-07-30T23:39:29Z,,70,11,11,Python,1,1,1,1,0,0,0,0,8,apache-2.0,"[""datasette"", ""datasette-plugin"", ""datasette-io""]",0,8,11,main,"{""admin"": false, ""push"": false, ""pull"": false}",,,0,1,"# datasette-media

[![PyPI](https://img.shields.io/pypi/v/datasette-media.svg)](https://pypi.org/project/datasette-media/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-media?include_prereleases&label=changelog)](https://github.com/simonw/datasette-media/releases)
[![CircleCI](https://circleci.com/gh/simonw/datasette-media.svg?style=svg)](https://circleci.com/gh/simonw/datasette-media)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-media/blob/master/LICENSE)

Datasette plugin for serving media based on a SQL query.

Use this when you have a database table containing references to files on disk - or binary content stored in BLOB columns - that you would like to be able to serve to your users.

## Installation

Install this plugin in the same environment as Datasette.

    $ pip install datasette-media

### HEIC image support

Modern iPhones save their photos using the [HEIC image format](https://en.wikipedia.org/wiki/High_Efficiency_Image_File_Format). Processing these images requires an additional dependency, [pyheif](https://pypi.org/project/pyheif/). You can include this dependency by running:

    $ pip install datasette-media[heif]

## Usage

You can use this plugin to configure Datasette to serve static media based on SQL queries to an underlying database table.

Media will be served from URLs that start with `/-/media/`. The full URL to each media asset will look like this:

    /-/media/type-of-media/media-key

`type-of-media` will correspond to a configured SQL query, and might be something like `photo`. `media-key` will be an identifier that is used as part of the underlying SQL query to find which file should be served.

### Serving static files from disk

The following ``metadata.json`` configuration will cause this plugin to serve files from disk, based on queries to a database table called `apple_photos`.

```json
{
    ""plugins"": {
        ""datasette-media"": {
            ""photo"": {
                ""sql"": ""select filepath from apple_photos where uuid=:key""
            }
        }
    }
}
```

A request to `/-/media/photo/CF972D33-5324-44F2-8DAE-22CB3182CD31` will execute the following SQL query:

```sql
select filepath from apple_photos where uuid=:key
```

The value from the URL -  in this case `CF972D33-5324-44F2-8DAE-22CB3182CD31` - will be passed as the `:key` parameter to the query.

The query returns a `filepath` value that has been read from the table. The plugin will then read that file from disk and serve it in response to the request.

SQL queries default to running against the first connected database. You can specify a different database to execute the query against using `""database"": ""name_of_db""`. To execute against `photos.db`, use this:

```json
{
    ""plugins"": {
        ""datasette-media"": {
            ""photo"": {
                ""sql"": ""select filepath from apple_photos where uuid=:key"",
                ""database"": ""photos""
            }
        }
    }
}
```

See [dogsheep-photos](https://github.com/dogsheep/dogsheep-photos) for an example of an application that can benefit from this plugin.

### Serving binary content from BLOB columns

If your SQL query returns a `content` column, this will be served directly to the user:

```json
{
    ""plugins"": {
        ""datasette-media"": {
            ""photo"": {
                ""sql"": ""select thumbnail as content from photos where uuid=:key"",
                ""database"": ""thumbs""
            }
        }
    }
}
```

You can also return a `content_type` column which will be used as the `Content-Type` header served to the user:

```json
{
    ""plugins"": {
        ""datasette-media"": {
            ""photo"": {
                ""sql"": ""select body as content, 'text/html;charset=utf-8' as content_type from documents where id=:key"",
                ""database"": ""documents""
            }
        }
    }
}
```

If you do not specify a `content_type` the default of `application/octet-stream` will be used.

### Serving content proxied from a URL

To serve content that is itself fetched from elsewhere, return a `content_url` column. This can be particularly useful when combined with the ability to resize images (described in the next section).

```json
{
    ""plugins"": {
        ""datasette-media"": {
            ""photos"": {
                ""sql"": ""select photo_url as content_url from photos where id=:key"",
                ""database"": ""photos"",
                ""enable_transform"": true
            }
        }
    }
}
```

Now you can access resized versions of images from that URL like so:

    /-/media/photos/13?w=200

### Setting a download file name

The `content_filename` column can be returned to force browsers to download the content using a specific file name.

```json
{
    ""plugins"": {
        ""datasette-media"": {
            ""hello"": {
                ""sql"": ""select 'Hello ' || :key as content, 'hello.txt' as content_filename""
            }
        }
    }
}
```

Visiting `/-/media/hello/Groot` will cause your browser to download a file called `hello.txt` containing the text `Hello Groot`.

### Resizing or transforming images

Your SQL query can specify that an image should be resized and/or converted to another format by returning additional columns. All three are optional.

* `resize_width` - the width to resize the image to
* `resize_width` - the height to resize the image to
* `output_format` - the output format to use (e.g. `jpeg` or `png`) - any output format [supported by Pillow](https://pillow.readthedocs.io/en/stable/handbook/image-file-formats.html) is allowed here.

If you specify one but not the other of `resize_width` or `resize_height` the unspecified one will be calculated automatically to maintain the aspect ratio of the image.

Here's an example configuration that will resize all images to be JPEGs that are 200 pixels in height:

```json
{
    ""plugins"": {
        ""datasette-media"": {
            ""photo"": {
                ""sql"": ""select filepath, 200 as resize_height, 'jpeg' as output_format from apple_photos where uuid=:key"",
                ""database"": ""photos""
            }
        }
    }
}
```

If you enable the `enable_transform` configuration option you can instead specify transform parameters at runtime using querystring parameters. For example:

- `/-/media/photo/CF972D33?w=200` to resize to a fixed width
- `/-/media/photo/CF972D33?h=200` to resize to a fixed height
- `/-/media/photo/CF972D33?format=jpeg` to convert to JPEG

That option is added like so:

```json
{
    ""plugins"": {
        ""datasette-media"": {
            ""photo"": {
                ""sql"": ""select filepath from apple_photos where uuid=:key"",
                ""database"": ""photos"",
                ""enable_transform"": true
            }
        }
    }
}
```

The maximum allowed height or width is 4000 pixels. You can change this limit using the `""max_width_height""` option:

```json
{
    ""plugins"": {
        ""datasette-media"": {
            ""photo"": {
                ""sql"": ""select filepath from apple_photos where uuid=:key"",
                ""database"": ""photos"",
                ""enable_transform"": true,
                ""max_width_height"": 1000
            }
        }
    }
}
```

## Configuration

In addition to the different named content types, the following special plugin configuration setting is available:

- `transform_threads` - number of threads to use for running transformations (e.g. resizing). Defaults to 4.

This can be used like this:

```json
{
    ""plugins"": {
        ""datasette-media"": {
            ""photo"": {
                ""sql"": ""select filepath from apple_photos where uuid=:key"",
                ""database"": ""photos""
            },
            ""transform_threads"": 8
        }
    }
}
```
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-datasette-media"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-media""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-media</h1>
<p><a href=""https://pypi.org/project/datasette-media/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/e54db370ebd4d8d68a1054cdb8186da706ee7e8e725a8543b6681404e49bff00/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d6d656469612e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-media.svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-media/releases""><img src=""https://camo.githubusercontent.com/45b31e5abfda86f1f9f9396856b3e703513fbfd980f2070daf1fa3fcb898214b/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d6d656469613f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-media?include_prereleases&amp;label=changelog"" style=""max-width:100%;""></a>
<a href=""https://circleci.com/gh/simonw/datasette-media"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/3f836c0fc5a36c2afcce223b886456440cf30d4f4cff3c009a40c730fcc0a66d/68747470733a2f2f636972636c6563692e636f6d2f67682f73696d6f6e772f6461746173657474652d6d656469612e7376673f7374796c653d737667"" alt=""CircleCI"" data-canonical-src=""https://circleci.com/gh/simonw/datasette-media.svg?style=svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-media/blob/master/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>Datasette plugin for serving media based on a SQL query.</p>
<p>Use this when you have a database table containing references to files on disk - or binary content stored in BLOB columns - that you would like to be able to serve to your users.</p>
<h2><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p>Install this plugin in the same environment as Datasette.</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ pip install datasette-media
""><pre><code>$ pip install datasette-media
</code></pre></div>
<h3><a id=""user-content-heic-image-support"" class=""anchor"" aria-hidden=""true"" href=""#user-content-heic-image-support""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>HEIC image support</h3>
<p>Modern iPhones save their photos using the <a href=""https://en.wikipedia.org/wiki/High_Efficiency_Image_File_Format"" rel=""nofollow"">HEIC image format</a>. Processing these images requires an additional dependency, <a href=""https://pypi.org/project/pyheif/"" rel=""nofollow"">pyheif</a>. You can include this dependency by running:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ pip install datasette-media[heif]
""><pre><code>$ pip install datasette-media[heif]
</code></pre></div>
<h2><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p>You can use this plugin to configure Datasette to serve static media based on SQL queries to an underlying database table.</p>
<p>Media will be served from URLs that start with <code>/-/media/</code>. The full URL to each media asset will look like this:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""/-/media/type-of-media/media-key
""><pre><code>/-/media/type-of-media/media-key
</code></pre></div>
<p><code>type-of-media</code> will correspond to a configured SQL query, and might be something like <code>photo</code>. <code>media-key</code> will be an identifier that is used as part of the underlying SQL query to find which file should be served.</p>
<h3><a id=""user-content-serving-static-files-from-disk"" class=""anchor"" aria-hidden=""true"" href=""#user-content-serving-static-files-from-disk""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Serving static files from disk</h3>
<p>The following <code>metadata.json</code> configuration will cause this plugin to serve files from disk, based on queries to a database table called <code>apple_photos</code>.</p>
<div class=""highlight highlight-source-json position-relative"" data-snippet-clipboard-copy-content=""{
    &quot;plugins&quot;: {
        &quot;datasette-media&quot;: {
            &quot;photo&quot;: {
                &quot;sql&quot;: &quot;select filepath from apple_photos where uuid=:key&quot;
            }
        }
    }
}
""><pre>{
    <span class=""pl-s""><span class=""pl-pds"">""</span>plugins<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>datasette-media<span class=""pl-pds"">""</span></span>: {
            <span class=""pl-s""><span class=""pl-pds"">""</span>photo<span class=""pl-pds"">""</span></span>: {
                <span class=""pl-s""><span class=""pl-pds"">""</span>sql<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>select filepath from apple_photos where uuid=:key<span class=""pl-pds"">""</span></span>
            }
        }
    }
}</pre></div>
<p>A request to <code>/-/media/photo/CF972D33-5324-44F2-8DAE-22CB3182CD31</code> will execute the following SQL query:</p>
<div class=""highlight highlight-source-sql position-relative"" data-snippet-clipboard-copy-content=""select filepath from apple_photos where uuid=:key
""><pre><span class=""pl-k"">select</span> filepath <span class=""pl-k"">from</span> apple_photos <span class=""pl-k"">where</span> uuid<span class=""pl-k"">=</span>:key</pre></div>
<p>The value from the URL -  in this case <code>CF972D33-5324-44F2-8DAE-22CB3182CD31</code> - will be passed as the <code>:key</code> parameter to the query.</p>
<p>The query returns a <code>filepath</code> value that has been read from the table. The plugin will then read that file from disk and serve it in response to the request.</p>
<p>SQL queries default to running against the first connected database. You can specify a different database to execute the query against using <code>""database"": ""name_of_db""</code>. To execute against <code>photos.db</code>, use this:</p>
<div class=""highlight highlight-source-json position-relative"" data-snippet-clipboard-copy-content=""{
    &quot;plugins&quot;: {
        &quot;datasette-media&quot;: {
            &quot;photo&quot;: {
                &quot;sql&quot;: &quot;select filepath from apple_photos where uuid=:key&quot;,
                &quot;database&quot;: &quot;photos&quot;
            }
        }
    }
}
""><pre>{
    <span class=""pl-s""><span class=""pl-pds"">""</span>plugins<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>datasette-media<span class=""pl-pds"">""</span></span>: {
            <span class=""pl-s""><span class=""pl-pds"">""</span>photo<span class=""pl-pds"">""</span></span>: {
                <span class=""pl-s""><span class=""pl-pds"">""</span>sql<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>select filepath from apple_photos where uuid=:key<span class=""pl-pds"">""</span></span>,
                <span class=""pl-s""><span class=""pl-pds"">""</span>database<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>photos<span class=""pl-pds"">""</span></span>
            }
        }
    }
}</pre></div>
<p>See <a href=""https://github.com/dogsheep/dogsheep-photos"">dogsheep-photos</a> for an example of an application that can benefit from this plugin.</p>
<h3><a id=""user-content-serving-binary-content-from-blob-columns"" class=""anchor"" aria-hidden=""true"" href=""#user-content-serving-binary-content-from-blob-columns""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Serving binary content from BLOB columns</h3>
<p>If your SQL query returns a <code>content</code> column, this will be served directly to the user:</p>
<div class=""highlight highlight-source-json position-relative"" data-snippet-clipboard-copy-content=""{
    &quot;plugins&quot;: {
        &quot;datasette-media&quot;: {
            &quot;photo&quot;: {
                &quot;sql&quot;: &quot;select thumbnail as content from photos where uuid=:key&quot;,
                &quot;database&quot;: &quot;thumbs&quot;
            }
        }
    }
}
""><pre>{
    <span class=""pl-s""><span class=""pl-pds"">""</span>plugins<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>datasette-media<span class=""pl-pds"">""</span></span>: {
            <span class=""pl-s""><span class=""pl-pds"">""</span>photo<span class=""pl-pds"">""</span></span>: {
                <span class=""pl-s""><span class=""pl-pds"">""</span>sql<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>select thumbnail as content from photos where uuid=:key<span class=""pl-pds"">""</span></span>,
                <span class=""pl-s""><span class=""pl-pds"">""</span>database<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>thumbs<span class=""pl-pds"">""</span></span>
            }
        }
    }
}</pre></div>
<p>You can also return a <code>content_type</code> column which will be used as the <code>Content-Type</code> header served to the user:</p>
<div class=""highlight highlight-source-json position-relative"" data-snippet-clipboard-copy-content=""{
    &quot;plugins&quot;: {
        &quot;datasette-media&quot;: {
            &quot;photo&quot;: {
                &quot;sql&quot;: &quot;select body as content, 'text/html;charset=utf-8' as content_type from documents where id=:key&quot;,
                &quot;database&quot;: &quot;documents&quot;
            }
        }
    }
}
""><pre>{
    <span class=""pl-s""><span class=""pl-pds"">""</span>plugins<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>datasette-media<span class=""pl-pds"">""</span></span>: {
            <span class=""pl-s""><span class=""pl-pds"">""</span>photo<span class=""pl-pds"">""</span></span>: {
                <span class=""pl-s""><span class=""pl-pds"">""</span>sql<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>select body as content, 'text/html;charset=utf-8' as content_type from documents where id=:key<span class=""pl-pds"">""</span></span>,
                <span class=""pl-s""><span class=""pl-pds"">""</span>database<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>documents<span class=""pl-pds"">""</span></span>
            }
        }
    }
}</pre></div>
<p>If you do not specify a <code>content_type</code> the default of <code>application/octet-stream</code> will be used.</p>
<h3><a id=""user-content-serving-content-proxied-from-a-url"" class=""anchor"" aria-hidden=""true"" href=""#user-content-serving-content-proxied-from-a-url""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Serving content proxied from a URL</h3>
<p>To serve content that is itself fetched from elsewhere, return a <code>content_url</code> column. This can be particularly useful when combined with the ability to resize images (described in the next section).</p>
<div class=""highlight highlight-source-json position-relative"" data-snippet-clipboard-copy-content=""{
    &quot;plugins&quot;: {
        &quot;datasette-media&quot;: {
            &quot;photos&quot;: {
                &quot;sql&quot;: &quot;select photo_url as content_url from photos where id=:key&quot;,
                &quot;database&quot;: &quot;photos&quot;,
                &quot;enable_transform&quot;: true
            }
        }
    }
}
""><pre>{
    <span class=""pl-s""><span class=""pl-pds"">""</span>plugins<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>datasette-media<span class=""pl-pds"">""</span></span>: {
            <span class=""pl-s""><span class=""pl-pds"">""</span>photos<span class=""pl-pds"">""</span></span>: {
                <span class=""pl-s""><span class=""pl-pds"">""</span>sql<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>select photo_url as content_url from photos where id=:key<span class=""pl-pds"">""</span></span>,
                <span class=""pl-s""><span class=""pl-pds"">""</span>database<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>photos<span class=""pl-pds"">""</span></span>,
                <span class=""pl-s""><span class=""pl-pds"">""</span>enable_transform<span class=""pl-pds"">""</span></span>: <span class=""pl-c1"">true</span>
            }
        }
    }
}</pre></div>
<p>Now you can access resized versions of images from that URL like so:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""/-/media/photos/13?w=200
""><pre><code>/-/media/photos/13?w=200
</code></pre></div>
<h3><a id=""user-content-setting-a-download-file-name"" class=""anchor"" aria-hidden=""true"" href=""#user-content-setting-a-download-file-name""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Setting a download file name</h3>
<p>The <code>content_filename</code> column can be returned to force browsers to download the content using a specific file name.</p>
<div class=""highlight highlight-source-json position-relative"" data-snippet-clipboard-copy-content=""{
    &quot;plugins&quot;: {
        &quot;datasette-media&quot;: {
            &quot;hello&quot;: {
                &quot;sql&quot;: &quot;select 'Hello ' || :key as content, 'hello.txt' as content_filename&quot;
            }
        }
    }
}
""><pre>{
    <span class=""pl-s""><span class=""pl-pds"">""</span>plugins<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>datasette-media<span class=""pl-pds"">""</span></span>: {
            <span class=""pl-s""><span class=""pl-pds"">""</span>hello<span class=""pl-pds"">""</span></span>: {
                <span class=""pl-s""><span class=""pl-pds"">""</span>sql<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>select 'Hello ' || :key as content, 'hello.txt' as content_filename<span class=""pl-pds"">""</span></span>
            }
        }
    }
}</pre></div>
<p>Visiting <code>/-/media/hello/Groot</code> will cause your browser to download a file called <code>hello.txt</code> containing the text <code>Hello Groot</code>.</p>
<h3><a id=""user-content-resizing-or-transforming-images"" class=""anchor"" aria-hidden=""true"" href=""#user-content-resizing-or-transforming-images""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Resizing or transforming images</h3>
<p>Your SQL query can specify that an image should be resized and/or converted to another format by returning additional columns. All three are optional.</p>
<ul>
<li><code>resize_width</code> - the width to resize the image to</li>
<li><code>resize_width</code> - the height to resize the image to</li>
<li><code>output_format</code> - the output format to use (e.g. <code>jpeg</code> or <code>png</code>) - any output format <a href=""https://pillow.readthedocs.io/en/stable/handbook/image-file-formats.html"" rel=""nofollow"">supported by Pillow</a> is allowed here.</li>
</ul>
<p>If you specify one but not the other of <code>resize_width</code> or <code>resize_height</code> the unspecified one will be calculated automatically to maintain the aspect ratio of the image.</p>
<p>Here's an example configuration that will resize all images to be JPEGs that are 200 pixels in height:</p>
<div class=""highlight highlight-source-json position-relative"" data-snippet-clipboard-copy-content=""{
    &quot;plugins&quot;: {
        &quot;datasette-media&quot;: {
            &quot;photo&quot;: {
                &quot;sql&quot;: &quot;select filepath, 200 as resize_height, 'jpeg' as output_format from apple_photos where uuid=:key&quot;,
                &quot;database&quot;: &quot;photos&quot;
            }
        }
    }
}
""><pre>{
    <span class=""pl-s""><span class=""pl-pds"">""</span>plugins<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>datasette-media<span class=""pl-pds"">""</span></span>: {
            <span class=""pl-s""><span class=""pl-pds"">""</span>photo<span class=""pl-pds"">""</span></span>: {
                <span class=""pl-s""><span class=""pl-pds"">""</span>sql<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>select filepath, 200 as resize_height, 'jpeg' as output_format from apple_photos where uuid=:key<span class=""pl-pds"">""</span></span>,
                <span class=""pl-s""><span class=""pl-pds"">""</span>database<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>photos<span class=""pl-pds"">""</span></span>
            }
        }
    }
}</pre></div>
<p>If you enable the <code>enable_transform</code> configuration option you can instead specify transform parameters at runtime using querystring parameters. For example:</p>
<ul>
<li><code>/-/media/photo/CF972D33?w=200</code> to resize to a fixed width</li>
<li><code>/-/media/photo/CF972D33?h=200</code> to resize to a fixed height</li>
<li><code>/-/media/photo/CF972D33?format=jpeg</code> to convert to JPEG</li>
</ul>
<p>That option is added like so:</p>
<div class=""highlight highlight-source-json position-relative"" data-snippet-clipboard-copy-content=""{
    &quot;plugins&quot;: {
        &quot;datasette-media&quot;: {
            &quot;photo&quot;: {
                &quot;sql&quot;: &quot;select filepath from apple_photos where uuid=:key&quot;,
                &quot;database&quot;: &quot;photos&quot;,
                &quot;enable_transform&quot;: true
            }
        }
    }
}
""><pre>{
    <span class=""pl-s""><span class=""pl-pds"">""</span>plugins<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>datasette-media<span class=""pl-pds"">""</span></span>: {
            <span class=""pl-s""><span class=""pl-pds"">""</span>photo<span class=""pl-pds"">""</span></span>: {
                <span class=""pl-s""><span class=""pl-pds"">""</span>sql<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>select filepath from apple_photos where uuid=:key<span class=""pl-pds"">""</span></span>,
                <span class=""pl-s""><span class=""pl-pds"">""</span>database<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>photos<span class=""pl-pds"">""</span></span>,
                <span class=""pl-s""><span class=""pl-pds"">""</span>enable_transform<span class=""pl-pds"">""</span></span>: <span class=""pl-c1"">true</span>
            }
        }
    }
}</pre></div>
<p>The maximum allowed height or width is 4000 pixels. You can change this limit using the <code>""max_width_height""</code> option:</p>
<div class=""highlight highlight-source-json position-relative"" data-snippet-clipboard-copy-content=""{
    &quot;plugins&quot;: {
        &quot;datasette-media&quot;: {
            &quot;photo&quot;: {
                &quot;sql&quot;: &quot;select filepath from apple_photos where uuid=:key&quot;,
                &quot;database&quot;: &quot;photos&quot;,
                &quot;enable_transform&quot;: true,
                &quot;max_width_height&quot;: 1000
            }
        }
    }
}
""><pre>{
    <span class=""pl-s""><span class=""pl-pds"">""</span>plugins<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>datasette-media<span class=""pl-pds"">""</span></span>: {
            <span class=""pl-s""><span class=""pl-pds"">""</span>photo<span class=""pl-pds"">""</span></span>: {
                <span class=""pl-s""><span class=""pl-pds"">""</span>sql<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>select filepath from apple_photos where uuid=:key<span class=""pl-pds"">""</span></span>,
                <span class=""pl-s""><span class=""pl-pds"">""</span>database<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>photos<span class=""pl-pds"">""</span></span>,
                <span class=""pl-s""><span class=""pl-pds"">""</span>enable_transform<span class=""pl-pds"">""</span></span>: <span class=""pl-c1"">true</span>,
                <span class=""pl-s""><span class=""pl-pds"">""</span>max_width_height<span class=""pl-pds"">""</span></span>: <span class=""pl-c1"">1000</span>
            }
        }
    }
}</pre></div>
<h2><a id=""user-content-configuration"" class=""anchor"" aria-hidden=""true"" href=""#user-content-configuration""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Configuration</h2>
<p>In addition to the different named content types, the following special plugin configuration setting is available:</p>
<ul>
<li><code>transform_threads</code> - number of threads to use for running transformations (e.g. resizing). Defaults to 4.</li>
</ul>
<p>This can be used like this:</p>
<div class=""highlight highlight-source-json position-relative"" data-snippet-clipboard-copy-content=""{
    &quot;plugins&quot;: {
        &quot;datasette-media&quot;: {
            &quot;photo&quot;: {
                &quot;sql&quot;: &quot;select filepath from apple_photos where uuid=:key&quot;,
                &quot;database&quot;: &quot;photos&quot;
            },
            &quot;transform_threads&quot;: 8
        }
    }
}
""><pre>{
    <span class=""pl-s""><span class=""pl-pds"">""</span>plugins<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>datasette-media<span class=""pl-pds"">""</span></span>: {
            <span class=""pl-s""><span class=""pl-pds"">""</span>photo<span class=""pl-pds"">""</span></span>: {
                <span class=""pl-s""><span class=""pl-pds"">""</span>sql<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>select filepath from apple_photos where uuid=:key<span class=""pl-pds"">""</span></span>,
                <span class=""pl-s""><span class=""pl-pds"">""</span>database<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>photos<span class=""pl-pds"">""</span></span>
            },
            <span class=""pl-s""><span class=""pl-pds"">""</span>transform_threads<span class=""pl-pds"">""</span></span>: <span class=""pl-c1"">8</span>
        }
    }
}</pre></div>
</article></div>",,,,,,
271408895,MDEwOlJlcG9zaXRvcnkyNzE0MDg4OTU=,datasette-permissions-sql,simonw/datasette-permissions-sql,0,9599,https://github.com/simonw/datasette-permissions-sql,Datasette plugin for configuring permission checks using SQL queries,0,2020-06-10T23:48:13Z,2020-06-12T07:06:12Z,2020-06-12T07:06:15Z,,25,0,0,Python,1,1,1,1,0,0,0,0,0,apache-2.0,"[""datasette"", ""datasette-plugin"", ""datasette-io""]",0,0,0,master,"{""admin"": false, ""push"": false, ""pull"": false}",,,0,1,"# datasette-permissions-sql

[![PyPI](https://img.shields.io/pypi/v/datasette-permissions-sql.svg)](https://pypi.org/project/datasette-permissions-sql/)
[![CircleCI](https://circleci.com/gh/simonw/datasette-permissions-sql.svg?style=svg)](https://circleci.com/gh/simonw/datasette-permissions-sql)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-permissions-sql/blob/master/LICENSE)

Datasette plugin for configuring permission checks using SQL queries

## Installation

Install this plugin in the same environment as Datasette.

    $ pip install datasette-permissions-sql

## Usage

First, read up on how Datasette's [authentication and permissions system](https://datasette.readthedocs.io/en/latest/authentication.html) works.

This plugin lets you define rules containing SQL queries that are executed to see if the currently authenticated actor has permission to perform certain actions.

Consider a canned query which authenticated users should only be able to execute if a row in the `users` table says that they are a member of staff.

That `users` table in the `mydatabase.db` database could look like this:

| id | username | is_staff |
|--|--------|--------|
| 1 | cleopaws | 0 |
| 2 | simon | 1 |

Authenticated users have an `actor` that looks like this:

```json
{
    ""id"": 2,
    ""username"": ""simon""
}
```

To configure the canned query to only be executable by staff users, add the following to your `metadata.json`:

```json
{
    ""plugins"": {
        ""datasette-permissions-sql"": [
            {
                ""action"": ""view-query"",
                ""resource"": [""mydatabase"", ""promote_to_staff""],
                ""sql"": ""SELECT * FROM users WHERE is_staff = 1 AND id = :actor_id""
            }
        ]
    },
    ""databases"": {
        ""mydatabase"": {
            ""queries"": {
                ""promote_to_staff"": {
                    ""sql"": ""UPDATE users SET is is_staff=1 WHERE id=:id"",
                    ""write"": true
                }
            }
        }
    }
}
```

The `""datasette-permissions-sql""` key is a list of rules. Each of those rules has the following shape:

```json
{
    ""action"": ""name-of-action"",
    ""resource"": [""resource identifier to run this on""],
    ""sql"": ""SQL query to execute"",
    ""database"": ""mydatabase""
}
```

Both `""action""` and `""resource""` are optional. If present, the SQL query will only be executed on permission checks that match the action and, if present, the resource indicators.

`""database""` is also optional: it specifies the named database that the query should be executed against. If it is not present the first connected database will be used.

The Datasette documentation includes a [list of built-in permissions](https://datasette.readthedocs.io/en/stable/authentication.html#built-in-permissions) that you might want to use here.

### The SQL query

If the SQL query returns any rows the action will be allowed. If it returns no rows, the plugin hook will return `False` and deny access to that action.

The SQL query is called with a number of named parameters. You can use any of these as part of the query.

The list of parameters is as follows:

* `action` - the action, e.g. `""view-database""`
* `resource_1` - the first component of the resource, if one was passed
* `resource_2` - the second component of the resource, if available
* `actor_*` - a parameter for every key on the actor. Usually `actor_id` is present.

If any rows are returned, the permission check passes. If no rows are returned the check fails.

Another example table, this time granting explicit access to individual tables. Consider a table called `table_access` that looks like this:

| user_id | database | table |
| - | - | - |
| 1 | mydb | dogs |
| 2 | mydb | dogs |
| 1 | mydb | cats |

The following SQL query would grant access to the `dogs` ttable in the `mydb.db` database to users 1 and 2 - but would forbid access for user 2 to the `cats` table:

```sql
SELECT
    *
FROM
    table_access
WHERE
    user_id = :actor_id
    AND ""database"" = :resource_1
    AND ""table"" = :resource_2
```
In a `metadata.yaml` configuration file that would look like this:

```yaml
databases:
  mydb:
    allow_sql: {}
plugins:
  datasette-permissions-sql:
  - action: view-table
    sql: |-
      SELECT
        *
      FROM
        table_access
      WHERE
        user_id = :actor_id
        AND ""database"" = :resource_1
        AND ""table"" = :resource_2
```
We're using `allow_sql: {}` here to disable arbitrary SQL queries. This prevents users from running `select * from cats` directly to work around the permissions limits.

### Fallback mode

The default behaviour of this plugin is to take full control of specified permissions. The SQL query will directly control if the user is allowed or denied access to the permission.

This means that the default policy for each permission (which in Datasette core is ""allow"" for `view-database` and friends) will be ignored. It also means that any other `permission_allowed` plugins will not get their turn once this plugin has executed.

You can change this on a per-rule basis using ``""fallback"": true``:

```json
{
    ""action"": ""view-table"",
    ""resource"": [""mydatabase"", ""mytable""],
    ""sql"": ""select * from admins where user_id = :actor_id"",
    ""fallback"": true
}
```

When running in fallback mode, a query result returning no rows will cause the plugin hook to return ``None`` - which means ""I have no opinion on this permission, fall back to other plugins or the default"".

In this mode you can still return `False` (for ""deny access"") by returning a single row with a single value of `-1`. For example:

```json
{
    ""action"": ""view-table"",
    ""resource"": [""mydatabase"", ""mytable""],
    ""sql"": ""select -1 from banned where user_id = :actor_id"",
    ""fallback"": true
}
```
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-datasette-permissions-sql"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-permissions-sql""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-permissions-sql</h1>
<p><a href=""https://pypi.org/project/datasette-permissions-sql/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/898a57540c5f63707e7d55e28f18598cd101ae1b7cbad1ff71ead7f4a12174f4/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d7065726d697373696f6e732d73716c2e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-permissions-sql.svg"" style=""max-width:100%;""></a>
<a href=""https://circleci.com/gh/simonw/datasette-permissions-sql"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/587e99cf6d88c2bcfe530b5748d7f0f2ed638809bec3aa914244b7bf2ddcdc82/68747470733a2f2f636972636c6563692e636f6d2f67682f73696d6f6e772f6461746173657474652d7065726d697373696f6e732d73716c2e7376673f7374796c653d737667"" alt=""CircleCI"" data-canonical-src=""https://circleci.com/gh/simonw/datasette-permissions-sql.svg?style=svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/datasette-permissions-sql/blob/master/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>Datasette plugin for configuring permission checks using SQL queries</p>
<h2><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p>Install this plugin in the same environment as Datasette.</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ pip install datasette-permissions-sql
""><pre><code>$ pip install datasette-permissions-sql
</code></pre></div>
<h2><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p>First, read up on how Datasette's <a href=""https://datasette.readthedocs.io/en/latest/authentication.html"" rel=""nofollow"">authentication and permissions system</a> works.</p>
<p>This plugin lets you define rules containing SQL queries that are executed to see if the currently authenticated actor has permission to perform certain actions.</p>
<p>Consider a canned query which authenticated users should only be able to execute if a row in the <code>users</code> table says that they are a member of staff.</p>
<p>That <code>users</code> table in the <code>mydatabase.db</code> database could look like this:</p>
<table>
<thead>
<tr>
<th>id</th>
<th>username</th>
<th>is_staff</th>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>cleopaws</td>
<td>0</td>
</tr>
<tr>
<td>2</td>
<td>simon</td>
<td>1</td>
</tr>
</tbody>
</table>
<p>Authenticated users have an <code>actor</code> that looks like this:</p>
<div class=""highlight highlight-source-json position-relative"" data-snippet-clipboard-copy-content=""{
    &quot;id&quot;: 2,
    &quot;username&quot;: &quot;simon&quot;
}
""><pre>{
    <span class=""pl-s""><span class=""pl-pds"">""</span>id<span class=""pl-pds"">""</span></span>: <span class=""pl-c1"">2</span>,
    <span class=""pl-s""><span class=""pl-pds"">""</span>username<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>simon<span class=""pl-pds"">""</span></span>
}</pre></div>
<p>To configure the canned query to only be executable by staff users, add the following to your <code>metadata.json</code>:</p>
<div class=""highlight highlight-source-json position-relative"" data-snippet-clipboard-copy-content=""{
    &quot;plugins&quot;: {
        &quot;datasette-permissions-sql&quot;: [
            {
                &quot;action&quot;: &quot;view-query&quot;,
                &quot;resource&quot;: [&quot;mydatabase&quot;, &quot;promote_to_staff&quot;],
                &quot;sql&quot;: &quot;SELECT * FROM users WHERE is_staff = 1 AND id = :actor_id&quot;
            }
        ]
    },
    &quot;databases&quot;: {
        &quot;mydatabase&quot;: {
            &quot;queries&quot;: {
                &quot;promote_to_staff&quot;: {
                    &quot;sql&quot;: &quot;UPDATE users SET is is_staff=1 WHERE id=:id&quot;,
                    &quot;write&quot;: true
                }
            }
        }
    }
}
""><pre>{
    <span class=""pl-s""><span class=""pl-pds"">""</span>plugins<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>datasette-permissions-sql<span class=""pl-pds"">""</span></span>: [
            {
                <span class=""pl-s""><span class=""pl-pds"">""</span>action<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>view-query<span class=""pl-pds"">""</span></span>,
                <span class=""pl-s""><span class=""pl-pds"">""</span>resource<span class=""pl-pds"">""</span></span>: [<span class=""pl-s""><span class=""pl-pds"">""</span>mydatabase<span class=""pl-pds"">""</span></span>, <span class=""pl-s""><span class=""pl-pds"">""</span>promote_to_staff<span class=""pl-pds"">""</span></span>],
                <span class=""pl-s""><span class=""pl-pds"">""</span>sql<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>SELECT * FROM users WHERE is_staff = 1 AND id = :actor_id<span class=""pl-pds"">""</span></span>
            }
        ]
    },
    <span class=""pl-s""><span class=""pl-pds"">""</span>databases<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>mydatabase<span class=""pl-pds"">""</span></span>: {
            <span class=""pl-s""><span class=""pl-pds"">""</span>queries<span class=""pl-pds"">""</span></span>: {
                <span class=""pl-s""><span class=""pl-pds"">""</span>promote_to_staff<span class=""pl-pds"">""</span></span>: {
                    <span class=""pl-s""><span class=""pl-pds"">""</span>sql<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>UPDATE users SET is is_staff=1 WHERE id=:id<span class=""pl-pds"">""</span></span>,
                    <span class=""pl-s""><span class=""pl-pds"">""</span>write<span class=""pl-pds"">""</span></span>: <span class=""pl-c1"">true</span>
                }
            }
        }
    }
}</pre></div>
<p>The <code>""datasette-permissions-sql""</code> key is a list of rules. Each of those rules has the following shape:</p>
<div class=""highlight highlight-source-json position-relative"" data-snippet-clipboard-copy-content=""{
    &quot;action&quot;: &quot;name-of-action&quot;,
    &quot;resource&quot;: [&quot;resource identifier to run this on&quot;],
    &quot;sql&quot;: &quot;SQL query to execute&quot;,
    &quot;database&quot;: &quot;mydatabase&quot;
}
""><pre>{
    <span class=""pl-s""><span class=""pl-pds"">""</span>action<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>name-of-action<span class=""pl-pds"">""</span></span>,
    <span class=""pl-s""><span class=""pl-pds"">""</span>resource<span class=""pl-pds"">""</span></span>: [<span class=""pl-s""><span class=""pl-pds"">""</span>resource identifier to run this on<span class=""pl-pds"">""</span></span>],
    <span class=""pl-s""><span class=""pl-pds"">""</span>sql<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>SQL query to execute<span class=""pl-pds"">""</span></span>,
    <span class=""pl-s""><span class=""pl-pds"">""</span>database<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>mydatabase<span class=""pl-pds"">""</span></span>
}</pre></div>
<p>Both <code>""action""</code> and <code>""resource""</code> are optional. If present, the SQL query will only be executed on permission checks that match the action and, if present, the resource indicators.</p>
<p><code>""database""</code> is also optional: it specifies the named database that the query should be executed against. If it is not present the first connected database will be used.</p>
<p>The Datasette documentation includes a <a href=""https://datasette.readthedocs.io/en/stable/authentication.html#built-in-permissions"" rel=""nofollow"">list of built-in permissions</a> that you might want to use here.</p>
<h3><a id=""user-content-the-sql-query"" class=""anchor"" aria-hidden=""true"" href=""#user-content-the-sql-query""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>The SQL query</h3>
<p>If the SQL query returns any rows the action will be allowed. If it returns no rows, the plugin hook will return <code>False</code> and deny access to that action.</p>
<p>The SQL query is called with a number of named parameters. You can use any of these as part of the query.</p>
<p>The list of parameters is as follows:</p>
<ul>
<li><code>action</code> - the action, e.g. <code>""view-database""</code></li>
<li><code>resource_1</code> - the first component of the resource, if one was passed</li>
<li><code>resource_2</code> - the second component of the resource, if available</li>
<li><code>actor_*</code> - a parameter for every key on the actor. Usually <code>actor_id</code> is present.</li>
</ul>
<p>If any rows are returned, the permission check passes. If no rows are returned the check fails.</p>
<p>Another example table, this time granting explicit access to individual tables. Consider a table called <code>table_access</code> that looks like this:</p>
<table>
<thead>
<tr>
<th>user_id</th>
<th>database</th>
<th>table</th>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>mydb</td>
<td>dogs</td>
</tr>
<tr>
<td>2</td>
<td>mydb</td>
<td>dogs</td>
</tr>
<tr>
<td>1</td>
<td>mydb</td>
<td>cats</td>
</tr>
</tbody>
</table>
<p>The following SQL query would grant access to the <code>dogs</code> ttable in the <code>mydb.db</code> database to users 1 and 2 - but would forbid access for user 2 to the <code>cats</code> table:</p>
<div class=""highlight highlight-source-sql position-relative"" data-snippet-clipboard-copy-content=""SELECT
    *
FROM
    table_access
WHERE
    user_id = :actor_id
    AND &quot;database&quot; = :resource_1
    AND &quot;table&quot; = :resource_2
""><pre><span class=""pl-k"">SELECT</span>
    <span class=""pl-k"">*</span>
<span class=""pl-k"">FROM</span>
    table_access
<span class=""pl-k"">WHERE</span>
    user_id <span class=""pl-k"">=</span> :actor_id
    <span class=""pl-k"">AND</span> <span class=""pl-s""><span class=""pl-pds"">""</span>database<span class=""pl-pds"">""</span></span> <span class=""pl-k"">=</span> :resource_1
    <span class=""pl-k"">AND</span> <span class=""pl-s""><span class=""pl-pds"">""</span>table<span class=""pl-pds"">""</span></span> <span class=""pl-k"">=</span> :resource_2</pre></div>
<p>In a <code>metadata.yaml</code> configuration file that would look like this:</p>
<div class=""highlight highlight-source-yaml position-relative"" data-snippet-clipboard-copy-content=""databases:
  mydb:
    allow_sql: {}
plugins:
  datasette-permissions-sql:
  - action: view-table
    sql: |-
      SELECT
        *
      FROM
        table_access
      WHERE
        user_id = :actor_id
        AND &quot;database&quot; = :resource_1
        AND &quot;table&quot; = :resource_2
""><pre><span class=""pl-ent"">databases</span>:
  <span class=""pl-ent"">mydb</span>:
    <span class=""pl-ent"">allow_sql</span>: <span class=""pl-s"">{}</span>
<span class=""pl-ent"">plugins</span>:
  <span class=""pl-ent"">datasette-permissions-sql</span>:
  - <span class=""pl-ent"">action</span>: <span class=""pl-s"">view-table</span>
    <span class=""pl-ent"">sql</span>: <span class=""pl-s"">|-</span>
<span class=""pl-s"">      SELECT</span>
<span class=""pl-s"">        *</span>
<span class=""pl-s"">      FROM</span>
<span class=""pl-s"">        table_access</span>
<span class=""pl-s"">      WHERE</span>
<span class=""pl-s"">        user_id = :actor_id</span>
<span class=""pl-s"">        AND ""database"" = :resource_1</span>
<span class=""pl-s"">        AND ""table"" = :resource_2</span></pre></div>
<p>We're using <code>allow_sql: {}</code> here to disable arbitrary SQL queries. This prevents users from running <code>select * from cats</code> directly to work around the permissions limits.</p>
<h3><a id=""user-content-fallback-mode"" class=""anchor"" aria-hidden=""true"" href=""#user-content-fallback-mode""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Fallback mode</h3>
<p>The default behaviour of this plugin is to take full control of specified permissions. The SQL query will directly control if the user is allowed or denied access to the permission.</p>
<p>This means that the default policy for each permission (which in Datasette core is ""allow"" for <code>view-database</code> and friends) will be ignored. It also means that any other <code>permission_allowed</code> plugins will not get their turn once this plugin has executed.</p>
<p>You can change this on a per-rule basis using <code>""fallback"": true</code>:</p>
<div class=""highlight highlight-source-json position-relative"" data-snippet-clipboard-copy-content=""{
    &quot;action&quot;: &quot;view-table&quot;,
    &quot;resource&quot;: [&quot;mydatabase&quot;, &quot;mytable&quot;],
    &quot;sql&quot;: &quot;select * from admins where user_id = :actor_id&quot;,
    &quot;fallback&quot;: true
}
""><pre>{
    <span class=""pl-s""><span class=""pl-pds"">""</span>action<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>view-table<span class=""pl-pds"">""</span></span>,
    <span class=""pl-s""><span class=""pl-pds"">""</span>resource<span class=""pl-pds"">""</span></span>: [<span class=""pl-s""><span class=""pl-pds"">""</span>mydatabase<span class=""pl-pds"">""</span></span>, <span class=""pl-s""><span class=""pl-pds"">""</span>mytable<span class=""pl-pds"">""</span></span>],
    <span class=""pl-s""><span class=""pl-pds"">""</span>sql<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>select * from admins where user_id = :actor_id<span class=""pl-pds"">""</span></span>,
    <span class=""pl-s""><span class=""pl-pds"">""</span>fallback<span class=""pl-pds"">""</span></span>: <span class=""pl-c1"">true</span>
}</pre></div>
<p>When running in fallback mode, a query result returning no rows will cause the plugin hook to return <code>None</code> - which means ""I have no opinion on this permission, fall back to other plugins or the default"".</p>
<p>In this mode you can still return <code>False</code> (for ""deny access"") by returning a single row with a single value of <code>-1</code>. For example:</p>
<div class=""highlight highlight-source-json position-relative"" data-snippet-clipboard-copy-content=""{
    &quot;action&quot;: &quot;view-table&quot;,
    &quot;resource&quot;: [&quot;mydatabase&quot;, &quot;mytable&quot;],
    &quot;sql&quot;: &quot;select -1 from banned where user_id = :actor_id&quot;,
    &quot;fallback&quot;: true
}
""><pre>{
    <span class=""pl-s""><span class=""pl-pds"">""</span>action<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>view-table<span class=""pl-pds"">""</span></span>,
    <span class=""pl-s""><span class=""pl-pds"">""</span>resource<span class=""pl-pds"">""</span></span>: [<span class=""pl-s""><span class=""pl-pds"">""</span>mydatabase<span class=""pl-pds"">""</span></span>, <span class=""pl-s""><span class=""pl-pds"">""</span>mytable<span class=""pl-pds"">""</span></span>],
    <span class=""pl-s""><span class=""pl-pds"">""</span>sql<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>select -1 from banned where user_id = :actor_id<span class=""pl-pds"">""</span></span>,
    <span class=""pl-s""><span class=""pl-pds"">""</span>fallback<span class=""pl-pds"">""</span></span>: <span class=""pl-c1"">true</span>
}</pre></div>
</article></div>",,,,,,
271665336,MDEwOlJlcG9zaXRvcnkyNzE2NjUzMzY=,datasette-auth-tokens,simonw/datasette-auth-tokens,0,9599,https://github.com/simonw/datasette-auth-tokens,Datasette plugin for authenticating access using API tokens,0,2020-06-11T23:23:30Z,2021-10-15T00:52:53Z,2021-10-15T00:54:20Z,,34,4,4,Python,1,1,1,1,0,1,0,0,0,apache-2.0,"[""datasette"", ""datasette-io"", ""datasette-plugin""]",1,0,4,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,1,3,"# datasette-auth-tokens

[![PyPI](https://img.shields.io/pypi/v/datasette-auth-tokens.svg)](https://pypi.org/project/datasette-auth-tokens/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-auth-tokens?include_prereleases&label=changelog)](https://github.com/simonw/datasette-auth-tokens/releases)
[![Tests](https://github.com/simonw/datasette-auth-tokens/workflows/Test/badge.svg)](https://github.com/simonw/datasette-auth-tokens/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-auth-tokens/blob/main/LICENSE)

Datasette plugin for authenticating access using API tokens

## Installation

Install this plugin in the same environment as Datasette.

    $ pip install datasette-auth-tokens

## Hard-coded tokens

Read about Datasette's [authentication and permissions system](https://datasette.readthedocs.io/en/latest/authentication.html).

This plugin lets you configure secret API tokens which can be used to make authenticated requests to Datasette.

First, create a random API token. A useful recipe for doing that is the following:

    $ python -c 'import secrets; print(secrets.token_hex(32))'
    5f9a486dd807de632200b17508c75002bb66ca6fde1993db1de6cbd446362589

Decide on the actor that this token should represent, for example:

```json
{
    ""bot_id"": ""my-bot""
}
```

You can then use `""allow""` blocks to provide that token with permission to access specific actions. To enable access to a configured writable SQL query you could use this in your `metadata.json`:

```json
{
    ""plugins"": {
        ""datasette-auth-tokens"": {
            ""tokens"": [
                {
                    ""token"": {
                        ""$env"": ""BOT_TOKEN""
                    },
                    ""actor"": {
                        ""bot_id"": ""my-bot""
                    }
                }
            ]
        }
    },
    ""databases"": {
        "":memory:"": {
            ""queries"": {
                ""show_version"": {
                    ""sql"": ""select sqlite_version()"",
                    ""allow"": {
                        ""bot_id"": ""my-bot""
                    }
                }
            }
        }
    }
}
```
This uses Datasette's [secret configuration values mechanism](https://datasette.readthedocs.io/en/stable/plugins.html#secret-configuration-values) to allow the secret token to be passed as an environment variable.

Run Datasette like this:

    BOT_TOKEN=""this-is-the-secret-token"" \
        datasette -m metadata.json

You can now run authenticated API queries like this:

    $ curl -H 'Authorization: Bearer this-is-the-secret-token' \
      'http://127.0.0.1:8001/:memory:/show_version.json?_shape=array'
    [{""sqlite_version()"": ""3.31.1""}]

Additionally you can allow passing the token as a query string parameter, although that's disabled by default given the security implications of URLs with secret tokens included. This may be useful to easily allow embedding data between different services.

Simply enable it using the `param` config value:

```json
{
    ""plugins"": {
        ""datasette-auth-tokens"": {
            ""tokens"": [
                {
                    ""token"": {
                        ""$env"": ""BOT_TOKEN""
                    },
                    ""actor"": {
                        ""bot_id"": ""my-bot""
                    },
                }
            ],
            ""param"": ""_auth_token""
        }
    },
    ""databases"": {
        "":memory:"": {
            ""queries"": {
                ""show_version"": {
                    ""sql"": ""select sqlite_version()"",
                    ""allow"": {
                        ""bot_id"": ""my-bot""
                    }
                }
            }
        }
    }
}
```

You can now run authenticated API queries like this:

    $ curl http://127.0.0.1:8001/:memory:/show_version.json?_shape=array&_auth_token=this-is-the-secret-token
    [{""sqlite_version()"": ""3.31.1""}]

## Tokens from your database

As an alternative (or in addition) to the hard-coded list of tokens you can store tokens in a database table and configure the plugin to access them using a SQL query.

Your query needs to take a `:token_id` parameter and return at least two columns: one called `token_secret` and one called `actor_*` - usually `actor_id`. Further `actor_` prefixed columns can be returned to provide more details for the authenticated actor.

Here's a simple example of a configuration query:

```sql
select actor_id, actor_name, token_secret from tokens where token_id = :token_id
```

This can run against a table like this one:

| token_id | token_secret | actor_id | actor_name |
| -------- | ------------ | -------- | ---------- |
| 1        | bd3c94f51fcd | 78       | Cleopaws   |
| 2        | 86681b4d6f66 | 32       | Pancakes   |

The tokens are formed as the token ID, then a hyphen, then the token secret. For example:

- `1-bd3c94f51fcd`
- `2-86681b4d6f66`

The SQL query will be executed with the portion before the hyphen as the `:token_id` parameter.

The `token_secret` value returned by the query will be compared to the portion of the token after the hyphen to check if the token is valid.

Columns with a prefix of `actor_` will be used to populate the actor dictionary. In the above example, a token of `2-86681b4d6f66` will become an actor dictionary of `{""id"": 32, ""name"": ""Pancakes""}`.

To configure this, use a `""query""` block in your plugin configuration like this:

```json
{
    ""plugins"": {
        ""datasette-auth-tokens"": {
            ""query"": {
                ""sql"": ""select actor_id, actor_name, token_secret from tokens where token_id = :token_id"",
                ""database"": ""tokens""
            }
        }
    },
    ""databases"": {
        ""tokens"": {
            ""allow"": {}
        }
    }
}
```
The `""sql""` key here contains the SQL query. The `""database""` key has the name of the attached database file that the query should be executed against - in this case it would execute against `tokens.db`.

### Securing your tokens

Anyone with access to your Datasette instance can use it to read the `token_secret` column in your tokens table. This probably isn't what you want!

To avoid this, you should lock down access to that table. The configuration example above shows how to do this using an `""allow"": {}` block. Consult Datasette's [Permissions documentation](https://datasette.readthedocs.io/en/stable/authentication.html#permissions) for more information about how to lock down this kind of access.
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-datasette-auth-tokens"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-auth-tokens""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-auth-tokens</h1>
<p><a href=""https://pypi.org/project/datasette-auth-tokens/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/8372a0259329c822611c82d155c9e1e29a243723aa17d0061b64bc06216c8e50/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d617574682d746f6b656e732e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-auth-tokens.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-auth-tokens/releases""><img src=""https://camo.githubusercontent.com/5c2dba28aad8f4910893ff5a064ae0c826b40c16aae67c5573c17f60c3b5ff97/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d617574682d746f6b656e733f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-auth-tokens?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-auth-tokens/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-auth-tokens/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-auth-tokens/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p>Datasette plugin for authenticating access using API tokens</p>
<h2><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p>Install this plugin in the same environment as Datasette.</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ pip install datasette-auth-tokens
""><pre><code>$ pip install datasette-auth-tokens
</code></pre></div>
<h2><a id=""user-content-hard-coded-tokens"" class=""anchor"" aria-hidden=""true"" href=""#user-content-hard-coded-tokens""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Hard-coded tokens</h2>
<p>Read about Datasette's <a href=""https://datasette.readthedocs.io/en/latest/authentication.html"" rel=""nofollow"">authentication and permissions system</a>.</p>
<p>This plugin lets you configure secret API tokens which can be used to make authenticated requests to Datasette.</p>
<p>First, create a random API token. A useful recipe for doing that is the following:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ python -c 'import secrets; print(secrets.token_hex(32))'
5f9a486dd807de632200b17508c75002bb66ca6fde1993db1de6cbd446362589
""><pre><code>$ python -c 'import secrets; print(secrets.token_hex(32))'
5f9a486dd807de632200b17508c75002bb66ca6fde1993db1de6cbd446362589
</code></pre></div>
<p>Decide on the actor that this token should represent, for example:</p>
<div class=""highlight highlight-source-json position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{
    &quot;bot_id&quot;: &quot;my-bot&quot;
}
""><pre>{
    <span class=""pl-s""><span class=""pl-pds"">""</span>bot_id<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>my-bot<span class=""pl-pds"">""</span></span>
}</pre></div>
<p>You can then use <code>""allow""</code> blocks to provide that token with permission to access specific actions. To enable access to a configured writable SQL query you could use this in your <code>metadata.json</code>:</p>
<div class=""highlight highlight-source-json position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{
    &quot;plugins&quot;: {
        &quot;datasette-auth-tokens&quot;: {
            &quot;tokens&quot;: [
                {
                    &quot;token&quot;: {
                        &quot;$env&quot;: &quot;BOT_TOKEN&quot;
                    },
                    &quot;actor&quot;: {
                        &quot;bot_id&quot;: &quot;my-bot&quot;
                    }
                }
            ]
        }
    },
    &quot;databases&quot;: {
        &quot;:memory:&quot;: {
            &quot;queries&quot;: {
                &quot;show_version&quot;: {
                    &quot;sql&quot;: &quot;select sqlite_version()&quot;,
                    &quot;allow&quot;: {
                        &quot;bot_id&quot;: &quot;my-bot&quot;
                    }
                }
            }
        }
    }
}
""><pre>{
    <span class=""pl-s""><span class=""pl-pds"">""</span>plugins<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>datasette-auth-tokens<span class=""pl-pds"">""</span></span>: {
            <span class=""pl-s""><span class=""pl-pds"">""</span>tokens<span class=""pl-pds"">""</span></span>: [
                {
                    <span class=""pl-s""><span class=""pl-pds"">""</span>token<span class=""pl-pds"">""</span></span>: {
                        <span class=""pl-s""><span class=""pl-pds"">""</span>$env<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>BOT_TOKEN<span class=""pl-pds"">""</span></span>
                    },
                    <span class=""pl-s""><span class=""pl-pds"">""</span>actor<span class=""pl-pds"">""</span></span>: {
                        <span class=""pl-s""><span class=""pl-pds"">""</span>bot_id<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>my-bot<span class=""pl-pds"">""</span></span>
                    }
                }
            ]
        }
    },
    <span class=""pl-s""><span class=""pl-pds"">""</span>databases<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>:memory:<span class=""pl-pds"">""</span></span>: {
            <span class=""pl-s""><span class=""pl-pds"">""</span>queries<span class=""pl-pds"">""</span></span>: {
                <span class=""pl-s""><span class=""pl-pds"">""</span>show_version<span class=""pl-pds"">""</span></span>: {
                    <span class=""pl-s""><span class=""pl-pds"">""</span>sql<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>select sqlite_version()<span class=""pl-pds"">""</span></span>,
                    <span class=""pl-s""><span class=""pl-pds"">""</span>allow<span class=""pl-pds"">""</span></span>: {
                        <span class=""pl-s""><span class=""pl-pds"">""</span>bot_id<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>my-bot<span class=""pl-pds"">""</span></span>
                    }
                }
            }
        }
    }
}</pre></div>
<p>This uses Datasette's <a href=""https://datasette.readthedocs.io/en/stable/plugins.html#secret-configuration-values"" rel=""nofollow"">secret configuration values mechanism</a> to allow the secret token to be passed as an environment variable.</p>
<p>Run Datasette like this:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""BOT_TOKEN=&quot;this-is-the-secret-token&quot; \
    datasette -m metadata.json
""><pre><code>BOT_TOKEN=""this-is-the-secret-token"" \
    datasette -m metadata.json
</code></pre></div>
<p>You can now run authenticated API queries like this:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ curl -H 'Authorization: Bearer this-is-the-secret-token' \
  'http://127.0.0.1:8001/:memory:/show_version.json?_shape=array'
[{&quot;sqlite_version()&quot;: &quot;3.31.1&quot;}]
""><pre><code>$ curl -H 'Authorization: Bearer this-is-the-secret-token' \
  'http://127.0.0.1:8001/:memory:/show_version.json?_shape=array'
[{""sqlite_version()"": ""3.31.1""}]
</code></pre></div>
<p>Additionally you can allow passing the token as a query string parameter, although that's disabled by default given the security implications of URLs with secret tokens included. This may be useful to easily allow embedding data between different services.</p>
<p>Simply enable it using the <code>param</code> config value:</p>
<div class=""highlight highlight-source-json position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{
    &quot;plugins&quot;: {
        &quot;datasette-auth-tokens&quot;: {
            &quot;tokens&quot;: [
                {
                    &quot;token&quot;: {
                        &quot;$env&quot;: &quot;BOT_TOKEN&quot;
                    },
                    &quot;actor&quot;: {
                        &quot;bot_id&quot;: &quot;my-bot&quot;
                    },
                }
            ],
            &quot;param&quot;: &quot;_auth_token&quot;
        }
    },
    &quot;databases&quot;: {
        &quot;:memory:&quot;: {
            &quot;queries&quot;: {
                &quot;show_version&quot;: {
                    &quot;sql&quot;: &quot;select sqlite_version()&quot;,
                    &quot;allow&quot;: {
                        &quot;bot_id&quot;: &quot;my-bot&quot;
                    }
                }
            }
        }
    }
}
""><pre>{
    <span class=""pl-s""><span class=""pl-pds"">""</span>plugins<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>datasette-auth-tokens<span class=""pl-pds"">""</span></span>: {
            <span class=""pl-s""><span class=""pl-pds"">""</span>tokens<span class=""pl-pds"">""</span></span>: [
                {
                    <span class=""pl-s""><span class=""pl-pds"">""</span>token<span class=""pl-pds"">""</span></span>: {
                        <span class=""pl-s""><span class=""pl-pds"">""</span>$env<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>BOT_TOKEN<span class=""pl-pds"">""</span></span>
                    },
                    <span class=""pl-s""><span class=""pl-pds"">""</span>actor<span class=""pl-pds"">""</span></span>: {
                        <span class=""pl-s""><span class=""pl-pds"">""</span>bot_id<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>my-bot<span class=""pl-pds"">""</span></span>
                    },
                }
            ],
            <span class=""pl-s""><span class=""pl-pds"">""</span>param<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>_auth_token<span class=""pl-pds"">""</span></span>
        }
    },
    <span class=""pl-s""><span class=""pl-pds"">""</span>databases<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>:memory:<span class=""pl-pds"">""</span></span>: {
            <span class=""pl-s""><span class=""pl-pds"">""</span>queries<span class=""pl-pds"">""</span></span>: {
                <span class=""pl-s""><span class=""pl-pds"">""</span>show_version<span class=""pl-pds"">""</span></span>: {
                    <span class=""pl-s""><span class=""pl-pds"">""</span>sql<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>select sqlite_version()<span class=""pl-pds"">""</span></span>,
                    <span class=""pl-s""><span class=""pl-pds"">""</span>allow<span class=""pl-pds"">""</span></span>: {
                        <span class=""pl-s""><span class=""pl-pds"">""</span>bot_id<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>my-bot<span class=""pl-pds"">""</span></span>
                    }
                }
            }
        }
    }
}</pre></div>
<p>You can now run authenticated API queries like this:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ curl http://127.0.0.1:8001/:memory:/show_version.json?_shape=array&amp;_auth_token=this-is-the-secret-token
[{&quot;sqlite_version()&quot;: &quot;3.31.1&quot;}]
""><pre><code>$ curl http://127.0.0.1:8001/:memory:/show_version.json?_shape=array&amp;_auth_token=this-is-the-secret-token
[{""sqlite_version()"": ""3.31.1""}]
</code></pre></div>
<h2><a id=""user-content-tokens-from-your-database"" class=""anchor"" aria-hidden=""true"" href=""#user-content-tokens-from-your-database""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Tokens from your database</h2>
<p>As an alternative (or in addition) to the hard-coded list of tokens you can store tokens in a database table and configure the plugin to access them using a SQL query.</p>
<p>Your query needs to take a <code>:token_id</code> parameter and return at least two columns: one called <code>token_secret</code> and one called <code>actor_*</code> - usually <code>actor_id</code>. Further <code>actor_</code> prefixed columns can be returned to provide more details for the authenticated actor.</p>
<p>Here's a simple example of a configuration query:</p>
<div class=""highlight highlight-source-sql position-relative overflow-auto"" data-snippet-clipboard-copy-content=""select actor_id, actor_name, token_secret from tokens where token_id = :token_id
""><pre><span class=""pl-k"">select</span> actor_id, actor_name, token_secret <span class=""pl-k"">from</span> tokens <span class=""pl-k"">where</span> token_id <span class=""pl-k"">=</span> :token_id</pre></div>
<p>This can run against a table like this one:</p>
<table>
<thead>
<tr>
<th>token_id</th>
<th>token_secret</th>
<th>actor_id</th>
<th>actor_name</th>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>bd3c94f51fcd</td>
<td>78</td>
<td>Cleopaws</td>
</tr>
<tr>
<td>2</td>
<td>86681b4d6f66</td>
<td>32</td>
<td>Pancakes</td>
</tr>
</tbody>
</table>
<p>The tokens are formed as the token ID, then a hyphen, then the token secret. For example:</p>
<ul>
<li><code>1-bd3c94f51fcd</code></li>
<li><code>2-86681b4d6f66</code></li>
</ul>
<p>The SQL query will be executed with the portion before the hyphen as the <code>:token_id</code> parameter.</p>
<p>The <code>token_secret</code> value returned by the query will be compared to the portion of the token after the hyphen to check if the token is valid.</p>
<p>Columns with a prefix of <code>actor_</code> will be used to populate the actor dictionary. In the above example, a token of <code>2-86681b4d6f66</code> will become an actor dictionary of <code>{""id"": 32, ""name"": ""Pancakes""}</code>.</p>
<p>To configure this, use a <code>""query""</code> block in your plugin configuration like this:</p>
<div class=""highlight highlight-source-json position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{
    &quot;plugins&quot;: {
        &quot;datasette-auth-tokens&quot;: {
            &quot;query&quot;: {
                &quot;sql&quot;: &quot;select actor_id, actor_name, token_secret from tokens where token_id = :token_id&quot;,
                &quot;database&quot;: &quot;tokens&quot;
            }
        }
    },
    &quot;databases&quot;: {
        &quot;tokens&quot;: {
            &quot;allow&quot;: {}
        }
    }
}
""><pre>{
    <span class=""pl-s""><span class=""pl-pds"">""</span>plugins<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>datasette-auth-tokens<span class=""pl-pds"">""</span></span>: {
            <span class=""pl-s""><span class=""pl-pds"">""</span>query<span class=""pl-pds"">""</span></span>: {
                <span class=""pl-s""><span class=""pl-pds"">""</span>sql<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>select actor_id, actor_name, token_secret from tokens where token_id = :token_id<span class=""pl-pds"">""</span></span>,
                <span class=""pl-s""><span class=""pl-pds"">""</span>database<span class=""pl-pds"">""</span></span>: <span class=""pl-s""><span class=""pl-pds"">""</span>tokens<span class=""pl-pds"">""</span></span>
            }
        }
    },
    <span class=""pl-s""><span class=""pl-pds"">""</span>databases<span class=""pl-pds"">""</span></span>: {
        <span class=""pl-s""><span class=""pl-pds"">""</span>tokens<span class=""pl-pds"">""</span></span>: {
            <span class=""pl-s""><span class=""pl-pds"">""</span>allow<span class=""pl-pds"">""</span></span>: {}
        }
    }
}</pre></div>
<p>The <code>""sql""</code> key here contains the SQL query. The <code>""database""</code> key has the name of the attached database file that the query should be executed against - in this case it would execute against <code>tokens.db</code>.</p>
<h3><a id=""user-content-securing-your-tokens"" class=""anchor"" aria-hidden=""true"" href=""#user-content-securing-your-tokens""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Securing your tokens</h3>
<p>Anyone with access to your Datasette instance can use it to read the <code>token_secret</code> column in your tokens table. This probably isn't what you want!</p>
<p>To avoid this, you should lock down access to that table. The configuration example above shows how to do this using an <code>""allow"": {}</code> block. Consult Datasette's <a href=""https://datasette.readthedocs.io/en/stable/authentication.html#permissions"" rel=""nofollow"">Permissions documentation</a> for more information about how to lock down this kind of access.</p>
</article></div>",1,public,0,,,
272098486,MDEwOlJlcG9zaXRvcnkyNzIwOTg0ODY=,datasette-psutil,simonw/datasette-psutil,0,9599,https://github.com/simonw/datasette-psutil,Datasette plugin adding a /-/psutil debugging endpoint,0,2020-06-13T22:57:07Z,2022-03-07T15:36:30Z,2022-03-07T15:35:57Z,https://datasette.io/plugins/datasette-psutil,12,2,2,Python,1,1,1,1,0,0,0,0,1,apache-2.0,"[""datasette"", ""datasette-io"", ""datasette-plugin"", ""psutil""]",0,1,2,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,0,2,"# datasette-psutil

[![PyPI](https://img.shields.io/pypi/v/datasette-psutil.svg)](https://pypi.org/project/datasette-psutil/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-psutil?include_prereleases&label=changelog)](https://github.com/simonw/datasette-psutil/releases)
[![Tests](https://github.com/simonw/datasette-psutil/workflows/Test/badge.svg)](https://github.com/simonw/datasette-psutil/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-psutil/blob/main/LICENSE)

Datasette plugin adding a `/-/psutil` debugging endpoint

## Installation

Install this plugin in the same environment as Datasette.

    $ pip install datasette-psutil

## Usage

Visit `/-/psutil` on your Datasette instance to see various information provided by [psutil](https://psutil.readthedocs.io/).

## Demo

https://latest-with-plugins.datasette.io/-/psutil is a live demo of this plugin, hosted on Google Cloud Run.
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-datasette-psutil"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-psutil""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-psutil</h1>
<p dir=""auto""><a href=""https://pypi.org/project/datasette-psutil/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/be1734fd4b22f0108384342af796a3468669cbb4d82f0e79b28aeb60f5634904/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d70737574696c2e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-psutil.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-psutil/releases""><img src=""https://camo.githubusercontent.com/8c025e6a2847072141ce3a8022ce39878eb8adc71fdeed5dbf93a6fa755f3785/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d70737574696c3f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-psutil?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-psutil/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-psutil/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-psutil/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Datasette plugin adding a <code>/-/psutil</code> debugging endpoint</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p dir=""auto"">Install this plugin in the same environment as Datasette.</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ pip install datasette-psutil""><pre><code>$ pip install datasette-psutil
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p dir=""auto"">Visit <code>/-/psutil</code> on your Datasette instance to see various information provided by <a href=""https://psutil.readthedocs.io/"" rel=""nofollow"">psutil</a>.</p>
<h2 dir=""auto""><a id=""user-content-demo"" class=""anchor"" aria-hidden=""true"" href=""#user-content-demo""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Demo</h2>
<p dir=""auto""><a href=""https://latest-with-plugins.datasette.io/-/psutil"" rel=""nofollow"">https://latest-with-plugins.datasette.io/-/psutil</a> is a live demo of this plugin, hosted on Google Cloud Run.</p>
</article></div>",1,public,0,,,
274264484,MDEwOlJlcG9zaXRvcnkyNzQyNjQ0ODQ=,sqlite-generate,simonw/sqlite-generate,0,9599,https://github.com/simonw/sqlite-generate,Tool for generating demo SQLite databases,0,2020-06-22T23:36:44Z,2021-02-27T15:25:26Z,2021-02-27T15:25:24Z,https://sqlite-generate-demo.datasette.io/,56,17,17,Python,1,1,1,1,0,0,0,0,0,apache-2.0,"[""sqlite"", ""datasette-io"", ""datasette-tool""]",0,0,17,main,"{""admin"": false, ""push"": false, ""pull"": false}",,,0,2,"# sqlite-generate

[![PyPI](https://img.shields.io/pypi/v/sqlite-generate.svg)](https://pypi.org/project/sqlite-generate/)
[![Changelog](https://img.shields.io/github/v/release/simonw/sqlite-generate?label=changelog)](https://github.com/simonw/sqlite-generate/releases)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/sqlite-generate/blob/master/LICENSE)

Tool for generating demo SQLite databases

## Installation

Install this plugin using `pip`:

    $ pip install sqlite-generate

## Demo

You can see a demo of the database generated using this command running in [Datasette](https://github.com/simonw/datasette) at https://sqlite-generate-demo.datasette.io/

The demo is generated using the following command:

    sqlite-generate demo.db --seed seed --fts --columns=10 --fks=0,3 --pks=0,2

## Usage

To generate a SQLite database file called `data.db` with 10 randomly named tables in it, run the following:

    sqlite-generate data.db

You can use the `--tables` option to generate a different number of tables:

    sqlite-generate data.db --tables 20

You can run the command against the same database file multiple times to keep adding new tables, using different settings for each batch of generated tables.

By default each table will contain a random number of rows between 0 and 200. You can customize this with the `--rows` option:

    sqlite-generate data.db --rows 20

This will insert 20 rows into each table.

    sqlite-generate data.db --rows 500,2000

This inserts a random number of rows between 500 and 2000 into each table.

Each table will have 5 columns. You can change this using `--columns`:

    sqlite-generate data.db --columns 10

`--columns` can also accept a range:

    sqlite-generate data.db --columns 5,15

You can control the random number seed used with the `--seed` option. This will result in the exact same database file being created by multiple runs of the tool:

    sqlite-generate data.db --seed=myseed

By default each table will contain between 0 and 2 foreign key columns to other tables. You can control this using the `--fks` option, with either a single number or a range:

    sqlite-generate data.db --columns=20 --fks=5,15

Each table will have a single primary key column called `id`. You can use the `--pks=` option to change the number of primary key columns on each table. Drop it to 0 to generate [rowid tables](https://www.sqlite.org/rowidtable.html). Increase it above 1 to generate tables with compound primary keys. Or use a range to get a random selection of different primary key layouts:

    sqlite-generate data.db --pks=0,2

To configure [SQLite full-text search](https://www.sqlite.org/fts5.html) for all columns of type text, use `--fts`:

    sqlite-generate data.db --fts

This will use FTS5 by default. To use [FTS4](https://www.sqlite.org/fts3.html) instead, use `--fts4`.

## Development

To contribute to this tool, first checkout the code. Then create a new virtual environment:

    cd sqlite-generate
    python -mvenv venv
    source venv/bin/activate

Or if you are using `pipenv`:

    pipenv shell

Now install the dependencies and tests:

    pip install -e '.[test]'

To run the tests:

    pytest
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-sqlite-generate"" class=""anchor"" aria-hidden=""true"" href=""#user-content-sqlite-generate""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>sqlite-generate</h1>
<p><a href=""https://pypi.org/project/sqlite-generate/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/f0bc12d9a036f8faadbe40bbc37caa416eadf33d6694322ca385c16f2302b575/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f73716c6974652d67656e65726174652e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/sqlite-generate.svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/sqlite-generate/releases""><img src=""https://camo.githubusercontent.com/8acbae82ad62477a2630aee86e3c2c6c498a06acc6fd6d6c08e51779ea3905de/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f73716c6974652d67656e65726174653f6c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/sqlite-generate?label=changelog"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/sqlite-generate/blob/master/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>Tool for generating demo SQLite databases</p>
<h2><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p>Install this plugin using <code>pip</code>:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ pip install sqlite-generate
""><pre><code>$ pip install sqlite-generate
</code></pre></div>
<h2><a id=""user-content-demo"" class=""anchor"" aria-hidden=""true"" href=""#user-content-demo""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Demo</h2>
<p>You can see a demo of the database generated using this command running in <a href=""https://github.com/simonw/datasette"">Datasette</a> at <a href=""https://sqlite-generate-demo.datasette.io/"" rel=""nofollow"">https://sqlite-generate-demo.datasette.io/</a></p>
<p>The demo is generated using the following command:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""sqlite-generate demo.db --seed seed --fts --columns=10 --fks=0,3 --pks=0,2
""><pre><code>sqlite-generate demo.db --seed seed --fts --columns=10 --fks=0,3 --pks=0,2
</code></pre></div>
<h2><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p>To generate a SQLite database file called <code>data.db</code> with 10 randomly named tables in it, run the following:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""sqlite-generate data.db
""><pre><code>sqlite-generate data.db
</code></pre></div>
<p>You can use the <code>--tables</code> option to generate a different number of tables:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""sqlite-generate data.db --tables 20
""><pre><code>sqlite-generate data.db --tables 20
</code></pre></div>
<p>You can run the command against the same database file multiple times to keep adding new tables, using different settings for each batch of generated tables.</p>
<p>By default each table will contain a random number of rows between 0 and 200. You can customize this with the <code>--rows</code> option:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""sqlite-generate data.db --rows 20
""><pre><code>sqlite-generate data.db --rows 20
</code></pre></div>
<p>This will insert 20 rows into each table.</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""sqlite-generate data.db --rows 500,2000
""><pre><code>sqlite-generate data.db --rows 500,2000
</code></pre></div>
<p>This inserts a random number of rows between 500 and 2000 into each table.</p>
<p>Each table will have 5 columns. You can change this using <code>--columns</code>:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""sqlite-generate data.db --columns 10
""><pre><code>sqlite-generate data.db --columns 10
</code></pre></div>
<p><code>--columns</code> can also accept a range:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""sqlite-generate data.db --columns 5,15
""><pre><code>sqlite-generate data.db --columns 5,15
</code></pre></div>
<p>You can control the random number seed used with the <code>--seed</code> option. This will result in the exact same database file being created by multiple runs of the tool:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""sqlite-generate data.db --seed=myseed
""><pre><code>sqlite-generate data.db --seed=myseed
</code></pre></div>
<p>By default each table will contain between 0 and 2 foreign key columns to other tables. You can control this using the <code>--fks</code> option, with either a single number or a range:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""sqlite-generate data.db --columns=20 --fks=5,15
""><pre><code>sqlite-generate data.db --columns=20 --fks=5,15
</code></pre></div>
<p>Each table will have a single primary key column called <code>id</code>. You can use the <code>--pks=</code> option to change the number of primary key columns on each table. Drop it to 0 to generate <a href=""https://www.sqlite.org/rowidtable.html"" rel=""nofollow"">rowid tables</a>. Increase it above 1 to generate tables with compound primary keys. Or use a range to get a random selection of different primary key layouts:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""sqlite-generate data.db --pks=0,2
""><pre><code>sqlite-generate data.db --pks=0,2
</code></pre></div>
<p>To configure <a href=""https://www.sqlite.org/fts5.html"" rel=""nofollow"">SQLite full-text search</a> for all columns of type text, use <code>--fts</code>:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""sqlite-generate data.db --fts
""><pre><code>sqlite-generate data.db --fts
</code></pre></div>
<p>This will use FTS5 by default. To use <a href=""https://www.sqlite.org/fts3.html"" rel=""nofollow"">FTS4</a> instead, use <code>--fts4</code>.</p>
<h2><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p>To contribute to this tool, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""cd sqlite-generate
python -mvenv venv
source venv/bin/activate
""><pre><code>cd sqlite-generate
python -mvenv venv
source venv/bin/activate
</code></pre></div>
<p>Or if you are using <code>pipenv</code>:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pipenv shell
""><pre><code>pipenv shell
</code></pre></div>
<p>Now install the dependencies and tests:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'
""><pre><code>pip install -e '.[test]'
</code></pre></div>
<p>To run the tests:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pytest
""><pre><code>pytest
</code></pre></div>
</article></div>",,,,,,
291339086,MDEwOlJlcG9zaXRvcnkyOTEzMzkwODY=,airtable-export,simonw/airtable-export,0,9599,https://github.com/simonw/airtable-export,"Export Airtable data to YAML, JSON or SQLite files on disk",0,2020-08-29T19:51:37Z,2021-06-08T17:30:30Z,2021-04-09T23:41:52Z,https://datasette.io/tools/airtable-export,41,33,33,Python,1,1,1,1,0,5,0,0,6,apache-2.0,"[""yaml"", ""airtable"", ""airtable-api"", ""datasette-io"", ""datasette-tool""]",5,6,33,main,"{""admin"": false, ""push"": false, ""pull"": false}",,,5,3,"# airtable-export

[![PyPI](https://img.shields.io/pypi/v/airtable-export.svg)](https://pypi.org/project/airtable-export/)
[![Changelog](https://img.shields.io/github/v/release/simonw/airtable-export?include_prereleases&label=changelog)](https://github.com/simonw/airtable-export/releases)
[![Tests](https://github.com/simonw/airtable-export/workflows/Test/badge.svg)](https://github.com/simonw/airtable-export/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/airtable-export/blob/master/LICENSE)

Export Airtable data to files on disk

## Installation

Install this tool using `pip`:

    $ pip install airtable-export

## Usage

You will need to know the following information:

- Your Airtable base ID - this is a string starting with `app...`
- Your Airtable API key - this is a string starting with `key...`
- The names of each of the tables that you wish to export

You can export all of your data to a folder called `export/` by running the following:

    airtable-export export base_id table1 table2 --key=key

This example would create two files: `export/table1.yml` and `export/table2.yml`.

Rather than passing the API key using the `--key` option you can set it as an environment variable called `AIRTABLE_KEY`.

## Export options

By default the tool exports your data as YAML.

You can also export as JSON or as [newline delimited JSON](http://ndjson.org/) using the `--json` or `--ndjson` options:

    airtable-export export base_id table1 table2 --key=key --ndjson

You can pass multiple format options at once. This command will create a `.json`, `.yml` and `.ndjson` file for each exported table:

    airtable-export export base_id table1 table2 \
        --key=key --ndjson --yaml --json

### SQLite database export

You can export tables to a SQLite database file using the `--sqlite database.db` option:

    airtable-export export base_id table1 table2 \
        --key=key --sqlite database.db

This can be combined with other format options. If you only specify `--sqlite` the export directory argument will be ignored.

The SQLite database will have a table created for each table you export. Those tables will have a primary key column called `airtable_id`.

If you run this command against an existing SQLite database records with matching primary keys will be over-written by new records from the export.

## Request options

By default the tool uses [python-httpx](https://www.python-httpx.org)'s default configurations.

You can override the `user-agent` using the `--user-agent` option:

    airtable-export export base_id table1 table2 --key=key --user-agent ""Airtable Export Robot""

You can override the [timeout during a network read operation](https://www.python-httpx.org/advanced/#fine-tuning-the-configuration) using the `--http-read-timeout` option. If not set, this defaults to 5s.

    airtable-export export base_id table1 table2 --key=key --http-read-timeout 60

## Running this using GitHub Actions

[GitHub Actions](https://github.com/features/actions) is GitHub's workflow automation product. You can use it to run `airtable-export` in order to back up your Airtable data to a GitHub repository. Doing this gives you a visible commit history of changes you make to your Airtable data - like [this one](https://github.com/natbat/rockybeaches/commits/main/airtable).

To run this for your own Airtable database you'll first need to add the following secrets to your GitHub repository:

<dl>
  <dt>AIRTABLE_BASE_ID</dt>
  <dd>The base ID, a string beginning `app...`</dd>
  <dt>AIRTABLE_KEY</dt>
  <dd>Your Airtable API key</dd>
  <dt>AIRTABLE_TABLES</dt>
  <dd>A space separated list of the Airtable tables that you want to backup. If any of these contain spaces you will need to enclose them in single quotes, e.g. <samp>'My table with spaces in the name' OtherTableWithNoSpaces</samp></dd>
</dl>

Once you have set those secrets, add the following as a file called `.github/workflows/backup-airtable.yml`:
```yaml
name: Backup Airtable

on:
  workflow_dispatch:
  schedule:
  - cron: '32 0 * * *'

jobs:
  build:
    runs-on: ubuntu-latest
    steps:
    - name: Check out repo
      uses: actions/checkout@v2
    - name: Set up Python
      uses: actions/setup-python@v2
      with:
        python-version: 3.8
    - uses: actions/cache@v2
      name: Configure pip caching
      with:
        path: ~/.cache/pip
        key: ${{ runner.os }}-pip-
        restore-keys: |
          ${{ runner.os }}-pip-
    - name: Install airtable-export
      run: |
        pip install airtable-export
    - name: Backup Airtable to backups/
      env:
        AIRTABLE_BASE_ID: ${{ secrets.AIRTABLE_BASE_ID }}
        AIRTABLE_KEY: ${{ secrets.AIRTABLE_KEY }}
        AIRTABLE_TABLES: ${{ secrets.AIRTABLE_TABLES }}
      run: |-
        airtable-export backups $AIRTABLE_BASE_ID $AIRTABLE_TABLES -v
    - name: Commit and push if it changed
      run: |-
        git config user.name ""Automated""
        git config user.email ""actions@users.noreply.github.com""
        git add -A
        timestamp=$(date -u)
        git commit -m ""Latest data: ${timestamp}"" || exit 0
        git push
```
This will run once a day (at 32 minutes past midnight UTC) and will also run if you manually click the ""Run workflow"" button, see [GitHub Actions: Manual triggers with workflow_dispatch](https://github.blog/changelog/2020-07-06-github-actions-manual-triggers-with-workflow_dispatch/).

## Development

To contribute to this tool, first checkout the code. Then create a new virtual environment:

    cd airtable-export
    python -mvenv venv
    source venv/bin/activate

Or if you are using `pipenv`:

    pipenv shell

Now install the dependencies and tests:

    pip install -e '.[test]'

To run the tests:

    pytest
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-airtable-export"" class=""anchor"" aria-hidden=""true"" href=""#user-content-airtable-export""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>airtable-export</h1>
<p><a href=""https://pypi.org/project/airtable-export/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/2635595699cd87784148506d703996615d75d6e8d6d0eba3928a886a5ebac963/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6169727461626c652d6578706f72742e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/airtable-export.svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/airtable-export/releases""><img src=""https://camo.githubusercontent.com/361aa58985cc1f5841b899de85bedd6e7ced2a253334b4dc390bd1762b2f8be5/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6169727461626c652d6578706f72743f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/airtable-export?include_prereleases&amp;label=changelog"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/airtable-export/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/airtable-export/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/airtable-export/blob/master/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>Export Airtable data to files on disk</p>
<h2><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p>Install this tool using <code>pip</code>:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ pip install airtable-export
""><pre><code>$ pip install airtable-export
</code></pre></div>
<h2><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p>You will need to know the following information:</p>
<ul>
<li>Your Airtable base ID - this is a string starting with <code>app...</code></li>
<li>Your Airtable API key - this is a string starting with <code>key...</code></li>
<li>The names of each of the tables that you wish to export</li>
</ul>
<p>You can export all of your data to a folder called <code>export/</code> by running the following:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""airtable-export export base_id table1 table2 --key=key
""><pre><code>airtable-export export base_id table1 table2 --key=key
</code></pre></div>
<p>This example would create two files: <code>export/table1.yml</code> and <code>export/table2.yml</code>.</p>
<p>Rather than passing the API key using the <code>--key</code> option you can set it as an environment variable called <code>AIRTABLE_KEY</code>.</p>
<h2><a id=""user-content-export-options"" class=""anchor"" aria-hidden=""true"" href=""#user-content-export-options""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Export options</h2>
<p>By default the tool exports your data as YAML.</p>
<p>You can also export as JSON or as <a href=""http://ndjson.org/"" rel=""nofollow"">newline delimited JSON</a> using the <code>--json</code> or <code>--ndjson</code> options:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""airtable-export export base_id table1 table2 --key=key --ndjson
""><pre><code>airtable-export export base_id table1 table2 --key=key --ndjson
</code></pre></div>
<p>You can pass multiple format options at once. This command will create a <code>.json</code>, <code>.yml</code> and <code>.ndjson</code> file for each exported table:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""airtable-export export base_id table1 table2 \
    --key=key --ndjson --yaml --json
""><pre><code>airtable-export export base_id table1 table2 \
    --key=key --ndjson --yaml --json
</code></pre></div>
<h3><a id=""user-content-sqlite-database-export"" class=""anchor"" aria-hidden=""true"" href=""#user-content-sqlite-database-export""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>SQLite database export</h3>
<p>You can export tables to a SQLite database file using the <code>--sqlite database.db</code> option:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""airtable-export export base_id table1 table2 \
    --key=key --sqlite database.db
""><pre><code>airtable-export export base_id table1 table2 \
    --key=key --sqlite database.db
</code></pre></div>
<p>This can be combined with other format options. If you only specify <code>--sqlite</code> the export directory argument will be ignored.</p>
<p>The SQLite database will have a table created for each table you export. Those tables will have a primary key column called <code>airtable_id</code>.</p>
<p>If you run this command against an existing SQLite database records with matching primary keys will be over-written by new records from the export.</p>
<h2><a id=""user-content-request-options"" class=""anchor"" aria-hidden=""true"" href=""#user-content-request-options""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Request options</h2>
<p>By default the tool uses <a href=""https://www.python-httpx.org"" rel=""nofollow"">python-httpx</a>'s default configurations.</p>
<p>You can override the <code>user-agent</code> using the <code>--user-agent</code> option:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""airtable-export export base_id table1 table2 --key=key --user-agent &quot;Airtable Export Robot&quot;
""><pre><code>airtable-export export base_id table1 table2 --key=key --user-agent ""Airtable Export Robot""
</code></pre></div>
<p>You can override the <a href=""https://www.python-httpx.org/advanced/#fine-tuning-the-configuration"" rel=""nofollow"">timeout during a network read operation</a> using the <code>--http-read-timeout</code> option. If not set, this defaults to 5s.</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""airtable-export export base_id table1 table2 --key=key --http-read-timeout 60
""><pre><code>airtable-export export base_id table1 table2 --key=key --http-read-timeout 60
</code></pre></div>
<h2><a id=""user-content-running-this-using-github-actions"" class=""anchor"" aria-hidden=""true"" href=""#user-content-running-this-using-github-actions""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Running this using GitHub Actions</h2>
<p><a href=""https://github.com/features/actions"">GitHub Actions</a> is GitHub's workflow automation product. You can use it to run <code>airtable-export</code> in order to back up your Airtable data to a GitHub repository. Doing this gives you a visible commit history of changes you make to your Airtable data - like <a href=""https://github.com/natbat/rockybeaches/commits/main/airtable"">this one</a>.</p>
<p>To run this for your own Airtable database you'll first need to add the following secrets to your GitHub repository:</p>
<dl>
  <dt>AIRTABLE_BASE_ID</dt>
  <dd>The base ID, a string beginning `app...`</dd>
  <dt>AIRTABLE_KEY</dt>
  <dd>Your Airtable API key</dd>
  <dt>AIRTABLE_TABLES</dt>
  <dd>A space separated list of the Airtable tables that you want to backup. If any of these contain spaces you will need to enclose them in single quotes, e.g. <samp>'My table with spaces in the name' OtherTableWithNoSpaces</samp></dd>
</dl>
<p>Once you have set those secrets, add the following as a file called <code>.github/workflows/backup-airtable.yml</code>:</p>
<div class=""highlight highlight-source-yaml position-relative"" data-snippet-clipboard-copy-content=""name: Backup Airtable

on:
  workflow_dispatch:
  schedule:
  - cron: '32 0 * * *'

jobs:
  build:
    runs-on: ubuntu-latest
    steps:
    - name: Check out repo
      uses: actions/checkout@v2
    - name: Set up Python
      uses: actions/setup-python@v2
      with:
        python-version: 3.8
    - uses: actions/cache@v2
      name: Configure pip caching
      with:
        path: ~/.cache/pip
        key: ${{ runner.os }}-pip-
        restore-keys: |
          ${{ runner.os }}-pip-
    - name: Install airtable-export
      run: |
        pip install airtable-export
    - name: Backup Airtable to backups/
      env:
        AIRTABLE_BASE_ID: ${{ secrets.AIRTABLE_BASE_ID }}
        AIRTABLE_KEY: ${{ secrets.AIRTABLE_KEY }}
        AIRTABLE_TABLES: ${{ secrets.AIRTABLE_TABLES }}
      run: |-
        airtable-export backups $AIRTABLE_BASE_ID $AIRTABLE_TABLES -v
    - name: Commit and push if it changed
      run: |-
        git config user.name &quot;Automated&quot;
        git config user.email &quot;actions@users.noreply.github.com&quot;
        git add -A
        timestamp=$(date -u)
        git commit -m &quot;Latest data: ${timestamp}&quot; || exit 0
        git push
""><pre><span class=""pl-ent"">name</span>: <span class=""pl-s"">Backup Airtable</span>

<span class=""pl-ent"">on</span>:
  <span class=""pl-ent"">workflow_dispatch</span>:
  <span class=""pl-ent"">schedule</span>:
  - <span class=""pl-ent"">cron</span>: <span class=""pl-s""><span class=""pl-pds"">'</span>32 0 * * *<span class=""pl-pds"">'</span></span>

<span class=""pl-ent"">jobs</span>:
  <span class=""pl-ent"">build</span>:
    <span class=""pl-ent"">runs-on</span>: <span class=""pl-s"">ubuntu-latest</span>
    <span class=""pl-ent"">steps</span>:
    - <span class=""pl-ent"">name</span>: <span class=""pl-s"">Check out repo</span>
      <span class=""pl-ent"">uses</span>: <span class=""pl-s"">actions/checkout@v2</span>
    - <span class=""pl-ent"">name</span>: <span class=""pl-s"">Set up Python</span>
      <span class=""pl-ent"">uses</span>: <span class=""pl-s"">actions/setup-python@v2</span>
      <span class=""pl-ent"">with</span>:
        <span class=""pl-ent"">python-version</span>: <span class=""pl-c1"">3.8</span>
    - <span class=""pl-ent"">uses</span>: <span class=""pl-s"">actions/cache@v2</span>
      <span class=""pl-ent"">name</span>: <span class=""pl-s"">Configure pip caching</span>
      <span class=""pl-ent"">with</span>:
        <span class=""pl-ent"">path</span>: <span class=""pl-s"">~/.cache/pip</span>
        <span class=""pl-ent"">key</span>: <span class=""pl-s"">${{ runner.os }}-pip-</span>
        <span class=""pl-ent"">restore-keys</span>: <span class=""pl-s"">|</span>
<span class=""pl-s"">          ${{ runner.os }}-pip-</span>
<span class=""pl-s""></span>    - <span class=""pl-ent"">name</span>: <span class=""pl-s"">Install airtable-export</span>
      <span class=""pl-ent"">run</span>: <span class=""pl-s"">|</span>
<span class=""pl-s"">        pip install airtable-export</span>
<span class=""pl-s""></span>    - <span class=""pl-ent"">name</span>: <span class=""pl-s"">Backup Airtable to backups/</span>
      <span class=""pl-ent"">env</span>:
        <span class=""pl-ent"">AIRTABLE_BASE_ID</span>: <span class=""pl-s"">${{ secrets.AIRTABLE_BASE_ID }}</span>
        <span class=""pl-ent"">AIRTABLE_KEY</span>: <span class=""pl-s"">${{ secrets.AIRTABLE_KEY }}</span>
        <span class=""pl-ent"">AIRTABLE_TABLES</span>: <span class=""pl-s"">${{ secrets.AIRTABLE_TABLES }}</span>
      <span class=""pl-ent"">run</span>: <span class=""pl-s"">|-</span>
<span class=""pl-s"">        airtable-export backups $AIRTABLE_BASE_ID $AIRTABLE_TABLES -v</span>
<span class=""pl-s""></span>    - <span class=""pl-ent"">name</span>: <span class=""pl-s"">Commit and push if it changed</span>
      <span class=""pl-ent"">run</span>: <span class=""pl-s"">|-</span>
<span class=""pl-s"">        git config user.name ""Automated""</span>
<span class=""pl-s"">        git config user.email ""actions@users.noreply.github.com""</span>
<span class=""pl-s"">        git add -A</span>
<span class=""pl-s"">        timestamp=$(date -u)</span>
<span class=""pl-s"">        git commit -m ""Latest data: ${timestamp}"" || exit 0</span>
<span class=""pl-s"">        git push</span></pre></div>
<p>This will run once a day (at 32 minutes past midnight UTC) and will also run if you manually click the ""Run workflow"" button, see <a href=""https://github.blog/changelog/2020-07-06-github-actions-manual-triggers-with-workflow_dispatch/"" rel=""nofollow"">GitHub Actions: Manual triggers with workflow_dispatch</a>.</p>
<h2><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p>To contribute to this tool, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""cd airtable-export
python -mvenv venv
source venv/bin/activate
""><pre><code>cd airtable-export
python -mvenv venv
source venv/bin/activate
</code></pre></div>
<p>Or if you are using <code>pipenv</code>:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pipenv shell
""><pre><code>pipenv shell
</code></pre></div>
<p>Now install the dependencies and tests:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'
""><pre><code>pip install -e '.[test]'
</code></pre></div>
<p>To run the tests:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pytest
""><pre><code>pytest
</code></pre></div>
</article></div>",,,,,,
293361514,MDEwOlJlcG9zaXRvcnkyOTMzNjE1MTQ=,geocode-sqlite,eyeseast/geocode-sqlite,0,25778,https://github.com/eyeseast/geocode-sqlite,Geocode rows in a SQLite database table,0,2020-09-06T21:05:39Z,2022-11-02T19:19:56Z,2022-11-07T17:31:05Z,,125,223,223,Python,1,1,1,1,0,6,0,0,8,apache-2.0,[],6,8,223,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,6,5,"# geocode-sqlite

[![PyPI](https://img.shields.io/pypi/v/geocode-sqlite.svg)](https://pypi.org/project/geocode-sqlite/)
[![Changelog](https://img.shields.io/github/v/release/eyeseast/geocode-sqlite?include_prereleases&label=changelog)](https://github.com/eyeseast/geocode-sqlite/releases)
[![Tests](https://github.com/eyeseast/geocode-sqlite/workflows/Test/badge.svg)](https://github.com/eyeseast/geocode-sqlite/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/eyeseast/geocode-sqlite/blob/master/LICENSE)

Geocode rows from a SQLite table

## Installation

Install this tool using `pip` or `pipx`:

```sh
# install inside a virtualenv
pip install geocode-sqlite

# install globally
pipx install geocode-sqlite
```

## Usage

Let's say you have a spreadsheet with addresses in it, and you'd like to map those locations.
First, create a SQLite database and insert rows from that spreadsheet using `sqlite-utils`.

```sh
sqlite-utils insert data.db data data.csv --csv
```

Now, geocode it using OpenStreetMap's Nominatim geocoder.

```sh
geocode-sqlite nominatim data.db data \
 --location=""{address}, {city}, {state} {zip}"" \
 --delay=1 \
 --user-agent=""this-is-me""
```

In the command above, you're using Nominatim, which is free and only asks for a unique user agent (`--user-agent`).

This will connect to a database (`data.db`) and read all rows from the table `data` (skipping any that already
have both a `latitude` and `longitude` column filled).

You're also telling the geocoder how to extract a location query (`--location`) from a row of data, using Python's
built-in string formatting, and setting a rate limit (`--delay`) of one request per second.

For each row where geocoding succeeds, `latitude` and `longitude` will be populated. If you hit an error, or a rate limit,
run the same query and pick up where you left off.

The resulting table layout can be visualized with [datasette-cluster-map](https://datasette.io/plugins/datasette-cluster-map).

Under the hood, this package uses the excellent [geopy](https://geopy.readthedocs.io/en/latest/) library, which is stable and thoroughly road-tested. If you need help understanding a particular geocoder's options, consult [geopy's documentation](https://geopy.readthedocs.io/en/latest/#module-geopy.geocoders).

### Supported Geocoders

The CLI currently supports these geocoders:

- `bing`
- `googlev3`
- `mapquest` (and `open-mapquest`)
- `mapbox`
- `nominatim`
- `opencage`

#### Adding new geocoders

1. Open an issue with the name of the geocoding service as the ticket title ([example](https://github.com/eyeseast/geocode-sqlite/issues/35)). Put any noteworthy implementation details in the ticket body, like where to get an API key if one is required.
2. Fork the repo and add a geocoder.
3. Add an example to the `Makefile`. Add tests if there's new shared functionality.

### Common arguments and options

Each geocoder needs to know where to find the data it's working with. These are the first two arguments:

- `database`: a path to a SQLite file, which must already exist
- `table`: the name of a table, in that database, which exists and has data to geocode

From there, we have a set of options passed to every geocoder:

- `location`: a [string format](https://docs.python.org/3/library/stdtypes.html#str.format) that will be expanded with each row to build a full query, to be geocoded
- `delay`: a delay between each call (some services require this)
- `latitude`: latitude column name
- `longitude`: longitude column name
- `geojson`: store results as GeoJSON, instead of in latitude and longitude columns
- `spatialite`: store results in a SpatiaLite geometry column, instead of in latitude and longitude columns
- `raw`: store raw geocoding results in a JSON column

Each geocoder takes additional, specific arguments beyond these, such as API keys. Again, [geopy's documentation](https://geopy.readthedocs.io/en/latest/#module-geopy.geocoders) is an excellent resource.

## Using SpatiaLite

The `--spatialite` flag will store results in a [geometry column](https://www.gaia-gis.it/gaia-sins/spatialite-cookbook-5/cookbook_topics.adminstration.html#topic_TABLE_to_SpatialTable), instead of `latitude` and `longitude` columns. This is useful if you're doing other GIS operations, such as using a [spatial index](https://www.gaia-gis.it/fossil/libspatialite/wiki?name=SpatialIndex). See the [SpatiaLite cookbook](https://www.gaia-gis.it/gaia-sins/spatialite-cookbook-5/index.html) and [functions list](https://www.gaia-gis.it/gaia-sins/spatialite-sql-latest.html) for more of what's possible.

## Capturing additional geocoding data

Geocoding services typically return more data than just coordinates. This might include accuracy, normalized addresses or other context. This can be captured using the `--raw` flag. By default, this will add a `raw` column and store the full geocoding response as JSON. If you want to rename that column, pass a value, like `--raw custom_raw`.

The shape of this response object will vary between services. You can query specific values using [SQLite's built-in JSON functions](https://www.sqlite.org/json1.html). For example, this will work with Google's geocoder:

```sql
select
  json_extract(raw, '$.formatted_address') as address,
  json_extract(raw, '$.geometry.location_type') as location_type
from
  innout_test
```

Check each geocoding service's documentation for what's included in the response.

## Python API

The command line interface aims to support the most common options for each geocoder. For more fine-grained control, use the Python API.

As with the CLI, this assumes you already have a SQLite database and a table of location data.

```python
from geocode_sqlite import geocode_table
from geopy.geocoders import Nominatim

# create a geocoder instance, with some extra options
nominatim = Nominatim(user_agent=""this-is-me"", domain=""nominatim.local.dev"", scheme=""http"")

# assuming our database is in the same directory
count = geocode_table(""data.db"", ""data"", query_template=""{address}, {city}, {state} {zip}"")

# when it's done
print(f""Geocoded {count} rows"")
```

Any [geopy geocoder](https://geopy.readthedocs.io/en/latest/#module-geopy.geocoders) can be used with the Python API.

## Development

To contribute to this tool, first checkout the code. Then create a new virtual environment:

```sh
cd geocode-sqlite
python -m venv .venv
source .venv/bin/activate
```

Or if you are using `pipenv`:

```sh
pipenv shell
```

Now install the dependencies and tests:

```sh
pip install -e '.[test]'
```

To run the tests:

```sh
pytest
```

Please remember that this library is mainly glue code between other well-tested projects, specifically: [click](https://click.palletsprojects.com/), [geopy](https://geopy.readthedocs.io/en/stable/) and [sqlite-utils](https://sqlite-utils.datasette.io/en/stable/). Tests should focus on making sure those parts fit together correctly. We can assume the parts themselves already work.

To that end, there is a test geocoder included: `geocode_sqlite.testing.DummyGeocoder`. That geocoder works with an included dataset of In-N-Out Burger locations provided by [AllThePlaces](https://www.alltheplaces.xyz/). It works like a normal GeoPy geocoder, except it will only return results for In-N-Out locations using the included database.
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-geocode-sqlite"" class=""anchor"" aria-hidden=""true"" href=""#user-content-geocode-sqlite""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>geocode-sqlite</h1>
<p dir=""auto""><a href=""https://pypi.org/project/geocode-sqlite/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/48afed6b156b122a781142db699a225016ec438b4f64f9534d5f852433332a50/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f67656f636f64652d73716c6974652e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/geocode-sqlite.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/eyeseast/geocode-sqlite/releases""><img src=""https://camo.githubusercontent.com/cca0e9a2e0f5dbbfb761aeb9803bac6602b9831541a309b6c5a73ebec690b35e/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f65796573656173742f67656f636f64652d73716c6974653f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/eyeseast/geocode-sqlite?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/eyeseast/geocode-sqlite/actions?query=workflow%3ATest""><img src=""https://github.com/eyeseast/geocode-sqlite/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/eyeseast/geocode-sqlite/blob/master/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Geocode rows from a SQLite table</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p dir=""auto"">Install this tool using <code>pip</code> or <code>pipx</code>:</p>
<div class=""highlight highlight-source-shell notranslate position-relative overflow-auto"" dir=""auto"" data-snippet-clipboard-copy-content=""# install inside a virtualenv
pip install geocode-sqlite

# install globally
pipx install geocode-sqlite""><pre><span class=""pl-c""><span class=""pl-c"">#</span> install inside a virtualenv</span>
pip install geocode-sqlite

<span class=""pl-c""><span class=""pl-c"">#</span> install globally</span>
pipx install geocode-sqlite</pre></div>
<h2 dir=""auto""><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p dir=""auto"">Let's say you have a spreadsheet with addresses in it, and you'd like to map those locations.
First, create a SQLite database and insert rows from that spreadsheet using <code>sqlite-utils</code>.</p>
<div class=""highlight highlight-source-shell notranslate position-relative overflow-auto"" dir=""auto"" data-snippet-clipboard-copy-content=""sqlite-utils insert data.db data data.csv --csv""><pre>sqlite-utils insert data.db data data.csv --csv</pre></div>
<p dir=""auto"">Now, geocode it using OpenStreetMap's Nominatim geocoder.</p>
<div class=""highlight highlight-source-shell notranslate position-relative overflow-auto"" dir=""auto"" data-snippet-clipboard-copy-content=""geocode-sqlite nominatim data.db data \
 --location=&quot;{address}, {city}, {state} {zip}&quot; \
 --delay=1 \
 --user-agent=&quot;this-is-me&quot;""><pre>geocode-sqlite nominatim data.db data \
 --location=<span class=""pl-s""><span class=""pl-pds"">""</span>{address}, {city}, {state} {zip}<span class=""pl-pds"">""</span></span> \
 --delay=1 \
 --user-agent=<span class=""pl-s""><span class=""pl-pds"">""</span>this-is-me<span class=""pl-pds"">""</span></span></pre></div>
<p dir=""auto"">In the command above, you're using Nominatim, which is free and only asks for a unique user agent (<code>--user-agent</code>).</p>
<p dir=""auto"">This will connect to a database (<code>data.db</code>) and read all rows from the table <code>data</code> (skipping any that already
have both a <code>latitude</code> and <code>longitude</code> column filled).</p>
<p dir=""auto"">You're also telling the geocoder how to extract a location query (<code>--location</code>) from a row of data, using Python's
built-in string formatting, and setting a rate limit (<code>--delay</code>) of one request per second.</p>
<p dir=""auto"">For each row where geocoding succeeds, <code>latitude</code> and <code>longitude</code> will be populated. If you hit an error, or a rate limit,
run the same query and pick up where you left off.</p>
<p dir=""auto"">The resulting table layout can be visualized with <a href=""https://datasette.io/plugins/datasette-cluster-map"" rel=""nofollow"">datasette-cluster-map</a>.</p>
<p dir=""auto"">Under the hood, this package uses the excellent <a href=""https://geopy.readthedocs.io/en/latest/"" rel=""nofollow"">geopy</a> library, which is stable and thoroughly road-tested. If you need help understanding a particular geocoder's options, consult <a href=""https://geopy.readthedocs.io/en/latest/#module-geopy.geocoders"" rel=""nofollow"">geopy's documentation</a>.</p>
<h3 dir=""auto""><a id=""user-content-supported-geocoders"" class=""anchor"" aria-hidden=""true"" href=""#user-content-supported-geocoders""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Supported Geocoders</h3>
<p dir=""auto"">The CLI currently supports these geocoders:</p>
<ul dir=""auto"">
<li><code>bing</code></li>
<li><code>googlev3</code></li>
<li><code>mapquest</code> (and <code>open-mapquest</code>)</li>
<li><code>mapbox</code></li>
<li><code>nominatim</code></li>
<li><code>opencage</code></li>
</ul>
<h4 dir=""auto""><a id=""user-content-adding-new-geocoders"" class=""anchor"" aria-hidden=""true"" href=""#user-content-adding-new-geocoders""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Adding new geocoders</h4>
<ol dir=""auto"">
<li>Open an issue with the name of the geocoding service as the ticket title (<a href=""https://github.com/eyeseast/geocode-sqlite/issues/35"" data-hovercard-type=""issue"" data-hovercard-url=""/eyeseast/geocode-sqlite/issues/35/hovercard"">example</a>). Put any noteworthy implementation details in the ticket body, like where to get an API key if one is required.</li>
<li>Fork the repo and add a geocoder.</li>
<li>Add an example to the <code>Makefile</code>. Add tests if there's new shared functionality.</li>
</ol>
<h3 dir=""auto""><a id=""user-content-common-arguments-and-options"" class=""anchor"" aria-hidden=""true"" href=""#user-content-common-arguments-and-options""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Common arguments and options</h3>
<p dir=""auto"">Each geocoder needs to know where to find the data it's working with. These are the first two arguments:</p>
<ul dir=""auto"">
<li><code>database</code>: a path to a SQLite file, which must already exist</li>
<li><code>table</code>: the name of a table, in that database, which exists and has data to geocode</li>
</ul>
<p dir=""auto"">From there, we have a set of options passed to every geocoder:</p>
<ul dir=""auto"">
<li><code>location</code>: a <a href=""https://docs.python.org/3/library/stdtypes.html#str.format"" rel=""nofollow"">string format</a> that will be expanded with each row to build a full query, to be geocoded</li>
<li><code>delay</code>: a delay between each call (some services require this)</li>
<li><code>latitude</code>: latitude column name</li>
<li><code>longitude</code>: longitude column name</li>
<li><code>geojson</code>: store results as GeoJSON, instead of in latitude and longitude columns</li>
<li><code>spatialite</code>: store results in a SpatiaLite geometry column, instead of in latitude and longitude columns</li>
<li><code>raw</code>: store raw geocoding results in a JSON column</li>
</ul>
<p dir=""auto"">Each geocoder takes additional, specific arguments beyond these, such as API keys. Again, <a href=""https://geopy.readthedocs.io/en/latest/#module-geopy.geocoders"" rel=""nofollow"">geopy's documentation</a> is an excellent resource.</p>
<h2 dir=""auto""><a id=""user-content-using-spatialite"" class=""anchor"" aria-hidden=""true"" href=""#user-content-using-spatialite""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Using SpatiaLite</h2>
<p dir=""auto"">The <code>--spatialite</code> flag will store results in a <a href=""https://www.gaia-gis.it/gaia-sins/spatialite-cookbook-5/cookbook_topics.adminstration.html#topic_TABLE_to_SpatialTable"" rel=""nofollow"">geometry column</a>, instead of <code>latitude</code> and <code>longitude</code> columns. This is useful if you're doing other GIS operations, such as using a <a href=""https://www.gaia-gis.it/fossil/libspatialite/wiki?name=SpatialIndex"" rel=""nofollow"">spatial index</a>. See the <a href=""https://www.gaia-gis.it/gaia-sins/spatialite-cookbook-5/index.html"" rel=""nofollow"">SpatiaLite cookbook</a> and <a href=""https://www.gaia-gis.it/gaia-sins/spatialite-sql-latest.html"" rel=""nofollow"">functions list</a> for more of what's possible.</p>
<h2 dir=""auto""><a id=""user-content-capturing-additional-geocoding-data"" class=""anchor"" aria-hidden=""true"" href=""#user-content-capturing-additional-geocoding-data""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Capturing additional geocoding data</h2>
<p dir=""auto"">Geocoding services typically return more data than just coordinates. This might include accuracy, normalized addresses or other context. This can be captured using the <code>--raw</code> flag. By default, this will add a <code>raw</code> column and store the full geocoding response as JSON. If you want to rename that column, pass a value, like <code>--raw custom_raw</code>.</p>
<p dir=""auto"">The shape of this response object will vary between services. You can query specific values using <a href=""https://www.sqlite.org/json1.html"" rel=""nofollow"">SQLite's built-in JSON functions</a>. For example, this will work with Google's geocoder:</p>
<div class=""highlight highlight-source-sql notranslate position-relative overflow-auto"" dir=""auto"" data-snippet-clipboard-copy-content=""select
  json_extract(raw, '$.formatted_address') as address,
  json_extract(raw, '$.geometry.location_type') as location_type
from
  innout_test""><pre><span class=""pl-k"">select</span>
  json_extract(raw, <span class=""pl-s""><span class=""pl-pds"">'</span>$.formatted_address<span class=""pl-pds"">'</span></span>) <span class=""pl-k"">as</span> address,
  json_extract(raw, <span class=""pl-s""><span class=""pl-pds"">'</span>$.geometry.location_type<span class=""pl-pds"">'</span></span>) <span class=""pl-k"">as</span> location_type
<span class=""pl-k"">from</span>
  innout_test</pre></div>
<p dir=""auto"">Check each geocoding service's documentation for what's included in the response.</p>
<h2 dir=""auto""><a id=""user-content-python-api"" class=""anchor"" aria-hidden=""true"" href=""#user-content-python-api""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Python API</h2>
<p dir=""auto"">The command line interface aims to support the most common options for each geocoder. For more fine-grained control, use the Python API.</p>
<p dir=""auto"">As with the CLI, this assumes you already have a SQLite database and a table of location data.</p>
<div class=""highlight highlight-source-python notranslate position-relative overflow-auto"" dir=""auto"" data-snippet-clipboard-copy-content=""from geocode_sqlite import geocode_table
from geopy.geocoders import Nominatim

# create a geocoder instance, with some extra options
nominatim = Nominatim(user_agent=&quot;this-is-me&quot;, domain=&quot;nominatim.local.dev&quot;, scheme=&quot;http&quot;)

# assuming our database is in the same directory
count = geocode_table(&quot;data.db&quot;, &quot;data&quot;, query_template=&quot;{address}, {city}, {state} {zip}&quot;)

# when it's done
print(f&quot;Geocoded {count} rows&quot;)""><pre><span class=""pl-k"">from</span> <span class=""pl-s1"">geocode_sqlite</span> <span class=""pl-k"">import</span> <span class=""pl-s1"">geocode_table</span>
<span class=""pl-k"">from</span> <span class=""pl-s1"">geopy</span>.<span class=""pl-s1"">geocoders</span> <span class=""pl-k"">import</span> <span class=""pl-v"">Nominatim</span>

<span class=""pl-c""># create a geocoder instance, with some extra options</span>
<span class=""pl-s1"">nominatim</span> <span class=""pl-c1"">=</span> <span class=""pl-v"">Nominatim</span>(<span class=""pl-s1"">user_agent</span><span class=""pl-c1"">=</span><span class=""pl-s"">""this-is-me""</span>, <span class=""pl-s1"">domain</span><span class=""pl-c1"">=</span><span class=""pl-s"">""nominatim.local.dev""</span>, <span class=""pl-s1"">scheme</span><span class=""pl-c1"">=</span><span class=""pl-s"">""http""</span>)

<span class=""pl-c""># assuming our database is in the same directory</span>
<span class=""pl-s1"">count</span> <span class=""pl-c1"">=</span> <span class=""pl-en"">geocode_table</span>(<span class=""pl-s"">""data.db""</span>, <span class=""pl-s"">""data""</span>, <span class=""pl-s1"">query_template</span><span class=""pl-c1"">=</span><span class=""pl-s"">""{address}, {city}, {state} {zip}""</span>)

<span class=""pl-c""># when it's done</span>
<span class=""pl-en"">print</span>(<span class=""pl-s"">f""Geocoded <span class=""pl-s1""><span class=""pl-kos"">{</span><span class=""pl-s1"">count</span><span class=""pl-kos"">}</span></span> rows""</span>)</pre></div>
<p dir=""auto"">Any <a href=""https://geopy.readthedocs.io/en/latest/#module-geopy.geocoders"" rel=""nofollow"">geopy geocoder</a> can be used with the Python API.</p>
<h2 dir=""auto""><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p dir=""auto"">To contribute to this tool, first checkout the code. Then create a new virtual environment:</p>
<div class=""highlight highlight-source-shell notranslate position-relative overflow-auto"" dir=""auto"" data-snippet-clipboard-copy-content=""cd geocode-sqlite
python -m venv .venv
source .venv/bin/activate""><pre><span class=""pl-c1"">cd</span> geocode-sqlite
python -m venv .venv
<span class=""pl-c1"">source</span> .venv/bin/activate</pre></div>
<p dir=""auto"">Or if you are using <code>pipenv</code>:</p>
<div class=""highlight highlight-source-shell notranslate position-relative overflow-auto"" dir=""auto"" data-snippet-clipboard-copy-content=""pipenv shell""><pre>pipenv shell</pre></div>
<p dir=""auto"">Now install the dependencies and tests:</p>
<div class=""highlight highlight-source-shell notranslate position-relative overflow-auto"" dir=""auto"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'""><pre>pip install -e <span class=""pl-s""><span class=""pl-pds"">'</span>.[test]<span class=""pl-pds"">'</span></span></pre></div>
<p dir=""auto"">To run the tests:</p>
<div class=""highlight highlight-source-shell notranslate position-relative overflow-auto"" dir=""auto"" data-snippet-clipboard-copy-content=""pytest""><pre>pytest</pre></div>
<p dir=""auto"">Please remember that this library is mainly glue code between other well-tested projects, specifically: <a href=""https://click.palletsprojects.com/"" rel=""nofollow"">click</a>, <a href=""https://geopy.readthedocs.io/en/stable/"" rel=""nofollow"">geopy</a> and <a href=""https://sqlite-utils.datasette.io/en/stable/"" rel=""nofollow"">sqlite-utils</a>. Tests should focus on making sure those parts fit together correctly. We can assume the parts themselves already work.</p>
<p dir=""auto"">To that end, there is a test geocoder included: <code>geocode_sqlite.testing.DummyGeocoder</code>. That geocoder works with an included dataset of In-N-Out Burger locations provided by <a href=""https://www.alltheplaces.xyz/"" rel=""nofollow"">AllThePlaces</a>. It works like a normal GeoPy geocoder, except it will only return results for In-N-Out locations using the included database.</p>
</article></div>",1,public,0,,0,0
303218369,MDEwOlJlcG9zaXRvcnkzMDMyMTgzNjk=,evernote-to-sqlite,dogsheep/evernote-to-sqlite,0,53015001,https://github.com/dogsheep/evernote-to-sqlite,Tools for converting Evernote content to SQLite,0,2020-10-11T21:45:49Z,2021-08-26T19:01:54Z,2021-08-26T19:02:47Z,,51,24,24,Python,1,1,1,1,0,4,0,0,3,apache-2.0,"[""datasette-io"", ""datasette-tool"", ""dogsheep"", ""evernote"", ""sqlite""]",4,3,24,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,53015001,4,4,"# evernote-to-sqlite

[![PyPI](https://img.shields.io/pypi/v/evernote-to-sqlite.svg)](https://pypi.org/project/evernote-to-sqlite/)
[![Changelog](https://img.shields.io/github/v/release/dogsheep/evernote-to-sqlite?include_prereleases&label=changelog)](https://github.com/dogsheep/evernote-to-sqlite/releases)
[![Tests](https://github.com/dogsheep/evernote-to-sqlite/workflows/Test/badge.svg)](https://github.com/dogsheep/evernote-to-sqlite/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/dogsheep/evernote-to-sqlite/blob/master/LICENSE)

Tools for converting Evernote content to SQLite. See [Building an Evernote to SQLite exporter](https://simonwillison.net/2020/Oct/16/building-evernote-sqlite-exporter/) for background on this project.

## Installation

Install this tool using `pip`:

    $ pip install evernote-to-sqlite

## Usage

Currently the only available command is `evernote-to-sqlite enex`, which converts Evernote's ENEX export files into a SQLite database.

You can create [an ENEX export](https://help.evernote.com/hc/en-us/articles/209005557-Export-notes-and-notebooks-as-ENEX-or-HTML) in the Evernote desktop application by selecting some notes (or all of your notes) and using the `File -> Export Notes...` menu option.

This used to be able to export everything in one go, but it looks like more recent Evernote versions only allow exporting up to fifty notes at a time, or let you export an entire notebook by right-clicking on the notebook and selecting ""Export notebook..."".

You can convert that file to SQLite like so:

    $ evernote-to-sqlite enex evernote.db MyNotes.enex

This will display a progress bar and create a SQLite database file called `evernote.db`.

### Limitations

Unfortunately the ENEX export format does not include a unique identifier for each note. This means you cannot use this tool to re-import notes after they have been updated - you should consider this tool to be a one-time transformation of an ENEX file into an equivalent SQLite database.

ENEX exports also do not include details of which notebook a note belongs to.

## Development

To contribute to this tool, first checkout the code. Then create a new virtual environment:

    cd evernote-to-sqlite
    python -mvenv venv
    source venv/bin/activate

Or if you are using `pipenv`:

    pipenv shell

Now install the dependencies and tests:

    pip install -e '.[test]'

To run the tests:

    pytest
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-evernote-to-sqlite"" class=""anchor"" aria-hidden=""true"" href=""#user-content-evernote-to-sqlite""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>evernote-to-sqlite</h1>
<p><a href=""https://pypi.org/project/evernote-to-sqlite/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/0035842afd045b61cc5c6ee9c8394c181ac4daf3530bf212349f37a05bebcdbb/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f657665726e6f74652d746f2d73716c6974652e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/evernote-to-sqlite.svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/dogsheep/evernote-to-sqlite/releases""><img src=""https://camo.githubusercontent.com/b51eee78bbc51296102af124839ed665bf3f17755f4dbff95c4bc8ed0713d763/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f646f6773686565702f657665726e6f74652d746f2d73716c6974653f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/dogsheep/evernote-to-sqlite?include_prereleases&amp;label=changelog"" style=""max-width:100%;""></a>
<a href=""https://github.com/dogsheep/evernote-to-sqlite/actions?query=workflow%3ATest""><img src=""https://github.com/dogsheep/evernote-to-sqlite/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width:100%;""></a>
<a href=""https://github.com/dogsheep/evernote-to-sqlite/blob/master/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>Tools for converting Evernote content to SQLite. See <a href=""https://simonwillison.net/2020/Oct/16/building-evernote-sqlite-exporter/"" rel=""nofollow"">Building an Evernote to SQLite exporter</a> for background on this project.</p>
<h2><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p>Install this tool using <code>pip</code>:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ pip install evernote-to-sqlite
""><pre><code>$ pip install evernote-to-sqlite
</code></pre></div>
<h2><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p>Currently the only available command is <code>evernote-to-sqlite enex</code>, which converts Evernote's ENEX export files into a SQLite database.</p>
<p>You can create <a href=""https://help.evernote.com/hc/en-us/articles/209005557-Export-notes-and-notebooks-as-ENEX-or-HTML"" rel=""nofollow"">an ENEX export</a> in the Evernote desktop application by selecting some notes (or all of your notes) and using the <code>File -&gt; Export Notes...</code> menu option.</p>
<p>This used to be able to export everything in one go, but it looks like more recent Evernote versions only allow exporting up to fifty notes at a time, or let you export an entire notebook by right-clicking on the notebook and selecting ""Export notebook..."".</p>
<p>You can convert that file to SQLite like so:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ evernote-to-sqlite enex evernote.db MyNotes.enex
""><pre><code>$ evernote-to-sqlite enex evernote.db MyNotes.enex
</code></pre></div>
<p>This will display a progress bar and create a SQLite database file called <code>evernote.db</code>.</p>
<h3><a id=""user-content-limitations"" class=""anchor"" aria-hidden=""true"" href=""#user-content-limitations""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Limitations</h3>
<p>Unfortunately the ENEX export format does not include a unique identifier for each note. This means you cannot use this tool to re-import notes after they have been updated - you should consider this tool to be a one-time transformation of an ENEX file into an equivalent SQLite database.</p>
<p>ENEX exports also do not include details of which notebook a note belongs to.</p>
<h2><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p>To contribute to this tool, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""cd evernote-to-sqlite
python -mvenv venv
source venv/bin/activate
""><pre><code>cd evernote-to-sqlite
python -mvenv venv
source venv/bin/activate
</code></pre></div>
<p>Or if you are using <code>pipenv</code>:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pipenv shell
""><pre><code>pipenv shell
</code></pre></div>
<p>Now install the dependencies and tests:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'
""><pre><code>pip install -e '.[test]'
</code></pre></div>
<p>To run the tests:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pytest
""><pre><code>pytest
</code></pre></div>
</article></div>",,,,,,
305199661,MDEwOlJlcG9zaXRvcnkzMDUxOTk2NjE=,sphinx-to-sqlite,simonw/sphinx-to-sqlite,0,9599,https://github.com/simonw/sphinx-to-sqlite,Create a SQLite database from Sphinx documentation,0,2020-10-18T21:26:55Z,2020-12-19T05:08:12Z,2020-10-22T04:55:45Z,,9,2,2,Python,1,1,1,1,0,0,0,0,2,apache-2.0,"[""sqlite"", ""sphinx"", ""datasette-io"", ""datasette-tool""]",0,2,2,main,"{""admin"": false, ""push"": false, ""pull"": false}",,,0,2,"# sphinx-to-sqlite

[![PyPI](https://img.shields.io/pypi/v/sphinx-to-sqlite.svg)](https://pypi.org/project/sphinx-to-sqlite/)
[![Changelog](https://img.shields.io/github/v/release/simonw/sphinx-to-sqlite?include_prereleases&label=changelog)](https://github.com/simonw/sphinx-to-sqlite/releases)
[![Tests](https://github.com/simonw/sphinx-to-sqlite/workflows/Test/badge.svg)](https://github.com/simonw/sphinx-to-sqlite/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/sphinx-to-sqlite/blob/master/LICENSE)

Create a SQLite database from Sphinx documentation.

## Demo

You can see the results of running this tool against the [Datasette documentation](https://docs.datasette.io/) at https://latest-docs.datasette.io/docs/sections

## Installation

Install this tool using `pip`:

    $ pip install sphinx-to-sqlite

## Usage

First run `sphinx-build` with the `-b xml` option to create XML files in your `_build/` directory.

Then run:

    $ sphinx-to-sqlite docs.db path/to/_build

To build the SQLite database.

## Development

To contribute to this tool, first checkout the code. Then create a new virtual environment:

    cd sphinx-to-sqlite
    python -mvenv venv
    source venv/bin/activate

Or if you are using `pipenv`:

    pipenv shell

Now install the dependencies and tests:

    pip install -e '.[test]'

To run the tests:

    pytest
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-sphinx-to-sqlite"" class=""anchor"" aria-hidden=""true"" href=""#user-content-sphinx-to-sqlite""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>sphinx-to-sqlite</h1>
<p><a href=""https://pypi.org/project/sphinx-to-sqlite/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/877d22c6402df75a56257c7d5a426d35ee787ab139307e4a7060c7d706b4c6cd/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f737068696e782d746f2d73716c6974652e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/sphinx-to-sqlite.svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/sphinx-to-sqlite/releases""><img src=""https://camo.githubusercontent.com/69509856ca4dd51d3c9e67530adbd0f1f662719a608252a8d9cb5bdc3b90590b/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f737068696e782d746f2d73716c6974653f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/sphinx-to-sqlite?include_prereleases&amp;label=changelog"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/sphinx-to-sqlite/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/sphinx-to-sqlite/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/sphinx-to-sqlite/blob/master/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>Create a SQLite database from Sphinx documentation.</p>
<h2><a id=""user-content-demo"" class=""anchor"" aria-hidden=""true"" href=""#user-content-demo""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Demo</h2>
<p>You can see the results of running this tool against the <a href=""https://docs.datasette.io/"" rel=""nofollow"">Datasette documentation</a> at <a href=""https://latest-docs.datasette.io/docs/sections"" rel=""nofollow"">https://latest-docs.datasette.io/docs/sections</a></p>
<h2><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p>Install this tool using <code>pip</code>:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ pip install sphinx-to-sqlite
""><pre><code>$ pip install sphinx-to-sqlite
</code></pre></div>
<h2><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p>First run <code>sphinx-build</code> with the <code>-b xml</code> option to create XML files in your <code>_build/</code> directory.</p>
<p>Then run:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ sphinx-to-sqlite docs.db path/to/_build
""><pre><code>$ sphinx-to-sqlite docs.db path/to/_build
</code></pre></div>
<p>To build the SQLite database.</p>
<h2><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p>To contribute to this tool, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""cd sphinx-to-sqlite
python -mvenv venv
source venv/bin/activate
""><pre><code>cd sphinx-to-sqlite
python -mvenv venv
source venv/bin/activate
</code></pre></div>
<p>Or if you are using <code>pipenv</code>:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pipenv shell
""><pre><code>pipenv shell
</code></pre></div>
<p>Now install the dependencies and tests:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'
""><pre><code>pip install -e '.[test]'
</code></pre></div>
<p>To run the tests:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pytest
""><pre><code>pytest
</code></pre></div>
</article></div>",,,,,,
315796015,MDEwOlJlcG9zaXRvcnkzMTU3OTYwMTU=,datasette-ripgrep,simonw/datasette-ripgrep,0,9599,https://github.com/simonw/datasette-ripgrep,"Web interface for searching your code using ripgrep, built as a Datasette plugin",0,2020-11-25T01:26:36Z,2022-04-24T03:48:42Z,2022-06-30T22:45:03Z,https://ripgrep.datasette.io,55,58,58,Python,1,1,1,1,0,1,0,0,6,apache-2.0,"[""codesearch"", ""datasette"", ""datasette-io"", ""datasette-plugin"", ""ripgrep""]",1,6,58,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,1,3,"# datasette-ripgrep

[![PyPI](https://img.shields.io/pypi/v/datasette-ripgrep.svg)](https://pypi.org/project/datasette-ripgrep/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-ripgrep?include_prereleases&label=changelog)](https://github.com/simonw/datasette-ripgrep/releases)
[![Tests](https://github.com/simonw/datasette-ripgrep/workflows/Test/badge.svg)](https://github.com/simonw/datasette-ripgrep/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-ripgrep/blob/main/LICENSE)

Web interface for searching your code using [ripgrep](https://github.com/BurntSushi/ripgrep), built as a [Datasette](https://datasette.io/) plugin

For background on this project see [datasette-ripgrep: deploy a regular expression search engine for your source code](https://simonwillison.net/2020/Nov/28/datasette-ripgrep/).

## Demo

Try this plugin out at https://ripgrep.datasette.io/-/ripgrep - where you can run regular expression searches across the source code of Datasette and all of the `datasette-*` plugins belonging to the [simonw GitHub user](https://github.com/simonw).

Some example searches:

- [with.\*AsyncClient](https://ripgrep.datasette.io/-/ripgrep?pattern=with.*AsyncClient) - regular expression search for `with.*AsyncClient`
- [.plugin_config, literal=on](https://ripgrep.datasette.io/-/ripgrep?pattern=.plugin_config\(&literal=on) - a non-regular expression search for `.plugin_config(`
- [with.\*AsyncClient glob=datasette/\*\*](https://ripgrep.datasette.io/-/ripgrep?pattern=with.*AsyncClient&glob=datasette%2F%2A%2A) - search for that pattern only within the `datasette/` top folder
- [""sqlite-utils\["">\] glob=setup.py](https://ripgrep.datasette.io/-/ripgrep?pattern=%22sqlite-utils%5B%22%3E%5D&glob=setup.py) - a regular expression search for packages that depend on either `sqlite-utils` or `sqlite-utils>=some-version`
- [test glob=!\*.html](https://ripgrep.datasette.io/-/ripgrep?pattern=test&glob=%21*.html) - search for the string `test` but exclude results in HTML files

## Installation

Install this plugin in the same environment as Datasette.

    $ datasette install datasette-ripgrep

The `rg` executable needs to be [installed](https://github.com/BurntSushi/ripgrep/blob/master/README.md#installation) such that it can be run by this tool.

## Usage

This plugin requires configuration: it needs to a `path` setting so that it knows where to run searches.

Create a `metadata.json` file that looks like this:

```json
{
    ""plugins"": {
        ""datasette-ripgrep"": {
            ""path"": ""/path/to/your/files""
        }
    }
}
```

Now run Datasette using `datasette -m metadata.json`. The plugin will add an interface at `/-/ripgrep` for running searches.

## Plugin configuration

The `""path""` configuration is required. Optional extra configuration options are:

- `time_limit` - floating point number. The `rg` process will be terminated if it takes longer than this limit. The default is one second, `1.0`.
- `max_lines` - integer. The `rg` process will be terminated if it returns more than this number of lines. The default is `2000`.

## Development

To set up this plugin locally, first checkout the code. Then create a new virtual environment:

    cd datasette-ripgrep
    python3 -mvenv venv
    source venv/bin/activate

Or if you are using `pipenv`:

    pipenv shell

Now install the dependencies and tests:

    pip install -e '.[test]'

To run the tests:

    pytest
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-datasette-ripgrep"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-ripgrep""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-ripgrep</h1>
<p dir=""auto""><a href=""https://pypi.org/project/datasette-ripgrep/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/ebcbf381887ee3c4ed1c9da54876d414e77b4a1c7eb1056f8ea8c7b4a24e8156/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d726970677265702e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-ripgrep.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-ripgrep/releases""><img src=""https://camo.githubusercontent.com/355fe4e16e5f39a0efea997bbb0d4b0eead22e1975695ca00351fa8f14deb370/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d726970677265703f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-ripgrep?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-ripgrep/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-ripgrep/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-ripgrep/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Web interface for searching your code using <a href=""https://github.com/BurntSushi/ripgrep"">ripgrep</a>, built as a <a href=""https://datasette.io/"" rel=""nofollow"">Datasette</a> plugin</p>
<p dir=""auto"">For background on this project see <a href=""https://simonwillison.net/2020/Nov/28/datasette-ripgrep/"" rel=""nofollow"">datasette-ripgrep: deploy a regular expression search engine for your source code</a>.</p>
<h2 dir=""auto""><a id=""user-content-demo"" class=""anchor"" aria-hidden=""true"" href=""#user-content-demo""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Demo</h2>
<p dir=""auto"">Try this plugin out at <a href=""https://ripgrep.datasette.io/-/ripgrep"" rel=""nofollow"">https://ripgrep.datasette.io/-/ripgrep</a> - where you can run regular expression searches across the source code of Datasette and all of the <code>datasette-*</code> plugins belonging to the <a href=""https://github.com/simonw"">simonw GitHub user</a>.</p>
<p dir=""auto"">Some example searches:</p>
<ul dir=""auto"">
<li><a href=""https://ripgrep.datasette.io/-/ripgrep?pattern=with.*AsyncClient"" rel=""nofollow"">with.*AsyncClient</a> - regular expression search for <code>with.*AsyncClient</code></li>
<li><a href=""https://ripgrep.datasette.io/-/ripgrep?pattern=.plugin_config(&amp;literal=on"" rel=""nofollow"">.plugin_config, literal=on</a> - a non-regular expression search for <code>.plugin_config(</code></li>
<li><a href=""https://ripgrep.datasette.io/-/ripgrep?pattern=with.*AsyncClient&amp;glob=datasette%2F%2A%2A"" rel=""nofollow"">with.*AsyncClient glob=datasette/**</a> - search for that pattern only within the <code>datasette/</code> top folder</li>
<li><a href=""https://ripgrep.datasette.io/-/ripgrep?pattern=%22sqlite-utils%5B%22%3E%5D&amp;glob=setup.py"" rel=""nofollow"">""sqlite-utils[""&gt;] glob=setup.py</a> - a regular expression search for packages that depend on either <code>sqlite-utils</code> or <code>sqlite-utils&gt;=some-version</code></li>
<li><a href=""https://ripgrep.datasette.io/-/ripgrep?pattern=test&amp;glob=%21*.html"" rel=""nofollow"">test glob=!*.html</a> - search for the string <code>test</code> but exclude results in HTML files</li>
</ul>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p dir=""auto"">Install this plugin in the same environment as Datasette.</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ datasette install datasette-ripgrep""><pre class=""notranslate""><code>$ datasette install datasette-ripgrep
</code></pre></div>
<p dir=""auto"">The <code>rg</code> executable needs to be <a href=""https://github.com/BurntSushi/ripgrep/blob/master/README.md#installation"">installed</a> such that it can be run by this tool.</p>
<h2 dir=""auto""><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p dir=""auto"">This plugin requires configuration: it needs to a <code>path</code> setting so that it knows where to run searches.</p>
<p dir=""auto"">Create a <code>metadata.json</code> file that looks like this:</p>
<div class=""highlight highlight-source-json notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{
    &quot;plugins&quot;: {
        &quot;datasette-ripgrep&quot;: {
            &quot;path&quot;: &quot;/path/to/your/files&quot;
        }
    }
}""><pre>{
    <span class=""pl-ent"">""plugins""</span>: {
        <span class=""pl-ent"">""datasette-ripgrep""</span>: {
            <span class=""pl-ent"">""path""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>/path/to/your/files<span class=""pl-pds"">""</span></span>
        }
    }
}</pre></div>
<p dir=""auto"">Now run Datasette using <code>datasette -m metadata.json</code>. The plugin will add an interface at <code>/-/ripgrep</code> for running searches.</p>
<h2 dir=""auto""><a id=""user-content-plugin-configuration"" class=""anchor"" aria-hidden=""true"" href=""#user-content-plugin-configuration""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Plugin configuration</h2>
<p dir=""auto"">The <code>""path""</code> configuration is required. Optional extra configuration options are:</p>
<ul dir=""auto"">
<li><code>time_limit</code> - floating point number. The <code>rg</code> process will be terminated if it takes longer than this limit. The default is one second, <code>1.0</code>.</li>
<li><code>max_lines</code> - integer. The <code>rg</code> process will be terminated if it returns more than this number of lines. The default is <code>2000</code>.</li>
</ul>
<h2 dir=""auto""><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p dir=""auto"">To set up this plugin locally, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""cd datasette-ripgrep
python3 -mvenv venv
source venv/bin/activate""><pre class=""notranslate""><code>cd datasette-ripgrep
python3 -mvenv venv
source venv/bin/activate
</code></pre></div>
<p dir=""auto"">Or if you are using <code>pipenv</code>:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pipenv shell""><pre class=""notranslate""><code>pipenv shell
</code></pre></div>
<p dir=""auto"">Now install the dependencies and tests:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'""><pre class=""notranslate""><code>pip install -e '.[test]'
</code></pre></div>
<p dir=""auto"">To run the tests:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pytest""><pre class=""notranslate""><code>pytest
</code></pre></div>
</article></div>",1,public,0,,0,
330304628,MDEwOlJlcG9zaXRvcnkzMzAzMDQ2Mjg=,datasette-query-history,bretwalker/datasette-query-history,0,181698,https://github.com/bretwalker/datasette-query-history,,0,2021-01-17T03:13:34Z,2021-04-18T03:06:36Z,2021-01-17T23:11:37Z,,75,3,3,JavaScript,1,1,1,1,0,0,0,0,0,apache-2.0,[],0,0,3,main,"{""admin"": false, ""push"": false, ""pull"": false}",,,0,1,"# datasette-query-history

[![PyPI](https://img.shields.io/pypi/v/datasette-query-history.svg)](https://pypi.org/project/datasette-query-history/)
[![Changelog](https://img.shields.io/github/v/release/bretwalker/datasette-query-history?include_prereleases&label=changelog)](https://github.com/bretwalker/datasette-query-history/releases)
[![Tests](https://github.com/bretwalker/datasette-query-history/workflows/Test/badge.svg)](https://github.com/bretwalker/datasette-query-history/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/bretwalker/datasette-query-history/blob/main/LICENSE)

Datasette plugin that keeps a list of the queries you've run and lets you rerun them.

## Installation

Install this plugin in the same environment as Datasette.

    $ datasette install datasette-query-history

## Usage

Click the `Query History` button on the SQL editor page to see previous queries.  
Click the ⬆︎ button to replace the current query with a previous query.  
Click the `Clear Query History` button to clear the list previous queries.

<img src=""https://raw.githubusercontent.com/bretwalker/datasette-query-history/main/docs/datasette-query-history-example1.png"" width=""350px"" alt=""Screenshot of plugin"">

## Development

To set up this plugin locally, first checkout the code. Then create a new virtual environment:

    cd datasette-query-history
    python3 -mvenv venv
    source venv/bin/activate

Or if you are using `pipenv`:

    pipenv shell

Now install the dependencies and tests:

    pip install -e '.[test]'

To run the tests:

    pytest
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-datasette-query-history"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-query-history""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-query-history</h1>
<p><a href=""https://pypi.org/project/datasette-query-history/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/d74ad3a4fc71b2fea733dcc8f934843e15f176363d7016e72f0f2e2a159148df/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d71756572792d686973746f72792e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-query-history.svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/bretwalker/datasette-query-history/releases""><img src=""https://camo.githubusercontent.com/9b546caa8588b66ba6dea138d26a307e46e9e28bc6de43117a4bc7afec6dc7fb/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f6272657477616c6b65722f6461746173657474652d71756572792d686973746f72793f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/bretwalker/datasette-query-history?include_prereleases&amp;label=changelog"" style=""max-width:100%;""></a>
<a href=""https://github.com/bretwalker/datasette-query-history/actions?query=workflow%3ATest""><img src=""https://github.com/bretwalker/datasette-query-history/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width:100%;""></a>
<a href=""https://github.com/bretwalker/datasette-query-history/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>Datasette plugin that keeps a list of the queries you've run and lets you rerun them.</p>
<h2><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p>Install this plugin in the same environment as Datasette.</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ datasette install datasette-query-history
""><pre><code>$ datasette install datasette-query-history
</code></pre></div>
<h2><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p>Click the <code>Query History</code> button on the SQL editor page to see previous queries.<br>
Click the ⬆︎ button to replace the current query with a previous query.<br>
Click the <code>Clear Query History</code> button to clear the list previous queries.</p>
<p><a target=""_blank"" rel=""noopener noreferrer"" href=""https://raw.githubusercontent.com/bretwalker/datasette-query-history/main/docs/datasette-query-history-example1.png""><img src=""https://raw.githubusercontent.com/bretwalker/datasette-query-history/main/docs/datasette-query-history-example1.png"" width=""350px"" alt=""Screenshot of plugin"" style=""max-width:100%;""></a></p>
<h2><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p>To set up this plugin locally, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""cd datasette-query-history
python3 -mvenv venv
source venv/bin/activate
""><pre><code>cd datasette-query-history
python3 -mvenv venv
source venv/bin/activate
</code></pre></div>
<p>Or if you are using <code>pipenv</code>:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pipenv shell
""><pre><code>pipenv shell
</code></pre></div>
<p>Now install the dependencies and tests:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'
""><pre><code>pip install -e '.[test]'
</code></pre></div>
<p>To run the tests:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pytest
""><pre><code>pytest
</code></pre></div>
</article></div>",,,,,,
335372050,MDEwOlJlcG9zaXRvcnkzMzUzNzIwNTA=,download-tiles,simonw/download-tiles,0,9599,https://github.com/simonw/download-tiles,Download map tiles and store them in an MBTiles database,0,2021-02-02T17:37:49Z,2021-05-29T07:22:58Z,2021-02-16T04:19:59Z,https://datasette.io/tools/download-tiles,26,9,9,Python,1,1,1,1,0,0,0,0,0,apache-2.0,"[""openstreetmap"", ""mbtiles"", ""datasette-io"", ""datasette-tool""]",0,0,9,main,"{""admin"": false, ""push"": false, ""pull"": false}",,,0,1,"# download-tiles

[![PyPI](https://img.shields.io/pypi/v/download-tiles.svg)](https://pypi.org/project/download-tiles/)
[![Changelog](https://img.shields.io/github/v/release/simonw/download-tiles?include_prereleases&label=changelog)](https://github.com/simonw/download-tiles/releases)
[![Tests](https://github.com/simonw/download-tiles/workflows/Test/badge.svg)](https://github.com/simonw/download-tiles/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/download-tiles/blob/master/LICENSE)

Download map tiles and store them in an MBTiles database

## Installation

Install this tool using `pip`:

    $ pip install download-tiles

## Usage

This tool downloads tiles from a specified [TMS (Tile Map Server)](https://wiki.openstreetmap.org/wiki/TMS) server for a specified bounding box and range of zoom levels and stores those tiles in a MBTiles SQLite database. It is a command-line wrapper around the [Landez](https://github.com/makinacorpus/landez) Python libary.

**Please use this tool responsibly**. Consult the usage policies of the tile servers you are interacting with, for example the [OpenStreetMap Tile Usage Policy](https://operations.osmfoundation.org/policies/tiles/).

Running the following will download zoom levels 0-3 of OpenStreetMap, 85 tiles total, and store them in a SQLite database called `world.mbtiles`:

    download-tiles world.mbtiles

You can customize which tile and zoom levels are downloaded using command options:

`--zoom-levels=0-3` or `-z=0-3`

The different zoom levels to download. Specify a single number, e.g. `15`, or a range of numbers e.g. `0-4`. Be careful with this setting as you can easily go over the limits requested by the underlying tile server.

`--bbox=3.9,-6.3,14.5,10.2` or `-b=3.9,-6.3,14.5,10.2`

The bounding box to fetch. Should be specified as `min-lon,min-lat,max-lon,max-lat`. You can use [bboxfinder.com](http://bboxfinder.com/) to find these for different areas.

`--city=london` or `--country=madagascar`

These options can be used instead of `--bbox`. The city or country specified will be looked up using the [Nominatum API](https://nominatim.org/release-docs/latest/api/Search/) and used to derive a bounding box.

`--show-bbox`

Use this option to output the bounding box that was retrieved for the `--city` or `--country` without downloading any tiles.

`--name=Name`

A name for this tile collection, used for the `name` field in the `metadata` table. If not specified a UUID will be used, or if you used `--city` or `--country` the name will be set to the full name of that place.

`--attribution=""Attribution string""`

Attribution string to bake into the `metadata` table. This will default to `© OpenStreetMap contributors` unless you use `--tiles-url` to specify an alternative tile server, in which case you should specify a custom attribution string.

You can use the `--attribution=osm` shortcut to specify the `© OpenStreetMap contributors` value without having to type it out in full.

`--tiles-url=https://...`

The tile server URL to use. This should include `{z}` and `{x}` and `{y}` specifiers, and can optionally include `{s}` for subdomains.

The default URL used here is for OpenStreetMap, `http://{s}.tile.openstreetmap.org/{z}/{x}/{y}.png`

`--tiles-subdomains=a,b,c`

A comma-separated list of subdomains to use for the `{s}` parameter.

`--verbose`

Use this option to turn on verbose logging.

`--cache-dir=/tmp/tiles`

Provide a directory to cache downloaded tiles between runs. This can be useful if you are worried you might not have used the correct options for the bounding box or zoom levels.

## Development

To contribute to this tool, first checkout the code. Then create a new virtual environment:

    cd download-tiles
    python -mvenv venv
    source venv/bin/activate

Or if you are using `pipenv`:

    pipenv shell

Now install the dependencies and tests:

    pip install -e '.[test]'

To run the tests:

    pytest
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-download-tiles"" class=""anchor"" aria-hidden=""true"" href=""#user-content-download-tiles""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>download-tiles</h1>
<p><a href=""https://pypi.org/project/download-tiles/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/82c46313526394b774727137f12ac6fbf6606364edbee19ebd99c951953b04b5/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f646f776e6c6f61642d74696c65732e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/download-tiles.svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/download-tiles/releases""><img src=""https://camo.githubusercontent.com/32d363e282d2f95ba1b135630d328b8b61459c3e97b2b4e2fe1ec629be13d80e/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f646f776e6c6f61642d74696c65733f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/download-tiles?include_prereleases&amp;label=changelog"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/download-tiles/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/download-tiles/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/download-tiles/blob/master/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>Download map tiles and store them in an MBTiles database</p>
<h2><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p>Install this tool using <code>pip</code>:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ pip install download-tiles
""><pre><code>$ pip install download-tiles
</code></pre></div>
<h2><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p>This tool downloads tiles from a specified <a href=""https://wiki.openstreetmap.org/wiki/TMS"" rel=""nofollow"">TMS (Tile Map Server)</a> server for a specified bounding box and range of zoom levels and stores those tiles in a MBTiles SQLite database. It is a command-line wrapper around the <a href=""https://github.com/makinacorpus/landez"">Landez</a> Python libary.</p>
<p><strong>Please use this tool responsibly</strong>. Consult the usage policies of the tile servers you are interacting with, for example the <a href=""https://operations.osmfoundation.org/policies/tiles/"" rel=""nofollow"">OpenStreetMap Tile Usage Policy</a>.</p>
<p>Running the following will download zoom levels 0-3 of OpenStreetMap, 85 tiles total, and store them in a SQLite database called <code>world.mbtiles</code>:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""download-tiles world.mbtiles
""><pre><code>download-tiles world.mbtiles
</code></pre></div>
<p>You can customize which tile and zoom levels are downloaded using command options:</p>
<p><code>--zoom-levels=0-3</code> or <code>-z=0-3</code></p>
<p>The different zoom levels to download. Specify a single number, e.g. <code>15</code>, or a range of numbers e.g. <code>0-4</code>. Be careful with this setting as you can easily go over the limits requested by the underlying tile server.</p>
<p><code>--bbox=3.9,-6.3,14.5,10.2</code> or <code>-b=3.9,-6.3,14.5,10.2</code></p>
<p>The bounding box to fetch. Should be specified as <code>min-lon,min-lat,max-lon,max-lat</code>. You can use <a href=""http://bboxfinder.com/"" rel=""nofollow"">bboxfinder.com</a> to find these for different areas.</p>
<p><code>--city=london</code> or <code>--country=madagascar</code></p>
<p>These options can be used instead of <code>--bbox</code>. The city or country specified will be looked up using the <a href=""https://nominatim.org/release-docs/latest/api/Search/"" rel=""nofollow"">Nominatum API</a> and used to derive a bounding box.</p>
<p><code>--show-bbox</code></p>
<p>Use this option to output the bounding box that was retrieved for the <code>--city</code> or <code>--country</code> without downloading any tiles.</p>
<p><code>--name=Name</code></p>
<p>A name for this tile collection, used for the <code>name</code> field in the <code>metadata</code> table. If not specified a UUID will be used, or if you used <code>--city</code> or <code>--country</code> the name will be set to the full name of that place.</p>
<p><code>--attribution=""Attribution string""</code></p>
<p>Attribution string to bake into the <code>metadata</code> table. This will default to <code>© OpenStreetMap contributors</code> unless you use <code>--tiles-url</code> to specify an alternative tile server, in which case you should specify a custom attribution string.</p>
<p>You can use the <code>--attribution=osm</code> shortcut to specify the <code>© OpenStreetMap contributors</code> value without having to type it out in full.</p>
<p><code>--tiles-url=https://...</code></p>
<p>The tile server URL to use. This should include <code>{z}</code> and <code>{x}</code> and <code>{y}</code> specifiers, and can optionally include <code>{s}</code> for subdomains.</p>
<p>The default URL used here is for OpenStreetMap, <code>http://{s}.tile.openstreetmap.org/{z}/{x}/{y}.png</code></p>
<p><code>--tiles-subdomains=a,b,c</code></p>
<p>A comma-separated list of subdomains to use for the <code>{s}</code> parameter.</p>
<p><code>--verbose</code></p>
<p>Use this option to turn on verbose logging.</p>
<p><code>--cache-dir=/tmp/tiles</code></p>
<p>Provide a directory to cache downloaded tiles between runs. This can be useful if you are worried you might not have used the correct options for the bounding box or zoom levels.</p>
<h2><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p>To contribute to this tool, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""cd download-tiles
python -mvenv venv
source venv/bin/activate
""><pre><code>cd download-tiles
python -mvenv venv
source venv/bin/activate
</code></pre></div>
<p>Or if you are using <code>pipenv</code>:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pipenv shell
""><pre><code>pipenv shell
</code></pre></div>
<p>Now install the dependencies and tests:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'
""><pre><code>pip install -e '.[test]'
</code></pre></div>
<p>To run the tests:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pytest
""><pre><code>pytest
</code></pre></div>
</article></div>",,,,,,
346597557,MDEwOlJlcG9zaXRvcnkzNDY1OTc1NTc=,tableau-to-sqlite,simonw/tableau-to-sqlite,0,9599,https://github.com/simonw/tableau-to-sqlite,Fetch data from Tableau into a SQLite database,0,2021-03-11T06:12:02Z,2021-06-10T04:40:44Z,2021-04-29T16:11:03Z,,212,8,8,Python,1,1,1,1,0,2,0,0,2,apache-2.0,"[""datasette-io"", ""datasette-tool""]",2,2,8,main,"{""admin"": false, ""push"": false, ""pull"": false}",,,2,1,"# tableau-to-sqlite

[![PyPI](https://img.shields.io/pypi/v/tableau-to-sqlite.svg)](https://pypi.org/project/tableau-to-sqlite/)
[![Changelog](https://img.shields.io/github/v/release/simonw/tableau-to-sqlite?include_prereleases&label=changelog)](https://github.com/simonw/tableau-to-sqlite/releases)
[![Tests](https://github.com/simonw/tableau-to-sqlite/workflows/Test/badge.svg)](https://github.com/simonw/tableau-to-sqlite/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/tableau-to-sqlite/blob/master/LICENSE)

Fetch data from Tableau into a SQLite database. A wrapper around [TableauScraper](https://github.com/bertrandmartel/tableau-scraping/).

## Installation

Install this tool using `pip`:

    $ pip install tableau-to-sqlite

## Usage

If you have the URL to a Tableau dashboard like this:

https://results.mo.gov/t/COVID19/views/VaccinationsDashboard/Vaccinations

You can pass that directly to the tool:

    tableau-to-sqlite tableau.db \
      https://results.mo.gov/t/COVID19/views/VaccinationsDashboard/Vaccinations

This will create a SQLite database called `tableau.db` containing one table for each of the worksheets in that dashboard.

If the dashboard is hosted on https://public.tableau.com/ you can instead provide the view name. This will be two strings separated by a `/` symbol - something like this:

    OregonCOVID-19VaccineProviderEnrollment/COVID-19VaccineProviderEnrollment

Now run the tool like this:

    tableau-to-sqlite tableau.db \
        OregonCOVID-19VaccineProviderEnrollment/COVID-19VaccineProviderEnrollment

## Get the data as JSON or CSV

If you're building a [git scraper](https://simonwillison.net/2020/Oct/9/git-scraping/) you may want to convert the data gathered by this tool to CSV or JSON to check into your repository.

You can do that using [sqlite-utils](https://sqlite-utils.datasette.io/). Install it using `pip`:

    pip install sqlite-utils

You can dump out a table as JSON like so:

    sqlite-utils rows tableau.db \
       'Admin Site and County Map Site No Info' > tableau.json

Or as CSV like this:

    sqlite-utils rows tableau.db --csv \
       'Admin Site and County Map Site No Info' > tableau.csv

## Development

To contribute to this tool, first checkout the code. Then create a new virtual environment:

    cd tableau-to-sqlite
    python -mvenv venv
    source venv/bin/activate

Or if you are using `pipenv`:

    pipenv shell

Now install the dependencies and tests:

    pip install -e '.[test]'

To run the tests:

    pytest
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1><a id=""user-content-tableau-to-sqlite"" class=""anchor"" aria-hidden=""true"" href=""#user-content-tableau-to-sqlite""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>tableau-to-sqlite</h1>
<p><a href=""https://pypi.org/project/tableau-to-sqlite/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/bdda6efc6980f655c99f0475b322f27bbc413e7e13e847e55e697dd135d10601/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f7461626c6561752d746f2d73716c6974652e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/tableau-to-sqlite.svg"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/tableau-to-sqlite/releases""><img src=""https://camo.githubusercontent.com/fd18da3a8541734c627a9be68134477abe876a8695165916ff7c8c5bbf7269eb/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f7461626c6561752d746f2d73716c6974653f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/tableau-to-sqlite?include_prereleases&amp;label=changelog"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/tableau-to-sqlite/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/tableau-to-sqlite/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width:100%;""></a>
<a href=""https://github.com/simonw/tableau-to-sqlite/blob/master/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width:100%;""></a></p>
<p>Fetch data from Tableau into a SQLite database. A wrapper around <a href=""https://github.com/bertrandmartel/tableau-scraping/"">TableauScraper</a>.</p>
<h2><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p>Install this tool using <code>pip</code>:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""$ pip install tableau-to-sqlite
""><pre><code>$ pip install tableau-to-sqlite
</code></pre></div>
<h2><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p>If you have the URL to a Tableau dashboard like this:</p>
<p><a href=""https://results.mo.gov/t/COVID19/views/VaccinationsDashboard/Vaccinations"" rel=""nofollow"">https://results.mo.gov/t/COVID19/views/VaccinationsDashboard/Vaccinations</a></p>
<p>You can pass that directly to the tool:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""tableau-to-sqlite tableau.db \
  https://results.mo.gov/t/COVID19/views/VaccinationsDashboard/Vaccinations
""><pre><code>tableau-to-sqlite tableau.db \
  https://results.mo.gov/t/COVID19/views/VaccinationsDashboard/Vaccinations
</code></pre></div>
<p>This will create a SQLite database called <code>tableau.db</code> containing one table for each of the worksheets in that dashboard.</p>
<p>If the dashboard is hosted on <a href=""https://public.tableau.com/"" rel=""nofollow"">https://public.tableau.com/</a> you can instead provide the view name. This will be two strings separated by a <code>/</code> symbol - something like this:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""OregonCOVID-19VaccineProviderEnrollment/COVID-19VaccineProviderEnrollment
""><pre><code>OregonCOVID-19VaccineProviderEnrollment/COVID-19VaccineProviderEnrollment
</code></pre></div>
<p>Now run the tool like this:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""tableau-to-sqlite tableau.db \
    OregonCOVID-19VaccineProviderEnrollment/COVID-19VaccineProviderEnrollment
""><pre><code>tableau-to-sqlite tableau.db \
    OregonCOVID-19VaccineProviderEnrollment/COVID-19VaccineProviderEnrollment
</code></pre></div>
<h2><a id=""user-content-get-the-data-as-json-or-csv"" class=""anchor"" aria-hidden=""true"" href=""#user-content-get-the-data-as-json-or-csv""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Get the data as JSON or CSV</h2>
<p>If you're building a <a href=""https://simonwillison.net/2020/Oct/9/git-scraping/"" rel=""nofollow"">git scraper</a> you may want to convert the data gathered by this tool to CSV or JSON to check into your repository.</p>
<p>You can do that using <a href=""https://sqlite-utils.datasette.io/"" rel=""nofollow"">sqlite-utils</a>. Install it using <code>pip</code>:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pip install sqlite-utils
""><pre><code>pip install sqlite-utils
</code></pre></div>
<p>You can dump out a table as JSON like so:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""sqlite-utils rows tableau.db \
   'Admin Site and County Map Site No Info' &gt; tableau.json
""><pre><code>sqlite-utils rows tableau.db \
   'Admin Site and County Map Site No Info' &gt; tableau.json
</code></pre></div>
<p>Or as CSV like this:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""sqlite-utils rows tableau.db --csv \
   'Admin Site and County Map Site No Info' &gt; tableau.csv
""><pre><code>sqlite-utils rows tableau.db --csv \
   'Admin Site and County Map Site No Info' &gt; tableau.csv
</code></pre></div>
<h2><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p>To contribute to this tool, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""cd tableau-to-sqlite
python -mvenv venv
source venv/bin/activate
""><pre><code>cd tableau-to-sqlite
python -mvenv venv
source venv/bin/activate
</code></pre></div>
<p>Or if you are using <code>pipenv</code>:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pipenv shell
""><pre><code>pipenv shell
</code></pre></div>
<p>Now install the dependencies and tests:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'
""><pre><code>pip install -e '.[test]'
</code></pre></div>
<p>To run the tests:</p>
<div class=""snippet-clipboard-content position-relative"" data-snippet-clipboard-copy-content=""pytest
""><pre><code>pytest
</code></pre></div>
</article></div>",,,,,,
347263722,MDEwOlJlcG9zaXRvcnkzNDcyNjM3MjI=,django-sql-dashboard,simonw/django-sql-dashboard,0,9599,https://github.com/simonw/django-sql-dashboard,Django app for building dashboards using raw SQL queries,0,2021-03-13T03:38:23Z,2022-04-19T01:13:12Z,2022-04-20T00:27:39Z,https://django-sql-dashboard.datasette.io/,513,335,335,Python,1,1,1,1,0,28,0,0,25,apache-2.0,"[""dashboards"", ""datasette-io"", ""datasette-tool"", ""django"", ""sql""]",28,25,335,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,28,9,"# django-sql-dashboard

[![PyPI](https://img.shields.io/pypi/v/django-sql-dashboard.svg)](https://pypi.org/project/django-sql-dashboard/)
[![Changelog](https://img.shields.io/github/v/release/simonw/django-sql-dashboard?include_prereleases&label=changelog)](https://github.com/simonw/django-sql-dashboard/releases)
[![Tests](https://github.com/simonw/django-sql-dashboard/workflows/Test/badge.svg)](https://github.com/simonw/django-sql-dashboard/actions?query=workflow%3ATest)
[![Documentation Status](https://readthedocs.org/projects/django-sql-dashboard/badge/?version=latest)](http://django-sql-dashboard.datasette.io/en/latest/?badge=latest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/django-sql-dashboard/blob/main/LICENSE)

Django SQL Dashboard provides an authenticated interface for executing read-only SQL queries directly against your PostgreSQL database, bringing a useful subset of [Datasette](https://datasette.io/) to Django.

Applications include ad-hoc analysis and debugging, plus the creation of reporting dashboards that can be shared with team members or published online.

See my blog for [more about this project](https://simonwillison.net/2021/May/10/django-sql-dashboard/), including [a video demo](https://www.youtube.com/watch?v=ausrmMZkPEY).

Features include:

- Safely run read-only one or more SQL queries against your database and view the results in your browser
- Bookmark queries and share those links with other members of your team
- Create [saved dashboards](https://django-sql-dashboard.datasette.io/en/latest/saved-dashboards.html) from your queries, with full control over who can view and edit them
- [Named parameters](https://django-sql-dashboard.datasette.io/en/latest/sql.html#sql-parameters) such as `select * from entries where id = %(id)s` will be turned into form fields, allowing quick creation of interactive dashboards
- Produce [bar charts](https://django-sql-dashboard.datasette.io/en/latest/widgets.html#bar-label-bar-quantity), [progress bars](https://django-sql-dashboard.datasette.io/en/latest/widgets.html#total-count-completed-count) and more from SQL queries, with the ability to easily create new [custom dashboard widgets](https://django-sql-dashboard.datasette.io/en/latest/widgets.html#custom-widgets) using the Django template system
- Write SQL queries that safely construct and render [markdown](https://django-sql-dashboard.datasette.io/en/latest/widgets.html#markdown) and [HTML](https://django-sql-dashboard.datasette.io/en/latest/widgets.html#html)
- Export the full results of a SQL query as a downloadable CSV or TSV file, using a combination of Django's [streaming HTTP response](https://docs.djangoproject.com/en/3.2/ref/request-response/#django.http.StreamingHttpResponse) mechanism and PostgreSQL [server-side cursors](https://www.psycopg.org/docs/usage.html#server-side-cursors) to efficiently stream large amounts of data without running out of resources
- Copy and paste the results of SQL queries directly into tools such as Google Sheets or Excel
- Uses Django's authentication system, so dashboard accounts can be granted using Django's Admin tools

## Documentation

Full documentation is at [django-sql-dashboard.datasette.io](https://django-sql-dashboard.datasette.io/)

## Screenshot

<img width=""1018"" alt=""Screenshot showing a SQL query that produces a table and one that produces a bar chart"" src=""https://user-images.githubusercontent.com/9599/124050883-42ad2300-d9d0-11eb-83e6-44ad85f7ef64.png"">

## Alternatives

- [django-sql-explorer](https://github.com/groveco/django-sql-explorer) provides a related set of functionality that also works against database backends other than PostgreSQL
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-django-sql-dashboard"" class=""anchor"" aria-hidden=""true"" href=""#user-content-django-sql-dashboard""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>django-sql-dashboard</h1>
<p dir=""auto""><a href=""https://pypi.org/project/django-sql-dashboard/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/87a7771e261b30e7b0da09a83f2d6120ce484b068a0d844a41d2f945141194d1/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f646a616e676f2d73716c2d64617368626f6172642e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/django-sql-dashboard.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/django-sql-dashboard/releases""><img src=""https://camo.githubusercontent.com/f3c931ccf5487f5df160f339f767aba9ac45e15a5f4e8beb37fd6e33cbc239c0/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f646a616e676f2d73716c2d64617368626f6172643f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/django-sql-dashboard?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/django-sql-dashboard/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/django-sql-dashboard/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""http://django-sql-dashboard.datasette.io/en/latest/?badge=latest"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/e5bd1be998e2bf35ee16299f10c1520b531a957a3b725b6b663e40d82d3c8425/68747470733a2f2f72656164746865646f63732e6f72672f70726f6a656374732f646a616e676f2d73716c2d64617368626f6172642f62616467652f3f76657273696f6e3d6c6174657374"" alt=""Documentation Status"" data-canonical-src=""https://readthedocs.org/projects/django-sql-dashboard/badge/?version=latest"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/django-sql-dashboard/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Django SQL Dashboard provides an authenticated interface for executing read-only SQL queries directly against your PostgreSQL database, bringing a useful subset of <a href=""https://datasette.io/"" rel=""nofollow"">Datasette</a> to Django.</p>
<p dir=""auto"">Applications include ad-hoc analysis and debugging, plus the creation of reporting dashboards that can be shared with team members or published online.</p>
<p dir=""auto"">See my blog for <a href=""https://simonwillison.net/2021/May/10/django-sql-dashboard/"" rel=""nofollow"">more about this project</a>, including <a href=""https://www.youtube.com/watch?v=ausrmMZkPEY"" rel=""nofollow"">a video demo</a>.</p>
<p dir=""auto"">Features include:</p>
<ul dir=""auto"">
<li>Safely run read-only one or more SQL queries against your database and view the results in your browser</li>
<li>Bookmark queries and share those links with other members of your team</li>
<li>Create <a href=""https://django-sql-dashboard.datasette.io/en/latest/saved-dashboards.html"" rel=""nofollow"">saved dashboards</a> from your queries, with full control over who can view and edit them</li>
<li><a href=""https://django-sql-dashboard.datasette.io/en/latest/sql.html#sql-parameters"" rel=""nofollow"">Named parameters</a> such as <code>select * from entries where id = %(id)s</code> will be turned into form fields, allowing quick creation of interactive dashboards</li>
<li>Produce <a href=""https://django-sql-dashboard.datasette.io/en/latest/widgets.html#bar-label-bar-quantity"" rel=""nofollow"">bar charts</a>, <a href=""https://django-sql-dashboard.datasette.io/en/latest/widgets.html#total-count-completed-count"" rel=""nofollow"">progress bars</a> and more from SQL queries, with the ability to easily create new <a href=""https://django-sql-dashboard.datasette.io/en/latest/widgets.html#custom-widgets"" rel=""nofollow"">custom dashboard widgets</a> using the Django template system</li>
<li>Write SQL queries that safely construct and render <a href=""https://django-sql-dashboard.datasette.io/en/latest/widgets.html#markdown"" rel=""nofollow"">markdown</a> and <a href=""https://django-sql-dashboard.datasette.io/en/latest/widgets.html#html"" rel=""nofollow"">HTML</a></li>
<li>Export the full results of a SQL query as a downloadable CSV or TSV file, using a combination of Django's <a href=""https://docs.djangoproject.com/en/3.2/ref/request-response/#django.http.StreamingHttpResponse"" rel=""nofollow"">streaming HTTP response</a> mechanism and PostgreSQL <a href=""https://www.psycopg.org/docs/usage.html#server-side-cursors"" rel=""nofollow"">server-side cursors</a> to efficiently stream large amounts of data without running out of resources</li>
<li>Copy and paste the results of SQL queries directly into tools such as Google Sheets or Excel</li>
<li>Uses Django's authentication system, so dashboard accounts can be granted using Django's Admin tools</li>
</ul>
<h2 dir=""auto""><a id=""user-content-documentation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-documentation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Documentation</h2>
<p dir=""auto"">Full documentation is at <a href=""https://django-sql-dashboard.datasette.io/"" rel=""nofollow"">django-sql-dashboard.datasette.io</a></p>
<h2 dir=""auto""><a id=""user-content-screenshot"" class=""anchor"" aria-hidden=""true"" href=""#user-content-screenshot""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Screenshot</h2>
<p><a target=""_blank"" rel=""noopener noreferrer"" href=""https://user-images.githubusercontent.com/9599/124050883-42ad2300-d9d0-11eb-83e6-44ad85f7ef64.png""><img width=""1018"" alt=""Screenshot showing a SQL query that produces a table and one that produces a bar chart"" src=""https://user-images.githubusercontent.com/9599/124050883-42ad2300-d9d0-11eb-83e6-44ad85f7ef64.png"" style=""max-width: 100%;""></a></p>
<h2 dir=""auto""><a id=""user-content-alternatives"" class=""anchor"" aria-hidden=""true"" href=""#user-content-alternatives""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Alternatives</h2>
<ul dir=""auto"">
<li><a href=""https://github.com/groveco/django-sql-explorer"">django-sql-explorer</a> provides a related set of functionality that also works against database backends other than PostgreSQL</li>
</ul>
</article></div>",1,public,0,,,
361014273,MDEwOlJlcG9zaXRvcnkzNjEwMTQyNzM=,datasette-dashboards,rclement/datasette-dashboards,0,1238873,https://github.com/rclement/datasette-dashboards,Datasette plugin providing data dashboards from metadata,0,2021-04-23T21:56:48Z,2022-09-21T13:03:39Z,2022-10-07T07:18:03Z,https://datasette-dashboards-demo.vercel.app,1746,74,74,Python,1,1,1,1,0,3,0,0,3,apache-2.0,"[""dashboards"", ""data-visualization"", ""datasette"", ""datasette-io"", ""datasette-plugin"", ""sql"", ""vega-lite""]",3,3,74,master,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,3,1,"# datasette-dashboards

> Datasette plugin providing data dashboards from metadata

[![PyPI](https://img.shields.io/pypi/v/datasette-dashboards.svg)](https://pypi.org/project/datasette-dashboards/)
[![CI/CD](https://github.com/rclement/datasette-dashboards/actions/workflows/ci-cd.yml/badge.svg)](https://github.com/rclement/datasette-dashboards/actions/workflows/ci-cd.yml)
[![Coverage Status](https://img.shields.io/codecov/c/github/rclement/datasette-dashboards)](https://codecov.io/gh/rclement/datasette-dashboards)
[![License](https://img.shields.io/github/license/rclement/datasette-dashboards)](https://github.com/rclement/datasette-dashboards/blob/master/LICENSE)

Try out a live demo at [https://datasette-dashboards-demo.vercel.app](https://datasette-dashboards-demo.vercel.app/-/dashboards)

**WARNING**: this plugin is still experimental and not ready for production.
Some breaking changes might happen between releases before reaching a stable version.
Use it at your own risks!

![Datasette Dashboards Demo](https://raw.githubusercontent.com/rclement/datasette-dashboards/master/demo/datasette-dashboards-demo.png)

## Installation

Install this plugin in the same environment as Datasette:

```bash
$ datasette install datasette-dashboards
```

## Usage

Define dashboards within `metadata.yml` / `metadata.json`:

```yaml
plugins:
  datasette-dashboards:
    my-dashboard:
      title: My Dashboard
      description: Showing some nice metrics
      layout:
        - [analysis-note, events-count]
        - [analysis-note, events-source]
      filters:
        date_start:
          name: Date Start
          type: date
          default: ""2021-01-01""
        date_end:
          name: Date End
          type: date
      charts:
        analysis-note:
          library: markdown
          display: |-
            # Analysis notes
            > A quick rundown of events statistics and KPIs

        events-count:
          title: Total number of events
          db: jobs
          query: SELECT count(*) as count FROM events
          library: metric
          display:
            field: count
            prefix:
            suffix:

        events-source:
          title: Number of events by source
          db: jobs
          query: SELECT source, count(*) as count FROM events WHERE TRUE [[ AND date >= date(:date_start) ]] [[ AND date <= date(:date_end) ]] GROUP BY source ORDER BY count DESC
          library: vega
          display:
            mark: { type: bar, tooltip: true }
            encoding:
              color: { field: source, type: nominal }
              theta: { field: count, type: quantitative }
```

A new menu entry is now available, pointing at `/-/dashboards` to access all defined dashboards.

### Properties

Dashboard properties:

| Property      | Type     | Description           |
| ------------- | -------- | --------------------- |
| `title`       | `string` | Dashboard title       |
| `description` | `string` | Dashboard description |
| `layout`      | `array`  | Dashboard layout      |
| `filters`     | `object` | Dashboard filters     |

Dashboard filters:

| Property  | Type               | Description                            |
| --------- | ------------------ | -------------------------------------- |
| `name`    | `string`           | Filter display name                    |
| `type`    | `string`           | Filter type (`text`, `date`, `number`) |
| `default` | `string`, `number` | (optional) Filter default value        |
| `min`     | `number`           | (optional) Filter minimum value        |
| `max`     | `number`           | (optional) Filter maximum value        |
| `step`    | `number`           | (optional) Filter stepping value       |

Common chart properties for all chart types:

| Property  | Type     | Description                                              |
| --------- | -------- | -------------------------------------------------------- |
| `title`   | `string` | Chart title                                              |
| `db`      | `string` | Database name against which to run the query             |
| `query`   | `string` | SQL query to run and extract data from                   |
| `library` | `string` | One of supported libraries: `vega`, `markdown`, `metric` |
| `display` | `object` | Chart display specification (depend on the used library) |

To define SQL queries using dashboard filters:

```sql
SELECT * FROM mytable [[ WHERE col >= :my_filter ]]
```

```sql
SELECT * FROM mytable WHERE TRUE [[ AND col1 = :my_filter_1 ]] [[ AND col2 = :my_filter_2 ]]
```

#### Vega properties

Available configuration for `vega` charts:

| Property  | Type     | Description               |
| --------- | -------- | ------------------------- |
| `library` | `string` | Must be set to `vega`     |
| `display` | `object` | Vega specification object |

Notes about the `display` property:

- Requires a valid [Vega specification object](https://vega.github.io/vega-lite/docs/)
- Some fields are pre-defined: `$schema`, `title`, `width`, `view`, `config`, `data`
- All fields are passed along as-is (overriding pre-defined fields if any)
- Only `mark` and `encoding` fields are required as the bare-minimum

#### Markdown properties

Available configuration for `markdown` chart:

| Property  | Type     | Description                                       |
| --------- | -------- | ------------------------------------------------- |
| `library` | `string` | Must be set to `markdown`                         |
| `display` | `string` | Multi-line string containing the Markdown content |

Note :

- Some common properties do not apply and can be omitted: `title`, `db`, `query`
- Markdown rendering is done by [`datasette-render-markdown`](https://datasette.io/plugins/datasette-render-markdown)
- To configure Markdown rendering, extensions can be enabled in [metadata](https://datasette.io/plugins/datasette-render-markdown#user-content-markdown-extensions)

#### Metric properties

Available configuration for `metric` chart:

| Property         | Type     | Description                               |
| ---------------- | -------- | ----------------------------------------- |
| `library`        | `string` | Must be set to `metric`                   |
| `display.field`  | `string` | Numerical field to be displayed as metric |
| `display.prefix` | `string` | Prefix to be displayed before metric      |
| `display.suffix` | `string` | Prefix to be displayed after metric       |

Note:

- The `display.field` must reference a single-numerical value from the SQL query
  (e.g. numerical `number` field in `SELECT count(*) as number FROM events`)

### Dashboard layout

The default dashboard layout will present two charts per row (one per row on mobile).
To make use of custom dashboard layout using [CSS Grid Layout](https://developer.mozilla.org/en-US/docs/Web/CSS/CSS_Grid_Layout),
define the `layout` array property as a grid / matrix:

- Each entry represents a row of charts
- Each column is referring a chart by its property name

## Development

To set up this plugin locally, first checkout the code.
Then create a new virtual environment and the required dependencies:

```bash
pipenv install -d
pipenv shell
```

To run the tests:

```bash
pytest
```

## Demo

With the developmnent environment setup, you can run the demo locally:

```bash
datasette --metadata demo/metadata.yml demo/jobs.db
```

## License

Licensed under Apache License, Version 2.0

Copyright (c) 2021 - present Romain Clement
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-datasette-dashboards"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-dashboards""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-dashboards</h1>
<blockquote>
<p dir=""auto"">Datasette plugin providing data dashboards from metadata</p>
</blockquote>
<p dir=""auto""><a href=""https://pypi.org/project/datasette-dashboards/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/f362a9e19c9666117e279326df7065185827fbec6acab7c6ae6c5e7e379191db/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d64617368626f617264732e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-dashboards.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/rclement/datasette-dashboards/actions/workflows/ci-cd.yml""><img src=""https://github.com/rclement/datasette-dashboards/actions/workflows/ci-cd.yml/badge.svg"" alt=""CI/CD"" style=""max-width: 100%;""></a>
<a href=""https://codecov.io/gh/rclement/datasette-dashboards"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/12d74ac8b571a59d446ce62428195bfc76d2027e8394fb3cb1e8584bdf025c9f/68747470733a2f2f696d672e736869656c64732e696f2f636f6465636f762f632f6769746875622f72636c656d656e742f6461746173657474652d64617368626f61726473"" alt=""Coverage Status"" data-canonical-src=""https://img.shields.io/codecov/c/github/rclement/datasette-dashboards"" style=""max-width: 100%;""></a>
<a href=""https://github.com/rclement/datasette-dashboards/blob/master/LICENSE""><img src=""https://camo.githubusercontent.com/210d7ef998772287f630b1a00a40abc4c413c989b0c8757f93ff9c674d46f5c7/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f6c6963656e73652f72636c656d656e742f6461746173657474652d64617368626f61726473"" alt=""License"" data-canonical-src=""https://img.shields.io/github/license/rclement/datasette-dashboards"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Try out a live demo at <a href=""https://datasette-dashboards-demo.vercel.app/-/dashboards"" rel=""nofollow"">https://datasette-dashboards-demo.vercel.app</a></p>
<p dir=""auto""><strong>WARNING</strong>: this plugin is still experimental and not ready for production.
Some breaking changes might happen between releases before reaching a stable version.
Use it at your own risks!</p>
<p dir=""auto""><a target=""_blank"" rel=""noopener noreferrer nofollow"" href=""https://raw.githubusercontent.com/rclement/datasette-dashboards/master/demo/datasette-dashboards-demo.png""><img src=""https://raw.githubusercontent.com/rclement/datasette-dashboards/master/demo/datasette-dashboards-demo.png"" alt=""Datasette Dashboards Demo"" style=""max-width: 100%;""></a></p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p dir=""auto"">Install this plugin in the same environment as Datasette:</p>
<div class=""highlight highlight-source-shell notranslate position-relative overflow-auto"" dir=""auto"" data-snippet-clipboard-copy-content=""$ datasette install datasette-dashboards""><pre>$ datasette install datasette-dashboards</pre></div>
<h2 dir=""auto""><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p dir=""auto"">Define dashboards within <code>metadata.yml</code> / <code>metadata.json</code>:</p>
<div class=""highlight highlight-source-yaml notranslate position-relative overflow-auto"" dir=""auto"" data-snippet-clipboard-copy-content=""plugins:
  datasette-dashboards:
    my-dashboard:
      title: My Dashboard
      description: Showing some nice metrics
      layout:
        - [analysis-note, events-count]
        - [analysis-note, events-source]
      filters:
        date_start:
          name: Date Start
          type: date
          default: &quot;2021-01-01&quot;
        date_end:
          name: Date End
          type: date
      charts:
        analysis-note:
          library: markdown
          display: |-
            # Analysis notes
            &gt; A quick rundown of events statistics and KPIs

        events-count:
          title: Total number of events
          db: jobs
          query: SELECT count(*) as count FROM events
          library: metric
          display:
            field: count
            prefix:
            suffix:

        events-source:
          title: Number of events by source
          db: jobs
          query: SELECT source, count(*) as count FROM events WHERE TRUE [[ AND date &gt;= date(:date_start) ]] [[ AND date &lt;= date(:date_end) ]] GROUP BY source ORDER BY count DESC
          library: vega
          display:
            mark: { type: bar, tooltip: true }
            encoding:
              color: { field: source, type: nominal }
              theta: { field: count, type: quantitative }""><pre><span class=""pl-ent"">plugins</span>:
  <span class=""pl-ent"">datasette-dashboards</span>:
    <span class=""pl-ent"">my-dashboard</span>:
      <span class=""pl-ent"">title</span>: <span class=""pl-s"">My Dashboard</span>
      <span class=""pl-ent"">description</span>: <span class=""pl-s"">Showing some nice metrics</span>
      <span class=""pl-ent"">layout</span>:
        - <span class=""pl-s"">[analysis-note, events-count]</span>
        - <span class=""pl-s"">[analysis-note, events-source]</span>
      <span class=""pl-ent"">filters</span>:
        <span class=""pl-ent"">date_start</span>:
          <span class=""pl-ent"">name</span>: <span class=""pl-s"">Date Start</span>
          <span class=""pl-ent"">type</span>: <span class=""pl-s"">date</span>
          <span class=""pl-ent"">default</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>2021-01-01<span class=""pl-pds"">""</span></span>
        <span class=""pl-ent"">date_end</span>:
          <span class=""pl-ent"">name</span>: <span class=""pl-s"">Date End</span>
          <span class=""pl-ent"">type</span>: <span class=""pl-s"">date</span>
      <span class=""pl-ent"">charts</span>:
        <span class=""pl-ent"">analysis-note</span>:
          <span class=""pl-ent"">library</span>: <span class=""pl-s"">markdown</span>
          <span class=""pl-ent"">display</span>: <span class=""pl-s"">|-</span>
<span class=""pl-s"">            # Analysis notes</span>
<span class=""pl-s"">            &gt; A quick rundown of events statistics and KPIs</span>
<span class=""pl-s""></span>
<span class=""pl-s""></span>        <span class=""pl-ent"">events-count</span>:
          <span class=""pl-ent"">title</span>: <span class=""pl-s"">Total number of events</span>
          <span class=""pl-ent"">db</span>: <span class=""pl-s"">jobs</span>
          <span class=""pl-ent"">query</span>: <span class=""pl-s"">SELECT count(*) as count FROM events</span>
          <span class=""pl-ent"">library</span>: <span class=""pl-s"">metric</span>
          <span class=""pl-ent"">display</span>:
            <span class=""pl-ent"">field</span>: <span class=""pl-s"">count</span>
            <span class=""pl-ent"">prefix</span>:
            <span class=""pl-ent"">suffix</span>:

        <span class=""pl-ent"">events-source</span>:
          <span class=""pl-ent"">title</span>: <span class=""pl-s"">Number of events by source</span>
          <span class=""pl-ent"">db</span>: <span class=""pl-s"">jobs</span>
          <span class=""pl-ent"">query</span>: <span class=""pl-s"">SELECT source, count(*) as count FROM events WHERE TRUE [[ AND date &gt;= date(:date_start) ]] [[ AND date &lt;= date(:date_end) ]] GROUP BY source ORDER BY count DESC</span>
          <span class=""pl-ent"">library</span>: <span class=""pl-s"">vega</span>
          <span class=""pl-ent"">display</span>:
            <span class=""pl-ent"">mark</span>: <span class=""pl-s"">{ type: bar, tooltip: true }</span>
            <span class=""pl-ent"">encoding</span>:
              <span class=""pl-ent"">color</span>: <span class=""pl-s"">{ field: source, type: nominal }</span>
              <span class=""pl-ent"">theta</span>: <span class=""pl-s"">{ field: count, type: quantitative }</span></pre></div>
<p dir=""auto"">A new menu entry is now available, pointing at <code>/-/dashboards</code> to access all defined dashboards.</p>
<h3 dir=""auto""><a id=""user-content-properties"" class=""anchor"" aria-hidden=""true"" href=""#user-content-properties""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Properties</h3>
<p dir=""auto"">Dashboard properties:</p>
<table>
<thead>
<tr>
<th>Property</th>
<th>Type</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td><code>title</code></td>
<td><code>string</code></td>
<td>Dashboard title</td>
</tr>
<tr>
<td><code>description</code></td>
<td><code>string</code></td>
<td>Dashboard description</td>
</tr>
<tr>
<td><code>layout</code></td>
<td><code>array</code></td>
<td>Dashboard layout</td>
</tr>
<tr>
<td><code>filters</code></td>
<td><code>object</code></td>
<td>Dashboard filters</td>
</tr>
</tbody>
</table>
<p dir=""auto"">Dashboard filters:</p>
<table>
<thead>
<tr>
<th>Property</th>
<th>Type</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td><code>name</code></td>
<td><code>string</code></td>
<td>Filter display name</td>
</tr>
<tr>
<td><code>type</code></td>
<td><code>string</code></td>
<td>Filter type (<code>text</code>, <code>date</code>, <code>number</code>)</td>
</tr>
<tr>
<td><code>default</code></td>
<td><code>string</code>, <code>number</code></td>
<td>(optional) Filter default value</td>
</tr>
<tr>
<td><code>min</code></td>
<td><code>number</code></td>
<td>(optional) Filter minimum value</td>
</tr>
<tr>
<td><code>max</code></td>
<td><code>number</code></td>
<td>(optional) Filter maximum value</td>
</tr>
<tr>
<td><code>step</code></td>
<td><code>number</code></td>
<td>(optional) Filter stepping value</td>
</tr>
</tbody>
</table>
<p dir=""auto"">Common chart properties for all chart types:</p>
<table>
<thead>
<tr>
<th>Property</th>
<th>Type</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td><code>title</code></td>
<td><code>string</code></td>
<td>Chart title</td>
</tr>
<tr>
<td><code>db</code></td>
<td><code>string</code></td>
<td>Database name against which to run the query</td>
</tr>
<tr>
<td><code>query</code></td>
<td><code>string</code></td>
<td>SQL query to run and extract data from</td>
</tr>
<tr>
<td><code>library</code></td>
<td><code>string</code></td>
<td>One of supported libraries: <code>vega</code>, <code>markdown</code>, <code>metric</code></td>
</tr>
<tr>
<td><code>display</code></td>
<td><code>object</code></td>
<td>Chart display specification (depend on the used library)</td>
</tr>
</tbody>
</table>
<p dir=""auto"">To define SQL queries using dashboard filters:</p>
<div class=""highlight highlight-source-sql notranslate position-relative overflow-auto"" dir=""auto"" data-snippet-clipboard-copy-content=""SELECT * FROM mytable [[ WHERE col &gt;= :my_filter ]]""><pre><span class=""pl-k"">SELECT</span> <span class=""pl-k"">*</span> <span class=""pl-k"">FROM</span> mytable [[ <span class=""pl-k"">WHERE</span> col <span class=""pl-k"">&gt;=</span> :my_filter ]]</pre></div>
<div class=""highlight highlight-source-sql notranslate position-relative overflow-auto"" dir=""auto"" data-snippet-clipboard-copy-content=""SELECT * FROM mytable WHERE TRUE [[ AND col1 = :my_filter_1 ]] [[ AND col2 = :my_filter_2 ]]""><pre><span class=""pl-k"">SELECT</span> <span class=""pl-k"">*</span> <span class=""pl-k"">FROM</span> mytable <span class=""pl-k"">WHERE</span> TRUE [[ <span class=""pl-k"">AND</span> col1 <span class=""pl-k"">=</span> :my_filter_1 ]] [[ <span class=""pl-k"">AND</span> col2 <span class=""pl-k"">=</span> :my_filter_2 ]]</pre></div>
<h4 dir=""auto""><a id=""user-content-vega-properties"" class=""anchor"" aria-hidden=""true"" href=""#user-content-vega-properties""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Vega properties</h4>
<p dir=""auto"">Available configuration for <code>vega</code> charts:</p>
<table>
<thead>
<tr>
<th>Property</th>
<th>Type</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td><code>library</code></td>
<td><code>string</code></td>
<td>Must be set to <code>vega</code></td>
</tr>
<tr>
<td><code>display</code></td>
<td><code>object</code></td>
<td>Vega specification object</td>
</tr>
</tbody>
</table>
<p dir=""auto"">Notes about the <code>display</code> property:</p>
<ul dir=""auto"">
<li>Requires a valid <a href=""https://vega.github.io/vega-lite/docs/"" rel=""nofollow"">Vega specification object</a></li>
<li>Some fields are pre-defined: <code>$schema</code>, <code>title</code>, <code>width</code>, <code>view</code>, <code>config</code>, <code>data</code></li>
<li>All fields are passed along as-is (overriding pre-defined fields if any)</li>
<li>Only <code>mark</code> and <code>encoding</code> fields are required as the bare-minimum</li>
</ul>
<h4 dir=""auto""><a id=""user-content-markdown-properties"" class=""anchor"" aria-hidden=""true"" href=""#user-content-markdown-properties""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Markdown properties</h4>
<p dir=""auto"">Available configuration for <code>markdown</code> chart:</p>
<table>
<thead>
<tr>
<th>Property</th>
<th>Type</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td><code>library</code></td>
<td><code>string</code></td>
<td>Must be set to <code>markdown</code></td>
</tr>
<tr>
<td><code>display</code></td>
<td><code>string</code></td>
<td>Multi-line string containing the Markdown content</td>
</tr>
</tbody>
</table>
<p dir=""auto"">Note :</p>
<ul dir=""auto"">
<li>Some common properties do not apply and can be omitted: <code>title</code>, <code>db</code>, <code>query</code></li>
<li>Markdown rendering is done by <a href=""https://datasette.io/plugins/datasette-render-markdown"" rel=""nofollow""><code>datasette-render-markdown</code></a></li>
<li>To configure Markdown rendering, extensions can be enabled in <a href=""https://datasette.io/plugins/datasette-render-markdown#user-content-markdown-extensions"" rel=""nofollow"">metadata</a></li>
</ul>
<h4 dir=""auto""><a id=""user-content-metric-properties"" class=""anchor"" aria-hidden=""true"" href=""#user-content-metric-properties""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Metric properties</h4>
<p dir=""auto"">Available configuration for <code>metric</code> chart:</p>
<table>
<thead>
<tr>
<th>Property</th>
<th>Type</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td><code>library</code></td>
<td><code>string</code></td>
<td>Must be set to <code>metric</code></td>
</tr>
<tr>
<td><code>display.field</code></td>
<td><code>string</code></td>
<td>Numerical field to be displayed as metric</td>
</tr>
<tr>
<td><code>display.prefix</code></td>
<td><code>string</code></td>
<td>Prefix to be displayed before metric</td>
</tr>
<tr>
<td><code>display.suffix</code></td>
<td><code>string</code></td>
<td>Prefix to be displayed after metric</td>
</tr>
</tbody>
</table>
<p dir=""auto"">Note:</p>
<ul dir=""auto"">
<li>The <code>display.field</code> must reference a single-numerical value from the SQL query
(e.g. numerical <code>number</code> field in <code>SELECT count(*) as number FROM events</code>)</li>
</ul>
<h3 dir=""auto""><a id=""user-content-dashboard-layout"" class=""anchor"" aria-hidden=""true"" href=""#user-content-dashboard-layout""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Dashboard layout</h3>
<p dir=""auto"">The default dashboard layout will present two charts per row (one per row on mobile).
To make use of custom dashboard layout using <a href=""https://developer.mozilla.org/en-US/docs/Web/CSS/CSS_Grid_Layout"" rel=""nofollow"">CSS Grid Layout</a>,
define the <code>layout</code> array property as a grid / matrix:</p>
<ul dir=""auto"">
<li>Each entry represents a row of charts</li>
<li>Each column is referring a chart by its property name</li>
</ul>
<h2 dir=""auto""><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p dir=""auto"">To set up this plugin locally, first checkout the code.
Then create a new virtual environment and the required dependencies:</p>
<div class=""highlight highlight-source-shell notranslate position-relative overflow-auto"" dir=""auto"" data-snippet-clipboard-copy-content=""pipenv install -d
pipenv shell""><pre>pipenv install -d
pipenv shell</pre></div>
<p dir=""auto"">To run the tests:</p>
<div class=""highlight highlight-source-shell notranslate position-relative overflow-auto"" dir=""auto"" data-snippet-clipboard-copy-content=""pytest""><pre>pytest</pre></div>
<h2 dir=""auto""><a id=""user-content-demo"" class=""anchor"" aria-hidden=""true"" href=""#user-content-demo""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Demo</h2>
<p dir=""auto"">With the developmnent environment setup, you can run the demo locally:</p>
<div class=""highlight highlight-source-shell notranslate position-relative overflow-auto"" dir=""auto"" data-snippet-clipboard-copy-content=""datasette --metadata demo/metadata.yml demo/jobs.db""><pre>datasette --metadata demo/metadata.yml demo/jobs.db</pre></div>
<h2 dir=""auto""><a id=""user-content-license"" class=""anchor"" aria-hidden=""true"" href=""#user-content-license""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>License</h2>
<p dir=""auto"">Licensed under Apache License, Version 2.0</p>
<p dir=""auto"">Copyright (c) 2021 - present Romain Clement</p>
</article></div>",1,public,0,,0,
390535500,MDEwOlJlcG9zaXRvcnkzOTA1MzU1MDA=,datasette-remote-metadata,simonw/datasette-remote-metadata,0,9599,https://github.com/simonw/datasette-remote-metadata,Periodically refresh Datasette metadata from a remote URL,0,2021-07-28T23:17:19Z,2021-12-13T19:40:51Z,2021-12-13T19:40:48Z,,8,3,3,Python,1,1,1,1,0,0,0,0,0,apache-2.0,[],0,0,3,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,0,2,"# datasette-remote-metadata

[![PyPI](https://img.shields.io/pypi/v/datasette-remote-metadata.svg)](https://pypi.org/project/datasette-remote-metadata/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-remote-metadata?include_prereleases&label=changelog)](https://github.com/simonw/datasette-remote-metadata/releases)
[![Tests](https://github.com/simonw/datasette-remote-metadata/workflows/Test/badge.svg)](https://github.com/simonw/datasette-remote-metadata/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-remote-metadata/blob/main/LICENSE)

Periodically refresh Datasette metadata from a remote URL

## Installation

Install this plugin in the same environment as Datasette.

    $ datasette install datasette-remote-metadata

## Usage

Add the following to your `metadata.json`:

```json
{
    ""plugins"": {
        ""datasette-remote-metadata"": {
            ""url"": ""https://example.com/remote-metadata.yml""
        }
    }
}
```
The plugin will fetch the specified metadata  from that URL at startup and combine it with any existing metadata. You can use a URL to either a JSON file or a YAML file.

It will periodically refresh that metadata - by default every 30 seconds, unless you specify an alternative `""ttl""` value in the plugin configuration.

## Configuration

Available configuration options are as follows:

- `""url""` - the URL to retrieve remote metadata from. Can link to a JSON or a YAML file.
- `""ttl""` - integer value in secords: how frequently should the script check for fresh metadata. Defaults to 30 seconds.
- `""headers""` - a dictionary of additional request headers to send.
- `""cachebust""` - if true, a random `?0.29508` value will be added to the query string of the remote metadata to bust any intermediary caches.

This example `metadata.json` configuration refreshes every 10 seconds, uses cache busting and sends an `Authorization: Bearer xyz` header with the request:

```json
{
    ""plugins"": {
        ""datasette-remote-metadata"": {
            ""url"": ""https://example.com/remote-metadata.yml"",
            ""ttl"": 10,
            ""cachebust"": true,
            ""headers"": {
                ""Authorization"": ""Bearer xyz""
            }
        }
    }
}
```
This example if you are using `metadata.yaml` for configuration:
```yaml
plugins:
  datasette-remote-metadata:
    url: https://example.com/remote-metadata.yml
    ttl: 10
    cachebust: true
    headers:
      Authorization: Bearer xyz
```

## Development

To set up this plugin locally, first checkout the code. Then create a new virtual environment:

    cd datasette-remote-metadata
    python3 -mvenv venv
    source venv/bin/activate

Or if you are using `pipenv`:

    pipenv shell

Now install the dependencies and test dependencies:

    pip install -e '.[test]'

To run the tests:

    pytest
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-datasette-remote-metadata"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-remote-metadata""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-remote-metadata</h1>
<p dir=""auto""><a href=""https://pypi.org/project/datasette-remote-metadata/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/88715eda95713112087b759d9348f413cf7421760dbf9dc75a891f603077bb02/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d72656d6f74652d6d657461646174612e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-remote-metadata.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-remote-metadata/releases""><img src=""https://camo.githubusercontent.com/81743aaae3ae2fa6d977f38bae18d79b191a53725378e4a945833f29a7d75bb1/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d72656d6f74652d6d657461646174613f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-remote-metadata?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-remote-metadata/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-remote-metadata/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-remote-metadata/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Periodically refresh Datasette metadata from a remote URL</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p dir=""auto"">Install this plugin in the same environment as Datasette.</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ datasette install datasette-remote-metadata""><pre><code>$ datasette install datasette-remote-metadata
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p dir=""auto"">Add the following to your <code>metadata.json</code>:</p>
<div class=""highlight highlight-source-json position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{
    &quot;plugins&quot;: {
        &quot;datasette-remote-metadata&quot;: {
            &quot;url&quot;: &quot;https://example.com/remote-metadata.yml&quot;
        }
    }
}""><pre>{
    <span class=""pl-ent"">""plugins""</span>: {
        <span class=""pl-ent"">""datasette-remote-metadata""</span>: {
            <span class=""pl-ent"">""url""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>https://example.com/remote-metadata.yml<span class=""pl-pds"">""</span></span>
        }
    }
}</pre></div>
<p dir=""auto"">The plugin will fetch the specified metadata  from that URL at startup and combine it with any existing metadata. You can use a URL to either a JSON file or a YAML file.</p>
<p dir=""auto"">It will periodically refresh that metadata - by default every 30 seconds, unless you specify an alternative <code>""ttl""</code> value in the plugin configuration.</p>
<h2 dir=""auto""><a id=""user-content-configuration"" class=""anchor"" aria-hidden=""true"" href=""#user-content-configuration""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Configuration</h2>
<p dir=""auto"">Available configuration options are as follows:</p>
<ul dir=""auto"">
<li><code>""url""</code> - the URL to retrieve remote metadata from. Can link to a JSON or a YAML file.</li>
<li><code>""ttl""</code> - integer value in secords: how frequently should the script check for fresh metadata. Defaults to 30 seconds.</li>
<li><code>""headers""</code> - a dictionary of additional request headers to send.</li>
<li><code>""cachebust""</code> - if true, a random <code>?0.29508</code> value will be added to the query string of the remote metadata to bust any intermediary caches.</li>
</ul>
<p dir=""auto"">This example <code>metadata.json</code> configuration refreshes every 10 seconds, uses cache busting and sends an <code>Authorization: Bearer xyz</code> header with the request:</p>
<div class=""highlight highlight-source-json position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{
    &quot;plugins&quot;: {
        &quot;datasette-remote-metadata&quot;: {
            &quot;url&quot;: &quot;https://example.com/remote-metadata.yml&quot;,
            &quot;ttl&quot;: 10,
            &quot;cachebust&quot;: true,
            &quot;headers&quot;: {
                &quot;Authorization&quot;: &quot;Bearer xyz&quot;
            }
        }
    }
}""><pre>{
    <span class=""pl-ent"">""plugins""</span>: {
        <span class=""pl-ent"">""datasette-remote-metadata""</span>: {
            <span class=""pl-ent"">""url""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>https://example.com/remote-metadata.yml<span class=""pl-pds"">""</span></span>,
            <span class=""pl-ent"">""ttl""</span>: <span class=""pl-c1"">10</span>,
            <span class=""pl-ent"">""cachebust""</span>: <span class=""pl-c1"">true</span>,
            <span class=""pl-ent"">""headers""</span>: {
                <span class=""pl-ent"">""Authorization""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>Bearer xyz<span class=""pl-pds"">""</span></span>
            }
        }
    }
}</pre></div>
<p dir=""auto"">This example if you are using <code>metadata.yaml</code> for configuration:</p>
<div class=""highlight highlight-source-yaml position-relative overflow-auto"" data-snippet-clipboard-copy-content=""plugins:
  datasette-remote-metadata:
    url: https://example.com/remote-metadata.yml
    ttl: 10
    cachebust: true
    headers:
      Authorization: Bearer xyz""><pre><span class=""pl-ent"">plugins</span>:
  <span class=""pl-ent"">datasette-remote-metadata</span>:
    <span class=""pl-ent"">url</span>: <span class=""pl-s"">https://example.com/remote-metadata.yml</span>
    <span class=""pl-ent"">ttl</span>: <span class=""pl-c1"">10</span>
    <span class=""pl-ent"">cachebust</span>: <span class=""pl-c1"">true</span>
    <span class=""pl-ent"">headers</span>:
      <span class=""pl-ent"">Authorization</span>: <span class=""pl-s"">Bearer xyz</span></pre></div>
<h2 dir=""auto""><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p dir=""auto"">To set up this plugin locally, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""cd datasette-remote-metadata
python3 -mvenv venv
source venv/bin/activate""><pre><code>cd datasette-remote-metadata
python3 -mvenv venv
source venv/bin/activate
</code></pre></div>
<p dir=""auto"">Or if you are using <code>pipenv</code>:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pipenv shell""><pre><code>pipenv shell
</code></pre></div>
<p dir=""auto"">Now install the dependencies and test dependencies:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'""><pre><code>pip install -e '.[test]'
</code></pre></div>
<p dir=""auto"">To run the tests:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pytest""><pre><code>pytest
</code></pre></div>
</article></div>",1,public,0,,,
423984522,R_kgDOGUV9ig,s3-credentials,simonw/s3-credentials,0,9599,https://github.com/simonw/s3-credentials,A tool for creating credentials for accessing S3 buckets,0,2021-11-02T20:09:50Z,2022-09-05T15:12:46Z,2022-09-15T23:43:10Z,https://s3-credentials.readthedocs.io,204,129,129,Python,1,1,1,1,0,10,0,0,18,apache-2.0,"[""aws"", ""boto3"", ""s3""]",10,18,129,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,10,2,"# s3-credentials

[![PyPI](https://img.shields.io/pypi/v/s3-credentials.svg)](https://pypi.org/project/s3-credentials/)
[![Changelog](https://img.shields.io/github/v/release/simonw/s3-credentials?include_prereleases&label=changelog)](https://github.com/simonw/s3-credentials/releases)
[![Tests](https://github.com/simonw/s3-credentials/workflows/Test/badge.svg)](https://github.com/simonw/s3-credentials/actions?query=workflow%3ATest)
[![Documentation Status](https://readthedocs.org/projects/s3-credentials/badge/?version=latest)](https://s3-credentials.readthedocs.org/)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/s3-credentials/blob/master/LICENSE)

A tool for creating credentials for accessing S3 buckets

For project background, see [s3-credentials: a tool for creating credentials for S3 buckets](https://simonwillison.net/2021/Nov/3/s3-credentials/) on my blog.

## Installation

    pip install s3-credentials

## Basic usage

To create a new S3 bucket and output credentials that can be used with only that bucket:
```
% s3-credentials create my-new-s3-bucket --create-bucket
Created bucket:  my-new-s3-bucket
Created user: s3.read-write.my-new-s3-bucket with permissions boundary: arn:aws:iam::aws:policy/AmazonS3FullAccess
Attached policy s3.read-write.my-new-s3-bucket to user s3.read-write.my-new-s3-bucket
Created access key for user: s3.read-write.my-new-s3-bucket
{
    ""UserName"": ""s3.read-write.my-new-s3-bucket"",
    ""AccessKeyId"": ""AKIAWXFXAIOZOYLZAEW5"",
    ""Status"": ""Active"",
    ""SecretAccessKey"": ""..."",
    ""CreateDate"": ""2021-11-03 01:38:24+00:00""
}
```
The tool can do a lot more than this. See the [documentation](https://s3-credentials.readthedocs.io/) for details.

## Documentation

- [Full documentation](https://s3-credentials.readthedocs.io/)
- [Command help reference](https://s3-credentials.readthedocs.io/en/stable/help.html)
- [Release notes](https://github.com/simonw/s3-credentials/releases)
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-s3-credentials"" class=""anchor"" aria-hidden=""true"" href=""#user-content-s3-credentials""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>s3-credentials</h1>
<p dir=""auto""><a href=""https://pypi.org/project/s3-credentials/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/843ad3dae900f3f8042c9847a2d4ec012f47c9e7569fb65cdf66d288c152aecf/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f73332d63726564656e7469616c732e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/s3-credentials.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/s3-credentials/releases""><img src=""https://camo.githubusercontent.com/03854eef387e7e67ee77ab44fc4770f3edef39895610bedd2de9aff4ed873dcf/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f73332d63726564656e7469616c733f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/s3-credentials?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/s3-credentials/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/s3-credentials/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://s3-credentials.readthedocs.org/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/2d884a293a1442ff42dc0a43da9554054e6153b0b32766a090d82e2add945d73/68747470733a2f2f72656164746865646f63732e6f72672f70726f6a656374732f73332d63726564656e7469616c732f62616467652f3f76657273696f6e3d6c6174657374"" alt=""Documentation Status"" data-canonical-src=""https://readthedocs.org/projects/s3-credentials/badge/?version=latest"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/s3-credentials/blob/master/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">A tool for creating credentials for accessing S3 buckets</p>
<p dir=""auto"">For project background, see <a href=""https://simonwillison.net/2021/Nov/3/s3-credentials/"" rel=""nofollow"">s3-credentials: a tool for creating credentials for S3 buckets</a> on my blog.</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install s3-credentials""><pre class=""notranslate""><code>pip install s3-credentials
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-basic-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-basic-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Basic usage</h2>
<p dir=""auto"">To create a new S3 bucket and output credentials that can be used with only that bucket:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""% s3-credentials create my-new-s3-bucket --create-bucket
Created bucket:  my-new-s3-bucket
Created user: s3.read-write.my-new-s3-bucket with permissions boundary: arn:aws:iam::aws:policy/AmazonS3FullAccess
Attached policy s3.read-write.my-new-s3-bucket to user s3.read-write.my-new-s3-bucket
Created access key for user: s3.read-write.my-new-s3-bucket
{
    &quot;UserName&quot;: &quot;s3.read-write.my-new-s3-bucket&quot;,
    &quot;AccessKeyId&quot;: &quot;AKIAWXFXAIOZOYLZAEW5&quot;,
    &quot;Status&quot;: &quot;Active&quot;,
    &quot;SecretAccessKey&quot;: &quot;...&quot;,
    &quot;CreateDate&quot;: &quot;2021-11-03 01:38:24+00:00&quot;
}""><pre class=""notranslate""><code>% s3-credentials create my-new-s3-bucket --create-bucket
Created bucket:  my-new-s3-bucket
Created user: s3.read-write.my-new-s3-bucket with permissions boundary: arn:aws:iam::aws:policy/AmazonS3FullAccess
Attached policy s3.read-write.my-new-s3-bucket to user s3.read-write.my-new-s3-bucket
Created access key for user: s3.read-write.my-new-s3-bucket
{
    ""UserName"": ""s3.read-write.my-new-s3-bucket"",
    ""AccessKeyId"": ""AKIAWXFXAIOZOYLZAEW5"",
    ""Status"": ""Active"",
    ""SecretAccessKey"": ""..."",
    ""CreateDate"": ""2021-11-03 01:38:24+00:00""
}
</code></pre></div>
<p dir=""auto"">The tool can do a lot more than this. See the <a href=""https://s3-credentials.readthedocs.io/"" rel=""nofollow"">documentation</a> for details.</p>
<h2 dir=""auto""><a id=""user-content-documentation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-documentation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Documentation</h2>
<ul dir=""auto"">
<li><a href=""https://s3-credentials.readthedocs.io/"" rel=""nofollow"">Full documentation</a></li>
<li><a href=""https://s3-credentials.readthedocs.io/en/stable/help.html"" rel=""nofollow"">Command help reference</a></li>
<li><a href=""https://github.com/simonw/s3-credentials/releases"">Release notes</a></li>
</ul>
</article></div>",1,public,0,,0,
427128866,R_kgDOGXV4Ig,git-history,simonw/git-history,0,9599,https://github.com/simonw/git-history,Tools for analyzing Git history using SQLite,0,2021-11-11T20:07:06Z,2022-10-20T20:33:28Z,2022-10-21T23:06:33Z,,117,114,114,Python,1,1,1,1,0,11,0,0,20,apache-2.0,"[""git"", ""sqlite""]",11,20,114,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,11,3,"# git-history

[![PyPI](https://img.shields.io/pypi/v/git-history.svg)](https://pypi.org/project/git-history/)
[![Changelog](https://img.shields.io/github/v/release/simonw/git-history?include_prereleases&label=changelog)](https://github.com/simonw/git-history/releases)
[![Tests](https://github.com/simonw/git-history/workflows/Test/badge.svg)](https://github.com/simonw/git-history/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/git-history/blob/master/LICENSE)

Tools for analyzing Git history using SQLite

For background on this project see [git-history: a tool for analyzing scraped data collected using Git and SQLite](https://simonwillison.net/2021/Dec/7/git-history/).

[Measuring traffic during the Half Moon Bay Pumpkin Festival](https://simonwillison.net/2022/Oct/19/measuring-traffic/) describes a project using this tool in detail.

## Installation

Install this tool using `pip`:

    $ pip install git-history

## Demos

[git-history-demos.datasette.io](http://git-history-demos.datasette.io/) hosts three example databases created using this tool:

- [pge-outages](https://git-history-demos.datasette.io/pge-outages) shows a history of PG&E (the electricity supplier) [outages](https://pgealerts.alerts.pge.com/outagecenter/), using data collected in [simonw/pge-outages](https://github.com/simonw/pge-outages) converted using [pge-outages.sh](https://github.com/simonw/git-history/blob/main/demos/pge-outages.sh)
- [ca-fires](https://git-history-demos.datasette.io/ca-fires) shows a history of fires in California reported on [fire.ca.gov/incidents](https://www.fire.ca.gov/incidents/), from data in [simonw/ca-fires-history](https://github.com/simonw/ca-fires-history) converted using [ca-fires.sh](https://github.com/simonw/git-history/blob/main/demos/ca-fires.sh)
- [sf-bay-511](https://git-history-demos.datasette.io/sf-bay-511) has records of San Francisco Bay Area traffic and transit incident data from [511.org](https://511.org/), collected in [dbreunig/511-events-history](https://github.com/dbreunig/511-events-history) converted using [sf-bay-511.sh](https://github.com/simonw/git-history/blob/main/demos/sf-bay-511.sh)

The demos are deployed using [Datasette](https://datasette.io/) on [Google Cloud Run](https://cloud.google.com/run/) by [this GitHub Actions workflow](https://github.com/simonw/git-history/blob/main/.github/workflows/deploy-demos.yml).

## Usage

This tool can be run against a Git repository that holds a file that contains JSON, CSV/TSV or some other format and which has multiple versions tracked in the Git history. Read [Git scraping: track changes over time by scraping to a Git repository](https://simonwillison.net/2020/Oct/9/git-scraping/) to understand how you might create such a repository.

The `file` command analyzes the history of an individual file within the repository, and generates a SQLite database table that represents the different versions of that file over time.

The file is assumed to contain multiple objects - for example, the results of scraping an electricity outage map or a CSV file full of records.

Assuming you have a file called `incidents.json` that is a JSON array of objects, with multiple versions of that file recorded in a repository. Each version of that file might look something like this:

```json
[
    {
        ""IncidentID"": ""abc123"",
        ""Location"": ""Corner of 4th and Vermont"",
        ""Type"": ""fire""
    },
    {
        ""IncidentID"": ""cde448"",
        ""Location"": ""555 West Example Drive"",
        ""Type"": ""medical""
    }
]
```

Change directory into the GitHub repository in question and run the following:

    git-history file incidents.db incidents.json

This will create a new SQLite database in the `incidents.db` file with three tables:

- `commits` containing a row for every commit, with a `hash` column, the `commit_at` date and a foreign key to a `namespace`.
- `item` containing a row for every item in every version of the `filename.json` file - with an extra `_commit` column that is a foreign key back to the `commit` table.
- `namespaces` containing a single row. This allows you to build multiple tables for different files, using the `--namespace` option described below.

The database schema for this example will look like this:

<!-- [[[cog
import cog, json
from git_history import cli
from click.testing import CliRunner
from tests.test_git_history import make_repo
import sqlite_utils
import tempfile, pathlib
tmpdir = pathlib.Path(tempfile.mkdtemp())
db_path = str(tmpdir / ""data.db"")
make_repo(tmpdir)
runner = CliRunner()
result = runner.invoke(cli.cli, [
    ""file"", db_path, str(tmpdir / ""repo"" / ""incidents.json""), ""--repo"", str(tmpdir / ""repo"")
])
cog.out(""```sql\n"")
cog.out(sqlite_utils.Database(db_path).schema)
cog.out(""\n```"")
]]] -->
```sql
CREATE TABLE [namespaces] (
   [id] INTEGER PRIMARY KEY,
   [name] TEXT
);
CREATE UNIQUE INDEX [idx_namespaces_name]
    ON [namespaces] ([name]);
CREATE TABLE [commits] (
   [id] INTEGER PRIMARY KEY,
   [namespace] INTEGER REFERENCES [namespaces]([id]),
   [hash] TEXT,
   [commit_at] TEXT
);
CREATE UNIQUE INDEX [idx_commits_namespace_hash]
    ON [commits] ([namespace], [hash]);
CREATE TABLE [item] (
   [IncidentID] TEXT,
   [Location] TEXT,
   [Type] TEXT
);
```
<!-- [[[end]]] -->

If you have 10 historic versions of the `incidents.json` file and each one contains 30 incidents, you will end up with 10 * 30 = 300 rows in your `item` table.

### Track the history of individual items using IDs

If your objects have a unique identifier - or multiple columns that together form a unique identifier - you can use the `--id` option to de-duplicate and track changes to each of those items over time.

This provides a much more interesting way to apply this tool.

If there is a unique identifier column called `IncidentID` you could run the following:

    git-history file incidents.db incidents.json --id IncidentID

The database schema used here is very different from the one used without the `--id` option.

If you have already imported history, the command will skip any commits that it has seen already and just process new ones. This means that even though an initial import could be slow subsequent imports should run a lot faster.

This command will create six tables - `commits`, `item`, `item_version`, `columns`, `item_changed` and `namespaces`.

Here's the full schema:

<!-- [[[cog
db_path2 = str(tmpdir / ""data2.db"")
result = runner.invoke(cli.cli, [
    ""file"", db_path2, str(tmpdir / ""repo"" / ""incidents.json""),
    ""--repo"", str(tmpdir / ""repo""),
    ""--id"", ""IncidentID""
])
cog.out(""```sql\n"")
cog.out(sqlite_utils.Database(db_path2).schema)
cog.out(""\n```"")
]]] -->
```sql
CREATE TABLE [namespaces] (
   [id] INTEGER PRIMARY KEY,
   [name] TEXT
);
CREATE UNIQUE INDEX [idx_namespaces_name]
    ON [namespaces] ([name]);
CREATE TABLE [commits] (
   [id] INTEGER PRIMARY KEY,
   [namespace] INTEGER REFERENCES [namespaces]([id]),
   [hash] TEXT,
   [commit_at] TEXT
);
CREATE UNIQUE INDEX [idx_commits_namespace_hash]
    ON [commits] ([namespace], [hash]);
CREATE TABLE [item] (
   [_id] INTEGER PRIMARY KEY,
   [_item_id] TEXT
, [IncidentID] TEXT, [Location] TEXT, [Type] TEXT, [_commit] INTEGER);
CREATE UNIQUE INDEX [idx_item__item_id]
    ON [item] ([_item_id]);
CREATE TABLE [item_version] (
   [_id] INTEGER PRIMARY KEY,
   [_item] INTEGER REFERENCES [item]([_id]),
   [_version] INTEGER,
   [_commit] INTEGER REFERENCES [commits]([id]),
   [IncidentID] TEXT,
   [Location] TEXT,
   [Type] TEXT,
   [_item_full_hash] TEXT
);
CREATE TABLE [columns] (
   [id] INTEGER PRIMARY KEY,
   [namespace] INTEGER REFERENCES [namespaces]([id]),
   [name] TEXT
);
CREATE UNIQUE INDEX [idx_columns_namespace_name]
    ON [columns] ([namespace], [name]);
CREATE TABLE [item_changed] (
   [item_version] INTEGER REFERENCES [item_version]([_id]),
   [column] INTEGER REFERENCES [columns]([id]),
   PRIMARY KEY ([item_version], [column])
);
CREATE VIEW item_version_detail AS select
  commits.commit_at as _commit_at,
  commits.hash as _commit_hash,
  item_version.*,
  (
    select json_group_array(name) from columns
    where id in (
      select column from item_changed
      where item_version = item_version._id
    )
) as _changed_columns
from item_version
  join commits on commits.id = item_version._commit;
CREATE INDEX [idx_item_version__item]
    ON [item_version] ([_item]);
```
<!-- [[[end]]] -->

#### item table

The `item` table will contain the most recent version of each row, de-duplicated by ID, plus the following additional columns:

- `_id` - a numeric integer primary key, used as a foreign key from the `item_version` table.
- `_item_id` - a hash of the values of the columns specified using the `--id` option to the command. This is used for de-duplication when processing new versions.
- `_commit` - a foreign key to the `commit` table, representing the most recent commit to modify this item.

#### item_version table

The `item_version` table will contain a row for each captured differing version of that item, plus the following columns:

- `_id` - a numeric ID for the item version record.
- `_item` - a foreign key to the `item` table.
- `_version` - the numeric version number, starting at 1 and incrementing for each captured version.
- `_commit` - a foreign key to the `commit` table.
- `_item_full_hash` - a hash of this version of the item. This is used internally by the tool to identify items that have changed between commits.

The other columns in this table represent columns in the original data that have changed since the previous version. If the value has not changed, it will be represented by a `null`.

If a value was previously set but has been changed back to `null` it will still be represented as `null` in the `item_version` row. You can identify these using the `item_changed` many-to-many table described below.

You can use the `--full-versions` option to store full copies of the item at each version, rather than just storing the columns that have changed.

#### item_version_detail view

This SQL view joins `item_version` against `commits` to add three further columns: `_commit_at` with the date of the commit, and `_commit_hash` with the Git commit hash.

#### item_changed

This many-to-many table indicates exactly which columns were changed in an `item_version`.

- `item_version` is a foreign key to a row in the `item_version` table.
- `column` is a foreign key to a row in the `columns` table.

This table with have the largest number of rows, which is why it stores just two integers in order to save space.

#### columns

The `columns` table stores column names. It is referenced by `item_changed`.

- `id` - an integer ID.
- `name` - the name of the column.
- `namespace` - a foreign key to `namespaces`, for if multiple file histories are sharing the same database.

#### Reserved column names

<!-- [[[cog
from git_history.utils import RESERVED
cog.out(""Note that "")
cog.out("", "".join(""`{}`"".format(r) for r in RESERVED))
cog.out("" are considered reserved column names for the purposes of this tool."")
]]] -->
Note that `_id`, `_item_full_hash`, `_item`, `_item_id`, `_version`, `_commit`, `_item_id`, `_commit_at`, `_commit_hash`, `_changed_columns`, `rowid` are considered reserved column names for the purposes of this tool.
<!-- [[[end]]] -->

If your data contains any of these they will be renamed to add a trailing underscore, for example `_id_`, `_item_`, `_version_`, to avoid clashing with the reserved columns.

If you have a column with a name such as `_commit_` it will be renamed too, adding an additional trailing underscore, so `_commit_` becomes `_commit__` and `_commit__` becomes `_commit___`.

### Additional options

- `--repo DIRECTORY` - the path to the Git repository, if it is not the current working directory.
- `--branch TEXT` - the Git branch to analyze - defaults to `main`.
- `--id TEXT` - as described above: pass one or more columns that uniquely identify a record, so that changes to that record can be calculated over time.
- `--full-versions` - instead of recording just the columns that have changed in the `item_version` table record a full copy of each version of theh item.
- `--ignore TEXT` - one or more columns to ignore - they will not be included in the resulting database.
- `--csv` - treat the data is CSV or TSV rather than JSON, and attempt to guess the correct dialect
- `--dialect` - use a spcific CSV dialect. Options are `excel`, `excel-tab` and `unix` - see [the Python CSV documentation](https://docs.python.org/3/library/csv.html#csv.excel) for details.
- `--skip TEXT` - one or more full Git commit hashes that should be skipped. You can use this if some of the data in your revision history is corrupted in a way that prevents this tool from working.
- `--start-at TEXT` - skip commits prior to the specified commit hash.
- `--start-after TEXT` - skip commits up to and including the specified commit hash, then start processing from the following commit.
- `--convert TEXT` - custom Python code for a conversion, described below.
- `--import TEXT` - additional Python modules to import for `--convert`.
- `--ignore-duplicate-ids` - if a single version of a file has the same ID in it more than once, the tool will exit with an error. Use this option to ignore this and instead pick just the first of the two duplicates.
- `--namespace TEXT` - use this if you wish to include the history of multiple different files in the same database. The default is `item` but you can set it to something else, which will produce tables with names like `yournamespace` and `yournamespace_version`.
- `--wal` - Enable WAL mode on the created database file. Use this if you plan to run queries against the database while `git-history` is creating it.
- `--silent` - don't show the progress bar.

### CSV and TSV data

If the data in your repository is a CSV or TSV file you can process it by adding the `--csv` option. This will attempt to detect which delimiter is used by the file, so the same option works for both comma- and tab-separated values.

    git-history file trees.db trees.csv --id TreeID

You can also specify the CSV dialect using the `--dialect` option.

### Custom conversions using --convert

If your data is not already either CSV/TSV or a flat JSON array, you can reshape it using the `--convert` option.

The format needed by this tool is an array of dictionaries, as demonstrated by the `incidents.json` example above.

If your data does not fit this shape, you can provide a snippet of Python code to converts the on-disk content of each stored file into a Python list of dictionaries.

For example, if your stored files each look like this:

```json
{
    ""incidents"": [
        {
            ""id"": ""552"",
            ""name"": ""Hawthorne Fire"",
            ""engines"": 3
        },
        {
            ""id"": ""556"",
            ""name"": ""Merlin Fire"",
            ""engines"": 1
        }
    ]
}
```
You could use the following Python snippet to convert them to the required format:

```python
json.loads(content)[""incidents""]
```
(The `json` module is exposed to your custom function by default.)

You would then run the tool like this:

    git-history file database.db incidents.json \
      --id id \
      --convert 'json.loads(content)[""incidents""]'

The `content` variable is always a `bytes` object representing the content of the file at a specific moment in the repository's history.

You can import additional modules using `--import`. This example shows how you could read a CSV file that uses `;` as the delimiter:

    git-history file trees.db ../sf-tree-history/Street_Tree_List.csv \
      --repo ../sf-tree-history \
      --import csv \
      --import io \
      --convert '
        fp = io.StringIO(content.decode(""utf-8""))
        return list(csv.DictReader(fp, delimiter="";""))
        ' \
      --id TreeID

You can import nested modules such as [ElementTree](https://docs.python.org/3/library/xml.etree.elementtree.html)  using `--import xml.etree.ElementTree`, then refer to them in your function body as `xml.etree.ElementTree`. For example, if your tracked data was in an `items.xml` file that looked like this:

```xml
<items>
  <item id=""1"" name=""One"" />
  <item id=""2"" name=""Two"" />
  <item id=""3"" name=""Three"" />
</item>
```
You could load it using the following `--convert` script:
```
git-history file items.xml --convert '
tree = xml.etree.ElementTree.fromstring(content)
return [el.attrib for el in tree.iter(""item"")]
' --import xml.etree.ElementTree --id id
```

If your Python code spans more than one line it needs to include a `return` statement.

You can also use Python generators in your `--convert` code, for example:

    git-history file stats.db package-stats/stats.json \
        --repo package-stats \
        --convert '
        data = json.loads(content)
        for key, counts in data.items():
            for date, count in counts.items():
                yield {
                    ""package"": key,
                    ""date"": date,
                    ""count"": count
                }
        ' --id package --id date

This conversion function expects data that looks like this:

```json
{
    ""airtable-export"": {
        ""2021-05-18"": 66,
        ""2021-05-19"": 60,
        ""2021-05-20"": 87
    }
}
```

## Development

To contribute to this tool, first checkout the code. Then create a new virtual environment:

    cd git-history
    python -m venv venv
    source venv/bin/activate

Or if you are using `pipenv`:

    pipenv shell

Now install the dependencies and test dependencies:

    pip install -e '.[test]'

To run the tests:

    pytest

To update the schema examples in this README file:

    cog -r README.md
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-git-history"" class=""anchor"" aria-hidden=""true"" href=""#user-content-git-history""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>git-history</h1>
<p dir=""auto""><a href=""https://pypi.org/project/git-history/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/eb1f825a264577967f0ed804c6ce43e204d4cb64f8a3cea7c45a023a5ae42ff3/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6769742d686973746f72792e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/git-history.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/git-history/releases""><img src=""https://camo.githubusercontent.com/19a0b180a4c69384c234f52e0defeb4086138c1edeccbe42e5623e340503c0ec/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6769742d686973746f72793f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/git-history?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/git-history/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/git-history/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/git-history/blob/master/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Tools for analyzing Git history using SQLite</p>
<p dir=""auto"">For background on this project see <a href=""https://simonwillison.net/2021/Dec/7/git-history/"" rel=""nofollow"">git-history: a tool for analyzing scraped data collected using Git and SQLite</a>.</p>
<p dir=""auto""><a href=""https://simonwillison.net/2022/Oct/19/measuring-traffic/"" rel=""nofollow"">Measuring traffic during the Half Moon Bay Pumpkin Festival</a> describes a project using this tool in detail.</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p dir=""auto"">Install this tool using <code>pip</code>:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ pip install git-history""><pre class=""notranslate""><code>$ pip install git-history
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-demos"" class=""anchor"" aria-hidden=""true"" href=""#user-content-demos""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Demos</h2>
<p dir=""auto""><a href=""http://git-history-demos.datasette.io/"" rel=""nofollow"">git-history-demos.datasette.io</a> hosts three example databases created using this tool:</p>
<ul dir=""auto"">
<li><a href=""https://git-history-demos.datasette.io/pge-outages"" rel=""nofollow"">pge-outages</a> shows a history of PG&amp;E (the electricity supplier) <a href=""https://pgealerts.alerts.pge.com/outagecenter/"" rel=""nofollow"">outages</a>, using data collected in <a href=""https://github.com/simonw/pge-outages"">simonw/pge-outages</a> converted using <a href=""https://github.com/simonw/git-history/blob/main/demos/pge-outages.sh"">pge-outages.sh</a></li>
<li><a href=""https://git-history-demos.datasette.io/ca-fires"" rel=""nofollow"">ca-fires</a> shows a history of fires in California reported on <a href=""https://www.fire.ca.gov/incidents/"" rel=""nofollow"">fire.ca.gov/incidents</a>, from data in <a href=""https://github.com/simonw/ca-fires-history"">simonw/ca-fires-history</a> converted using <a href=""https://github.com/simonw/git-history/blob/main/demos/ca-fires.sh"">ca-fires.sh</a></li>
<li><a href=""https://git-history-demos.datasette.io/sf-bay-511"" rel=""nofollow"">sf-bay-511</a> has records of San Francisco Bay Area traffic and transit incident data from <a href=""https://511.org/"" rel=""nofollow"">511.org</a>, collected in <a href=""https://github.com/dbreunig/511-events-history"">dbreunig/511-events-history</a> converted using <a href=""https://github.com/simonw/git-history/blob/main/demos/sf-bay-511.sh"">sf-bay-511.sh</a></li>
</ul>
<p dir=""auto"">The demos are deployed using <a href=""https://datasette.io/"" rel=""nofollow"">Datasette</a> on <a href=""https://cloud.google.com/run/"" rel=""nofollow"">Google Cloud Run</a> by <a href=""https://github.com/simonw/git-history/blob/main/.github/workflows/deploy-demos.yml"">this GitHub Actions workflow</a>.</p>
<h2 dir=""auto""><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p dir=""auto"">This tool can be run against a Git repository that holds a file that contains JSON, CSV/TSV or some other format and which has multiple versions tracked in the Git history. Read <a href=""https://simonwillison.net/2020/Oct/9/git-scraping/"" rel=""nofollow"">Git scraping: track changes over time by scraping to a Git repository</a> to understand how you might create such a repository.</p>
<p dir=""auto"">The <code>file</code> command analyzes the history of an individual file within the repository, and generates a SQLite database table that represents the different versions of that file over time.</p>
<p dir=""auto"">The file is assumed to contain multiple objects - for example, the results of scraping an electricity outage map or a CSV file full of records.</p>
<p dir=""auto"">Assuming you have a file called <code>incidents.json</code> that is a JSON array of objects, with multiple versions of that file recorded in a repository. Each version of that file might look something like this:</p>
<div class=""highlight highlight-source-json notranslate position-relative overflow-auto"" dir=""auto"" data-snippet-clipboard-copy-content=""[
    {
        &quot;IncidentID&quot;: &quot;abc123&quot;,
        &quot;Location&quot;: &quot;Corner of 4th and Vermont&quot;,
        &quot;Type&quot;: &quot;fire&quot;
    },
    {
        &quot;IncidentID&quot;: &quot;cde448&quot;,
        &quot;Location&quot;: &quot;555 West Example Drive&quot;,
        &quot;Type&quot;: &quot;medical&quot;
    }
]""><pre>[
    {
        <span class=""pl-ent"">""IncidentID""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>abc123<span class=""pl-pds"">""</span></span>,
        <span class=""pl-ent"">""Location""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>Corner of 4th and Vermont<span class=""pl-pds"">""</span></span>,
        <span class=""pl-ent"">""Type""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>fire<span class=""pl-pds"">""</span></span>
    },
    {
        <span class=""pl-ent"">""IncidentID""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>cde448<span class=""pl-pds"">""</span></span>,
        <span class=""pl-ent"">""Location""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>555 West Example Drive<span class=""pl-pds"">""</span></span>,
        <span class=""pl-ent"">""Type""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>medical<span class=""pl-pds"">""</span></span>
    }
]</pre></div>
<p dir=""auto"">Change directory into the GitHub repository in question and run the following:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""git-history file incidents.db incidents.json""><pre class=""notranslate""><code>git-history file incidents.db incidents.json
</code></pre></div>
<p dir=""auto"">This will create a new SQLite database in the <code>incidents.db</code> file with three tables:</p>
<ul dir=""auto"">
<li><code>commits</code> containing a row for every commit, with a <code>hash</code> column, the <code>commit_at</code> date and a foreign key to a <code>namespace</code>.</li>
<li><code>item</code> containing a row for every item in every version of the <code>filename.json</code> file - with an extra <code>_commit</code> column that is a foreign key back to the <code>commit</code> table.</li>
<li><code>namespaces</code> containing a single row. This allows you to build multiple tables for different files, using the <code>--namespace</code> option described below.</li>
</ul>
<p dir=""auto"">The database schema for this example will look like this:</p>

<div class=""highlight highlight-source-sql notranslate position-relative overflow-auto"" dir=""auto"" data-snippet-clipboard-copy-content=""CREATE TABLE [namespaces] (
   [id] INTEGER PRIMARY KEY,
   [name] TEXT
);
CREATE UNIQUE INDEX [idx_namespaces_name]
    ON [namespaces] ([name]);
CREATE TABLE [commits] (
   [id] INTEGER PRIMARY KEY,
   [namespace] INTEGER REFERENCES [namespaces]([id]),
   [hash] TEXT,
   [commit_at] TEXT
);
CREATE UNIQUE INDEX [idx_commits_namespace_hash]
    ON [commits] ([namespace], [hash]);
CREATE TABLE [item] (
   [IncidentID] TEXT,
   [Location] TEXT,
   [Type] TEXT
);""><pre>CREATE TABLE [namespaces] (
   [id] <span class=""pl-k"">INTEGER</span> <span class=""pl-k"">PRIMARY KEY</span>,
   [name] <span class=""pl-k"">TEXT</span>
);
CREATE UNIQUE INDEX [idx_namespaces_name]
    <span class=""pl-k"">ON</span> [namespaces] ([name]);
CREATE TABLE [commits] (
   [id] <span class=""pl-k"">INTEGER</span> <span class=""pl-k"">PRIMARY KEY</span>,
   [namespace] <span class=""pl-k"">INTEGER</span> <span class=""pl-k"">REFERENCES</span> [namespaces]([id]),
   [hash] <span class=""pl-k"">TEXT</span>,
   [commit_at] <span class=""pl-k"">TEXT</span>
);
CREATE UNIQUE INDEX [idx_commits_namespace_hash]
    <span class=""pl-k"">ON</span> [commits] ([namespace], [hash]);
CREATE TABLE [item] (
   [IncidentID] <span class=""pl-k"">TEXT</span>,
   [Location] <span class=""pl-k"">TEXT</span>,
   [Type] <span class=""pl-k"">TEXT</span>
);</pre></div>

<p dir=""auto"">If you have 10 historic versions of the <code>incidents.json</code> file and each one contains 30 incidents, you will end up with 10 * 30 = 300 rows in your <code>item</code> table.</p>
<h3 dir=""auto""><a id=""user-content-track-the-history-of-individual-items-using-ids"" class=""anchor"" aria-hidden=""true"" href=""#user-content-track-the-history-of-individual-items-using-ids""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Track the history of individual items using IDs</h3>
<p dir=""auto"">If your objects have a unique identifier - or multiple columns that together form a unique identifier - you can use the <code>--id</code> option to de-duplicate and track changes to each of those items over time.</p>
<p dir=""auto"">This provides a much more interesting way to apply this tool.</p>
<p dir=""auto"">If there is a unique identifier column called <code>IncidentID</code> you could run the following:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""git-history file incidents.db incidents.json --id IncidentID""><pre class=""notranslate""><code>git-history file incidents.db incidents.json --id IncidentID
</code></pre></div>
<p dir=""auto"">The database schema used here is very different from the one used without the <code>--id</code> option.</p>
<p dir=""auto"">If you have already imported history, the command will skip any commits that it has seen already and just process new ones. This means that even though an initial import could be slow subsequent imports should run a lot faster.</p>
<p dir=""auto"">This command will create six tables - <code>commits</code>, <code>item</code>, <code>item_version</code>, <code>columns</code>, <code>item_changed</code> and <code>namespaces</code>.</p>
<p dir=""auto"">Here's the full schema:</p>

<div class=""highlight highlight-source-sql notranslate position-relative overflow-auto"" dir=""auto"" data-snippet-clipboard-copy-content=""CREATE TABLE [namespaces] (
   [id] INTEGER PRIMARY KEY,
   [name] TEXT
);
CREATE UNIQUE INDEX [idx_namespaces_name]
    ON [namespaces] ([name]);
CREATE TABLE [commits] (
   [id] INTEGER PRIMARY KEY,
   [namespace] INTEGER REFERENCES [namespaces]([id]),
   [hash] TEXT,
   [commit_at] TEXT
);
CREATE UNIQUE INDEX [idx_commits_namespace_hash]
    ON [commits] ([namespace], [hash]);
CREATE TABLE [item] (
   [_id] INTEGER PRIMARY KEY,
   [_item_id] TEXT
, [IncidentID] TEXT, [Location] TEXT, [Type] TEXT, [_commit] INTEGER);
CREATE UNIQUE INDEX [idx_item__item_id]
    ON [item] ([_item_id]);
CREATE TABLE [item_version] (
   [_id] INTEGER PRIMARY KEY,
   [_item] INTEGER REFERENCES [item]([_id]),
   [_version] INTEGER,
   [_commit] INTEGER REFERENCES [commits]([id]),
   [IncidentID] TEXT,
   [Location] TEXT,
   [Type] TEXT,
   [_item_full_hash] TEXT
);
CREATE TABLE [columns] (
   [id] INTEGER PRIMARY KEY,
   [namespace] INTEGER REFERENCES [namespaces]([id]),
   [name] TEXT
);
CREATE UNIQUE INDEX [idx_columns_namespace_name]
    ON [columns] ([namespace], [name]);
CREATE TABLE [item_changed] (
   [item_version] INTEGER REFERENCES [item_version]([_id]),
   [column] INTEGER REFERENCES [columns]([id]),
   PRIMARY KEY ([item_version], [column])
);
CREATE VIEW item_version_detail AS select
  commits.commit_at as _commit_at,
  commits.hash as _commit_hash,
  item_version.*,
  (
    select json_group_array(name) from columns
    where id in (
      select column from item_changed
      where item_version = item_version._id
    )
) as _changed_columns
from item_version
  join commits on commits.id = item_version._commit;
CREATE INDEX [idx_item_version__item]
    ON [item_version] ([_item]);""><pre>CREATE TABLE [namespaces] (
   [id] <span class=""pl-k"">INTEGER</span> <span class=""pl-k"">PRIMARY KEY</span>,
   [name] <span class=""pl-k"">TEXT</span>
);
CREATE UNIQUE INDEX [idx_namespaces_name]
    <span class=""pl-k"">ON</span> [namespaces] ([name]);
CREATE TABLE [commits] (
   [id] <span class=""pl-k"">INTEGER</span> <span class=""pl-k"">PRIMARY KEY</span>,
   [namespace] <span class=""pl-k"">INTEGER</span> <span class=""pl-k"">REFERENCES</span> [namespaces]([id]),
   [hash] <span class=""pl-k"">TEXT</span>,
   [commit_at] <span class=""pl-k"">TEXT</span>
);
CREATE UNIQUE INDEX [idx_commits_namespace_hash]
    <span class=""pl-k"">ON</span> [commits] ([namespace], [hash]);
CREATE TABLE [item] (
   [_id] <span class=""pl-k"">INTEGER</span> <span class=""pl-k"">PRIMARY KEY</span>,
   [_item_id] <span class=""pl-k"">TEXT</span>
, [IncidentID] <span class=""pl-k"">TEXT</span>, [Location] <span class=""pl-k"">TEXT</span>, [Type] <span class=""pl-k"">TEXT</span>, [_commit] <span class=""pl-k"">INTEGER</span>);
CREATE UNIQUE INDEX [idx_item__item_id]
    <span class=""pl-k"">ON</span> [item] ([_item_id]);
CREATE TABLE [item_version] (
   [_id] <span class=""pl-k"">INTEGER</span> <span class=""pl-k"">PRIMARY KEY</span>,
   [_item] <span class=""pl-k"">INTEGER</span> <span class=""pl-k"">REFERENCES</span> [item]([_id]),
   [_version] <span class=""pl-k"">INTEGER</span>,
   [_commit] <span class=""pl-k"">INTEGER</span> <span class=""pl-k"">REFERENCES</span> [commits]([id]),
   [IncidentID] <span class=""pl-k"">TEXT</span>,
   [Location] <span class=""pl-k"">TEXT</span>,
   [Type] <span class=""pl-k"">TEXT</span>,
   [_item_full_hash] <span class=""pl-k"">TEXT</span>
);
CREATE TABLE [columns] (
   [id] <span class=""pl-k"">INTEGER</span> <span class=""pl-k"">PRIMARY KEY</span>,
   [namespace] <span class=""pl-k"">INTEGER</span> <span class=""pl-k"">REFERENCES</span> [namespaces]([id]),
   [name] <span class=""pl-k"">TEXT</span>
);
CREATE UNIQUE INDEX [idx_columns_namespace_name]
    <span class=""pl-k"">ON</span> [columns] ([namespace], [name]);
CREATE TABLE [item_changed] (
   [item_version] <span class=""pl-k"">INTEGER</span> <span class=""pl-k"">REFERENCES</span> [item_version]([_id]),
   [column] <span class=""pl-k"">INTEGER</span> <span class=""pl-k"">REFERENCES</span> [columns]([id]),
   <span class=""pl-k"">PRIMARY KEY</span> ([item_version], [column])
);
<span class=""pl-k"">CREATE</span> <span class=""pl-k"">VIEW</span> <span class=""pl-en"">item_version_detail</span> <span class=""pl-k"">AS</span> <span class=""pl-k"">select</span>
  <span class=""pl-c1"">commits</span>.<span class=""pl-c1"">commit_at</span> <span class=""pl-k"">as</span> _commit_at,
  <span class=""pl-c1"">commits</span>.<span class=""pl-c1"">hash</span> <span class=""pl-k"">as</span> _commit_hash,
  item_version.<span class=""pl-k"">*</span>,
  (
    <span class=""pl-k"">select</span> json_group_array(name) <span class=""pl-k"">from</span> columns
    <span class=""pl-k"">where</span> id <span class=""pl-k"">in</span> (
      <span class=""pl-k"">select</span> column <span class=""pl-k"">from</span> item_changed
      <span class=""pl-k"">where</span> item_version <span class=""pl-k"">=</span> <span class=""pl-c1"">item_version</span>.<span class=""pl-c1"">_id</span>
    )
) <span class=""pl-k"">as</span> _changed_columns
<span class=""pl-k"">from</span> item_version
  <span class=""pl-k"">join</span> commits <span class=""pl-k"">on</span> <span class=""pl-c1"">commits</span>.<span class=""pl-c1"">id</span> <span class=""pl-k"">=</span> <span class=""pl-c1"">item_version</span>.<span class=""pl-c1"">_commit</span>;
CREATE INDEX [idx_item_version__item]
    <span class=""pl-k"">ON</span> [item_version] ([_item]);</pre></div>

<h4 dir=""auto""><a id=""user-content-item-table"" class=""anchor"" aria-hidden=""true"" href=""#user-content-item-table""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>item table</h4>
<p dir=""auto"">The <code>item</code> table will contain the most recent version of each row, de-duplicated by ID, plus the following additional columns:</p>
<ul dir=""auto"">
<li><code>_id</code> - a numeric integer primary key, used as a foreign key from the <code>item_version</code> table.</li>
<li><code>_item_id</code> - a hash of the values of the columns specified using the <code>--id</code> option to the command. This is used for de-duplication when processing new versions.</li>
<li><code>_commit</code> - a foreign key to the <code>commit</code> table, representing the most recent commit to modify this item.</li>
</ul>
<h4 dir=""auto""><a id=""user-content-item_version-table"" class=""anchor"" aria-hidden=""true"" href=""#user-content-item_version-table""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>item_version table</h4>
<p dir=""auto"">The <code>item_version</code> table will contain a row for each captured differing version of that item, plus the following columns:</p>
<ul dir=""auto"">
<li><code>_id</code> - a numeric ID for the item version record.</li>
<li><code>_item</code> - a foreign key to the <code>item</code> table.</li>
<li><code>_version</code> - the numeric version number, starting at 1 and incrementing for each captured version.</li>
<li><code>_commit</code> - a foreign key to the <code>commit</code> table.</li>
<li><code>_item_full_hash</code> - a hash of this version of the item. This is used internally by the tool to identify items that have changed between commits.</li>
</ul>
<p dir=""auto"">The other columns in this table represent columns in the original data that have changed since the previous version. If the value has not changed, it will be represented by a <code>null</code>.</p>
<p dir=""auto"">If a value was previously set but has been changed back to <code>null</code> it will still be represented as <code>null</code> in the <code>item_version</code> row. You can identify these using the <code>item_changed</code> many-to-many table described below.</p>
<p dir=""auto"">You can use the <code>--full-versions</code> option to store full copies of the item at each version, rather than just storing the columns that have changed.</p>
<h4 dir=""auto""><a id=""user-content-item_version_detail-view"" class=""anchor"" aria-hidden=""true"" href=""#user-content-item_version_detail-view""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>item_version_detail view</h4>
<p dir=""auto"">This SQL view joins <code>item_version</code> against <code>commits</code> to add three further columns: <code>_commit_at</code> with the date of the commit, and <code>_commit_hash</code> with the Git commit hash.</p>
<h4 dir=""auto""><a id=""user-content-item_changed"" class=""anchor"" aria-hidden=""true"" href=""#user-content-item_changed""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>item_changed</h4>
<p dir=""auto"">This many-to-many table indicates exactly which columns were changed in an <code>item_version</code>.</p>
<ul dir=""auto"">
<li><code>item_version</code> is a foreign key to a row in the <code>item_version</code> table.</li>
<li><code>column</code> is a foreign key to a row in the <code>columns</code> table.</li>
</ul>
<p dir=""auto"">This table with have the largest number of rows, which is why it stores just two integers in order to save space.</p>
<h4 dir=""auto""><a id=""user-content-columns"" class=""anchor"" aria-hidden=""true"" href=""#user-content-columns""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>columns</h4>
<p dir=""auto"">The <code>columns</code> table stores column names. It is referenced by <code>item_changed</code>.</p>
<ul dir=""auto"">
<li><code>id</code> - an integer ID.</li>
<li><code>name</code> - the name of the column.</li>
<li><code>namespace</code> - a foreign key to <code>namespaces</code>, for if multiple file histories are sharing the same database.</li>
</ul>
<h4 dir=""auto""><a id=""user-content-reserved-column-names"" class=""anchor"" aria-hidden=""true"" href=""#user-content-reserved-column-names""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Reserved column names</h4>

<p dir=""auto"">Note that <code>_id</code>, <code>_item_full_hash</code>, <code>_item</code>, <code>_item_id</code>, <code>_version</code>, <code>_commit</code>, <code>_item_id</code>, <code>_commit_at</code>, <code>_commit_hash</code>, <code>_changed_columns</code>, <code>rowid</code> are considered reserved column names for the purposes of this tool.</p>

<p dir=""auto"">If your data contains any of these they will be renamed to add a trailing underscore, for example <code>_id_</code>, <code>_item_</code>, <code>_version_</code>, to avoid clashing with the reserved columns.</p>
<p dir=""auto"">If you have a column with a name such as <code>_commit_</code> it will be renamed too, adding an additional trailing underscore, so <code>_commit_</code> becomes <code>_commit__</code> and <code>_commit__</code> becomes <code>_commit___</code>.</p>
<h3 dir=""auto""><a id=""user-content-additional-options"" class=""anchor"" aria-hidden=""true"" href=""#user-content-additional-options""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Additional options</h3>
<ul dir=""auto"">
<li><code>--repo DIRECTORY</code> - the path to the Git repository, if it is not the current working directory.</li>
<li><code>--branch TEXT</code> - the Git branch to analyze - defaults to <code>main</code>.</li>
<li><code>--id TEXT</code> - as described above: pass one or more columns that uniquely identify a record, so that changes to that record can be calculated over time.</li>
<li><code>--full-versions</code> - instead of recording just the columns that have changed in the <code>item_version</code> table record a full copy of each version of theh item.</li>
<li><code>--ignore TEXT</code> - one or more columns to ignore - they will not be included in the resulting database.</li>
<li><code>--csv</code> - treat the data is CSV or TSV rather than JSON, and attempt to guess the correct dialect</li>
<li><code>--dialect</code> - use a spcific CSV dialect. Options are <code>excel</code>, <code>excel-tab</code> and <code>unix</code> - see <a href=""https://docs.python.org/3/library/csv.html#csv.excel"" rel=""nofollow"">the Python CSV documentation</a> for details.</li>
<li><code>--skip TEXT</code> - one or more full Git commit hashes that should be skipped. You can use this if some of the data in your revision history is corrupted in a way that prevents this tool from working.</li>
<li><code>--start-at TEXT</code> - skip commits prior to the specified commit hash.</li>
<li><code>--start-after TEXT</code> - skip commits up to and including the specified commit hash, then start processing from the following commit.</li>
<li><code>--convert TEXT</code> - custom Python code for a conversion, described below.</li>
<li><code>--import TEXT</code> - additional Python modules to import for <code>--convert</code>.</li>
<li><code>--ignore-duplicate-ids</code> - if a single version of a file has the same ID in it more than once, the tool will exit with an error. Use this option to ignore this and instead pick just the first of the two duplicates.</li>
<li><code>--namespace TEXT</code> - use this if you wish to include the history of multiple different files in the same database. The default is <code>item</code> but you can set it to something else, which will produce tables with names like <code>yournamespace</code> and <code>yournamespace_version</code>.</li>
<li><code>--wal</code> - Enable WAL mode on the created database file. Use this if you plan to run queries against the database while <code>git-history</code> is creating it.</li>
<li><code>--silent</code> - don't show the progress bar.</li>
</ul>
<h3 dir=""auto""><a id=""user-content-csv-and-tsv-data"" class=""anchor"" aria-hidden=""true"" href=""#user-content-csv-and-tsv-data""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>CSV and TSV data</h3>
<p dir=""auto"">If the data in your repository is a CSV or TSV file you can process it by adding the <code>--csv</code> option. This will attempt to detect which delimiter is used by the file, so the same option works for both comma- and tab-separated values.</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""git-history file trees.db trees.csv --id TreeID""><pre class=""notranslate""><code>git-history file trees.db trees.csv --id TreeID
</code></pre></div>
<p dir=""auto"">You can also specify the CSV dialect using the <code>--dialect</code> option.</p>
<h3 dir=""auto""><a id=""user-content-custom-conversions-using---convert"" class=""anchor"" aria-hidden=""true"" href=""#user-content-custom-conversions-using---convert""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Custom conversions using --convert</h3>
<p dir=""auto"">If your data is not already either CSV/TSV or a flat JSON array, you can reshape it using the <code>--convert</code> option.</p>
<p dir=""auto"">The format needed by this tool is an array of dictionaries, as demonstrated by the <code>incidents.json</code> example above.</p>
<p dir=""auto"">If your data does not fit this shape, you can provide a snippet of Python code to converts the on-disk content of each stored file into a Python list of dictionaries.</p>
<p dir=""auto"">For example, if your stored files each look like this:</p>
<div class=""highlight highlight-source-json notranslate position-relative overflow-auto"" dir=""auto"" data-snippet-clipboard-copy-content=""{
    &quot;incidents&quot;: [
        {
            &quot;id&quot;: &quot;552&quot;,
            &quot;name&quot;: &quot;Hawthorne Fire&quot;,
            &quot;engines&quot;: 3
        },
        {
            &quot;id&quot;: &quot;556&quot;,
            &quot;name&quot;: &quot;Merlin Fire&quot;,
            &quot;engines&quot;: 1
        }
    ]
}""><pre>{
    <span class=""pl-ent"">""incidents""</span>: [
        {
            <span class=""pl-ent"">""id""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>552<span class=""pl-pds"">""</span></span>,
            <span class=""pl-ent"">""name""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>Hawthorne Fire<span class=""pl-pds"">""</span></span>,
            <span class=""pl-ent"">""engines""</span>: <span class=""pl-c1"">3</span>
        },
        {
            <span class=""pl-ent"">""id""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>556<span class=""pl-pds"">""</span></span>,
            <span class=""pl-ent"">""name""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>Merlin Fire<span class=""pl-pds"">""</span></span>,
            <span class=""pl-ent"">""engines""</span>: <span class=""pl-c1"">1</span>
        }
    ]
}</pre></div>
<p dir=""auto"">You could use the following Python snippet to convert them to the required format:</p>
<div class=""highlight highlight-source-python notranslate position-relative overflow-auto"" dir=""auto"" data-snippet-clipboard-copy-content=""json.loads(content)[&quot;incidents&quot;]""><pre><span class=""pl-s1"">json</span>.<span class=""pl-en"">loads</span>(<span class=""pl-s1"">content</span>)[<span class=""pl-s"">""incidents""</span>]</pre></div>
<p dir=""auto"">(The <code>json</code> module is exposed to your custom function by default.)</p>
<p dir=""auto"">You would then run the tool like this:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""git-history file database.db incidents.json \
  --id id \
  --convert 'json.loads(content)[&quot;incidents&quot;]'""><pre class=""notranslate""><code>git-history file database.db incidents.json \
  --id id \
  --convert 'json.loads(content)[""incidents""]'
</code></pre></div>
<p dir=""auto"">The <code>content</code> variable is always a <code>bytes</code> object representing the content of the file at a specific moment in the repository's history.</p>
<p dir=""auto"">You can import additional modules using <code>--import</code>. This example shows how you could read a CSV file that uses <code>;</code> as the delimiter:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""git-history file trees.db ../sf-tree-history/Street_Tree_List.csv \
  --repo ../sf-tree-history \
  --import csv \
  --import io \
  --convert '
    fp = io.StringIO(content.decode(&quot;utf-8&quot;))
    return list(csv.DictReader(fp, delimiter=&quot;;&quot;))
    ' \
  --id TreeID""><pre class=""notranslate""><code>git-history file trees.db ../sf-tree-history/Street_Tree_List.csv \
  --repo ../sf-tree-history \
  --import csv \
  --import io \
  --convert '
    fp = io.StringIO(content.decode(""utf-8""))
    return list(csv.DictReader(fp, delimiter="";""))
    ' \
  --id TreeID
</code></pre></div>
<p dir=""auto"">You can import nested modules such as <a href=""https://docs.python.org/3/library/xml.etree.elementtree.html"" rel=""nofollow"">ElementTree</a>  using <code>--import xml.etree.ElementTree</code>, then refer to them in your function body as <code>xml.etree.ElementTree</code>. For example, if your tracked data was in an <code>items.xml</code> file that looked like this:</p>
<div class=""highlight highlight-text-xml notranslate position-relative overflow-auto"" dir=""auto"" data-snippet-clipboard-copy-content=""&lt;items&gt;
  &lt;item id=&quot;1&quot; name=&quot;One&quot; /&gt;
  &lt;item id=&quot;2&quot; name=&quot;Two&quot; /&gt;
  &lt;item id=&quot;3&quot; name=&quot;Three&quot; /&gt;
&lt;/item&gt;""><pre>&lt;<span class=""pl-ent"">items</span>&gt;
  &lt;<span class=""pl-ent"">item</span> <span class=""pl-e"">id</span>=<span class=""pl-s""><span class=""pl-pds"">""</span>1<span class=""pl-pds"">""</span></span> <span class=""pl-e"">name</span>=<span class=""pl-s""><span class=""pl-pds"">""</span>One<span class=""pl-pds"">""</span></span> /&gt;
  &lt;<span class=""pl-ent"">item</span> <span class=""pl-e"">id</span>=<span class=""pl-s""><span class=""pl-pds"">""</span>2<span class=""pl-pds"">""</span></span> <span class=""pl-e"">name</span>=<span class=""pl-s""><span class=""pl-pds"">""</span>Two<span class=""pl-pds"">""</span></span> /&gt;
  &lt;<span class=""pl-ent"">item</span> <span class=""pl-e"">id</span>=<span class=""pl-s""><span class=""pl-pds"">""</span>3<span class=""pl-pds"">""</span></span> <span class=""pl-e"">name</span>=<span class=""pl-s""><span class=""pl-pds"">""</span>Three<span class=""pl-pds"">""</span></span> /&gt;
&lt;/<span class=""pl-ent"">item</span>&gt;</pre></div>
<p dir=""auto"">You could load it using the following <code>--convert</code> script:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""git-history file items.xml --convert '
tree = xml.etree.ElementTree.fromstring(content)
return [el.attrib for el in tree.iter(&quot;item&quot;)]
' --import xml.etree.ElementTree --id id""><pre class=""notranslate""><code>git-history file items.xml --convert '
tree = xml.etree.ElementTree.fromstring(content)
return [el.attrib for el in tree.iter(""item"")]
' --import xml.etree.ElementTree --id id
</code></pre></div>
<p dir=""auto"">If your Python code spans more than one line it needs to include a <code>return</code> statement.</p>
<p dir=""auto"">You can also use Python generators in your <code>--convert</code> code, for example:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""git-history file stats.db package-stats/stats.json \
    --repo package-stats \
    --convert '
    data = json.loads(content)
    for key, counts in data.items():
        for date, count in counts.items():
            yield {
                &quot;package&quot;: key,
                &quot;date&quot;: date,
                &quot;count&quot;: count
            }
    ' --id package --id date""><pre class=""notranslate""><code>git-history file stats.db package-stats/stats.json \
    --repo package-stats \
    --convert '
    data = json.loads(content)
    for key, counts in data.items():
        for date, count in counts.items():
            yield {
                ""package"": key,
                ""date"": date,
                ""count"": count
            }
    ' --id package --id date
</code></pre></div>
<p dir=""auto"">This conversion function expects data that looks like this:</p>
<div class=""highlight highlight-source-json notranslate position-relative overflow-auto"" dir=""auto"" data-snippet-clipboard-copy-content=""{
    &quot;airtable-export&quot;: {
        &quot;2021-05-18&quot;: 66,
        &quot;2021-05-19&quot;: 60,
        &quot;2021-05-20&quot;: 87
    }
}""><pre>{
    <span class=""pl-ent"">""airtable-export""</span>: {
        <span class=""pl-ent"">""2021-05-18""</span>: <span class=""pl-c1"">66</span>,
        <span class=""pl-ent"">""2021-05-19""</span>: <span class=""pl-c1"">60</span>,
        <span class=""pl-ent"">""2021-05-20""</span>: <span class=""pl-c1"">87</span>
    }
}</pre></div>
<h2 dir=""auto""><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p dir=""auto"">To contribute to this tool, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""cd git-history
python -m venv venv
source venv/bin/activate""><pre class=""notranslate""><code>cd git-history
python -m venv venv
source venv/bin/activate
</code></pre></div>
<p dir=""auto"">Or if you are using <code>pipenv</code>:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pipenv shell""><pre class=""notranslate""><code>pipenv shell
</code></pre></div>
<p dir=""auto"">Now install the dependencies and test dependencies:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'""><pre class=""notranslate""><code>pip install -e '.[test]'
</code></pre></div>
<p dir=""auto"">To run the tests:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pytest""><pre class=""notranslate""><code>pytest
</code></pre></div>
<p dir=""auto"">To update the schema examples in this README file:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""cog -r README.md""><pre class=""notranslate""><code>cog -r README.md
</code></pre></div>
</article></div>",1,public,0,"{""id"": 401177473, ""node_id"": ""MDEwOlJlcG9zaXRvcnk0MDExNzc0NzM="", ""name"": ""click-app-template-repository"", ""full_name"": ""simonw/click-app-template-repository"", ""private"": false, ""owner"": {""login"": ""simonw"", ""id"": 9599, ""node_id"": ""MDQ6VXNlcjk1OTk="", ""avatar_url"": ""https://avatars.githubusercontent.com/u/9599?v=4"", ""gravatar_id"": """", ""url"": ""https://api.github.com/users/simonw"", ""html_url"": ""https://github.com/simonw"", ""followers_url"": ""https://api.github.com/users/simonw/followers"", ""following_url"": ""https://api.github.com/users/simonw/following{/other_user}"", ""gists_url"": ""https://api.github.com/users/simonw/gists{/gist_id}"", ""starred_url"": ""https://api.github.com/users/simonw/starred{/owner}{/repo}"", ""subscriptions_url"": ""https://api.github.com/users/simonw/subscriptions"", ""organizations_url"": ""https://api.github.com/users/simonw/orgs"", ""repos_url"": ""https://api.github.com/users/simonw/repos"", ""events_url"": ""https://api.github.com/users/simonw/events{/privacy}"", ""received_events_url"": ""https://api.github.com/users/simonw/received_events"", ""type"": ""User"", ""site_admin"": false}, ""html_url"": ""https://github.com/simonw/click-app-template-repository"", ""description"": ""GitHub template repository for creating new Python Click CLI tools, using the simonw/click-app cookiecutter template"", ""fork"": false, ""url"": ""https://api.github.com/repos/simonw/click-app-template-repository"", ""forks_url"": ""https://api.github.com/repos/simonw/click-app-template-repository/forks"", ""keys_url"": ""https://api.github.com/repos/simonw/click-app-template-repository/keys{/key_id}"", ""collaborators_url"": ""https://api.github.com/repos/simonw/click-app-template-repository/collaborators{/collaborator}"", ""teams_url"": ""https://api.github.com/repos/simonw/click-app-template-repository/teams"", ""hooks_url"": ""https://api.github.com/repos/simonw/click-app-template-repository/hooks"", ""issue_events_url"": ""https://api.github.com/repos/simonw/click-app-template-repository/issues/events{/number}"", ""events_url"": ""https://api.github.com/repos/simonw/click-app-template-repository/events"", ""assignees_url"": ""https://api.github.com/repos/simonw/click-app-template-repository/assignees{/user}"", ""branches_url"": ""https://api.github.com/repos/simonw/click-app-template-repository/branches{/branch}"", ""tags_url"": ""https://api.github.com/repos/simonw/click-app-template-repository/tags"", ""blobs_url"": ""https://api.github.com/repos/simonw/click-app-template-repository/git/blobs{/sha}"", ""git_tags_url"": ""https://api.github.com/repos/simonw/click-app-template-repository/git/tags{/sha}"", ""git_refs_url"": ""https://api.github.com/repos/simonw/click-app-template-repository/git/refs{/sha}"", ""trees_url"": ""https://api.github.com/repos/simonw/click-app-template-repository/git/trees{/sha}"", ""statuses_url"": ""https://api.github.com/repos/simonw/click-app-template-repository/statuses/{sha}"", ""languages_url"": ""https://api.github.com/repos/simonw/click-app-template-repository/languages"", ""stargazers_url"": ""https://api.github.com/repos/simonw/click-app-template-repository/stargazers"", ""contributors_url"": ""https://api.github.com/repos/simonw/click-app-template-repository/contributors"", ""subscribers_url"": ""https://api.github.com/repos/simonw/click-app-template-repository/subscribers"", ""subscription_url"": ""https://api.github.com/repos/simonw/click-app-template-repository/subscription"", ""commits_url"": ""https://api.github.com/repos/simonw/click-app-template-repository/commits{/sha}"", ""git_commits_url"": ""https://api.github.com/repos/simonw/click-app-template-repository/git/commits{/sha}"", ""comments_url"": ""https://api.github.com/repos/simonw/click-app-template-repository/comments{/number}"", ""issue_comment_url"": ""https://api.github.com/repos/simonw/click-app-template-repository/issues/comments{/number}"", ""contents_url"": ""https://api.github.com/repos/simonw/click-app-template-repository/contents/{+path}"", ""compare_url"": ""https://api.github.com/repos/simonw/click-app-template-repository/compare/{base}...{head}"", ""merges_url"": ""https://api.github.com/repos/simonw/click-app-template-repository/merges"", ""archive_url"": ""https://api.github.com/repos/simonw/click-app-template-repository/{archive_format}{/ref}"", ""downloads_url"": ""https://api.github.com/repos/simonw/click-app-template-repository/downloads"", ""issues_url"": ""https://api.github.com/repos/simonw/click-app-template-repository/issues{/number}"", ""pulls_url"": ""https://api.github.com/repos/simonw/click-app-template-repository/pulls{/number}"", ""milestones_url"": ""https://api.github.com/repos/simonw/click-app-template-repository/milestones{/number}"", ""notifications_url"": ""https://api.github.com/repos/simonw/click-app-template-repository/notifications{?since,all,participating}"", ""labels_url"": ""https://api.github.com/repos/simonw/click-app-template-repository/labels{/name}"", ""releases_url"": ""https://api.github.com/repos/simonw/click-app-template-repository/releases{/id}"", ""deployments_url"": ""https://api.github.com/repos/simonw/click-app-template-repository/deployments"", ""created_at"": ""2021-08-30T01:03:34Z"", ""updated_at"": ""2022-07-17T02:01:39Z"", ""pushed_at"": ""2022-03-16T23:35:31Z"", ""git_url"": ""git://github.com/simonw/click-app-template-repository.git"", ""ssh_url"": ""git@github.com:simonw/click-app-template-repository.git"", ""clone_url"": ""https://github.com/simonw/click-app-template-repository.git"", ""svn_url"": ""https://github.com/simonw/click-app-template-repository"", ""homepage"": """", ""size"": 12, ""stargazers_count"": 8, ""watchers_count"": 8, ""language"": null, ""has_issues"": true, ""has_projects"": true, ""has_downloads"": true, ""has_wiki"": true, ""has_pages"": false, ""forks_count"": 0, ""mirror_url"": null, ""archived"": false, ""disabled"": false, ""open_issues_count"": 0, ""license"": null, ""allow_forking"": true, ""is_template"": true, ""web_commit_signoff_required"": false, ""topics"": [], ""visibility"": ""public"", ""forks"": 0, ""open_issues"": 0, ""watchers"": 8, ""default_branch"": ""main"", ""permissions"": {""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}, ""temp_clone_token"": """"}",0,
438003374,R_kgDOGhtmrg,datasette-pretty-traces,simonw/datasette-pretty-traces,0,9599,https://github.com/simonw/datasette-pretty-traces,Prettier formatting for ?_trace=1 traces,0,2021-12-13T19:43:28Z,2021-12-19T20:40:10Z,2022-01-14T02:08:51Z,,22,2,2,JavaScript,1,1,1,1,0,0,0,0,0,apache-2.0,"[""datasette"", ""datasette-io"", ""datasette-plugin""]",0,0,2,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,0,1,"# datasette-pretty-traces

[![PyPI](https://img.shields.io/pypi/v/datasette-pretty-traces.svg)](https://pypi.org/project/datasette-pretty-traces/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-pretty-traces?include_prereleases&label=changelog)](https://github.com/simonw/datasette-pretty-traces/releases)
[![Tests](https://github.com/simonw/datasette-pretty-traces/workflows/Test/badge.svg)](https://github.com/simonw/datasette-pretty-traces/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-pretty-traces/blob/main/LICENSE)

Prettier formatting for `?_trace=1` traces

## Installation

Install this plugin in the same environment as Datasette.

    $ datasette install datasette-pretty-traces

## Usage

Once installed, run Datasette using `--setting trace_debug 1`:

    datasette fixtures.db --setting trace_debug 1

Then navigate to any page and add `?_trace=` to the URL:

    http://localhost:8001/?_trace=1

The plugin will scroll you down the page to the visualized trace information.

## Demo

You can try out the demo here:

- [/?_trace=1](https://latest-with-plugins.datasette.io/?_trace=1) tracing the homepage
- [/github/commits?_trace=1](https://latest-with-plugins.datasette.io/github/commits?_trace=1) tracing a table page

## Screenshot

![Screenshot showing the visualization produced by the plugin](https://user-images.githubusercontent.com/9599/145883732-a53accdd-5feb-4629-94cd-f73407c7943d.png)

## Development

To set up this plugin locally, first checkout the code. Then create a new virtual environment:

    cd datasette-pretty-traces
    python3 -mvenv venv
    source venv/bin/activate

Or if you are using `pipenv`:

    pipenv shell

Now install the dependencies and test dependencies:

    pip install -e '.[test]'

To run the tests:

    pytest
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-datasette-pretty-traces"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-pretty-traces""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-pretty-traces</h1>
<p dir=""auto""><a href=""https://pypi.org/project/datasette-pretty-traces/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/ed86d276f9a69a313d94c59c0a68e29ebd828d11646c2cfd55d776d8738ec333/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d7072657474792d7472616365732e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-pretty-traces.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-pretty-traces/releases""><img src=""https://camo.githubusercontent.com/26aa360a6a2df25aefecf0566189c1b970e68493853d18d55045684e328eb303/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d7072657474792d7472616365733f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-pretty-traces?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-pretty-traces/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-pretty-traces/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-pretty-traces/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Prettier formatting for <code>?_trace=1</code> traces</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p dir=""auto"">Install this plugin in the same environment as Datasette.</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ datasette install datasette-pretty-traces""><pre><code>$ datasette install datasette-pretty-traces
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p dir=""auto"">Once installed, run Datasette using <code>--setting trace_debug 1</code>:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""datasette fixtures.db --setting trace_debug 1""><pre><code>datasette fixtures.db --setting trace_debug 1
</code></pre></div>
<p dir=""auto"">Then navigate to any page and add <code>?_trace=</code> to the URL:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""http://localhost:8001/?_trace=1""><pre><code>http://localhost:8001/?_trace=1
</code></pre></div>
<p dir=""auto"">The plugin will scroll you down the page to the visualized trace information.</p>
<h2 dir=""auto""><a id=""user-content-demo"" class=""anchor"" aria-hidden=""true"" href=""#user-content-demo""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Demo</h2>
<p dir=""auto"">You can try out the demo here:</p>
<ul dir=""auto"">
<li><a href=""https://latest-with-plugins.datasette.io/?_trace=1"" rel=""nofollow"">/?_trace=1</a> tracing the homepage</li>
<li><a href=""https://latest-with-plugins.datasette.io/github/commits?_trace=1"" rel=""nofollow"">/github/commits?_trace=1</a> tracing a table page</li>
</ul>
<h2 dir=""auto""><a id=""user-content-screenshot"" class=""anchor"" aria-hidden=""true"" href=""#user-content-screenshot""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Screenshot</h2>
<p dir=""auto""><a target=""_blank"" rel=""noopener noreferrer"" href=""https://user-images.githubusercontent.com/9599/145883732-a53accdd-5feb-4629-94cd-f73407c7943d.png""><img src=""https://user-images.githubusercontent.com/9599/145883732-a53accdd-5feb-4629-94cd-f73407c7943d.png"" alt=""Screenshot showing the visualization produced by the plugin"" style=""max-width: 100%;""></a></p>
<h2 dir=""auto""><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p dir=""auto"">To set up this plugin locally, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""cd datasette-pretty-traces
python3 -mvenv venv
source venv/bin/activate""><pre><code>cd datasette-pretty-traces
python3 -mvenv venv
source venv/bin/activate
</code></pre></div>
<p dir=""auto"">Or if you are using <code>pipenv</code>:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pipenv shell""><pre><code>pipenv shell
</code></pre></div>
<p dir=""auto"">Now install the dependencies and test dependencies:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'""><pre><code>pip install -e '.[test]'
</code></pre></div>
<p dir=""auto"">To run the tests:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pytest""><pre><code>pytest
</code></pre></div>
</article></div>",1,public,0,,,
441024802,R_kgDOGkmBIg,datasette-tiddlywiki,simonw/datasette-tiddlywiki,0,9599,https://github.com/simonw/datasette-tiddlywiki,Run TiddlyWiki in Datasette and save Tiddlers to a SQLite database,0,2021-12-23T01:05:56Z,2022-02-14T08:57:33Z,2022-03-08T01:36:10Z,,426,22,22,HTML,1,1,1,1,0,0,0,0,3,apache-2.0,[],0,3,22,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,0,2,"# datasette-tiddlywiki

[![PyPI](https://img.shields.io/pypi/v/datasette-tiddlywiki.svg)](https://pypi.org/project/datasette-tiddlywiki/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-tiddlywiki?include_prereleases&label=changelog)](https://github.com/simonw/datasette-tiddlywiki/releases)
[![Tests](https://github.com/simonw/datasette-tiddlywiki/workflows/Test/badge.svg)](https://github.com/simonw/datasette-tiddlywiki/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-tiddlywiki/blob/main/LICENSE)

Run [TiddlyWiki](https://tiddlywiki.com/) in Datasette and save Tiddlers to a SQLite database

Read more about this project [on my blog](https://simonwillison.net/2021/Dec/24/datasette-tiddlywiki/).

## Installation

Install this plugin in the same environment as Datasette.

    $ datasette install datasette-tiddlywiki

## Usage

Start Datasette with a `tiddlywiki.db` database. You can create it if it does not yet exist using `--create`.

You need to be signed in as the `root` user to write to the wiki, so use the `--root` option and click on the link it provides:

    % datasette tiddlywiki.db --create --root
    http://127.0.0.1:8001/-/auth-token?token=456670f1e8d01a8a33b71e17653130de17387336e29afcdfb4ab3d18261e6630
    # ...

Navigate to `/-/tiddlywiki` on your instance to interact with TiddlyWiki.

## Authentication and permissions

By default, the wiki can be read by anyone who has permission to read the `tiddlywiki.db` database. Only the signed in `root` user can write to it.

You can sign in using the `--root` option described above, or you can set a password for that user using the [datasette-auth-passwords](https://datasette.io/plugins/datasette-auth-passwords) plugin and sign in using the `/-/login` page.

You can use the `edit-tiddlywiki` permission to grant edit permisions to other users, using another plugin such as [datasette-permissions-sql](https://datasette.io/plugins/datasette-permissions-sql).

You can use the `view-database` permission against the `tiddlywiki` database to control who can view the wiki.

Datasette's permissions mechanism is described in full in [the Datasette documentation](https://docs.datasette.io/en/stable/authentication.html).

## Development

To set up this plugin locally, first checkout the code. Then create a new virtual environment:

    cd datasette-tiddlywiki
    python3 -mvenv venv
    source venv/bin/activate

Or if you are using `pipenv`:

    pipenv shell

Now install the dependencies and test dependencies:

    pip install -e '.[test]'

To run the tests:

    pytest
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-datasette-tiddlywiki"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-tiddlywiki""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-tiddlywiki</h1>
<p dir=""auto""><a href=""https://pypi.org/project/datasette-tiddlywiki/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/94d62f40def55ce1280599082836059caa2a8015c30d984cde1872d75a15d0f4/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d746964646c7977696b692e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-tiddlywiki.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-tiddlywiki/releases""><img src=""https://camo.githubusercontent.com/c054acb54b94f60a29e4f277807ce46811b9693a170e3e02c1e04798e3a595e8/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d746964646c7977696b693f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-tiddlywiki?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-tiddlywiki/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-tiddlywiki/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-tiddlywiki/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Run <a href=""https://tiddlywiki.com/"" rel=""nofollow"">TiddlyWiki</a> in Datasette and save Tiddlers to a SQLite database</p>
<p dir=""auto"">Read more about this project <a href=""https://simonwillison.net/2021/Dec/24/datasette-tiddlywiki/"" rel=""nofollow"">on my blog</a>.</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p dir=""auto"">Install this plugin in the same environment as Datasette.</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ datasette install datasette-tiddlywiki""><pre><code>$ datasette install datasette-tiddlywiki
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p dir=""auto"">Start Datasette with a <code>tiddlywiki.db</code> database. You can create it if it does not yet exist using <code>--create</code>.</p>
<p dir=""auto"">You need to be signed in as the <code>root</code> user to write to the wiki, so use the <code>--root</code> option and click on the link it provides:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""% datasette tiddlywiki.db --create --root
http://127.0.0.1:8001/-/auth-token?token=456670f1e8d01a8a33b71e17653130de17387336e29afcdfb4ab3d18261e6630
# ...""><pre><code>% datasette tiddlywiki.db --create --root
http://127.0.0.1:8001/-/auth-token?token=456670f1e8d01a8a33b71e17653130de17387336e29afcdfb4ab3d18261e6630
# ...
</code></pre></div>
<p dir=""auto"">Navigate to <code>/-/tiddlywiki</code> on your instance to interact with TiddlyWiki.</p>
<h2 dir=""auto""><a id=""user-content-authentication-and-permissions"" class=""anchor"" aria-hidden=""true"" href=""#user-content-authentication-and-permissions""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Authentication and permissions</h2>
<p dir=""auto"">By default, the wiki can be read by anyone who has permission to read the <code>tiddlywiki.db</code> database. Only the signed in <code>root</code> user can write to it.</p>
<p dir=""auto"">You can sign in using the <code>--root</code> option described above, or you can set a password for that user using the <a href=""https://datasette.io/plugins/datasette-auth-passwords"" rel=""nofollow"">datasette-auth-passwords</a> plugin and sign in using the <code>/-/login</code> page.</p>
<p dir=""auto"">You can use the <code>edit-tiddlywiki</code> permission to grant edit permisions to other users, using another plugin such as <a href=""https://datasette.io/plugins/datasette-permissions-sql"" rel=""nofollow"">datasette-permissions-sql</a>.</p>
<p dir=""auto"">You can use the <code>view-database</code> permission against the <code>tiddlywiki</code> database to control who can view the wiki.</p>
<p dir=""auto"">Datasette's permissions mechanism is described in full in <a href=""https://docs.datasette.io/en/stable/authentication.html"" rel=""nofollow"">the Datasette documentation</a>.</p>
<h2 dir=""auto""><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p dir=""auto"">To set up this plugin locally, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""cd datasette-tiddlywiki
python3 -mvenv venv
source venv/bin/activate""><pre><code>cd datasette-tiddlywiki
python3 -mvenv venv
source venv/bin/activate
</code></pre></div>
<p dir=""auto"">Or if you are using <code>pipenv</code>:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pipenv shell""><pre><code>pipenv shell
</code></pre></div>
<p dir=""auto"">Now install the dependencies and test dependencies:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'""><pre><code>pip install -e '.[test]'
</code></pre></div>
<p dir=""auto"">To run the tests:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pytest""><pre><code>pytest
</code></pre></div>
</article></div>",1,public,0,,,
459821110,R_kgDOG2hQNg,google-drive-to-sqlite,simonw/google-drive-to-sqlite,0,9599,https://github.com/simonw/google-drive-to-sqlite,Create a SQLite database containing metadata from Google Drive,0,2022-02-16T02:16:29Z,2022-05-17T00:30:43Z,2022-05-21T16:56:11Z,https://datasette.io/tools/google-drive-to-sqlite,74,133,133,Python,1,1,1,1,0,11,0,0,9,apache-2.0,[],11,9,133,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,11,3,"# google-drive-to-sqlite

[![PyPI](https://img.shields.io/pypi/v/google-drive-to-sqlite.svg)](https://pypi.org/project/google-drive-to-sqlite/)
[![Changelog](https://img.shields.io/github/v/release/simonw/google-drive-to-sqlite?include_prereleases&label=changelog)](https://github.com/simonw/google-drive-to-sqlite/releases)
[![Tests](https://github.com/simonw/google-drive-to-sqlite/workflows/Test/badge.svg)](https://github.com/simonw/google-drive-to-sqlite/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/google-drive-to-sqlite/blob/master/LICENSE)

Create a SQLite database containing metadata from [Google Drive](https://www.google.com/drive)

For background on this project, see [Google Drive to SQLite](https://simonwillison.net/2022/Feb/20/google-drive-to-sqlite/) on my blog.

If you use Google Drive, and especially if you have shared drives with other people there's a good chance you have hundreds or even thousands of files that you may not be fully aware of.

This tool can download metadata about those files - their names, sizes, folders, content types, permissions, creation dates and more - and store them in a SQLite database.

This lets you use SQL to analyze your Google Drive contents, using [Datasette](https://datasette.io/) or the SQLite command-line tool or any other SQLite database browsing software.

## Installation

Install this tool using `pip`:

    pip install google-drive-to-sqlite

## Quickstart

Authenticate with Google Drive by running:

    google-drive-to-sqlite auth

Now create a SQLite database with metadata about all of the files you have starred using:

    google-drive-to-sqlite files starred.db --starred

You can explore the resulting database using [Datasette](https://datasette.io/):

    $ pip install datasette
    $ datasette starred.db
    INFO:     Started server process [24661]
    INFO:     Uvicorn running on http://127.0.0.1:8001

## Authentication

> :warning: **This application has not yet been verified by Google** - you may find you are unable to authenticate until that verification is complete. [#10](https://github.com/simonw/google-drive-to-sqlite/issues/10)
>
> You can work around this issue by [creating your own OAuth client ID key](https://til.simonwillison.net/googlecloud/google-oauth-cli-application) and passing it to the `auth` command using `--google-client-id` and `--google-client-secret`.

First, authenticate with Google Drive using the `auth` command:

    $ google-drive-to-sqlite auth
    Visit the following URL to authenticate with Google Drive

    https://accounts.google.com/o/oauth2/v2/auth?...

    Then return here and paste in the resulting code:
    Paste code here: 

Follow the link, sign in with Google Drive and then copy and paste the resulting code back into the tool.

This will save an authentication token to the file called `auth.json` in the current directory.

To specify a different location for that file, use the `--auth` option:

    google-drive-to-sqlite auth --auth ~/google-drive-auth.json

The `auth` command also provides options for using a different scope, Google client ID and Google client secret. You can use these to create your own custom authentication tokens that can work with other Google APIs, see [issue #5](https://github.com/simonw/google-drive-to-sqlite/issues/5) for details.

Full `--help`:

<!-- [[[cog
import cog
from google_drive_to_sqlite import cli
from click.testing import CliRunner
runner = CliRunner()
result = runner.invoke(cli.cli, [""auth"", ""--help""])
help = result.output.replace(""Usage: cli"", ""Usage: google-drive-to-sqlite"")
cog.out(
    ""```\n{}\n```\n"".format(help)
)
]]] -->
```
Usage: google-drive-to-sqlite auth [OPTIONS]

  Authenticate user and save credentials

Options:
  -a, --auth FILE              Path to save token, defaults to auth.json
  --google-client-id TEXT      Custom Google client ID
  --google-client-secret TEXT  Custom Google client secret
  --scope TEXT                 Custom token scope
  --help                       Show this message and exit.

```
<!-- [[[end]]] -->

To revoke the token that is stored in `auth.json`, such that it cannot be used to access Google Drive in the future, run the `revoke` command:

    google-drive-to-sqlite revoke

Or if your token is stored in another location:

    google-drive-to-sqlite revoke -a ~/google-drive-auth.json

You will need to obtain a fresh token using the `auth` command in order to continue using this tool.

## google-drive-to-sqlite files

To retrieve metadata about the files in your Google Drive, or a folder or search within it, use the `google-drive-to-sqlite files` command.

This will default to writing details about every file in your Google Drive to a SQLite database:

    google-drive-to-sqlite files files.db

Files and folders will be written to databases tables, which will be created if they do not yet exist. The database schema is [shown below](#database-schema).

If a file or folder already exists, based on a matching `id`, it will be replaced with fresh data.

Instead of writing to SQLite you can use `--json` to output as JSON, or `--nl` to output as newline-delimited JSON:

    google-drive-to-sqlite files --nl

Use `--folder ID` to retrieve everything in a specified folder and its sub-folders:

    google-drive-to-sqlite files files.db --folder 1E6Zg2X2bjjtPzVfX8YqdXZDCoB3AVA7i

Use `--q QUERY` to use a [custom search query](https://developers.google.com/drive/api/v3/reference/query-ref):

    google-drive-to-sqlite files files.db -q ""viewedByMeTime > '2022-01-01'""

The following shortcut options help build queries:

- `--full-text TEXT` to search for files where the full text matches a search term
- `--starred` for files and folders you have starred
- `--trashed` for files and folders in the trash
- `--shared-with-me` for files and folders that have been shared with you
- `--apps` for Google Apps documents, spreadsheets, presentations and drawings (equivalent to setting all  of the next four options)
- `--docs` for Google Apps documents
- `--sheets` for Google Apps spreadsheets
- `--presentations` for Google Apps presentations
- `--drawings` for Google Apps drawings

You can combine these - for example, this returns all files that you have starred and that were shared with you:

    google-drive-to-sqlite files highlights.db \
      --starred --shared-with-me

Multiple options are treated as AND, with the exception of the Google Apps options which are treated as OR - so the following would retrieve all spreadsheets and presentations that have also been starred:

    google-drive-to-sqlite files highlights.db \
      --starred --sheets --presentations

You can use `--stop-after X` to stop after retrieving X files, useful for trying out a new search pattern and seeing results straight away.

The `--import-json` and `--import-nl` options are mainly useful for testing and developing this tool. They allow you to replay the JSON or newline-delimited JSON that was previously fetched using `--json` or `--nl` and use it to create a fresh SQLite database, without needing to make any outbound API calls:

    # Fetch all starred files from the API, write to starred.json
    google-drive-to-sqlite files -q 'starred = true' --json > starred.json
    # Now import that data into a new SQLite database file
    google-drive-to-sqlite files starred.db --import-json starred.json

Full `--help`:

<!-- [[[cog
result = runner.invoke(cli.cli, [""files"", ""--help""])
help = result.output.replace(""Usage: cli"", ""Usage: google-drive-to-sqlite"")
cog.out(
    ""```\n{}\n```\n"".format(help)
)
]]] -->
```
Usage: google-drive-to-sqlite files [OPTIONS] [DATABASE]

  Retrieve metadata for files in Google Drive, and write to a SQLite database or
  output as JSON.

      google-drive-to-sqlite files files.db

  Use --json to output JSON, --nl for newline-delimited JSON:

      google-drive-to-sqlite files files.db --json

  Use a folder ID to recursively fetch every file in that folder and its sub-
  folders:

      google-drive-to-sqlite files files.db --folder
      1E6Zg2X2bjjtPzVfX8YqdXZDCoB3AVA7i

  Fetch files you have starred:

      google-drive-to-sqlite files starred.db --starred

Options:
  -a, --auth FILE       Path to auth.json token file
  --folder TEXT         Files in this folder ID and its sub-folders
  -q TEXT               Files matching this query
  --full-text TEXT      Search for files with text match
  --starred             Files you have starred
  --trashed             Files in the trash
  --shared-with-me      Files that have been shared with you
  --apps                Google Apps docs, spreadsheets, presentations and
                        drawings
  --docs                Google Apps docs
  --sheets              Google Apps spreadsheets
  --presentations       Google Apps presentations
  --drawings            Google Apps drawings
  --json                Output JSON rather than write to DB
  --nl                  Output newline-delimited JSON rather than write to DB
  --stop-after INTEGER  Stop paginating after X results
  --import-json FILE    Import from this JSON file instead of the API
  --import-nl FILE      Import from this newline-delimited JSON file
  -v, --verbose         Send verbose output to stderr
  --help                Show this message and exit.

```
<!-- [[[end]]] -->

## google-drive-to-sqlite download FILE_ID

The `download` command can be used to download files from Google Drive.

You'll need one or more file IDs, which look something like `0B32uDVNZfiEKLUtIT1gzYWN2NDI4SzVQYTFWWWxCWUtvVGNB`.

To download the file, run this:

    google-drive-to-sqlite download 0B32uDVNZfiEKLUtIT1gzYWN2NDI4SzVQYTFWWWxCWUtvVGNB

This will detect the content type of the file and use that as the extension - so if this file is a JPEG the file would be downloaded as:

    0B32uDVNZfiEKLUtIT1gzYWN2NDI4SzVQYTFWWWxCWUtvVGNB.jpeg

You can pass multiple file IDs to the command at once.

To hide the progress bar and filename output, use `-s` or `--silent`.

If you are downloading a single file you can use the `-o` output to specify a filename and location:

    google-drive-to-sqlite download 0B32uDVNZfiEKLUtIT1gzYWN2NDI4SzVQYTFWWWxCWUtvVGNB \
      -o my-image.jpeg

Use `-o -` to write the file contents to standard output:

    google-drive-to-sqlite download 0B32uDVNZfiEKLUtIT1gzYWN2NDI4SzVQYTFWWWxCWUtvVGNB \
      -o - > my-image.jpeg

Full `--help`:

<!-- [[[cog
result = runner.invoke(cli.cli, [""download"", ""--help""])
help = result.output.replace(""Usage: cli"", ""Usage: google-drive-to-sqlite"")
cog.out(
    ""```\n{}\n```\n"".format(help)
)
]]] -->
```
Usage: google-drive-to-sqlite download [OPTIONS] FILE_IDS...

  Download one or more files to disk, based on their file IDs.

  The file content will be saved to a file with the name:

      FILE_ID.ext

  Where the extension is automatically picked based on the type of file.

  If you are downloading a single file you can specify a filename with -o:

      google-drive-to-sqlite download MY_FILE_ID -o myfile.txt

Options:
  -a, --auth FILE    Path to auth.json token file
  -o, --output FILE  File to write to, or - for standard output
  -s, --silent       Hide progress bar and filename
  --help             Show this message and exit.

```
<!-- [[[end]]] -->

## google-drive-to-sqlite export FORMAT FILE_ID

The `export` command can be used to export Google Docs documents, spreadsheets and presentations in a number of different formats.

You'll need one or more document IDs, which look something like `10BOHGDUYa7lBjUSo26YFCHTpgEmtXabdVFaopCTh1vU`. You can find these by looking at the URL of your document on the Google Docs site.

To export that document as PDF, run this:

    google-drive-to-sqlite export pdf 10BOHGDUYa7lBjUSo26YFCHTpgEmtXabdVFaopCTh1vU

The file will be exported as:

    10BOHGDUYa7lBjUSo26YFCHTpgEmtXabdVFaopCTh1vU-export.pdf

You can pass multiple file IDs to the command at once.

For the `FORMAT` option you can use any of the mime type options listed [on this page](https://developers.google.com/drive/api/v3/ref-export-formats) - for example, to export as an Open Office document you could use:

    google-drive-to-sqlite export \
     application/vnd.oasis.opendocument.text \
     10BOHGDUYa7lBjUSo26YFCHTpgEmtXabdVFaopCTh1vU

For convenience the following shortcuts for common file formats are provided:

- Google Docs: `html`, `txt`, `rtf`, `pdf`, `doc`, `zip`, `epub`
- Google Sheets: `xls`, `pdf`, `csv`, `tsv`, `zip`
- Presentations: `ppt`, `pdf`, `txt`
- Drawings: `jpeg`, `png`, `svg`

The `zip` option returns a zip file of HTML. `txt` returns plain text. The others should be self-evident.

To hide the filename output, use `-s` or `--silent`.

If you are exporting a single file you can use the `-o` output to specify a filename and location:

    google-drive-to-sqlite export pdf 10BOHGDUYa7lBjUSo26YFCHTpgEmtXabdVFaopCTh1vU \
      -o my-document.pdf

Use `-o -` to write the file contents to standard output:

    google-drive-to-sqlite export pdf 10BOHGDUYa7lBjUSo26YFCHTpgEmtXabdVFaopCTh1vU \
      -o - > my-document.pdf

Full `--help`:

<!-- [[[cog
result = runner.invoke(cli.cli, [""export"", ""--help""])
help = result.output.replace(""Usage: cli"", ""Usage: google-drive-to-sqlite"")
cog.out(
    ""```\n{}\n```\n"".format(help)
)
]]] -->
```
Usage: google-drive-to-sqlite export [OPTIONS] FORMAT FILE_IDS...

  Export one or more files to the specified format.

  Usage:

      google-drive-to-sqlite export pdf FILE_ID_1 FILE_ID_2

  The file content will be saved to a file with the name:

      FILE_ID-export.ext

  Where the extension is based on the format you specified.

  Available export formats can be seen here:
  https://developers.google.com/drive/api/v3/ref-export-formats

  Or you can use one of the following shortcuts:

  - Google Docs: html, txt, rtf, pdf, doc, zip, epub
  - Google Sheets: xls, pdf, csv, tsv, zip
  - Presentations: ppt, pdf, txt
  - Drawings: jpeg, png, svg

  ""zip"" returns a zip file of HTML.

  If you are exporting a single file you can specify a filename with -o:

      google-drive-to-sqlite export zip MY_FILE_ID -o myfile.zip

Options:
  -a, --auth FILE    Path to auth.json token file
  -o, --output FILE  File to write to, or - for standard output
  -s, --silent       Hide progress bar and filename
  --help             Show this message and exit.

```
<!-- [[[end]]] -->

## google-drive-to-sqlite get URL

The `get` command makes authenticated requests to the specified URL, using credentials derived from the `auth.json` file.

For example:

    $ google-drive-to-sqlite get 'https://www.googleapis.com/drive/v3/about?fields=*'
    {
        ""kind"": ""drive#about"",
        ""user"": {
            ""kind"": ""drive#user"",
            ""displayName"": ""Simon Willison"",
    # ...

If the resource you are fetching supports pagination you can use `--paginate key` to paginate through all of the rows in a specified key. For example, the following API has a `nextPageToken` key and a `files` list, suggesting it supports pagination:

    $ google-drive-to-sqlite get https://www.googleapis.com/drive/v3/files
    {
        ""kind"": ""drive#fileList"",
        ""nextPageToken"": ""~!!~AI9...wogHHYlc="",
        ""incompleteSearch"": false,
        ""files"": [
            {
                ""kind"": ""drive#file"",
                ""id"": ""1YEsITp_X8PtDUJWHGM0osT-TXAU1nr0e7RSWRM2Jpyg"",
                ""name"": ""Title of a spreadsheet"",
                ""mimeType"": ""application/vnd.google-apps.spreadsheet""
            },

To paginate through everything in the `files` list you would use `--paginate files` like this:

    $ google-drive-to-sqlite get https://www.googleapis.com/drive/v3/files --paginate files
    [
      {
        ""kind"": ""drive#file"",
        ""id"": ""1YEsITp_X8PtDUJWHGM0osT-TXAU1nr0e7RSWRM2Jpyg"",
        ""name"": ""Title of a spreadsheet"",
        ""mimeType"": ""application/vnd.google-apps.spreadsheet""
      },
      # ...

Add `--nl` to stream paginated data as newline-delimited JSON:

    $ google-drive-to-sqlite get https://www.googleapis.com/drive/v3/files --paginate files --nl
    {""kind"": ""drive#file"", ""id"": ""1YEsITp_X8PtDUJWHGM0osT-TXAU1nr0e7RSWRM2Jpyg"", ""name"": ""Title of a spreadsheet"", ""mimeType"": ""application/vnd.google-apps.spreadsheet""}
    {""kind"": ""drive#file"", ""id"": ""1E6Zg2X2bjjtPzVfX8YqdXZDCoB3AVA7i"", ""name"": ""Subfolder"", ""mimeType"": ""application/vnd.google-apps.folder""}

Add `--stop-after 5` to stop after 5 records - useful for testing.

Full `--help`:

<!-- [[[cog
result = runner.invoke(cli.cli, [""get"", ""--help""])
help = result.output.replace(""Usage: cli"", ""Usage: google-drive-to-sqlite"")
cog.out(
    ""```\n{}\n```\n"".format(help)
)
]]] -->
```
Usage: google-drive-to-sqlite get [OPTIONS] URL

  Make an authenticated HTTP GET to the specified URL

Options:
  -a, --auth FILE       Path to auth.json token file
  --paginate TEXT       Paginate through all results in this key
  --nl                  Output paginated data as newline-delimited JSON
  --stop-after INTEGER  Stop paginating after X results
  -v, --verbose         Send verbose output to stderr
  --help                Show this message and exit.

```
<!-- [[[end]]] -->


## Database schema

The database created by this tool has the following schema:

<!-- [[[cog
import tempfile, pathlib, sqlite_utils
tmpdir = pathlib.Path(tempfile.mkdtemp())
db_path = str(tmpdir / ""docs.db"")
result = runner.invoke(cli.cli, [
    ""files"", db_path, ""--import-json"", ""tests/folder-and-children.json""
])
cog.out(""```sql\n"")
schema = sqlite_utils.Database(db_path).schema
# Tidy up some formatting
schema = schema.replace("", ["", "",\n   ["")
schema = schema.replace(""\n,\n"", "",\n"")
schema = schema.replace(""TEXT);"", ""TEXT\n);"")
cog.out(schema)
cog.out(""\n```"")
]]] -->
```sql
CREATE TABLE [drive_users] (
   [permissionId] TEXT PRIMARY KEY,
   [kind] TEXT,
   [displayName] TEXT,
   [photoLink] TEXT,
   [me] INTEGER,
   [emailAddress] TEXT
);
CREATE TABLE [drive_folders] (
   [id] TEXT PRIMARY KEY,
   [_parent] TEXT,
   [_owner] TEXT,
   [lastModifyingUser] TEXT,
   [kind] TEXT,
   [name] TEXT,
   [mimeType] TEXT,
   [starred] INTEGER,
   [trashed] INTEGER,
   [explicitlyTrashed] INTEGER,
   [parents] TEXT,
   [spaces] TEXT,
   [version] TEXT,
   [webViewLink] TEXT,
   [iconLink] TEXT,
   [hasThumbnail] INTEGER,
   [thumbnailVersion] TEXT,
   [viewedByMe] INTEGER,
   [createdTime] TEXT,
   [modifiedTime] TEXT,
   [modifiedByMe] INTEGER,
   [shared] INTEGER,
   [ownedByMe] INTEGER,
   [viewersCanCopyContent] INTEGER,
   [copyRequiresWriterPermission] INTEGER,
   [writersCanShare] INTEGER,
   [folderColorRgb] TEXT,
   [quotaBytesUsed] TEXT,
   [isAppAuthorized] INTEGER,
   [linkShareMetadata] TEXT,
   FOREIGN KEY([_parent]) REFERENCES [drive_folders]([id]),
   FOREIGN KEY([_owner]) REFERENCES [drive_users]([permissionId]),
   FOREIGN KEY([lastModifyingUser]) REFERENCES [drive_users]([permissionId])
);
CREATE TABLE [drive_files] (
   [id] TEXT PRIMARY KEY,
   [_parent] TEXT,
   [_owner] TEXT,
   [lastModifyingUser] TEXT,
   [kind] TEXT,
   [name] TEXT,
   [mimeType] TEXT,
   [starred] INTEGER,
   [trashed] INTEGER,
   [explicitlyTrashed] INTEGER,
   [parents] TEXT,
   [spaces] TEXT,
   [version] TEXT,
   [webViewLink] TEXT,
   [iconLink] TEXT,
   [hasThumbnail] INTEGER,
   [thumbnailVersion] TEXT,
   [viewedByMe] INTEGER,
   [createdTime] TEXT,
   [modifiedTime] TEXT,
   [modifiedByMe] INTEGER,
   [shared] INTEGER,
   [ownedByMe] INTEGER,
   [viewersCanCopyContent] INTEGER,
   [copyRequiresWriterPermission] INTEGER,
   [writersCanShare] INTEGER,
   [quotaBytesUsed] TEXT,
   [isAppAuthorized] INTEGER,
   [linkShareMetadata] TEXT,
   FOREIGN KEY([_parent]) REFERENCES [drive_folders]([id]),
   FOREIGN KEY([_owner]) REFERENCES [drive_users]([permissionId]),
   FOREIGN KEY([lastModifyingUser]) REFERENCES [drive_users]([permissionId])
);
```
<!-- [[[end]]] -->

## Thumbnails

You can construct a thumbnail image for a known file ID using the following URL:

    https://drive.google.com/thumbnail?sz=w800-h800&id=FILE_ID

Users who are signed into Google Drive and have permission to view a file will be redirected to a thumbnail version of that file. You can tweak the `w800` and `h800` parameters to request different thumbnail sizes.

## Privacy policy

This tool requests access to your Google Drive account in order to retrieve metadata about your files there. It also offers a feature that can download the content of those files.

The credentials used to access your account are stored in the `auth.json` file on your computer. The metadata and content retrieved from Google Drive is also stored only on your own personal computer.

At no point do the developers of this tool gain access to any of your data.

## Development

To contribute to this tool, first checkout the code. Then create a new virtual environment:

    cd google-drive-to-sqlite
    python -m venv venv
    source venv/bin/activate

Or if you are using `pipenv`:

    pipenv shell

Now install the dependencies and test dependencies:

    pip install -e '.[test]'

To run the tests:

    pytest
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-google-drive-to-sqlite"" class=""anchor"" aria-hidden=""true"" href=""#user-content-google-drive-to-sqlite""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>google-drive-to-sqlite</h1>
<p dir=""auto""><a href=""https://pypi.org/project/google-drive-to-sqlite/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/18fcd4b2930251d5e234ec0832e9dd82dde2d7b5bfc9dc31387e22107a1da0a3/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f676f6f676c652d64726976652d746f2d73716c6974652e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/google-drive-to-sqlite.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/google-drive-to-sqlite/releases""><img src=""https://camo.githubusercontent.com/7c733ea3efff14b60d66656573c9bdc05d5d6a2d84ec3fdec73cef08685decf1/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f676f6f676c652d64726976652d746f2d73716c6974653f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/google-drive-to-sqlite?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/google-drive-to-sqlite/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/google-drive-to-sqlite/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/google-drive-to-sqlite/blob/master/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Create a SQLite database containing metadata from <a href=""https://www.google.com/drive"" rel=""nofollow"">Google Drive</a></p>
<p dir=""auto"">For background on this project, see <a href=""https://simonwillison.net/2022/Feb/20/google-drive-to-sqlite/"" rel=""nofollow"">Google Drive to SQLite</a> on my blog.</p>
<p dir=""auto"">If you use Google Drive, and especially if you have shared drives with other people there's a good chance you have hundreds or even thousands of files that you may not be fully aware of.</p>
<p dir=""auto"">This tool can download metadata about those files - their names, sizes, folders, content types, permissions, creation dates and more - and store them in a SQLite database.</p>
<p dir=""auto"">This lets you use SQL to analyze your Google Drive contents, using <a href=""https://datasette.io/"" rel=""nofollow"">Datasette</a> or the SQLite command-line tool or any other SQLite database browsing software.</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p dir=""auto"">Install this tool using <code>pip</code>:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install google-drive-to-sqlite""><pre class=""notranslate""><code>pip install google-drive-to-sqlite
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-quickstart"" class=""anchor"" aria-hidden=""true"" href=""#user-content-quickstart""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Quickstart</h2>
<p dir=""auto"">Authenticate with Google Drive by running:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""google-drive-to-sqlite auth""><pre class=""notranslate""><code>google-drive-to-sqlite auth
</code></pre></div>
<p dir=""auto"">Now create a SQLite database with metadata about all of the files you have starred using:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""google-drive-to-sqlite files starred.db --starred""><pre class=""notranslate""><code>google-drive-to-sqlite files starred.db --starred
</code></pre></div>
<p dir=""auto"">You can explore the resulting database using <a href=""https://datasette.io/"" rel=""nofollow"">Datasette</a>:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ pip install datasette
$ datasette starred.db
INFO:     Started server process [24661]
INFO:     Uvicorn running on http://127.0.0.1:8001""><pre class=""notranslate""><code>$ pip install datasette
$ datasette starred.db
INFO:     Started server process [24661]
INFO:     Uvicorn running on http://127.0.0.1:8001
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-authentication"" class=""anchor"" aria-hidden=""true"" href=""#user-content-authentication""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Authentication</h2>
<blockquote>
<p dir=""auto""><g-emoji class=""g-emoji"" alias=""warning"" fallback-src=""https://github.githubassets.com/images/icons/emoji/unicode/26a0.png"">⚠️</g-emoji> <strong>This application has not yet been verified by Google</strong> - you may find you are unable to authenticate until that verification is complete. <a href=""https://github.com/simonw/google-drive-to-sqlite/issues/10"" data-hovercard-type=""issue"" data-hovercard-url=""/simonw/google-drive-to-sqlite/issues/10/hovercard"">#10</a></p>
<p dir=""auto"">You can work around this issue by <a href=""https://til.simonwillison.net/googlecloud/google-oauth-cli-application"" rel=""nofollow"">creating your own OAuth client ID key</a> and passing it to the <code>auth</code> command using <code>--google-client-id</code> and <code>--google-client-secret</code>.</p>
</blockquote>
<p dir=""auto"">First, authenticate with Google Drive using the <code>auth</code> command:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ google-drive-to-sqlite auth
Visit the following URL to authenticate with Google Drive

https://accounts.google.com/o/oauth2/v2/auth?...

Then return here and paste in the resulting code:
Paste code here: ""><pre class=""notranslate""><code>$ google-drive-to-sqlite auth
Visit the following URL to authenticate with Google Drive

https://accounts.google.com/o/oauth2/v2/auth?...

Then return here and paste in the resulting code:
Paste code here: 
</code></pre></div>
<p dir=""auto"">Follow the link, sign in with Google Drive and then copy and paste the resulting code back into the tool.</p>
<p dir=""auto"">This will save an authentication token to the file called <code>auth.json</code> in the current directory.</p>
<p dir=""auto"">To specify a different location for that file, use the <code>--auth</code> option:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""google-drive-to-sqlite auth --auth ~/google-drive-auth.json""><pre class=""notranslate""><code>google-drive-to-sqlite auth --auth ~/google-drive-auth.json
</code></pre></div>
<p dir=""auto"">The <code>auth</code> command also provides options for using a different scope, Google client ID and Google client secret. You can use these to create your own custom authentication tokens that can work with other Google APIs, see <a href=""https://github.com/simonw/google-drive-to-sqlite/issues/5"" data-hovercard-type=""issue"" data-hovercard-url=""/simonw/google-drive-to-sqlite/issues/5/hovercard"">issue #5</a> for details.</p>
<p dir=""auto"">Full <code>--help</code>:</p>

<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""Usage: google-drive-to-sqlite auth [OPTIONS]

  Authenticate user and save credentials

Options:
  -a, --auth FILE              Path to save token, defaults to auth.json
  --google-client-id TEXT      Custom Google client ID
  --google-client-secret TEXT  Custom Google client secret
  --scope TEXT                 Custom token scope
  --help                       Show this message and exit.
""><pre class=""notranslate""><code>Usage: google-drive-to-sqlite auth [OPTIONS]

  Authenticate user and save credentials

Options:
  -a, --auth FILE              Path to save token, defaults to auth.json
  --google-client-id TEXT      Custom Google client ID
  --google-client-secret TEXT  Custom Google client secret
  --scope TEXT                 Custom token scope
  --help                       Show this message and exit.

</code></pre></div>

<p dir=""auto"">To revoke the token that is stored in <code>auth.json</code>, such that it cannot be used to access Google Drive in the future, run the <code>revoke</code> command:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""google-drive-to-sqlite revoke""><pre class=""notranslate""><code>google-drive-to-sqlite revoke
</code></pre></div>
<p dir=""auto"">Or if your token is stored in another location:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""google-drive-to-sqlite revoke -a ~/google-drive-auth.json""><pre class=""notranslate""><code>google-drive-to-sqlite revoke -a ~/google-drive-auth.json
</code></pre></div>
<p dir=""auto"">You will need to obtain a fresh token using the <code>auth</code> command in order to continue using this tool.</p>
<h2 dir=""auto""><a id=""user-content-google-drive-to-sqlite-files"" class=""anchor"" aria-hidden=""true"" href=""#user-content-google-drive-to-sqlite-files""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>google-drive-to-sqlite files</h2>
<p dir=""auto"">To retrieve metadata about the files in your Google Drive, or a folder or search within it, use the <code>google-drive-to-sqlite files</code> command.</p>
<p dir=""auto"">This will default to writing details about every file in your Google Drive to a SQLite database:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""google-drive-to-sqlite files files.db""><pre class=""notranslate""><code>google-drive-to-sqlite files files.db
</code></pre></div>
<p dir=""auto"">Files and folders will be written to databases tables, which will be created if they do not yet exist. The database schema is <a href=""#user-content-database-schema"">shown below</a>.</p>
<p dir=""auto"">If a file or folder already exists, based on a matching <code>id</code>, it will be replaced with fresh data.</p>
<p dir=""auto"">Instead of writing to SQLite you can use <code>--json</code> to output as JSON, or <code>--nl</code> to output as newline-delimited JSON:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""google-drive-to-sqlite files --nl""><pre class=""notranslate""><code>google-drive-to-sqlite files --nl
</code></pre></div>
<p dir=""auto"">Use <code>--folder ID</code> to retrieve everything in a specified folder and its sub-folders:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""google-drive-to-sqlite files files.db --folder 1E6Zg2X2bjjtPzVfX8YqdXZDCoB3AVA7i""><pre class=""notranslate""><code>google-drive-to-sqlite files files.db --folder 1E6Zg2X2bjjtPzVfX8YqdXZDCoB3AVA7i
</code></pre></div>
<p dir=""auto"">Use <code>--q QUERY</code> to use a <a href=""https://developers.google.com/drive/api/v3/reference/query-ref"" rel=""nofollow"">custom search query</a>:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""google-drive-to-sqlite files files.db -q &quot;viewedByMeTime &gt; '2022-01-01'&quot;""><pre class=""notranslate""><code>google-drive-to-sqlite files files.db -q ""viewedByMeTime &gt; '2022-01-01'""
</code></pre></div>
<p dir=""auto"">The following shortcut options help build queries:</p>
<ul dir=""auto"">
<li><code>--full-text TEXT</code> to search for files where the full text matches a search term</li>
<li><code>--starred</code> for files and folders you have starred</li>
<li><code>--trashed</code> for files and folders in the trash</li>
<li><code>--shared-with-me</code> for files and folders that have been shared with you</li>
<li><code>--apps</code> for Google Apps documents, spreadsheets, presentations and drawings (equivalent to setting all  of the next four options)</li>
<li><code>--docs</code> for Google Apps documents</li>
<li><code>--sheets</code> for Google Apps spreadsheets</li>
<li><code>--presentations</code> for Google Apps presentations</li>
<li><code>--drawings</code> for Google Apps drawings</li>
</ul>
<p dir=""auto"">You can combine these - for example, this returns all files that you have starred and that were shared with you:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""google-drive-to-sqlite files highlights.db \
  --starred --shared-with-me""><pre class=""notranslate""><code>google-drive-to-sqlite files highlights.db \
  --starred --shared-with-me
</code></pre></div>
<p dir=""auto"">Multiple options are treated as AND, with the exception of the Google Apps options which are treated as OR - so the following would retrieve all spreadsheets and presentations that have also been starred:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""google-drive-to-sqlite files highlights.db \
  --starred --sheets --presentations""><pre class=""notranslate""><code>google-drive-to-sqlite files highlights.db \
  --starred --sheets --presentations
</code></pre></div>
<p dir=""auto"">You can use <code>--stop-after X</code> to stop after retrieving X files, useful for trying out a new search pattern and seeing results straight away.</p>
<p dir=""auto"">The <code>--import-json</code> and <code>--import-nl</code> options are mainly useful for testing and developing this tool. They allow you to replay the JSON or newline-delimited JSON that was previously fetched using <code>--json</code> or <code>--nl</code> and use it to create a fresh SQLite database, without needing to make any outbound API calls:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""# Fetch all starred files from the API, write to starred.json
google-drive-to-sqlite files -q 'starred = true' --json &gt; starred.json
# Now import that data into a new SQLite database file
google-drive-to-sqlite files starred.db --import-json starred.json""><pre class=""notranslate""><code># Fetch all starred files from the API, write to starred.json
google-drive-to-sqlite files -q 'starred = true' --json &gt; starred.json
# Now import that data into a new SQLite database file
google-drive-to-sqlite files starred.db --import-json starred.json
</code></pre></div>
<p dir=""auto"">Full <code>--help</code>:</p>

<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""Usage: google-drive-to-sqlite files [OPTIONS] [DATABASE]

  Retrieve metadata for files in Google Drive, and write to a SQLite database or
  output as JSON.

      google-drive-to-sqlite files files.db

  Use --json to output JSON, --nl for newline-delimited JSON:

      google-drive-to-sqlite files files.db --json

  Use a folder ID to recursively fetch every file in that folder and its sub-
  folders:

      google-drive-to-sqlite files files.db --folder
      1E6Zg2X2bjjtPzVfX8YqdXZDCoB3AVA7i

  Fetch files you have starred:

      google-drive-to-sqlite files starred.db --starred

Options:
  -a, --auth FILE       Path to auth.json token file
  --folder TEXT         Files in this folder ID and its sub-folders
  -q TEXT               Files matching this query
  --full-text TEXT      Search for files with text match
  --starred             Files you have starred
  --trashed             Files in the trash
  --shared-with-me      Files that have been shared with you
  --apps                Google Apps docs, spreadsheets, presentations and
                        drawings
  --docs                Google Apps docs
  --sheets              Google Apps spreadsheets
  --presentations       Google Apps presentations
  --drawings            Google Apps drawings
  --json                Output JSON rather than write to DB
  --nl                  Output newline-delimited JSON rather than write to DB
  --stop-after INTEGER  Stop paginating after X results
  --import-json FILE    Import from this JSON file instead of the API
  --import-nl FILE      Import from this newline-delimited JSON file
  -v, --verbose         Send verbose output to stderr
  --help                Show this message and exit.
""><pre class=""notranslate""><code>Usage: google-drive-to-sqlite files [OPTIONS] [DATABASE]

  Retrieve metadata for files in Google Drive, and write to a SQLite database or
  output as JSON.

      google-drive-to-sqlite files files.db

  Use --json to output JSON, --nl for newline-delimited JSON:

      google-drive-to-sqlite files files.db --json

  Use a folder ID to recursively fetch every file in that folder and its sub-
  folders:

      google-drive-to-sqlite files files.db --folder
      1E6Zg2X2bjjtPzVfX8YqdXZDCoB3AVA7i

  Fetch files you have starred:

      google-drive-to-sqlite files starred.db --starred

Options:
  -a, --auth FILE       Path to auth.json token file
  --folder TEXT         Files in this folder ID and its sub-folders
  -q TEXT               Files matching this query
  --full-text TEXT      Search for files with text match
  --starred             Files you have starred
  --trashed             Files in the trash
  --shared-with-me      Files that have been shared with you
  --apps                Google Apps docs, spreadsheets, presentations and
                        drawings
  --docs                Google Apps docs
  --sheets              Google Apps spreadsheets
  --presentations       Google Apps presentations
  --drawings            Google Apps drawings
  --json                Output JSON rather than write to DB
  --nl                  Output newline-delimited JSON rather than write to DB
  --stop-after INTEGER  Stop paginating after X results
  --import-json FILE    Import from this JSON file instead of the API
  --import-nl FILE      Import from this newline-delimited JSON file
  -v, --verbose         Send verbose output to stderr
  --help                Show this message and exit.

</code></pre></div>

<h2 dir=""auto""><a id=""user-content-google-drive-to-sqlite-download-file_id"" class=""anchor"" aria-hidden=""true"" href=""#user-content-google-drive-to-sqlite-download-file_id""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>google-drive-to-sqlite download FILE_ID</h2>
<p dir=""auto"">The <code>download</code> command can be used to download files from Google Drive.</p>
<p dir=""auto"">You'll need one or more file IDs, which look something like <code>0B32uDVNZfiEKLUtIT1gzYWN2NDI4SzVQYTFWWWxCWUtvVGNB</code>.</p>
<p dir=""auto"">To download the file, run this:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""google-drive-to-sqlite download 0B32uDVNZfiEKLUtIT1gzYWN2NDI4SzVQYTFWWWxCWUtvVGNB""><pre class=""notranslate""><code>google-drive-to-sqlite download 0B32uDVNZfiEKLUtIT1gzYWN2NDI4SzVQYTFWWWxCWUtvVGNB
</code></pre></div>
<p dir=""auto"">This will detect the content type of the file and use that as the extension - so if this file is a JPEG the file would be downloaded as:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""0B32uDVNZfiEKLUtIT1gzYWN2NDI4SzVQYTFWWWxCWUtvVGNB.jpeg""><pre class=""notranslate""><code>0B32uDVNZfiEKLUtIT1gzYWN2NDI4SzVQYTFWWWxCWUtvVGNB.jpeg
</code></pre></div>
<p dir=""auto"">You can pass multiple file IDs to the command at once.</p>
<p dir=""auto"">To hide the progress bar and filename output, use <code>-s</code> or <code>--silent</code>.</p>
<p dir=""auto"">If you are downloading a single file you can use the <code>-o</code> output to specify a filename and location:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""google-drive-to-sqlite download 0B32uDVNZfiEKLUtIT1gzYWN2NDI4SzVQYTFWWWxCWUtvVGNB \
  -o my-image.jpeg""><pre class=""notranslate""><code>google-drive-to-sqlite download 0B32uDVNZfiEKLUtIT1gzYWN2NDI4SzVQYTFWWWxCWUtvVGNB \
  -o my-image.jpeg
</code></pre></div>
<p dir=""auto"">Use <code>-o -</code> to write the file contents to standard output:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""google-drive-to-sqlite download 0B32uDVNZfiEKLUtIT1gzYWN2NDI4SzVQYTFWWWxCWUtvVGNB \
  -o - &gt; my-image.jpeg""><pre class=""notranslate""><code>google-drive-to-sqlite download 0B32uDVNZfiEKLUtIT1gzYWN2NDI4SzVQYTFWWWxCWUtvVGNB \
  -o - &gt; my-image.jpeg
</code></pre></div>
<p dir=""auto"">Full <code>--help</code>:</p>

<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""Usage: google-drive-to-sqlite download [OPTIONS] FILE_IDS...

  Download one or more files to disk, based on their file IDs.

  The file content will be saved to a file with the name:

      FILE_ID.ext

  Where the extension is automatically picked based on the type of file.

  If you are downloading a single file you can specify a filename with -o:

      google-drive-to-sqlite download MY_FILE_ID -o myfile.txt

Options:
  -a, --auth FILE    Path to auth.json token file
  -o, --output FILE  File to write to, or - for standard output
  -s, --silent       Hide progress bar and filename
  --help             Show this message and exit.
""><pre class=""notranslate""><code>Usage: google-drive-to-sqlite download [OPTIONS] FILE_IDS...

  Download one or more files to disk, based on their file IDs.

  The file content will be saved to a file with the name:

      FILE_ID.ext

  Where the extension is automatically picked based on the type of file.

  If you are downloading a single file you can specify a filename with -o:

      google-drive-to-sqlite download MY_FILE_ID -o myfile.txt

Options:
  -a, --auth FILE    Path to auth.json token file
  -o, --output FILE  File to write to, or - for standard output
  -s, --silent       Hide progress bar and filename
  --help             Show this message and exit.

</code></pre></div>

<h2 dir=""auto""><a id=""user-content-google-drive-to-sqlite-export-format-file_id"" class=""anchor"" aria-hidden=""true"" href=""#user-content-google-drive-to-sqlite-export-format-file_id""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>google-drive-to-sqlite export FORMAT FILE_ID</h2>
<p dir=""auto"">The <code>export</code> command can be used to export Google Docs documents, spreadsheets and presentations in a number of different formats.</p>
<p dir=""auto"">You'll need one or more document IDs, which look something like <code>10BOHGDUYa7lBjUSo26YFCHTpgEmtXabdVFaopCTh1vU</code>. You can find these by looking at the URL of your document on the Google Docs site.</p>
<p dir=""auto"">To export that document as PDF, run this:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""google-drive-to-sqlite export pdf 10BOHGDUYa7lBjUSo26YFCHTpgEmtXabdVFaopCTh1vU""><pre class=""notranslate""><code>google-drive-to-sqlite export pdf 10BOHGDUYa7lBjUSo26YFCHTpgEmtXabdVFaopCTh1vU
</code></pre></div>
<p dir=""auto"">The file will be exported as:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""10BOHGDUYa7lBjUSo26YFCHTpgEmtXabdVFaopCTh1vU-export.pdf""><pre class=""notranslate""><code>10BOHGDUYa7lBjUSo26YFCHTpgEmtXabdVFaopCTh1vU-export.pdf
</code></pre></div>
<p dir=""auto"">You can pass multiple file IDs to the command at once.</p>
<p dir=""auto"">For the <code>FORMAT</code> option you can use any of the mime type options listed <a href=""https://developers.google.com/drive/api/v3/ref-export-formats"" rel=""nofollow"">on this page</a> - for example, to export as an Open Office document you could use:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""google-drive-to-sqlite export \
 application/vnd.oasis.opendocument.text \
 10BOHGDUYa7lBjUSo26YFCHTpgEmtXabdVFaopCTh1vU""><pre class=""notranslate""><code>google-drive-to-sqlite export \
 application/vnd.oasis.opendocument.text \
 10BOHGDUYa7lBjUSo26YFCHTpgEmtXabdVFaopCTh1vU
</code></pre></div>
<p dir=""auto"">For convenience the following shortcuts for common file formats are provided:</p>
<ul dir=""auto"">
<li>Google Docs: <code>html</code>, <code>txt</code>, <code>rtf</code>, <code>pdf</code>, <code>doc</code>, <code>zip</code>, <code>epub</code></li>
<li>Google Sheets: <code>xls</code>, <code>pdf</code>, <code>csv</code>, <code>tsv</code>, <code>zip</code></li>
<li>Presentations: <code>ppt</code>, <code>pdf</code>, <code>txt</code></li>
<li>Drawings: <code>jpeg</code>, <code>png</code>, <code>svg</code></li>
</ul>
<p dir=""auto"">The <code>zip</code> option returns a zip file of HTML. <code>txt</code> returns plain text. The others should be self-evident.</p>
<p dir=""auto"">To hide the filename output, use <code>-s</code> or <code>--silent</code>.</p>
<p dir=""auto"">If you are exporting a single file you can use the <code>-o</code> output to specify a filename and location:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""google-drive-to-sqlite export pdf 10BOHGDUYa7lBjUSo26YFCHTpgEmtXabdVFaopCTh1vU \
  -o my-document.pdf""><pre class=""notranslate""><code>google-drive-to-sqlite export pdf 10BOHGDUYa7lBjUSo26YFCHTpgEmtXabdVFaopCTh1vU \
  -o my-document.pdf
</code></pre></div>
<p dir=""auto"">Use <code>-o -</code> to write the file contents to standard output:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""google-drive-to-sqlite export pdf 10BOHGDUYa7lBjUSo26YFCHTpgEmtXabdVFaopCTh1vU \
  -o - &gt; my-document.pdf""><pre class=""notranslate""><code>google-drive-to-sqlite export pdf 10BOHGDUYa7lBjUSo26YFCHTpgEmtXabdVFaopCTh1vU \
  -o - &gt; my-document.pdf
</code></pre></div>
<p dir=""auto"">Full <code>--help</code>:</p>

<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""Usage: google-drive-to-sqlite export [OPTIONS] FORMAT FILE_IDS...

  Export one or more files to the specified format.

  Usage:

      google-drive-to-sqlite export pdf FILE_ID_1 FILE_ID_2

  The file content will be saved to a file with the name:

      FILE_ID-export.ext

  Where the extension is based on the format you specified.

  Available export formats can be seen here:
  https://developers.google.com/drive/api/v3/ref-export-formats

  Or you can use one of the following shortcuts:

  - Google Docs: html, txt, rtf, pdf, doc, zip, epub
  - Google Sheets: xls, pdf, csv, tsv, zip
  - Presentations: ppt, pdf, txt
  - Drawings: jpeg, png, svg

  &quot;zip&quot; returns a zip file of HTML.

  If you are exporting a single file you can specify a filename with -o:

      google-drive-to-sqlite export zip MY_FILE_ID -o myfile.zip

Options:
  -a, --auth FILE    Path to auth.json token file
  -o, --output FILE  File to write to, or - for standard output
  -s, --silent       Hide progress bar and filename
  --help             Show this message and exit.
""><pre class=""notranslate""><code>Usage: google-drive-to-sqlite export [OPTIONS] FORMAT FILE_IDS...

  Export one or more files to the specified format.

  Usage:

      google-drive-to-sqlite export pdf FILE_ID_1 FILE_ID_2

  The file content will be saved to a file with the name:

      FILE_ID-export.ext

  Where the extension is based on the format you specified.

  Available export formats can be seen here:
  https://developers.google.com/drive/api/v3/ref-export-formats

  Or you can use one of the following shortcuts:

  - Google Docs: html, txt, rtf, pdf, doc, zip, epub
  - Google Sheets: xls, pdf, csv, tsv, zip
  - Presentations: ppt, pdf, txt
  - Drawings: jpeg, png, svg

  ""zip"" returns a zip file of HTML.

  If you are exporting a single file you can specify a filename with -o:

      google-drive-to-sqlite export zip MY_FILE_ID -o myfile.zip

Options:
  -a, --auth FILE    Path to auth.json token file
  -o, --output FILE  File to write to, or - for standard output
  -s, --silent       Hide progress bar and filename
  --help             Show this message and exit.

</code></pre></div>

<h2 dir=""auto""><a id=""user-content-google-drive-to-sqlite-get-url"" class=""anchor"" aria-hidden=""true"" href=""#user-content-google-drive-to-sqlite-get-url""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>google-drive-to-sqlite get URL</h2>
<p dir=""auto"">The <code>get</code> command makes authenticated requests to the specified URL, using credentials derived from the <code>auth.json</code> file.</p>
<p dir=""auto"">For example:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ google-drive-to-sqlite get 'https://www.googleapis.com/drive/v3/about?fields=*'
{
    &quot;kind&quot;: &quot;drive#about&quot;,
    &quot;user&quot;: {
        &quot;kind&quot;: &quot;drive#user&quot;,
        &quot;displayName&quot;: &quot;Simon Willison&quot;,
# ...""><pre class=""notranslate""><code>$ google-drive-to-sqlite get 'https://www.googleapis.com/drive/v3/about?fields=*'
{
    ""kind"": ""drive#about"",
    ""user"": {
        ""kind"": ""drive#user"",
        ""displayName"": ""Simon Willison"",
# ...
</code></pre></div>
<p dir=""auto"">If the resource you are fetching supports pagination you can use <code>--paginate key</code> to paginate through all of the rows in a specified key. For example, the following API has a <code>nextPageToken</code> key and a <code>files</code> list, suggesting it supports pagination:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ google-drive-to-sqlite get https://www.googleapis.com/drive/v3/files
{
    &quot;kind&quot;: &quot;drive#fileList&quot;,
    &quot;nextPageToken&quot;: &quot;~!!~AI9...wogHHYlc=&quot;,
    &quot;incompleteSearch&quot;: false,
    &quot;files&quot;: [
        {
            &quot;kind&quot;: &quot;drive#file&quot;,
            &quot;id&quot;: &quot;1YEsITp_X8PtDUJWHGM0osT-TXAU1nr0e7RSWRM2Jpyg&quot;,
            &quot;name&quot;: &quot;Title of a spreadsheet&quot;,
            &quot;mimeType&quot;: &quot;application/vnd.google-apps.spreadsheet&quot;
        },""><pre class=""notranslate""><code>$ google-drive-to-sqlite get https://www.googleapis.com/drive/v3/files
{
    ""kind"": ""drive#fileList"",
    ""nextPageToken"": ""~!!~AI9...wogHHYlc="",
    ""incompleteSearch"": false,
    ""files"": [
        {
            ""kind"": ""drive#file"",
            ""id"": ""1YEsITp_X8PtDUJWHGM0osT-TXAU1nr0e7RSWRM2Jpyg"",
            ""name"": ""Title of a spreadsheet"",
            ""mimeType"": ""application/vnd.google-apps.spreadsheet""
        },
</code></pre></div>
<p dir=""auto"">To paginate through everything in the <code>files</code> list you would use <code>--paginate files</code> like this:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ google-drive-to-sqlite get https://www.googleapis.com/drive/v3/files --paginate files
[
  {
    &quot;kind&quot;: &quot;drive#file&quot;,
    &quot;id&quot;: &quot;1YEsITp_X8PtDUJWHGM0osT-TXAU1nr0e7RSWRM2Jpyg&quot;,
    &quot;name&quot;: &quot;Title of a spreadsheet&quot;,
    &quot;mimeType&quot;: &quot;application/vnd.google-apps.spreadsheet&quot;
  },
  # ...""><pre class=""notranslate""><code>$ google-drive-to-sqlite get https://www.googleapis.com/drive/v3/files --paginate files
[
  {
    ""kind"": ""drive#file"",
    ""id"": ""1YEsITp_X8PtDUJWHGM0osT-TXAU1nr0e7RSWRM2Jpyg"",
    ""name"": ""Title of a spreadsheet"",
    ""mimeType"": ""application/vnd.google-apps.spreadsheet""
  },
  # ...
</code></pre></div>
<p dir=""auto"">Add <code>--nl</code> to stream paginated data as newline-delimited JSON:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ google-drive-to-sqlite get https://www.googleapis.com/drive/v3/files --paginate files --nl
{&quot;kind&quot;: &quot;drive#file&quot;, &quot;id&quot;: &quot;1YEsITp_X8PtDUJWHGM0osT-TXAU1nr0e7RSWRM2Jpyg&quot;, &quot;name&quot;: &quot;Title of a spreadsheet&quot;, &quot;mimeType&quot;: &quot;application/vnd.google-apps.spreadsheet&quot;}
{&quot;kind&quot;: &quot;drive#file&quot;, &quot;id&quot;: &quot;1E6Zg2X2bjjtPzVfX8YqdXZDCoB3AVA7i&quot;, &quot;name&quot;: &quot;Subfolder&quot;, &quot;mimeType&quot;: &quot;application/vnd.google-apps.folder&quot;}""><pre class=""notranslate""><code>$ google-drive-to-sqlite get https://www.googleapis.com/drive/v3/files --paginate files --nl
{""kind"": ""drive#file"", ""id"": ""1YEsITp_X8PtDUJWHGM0osT-TXAU1nr0e7RSWRM2Jpyg"", ""name"": ""Title of a spreadsheet"", ""mimeType"": ""application/vnd.google-apps.spreadsheet""}
{""kind"": ""drive#file"", ""id"": ""1E6Zg2X2bjjtPzVfX8YqdXZDCoB3AVA7i"", ""name"": ""Subfolder"", ""mimeType"": ""application/vnd.google-apps.folder""}
</code></pre></div>
<p dir=""auto"">Add <code>--stop-after 5</code> to stop after 5 records - useful for testing.</p>
<p dir=""auto"">Full <code>--help</code>:</p>

<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""Usage: google-drive-to-sqlite get [OPTIONS] URL

  Make an authenticated HTTP GET to the specified URL

Options:
  -a, --auth FILE       Path to auth.json token file
  --paginate TEXT       Paginate through all results in this key
  --nl                  Output paginated data as newline-delimited JSON
  --stop-after INTEGER  Stop paginating after X results
  -v, --verbose         Send verbose output to stderr
  --help                Show this message and exit.
""><pre class=""notranslate""><code>Usage: google-drive-to-sqlite get [OPTIONS] URL

  Make an authenticated HTTP GET to the specified URL

Options:
  -a, --auth FILE       Path to auth.json token file
  --paginate TEXT       Paginate through all results in this key
  --nl                  Output paginated data as newline-delimited JSON
  --stop-after INTEGER  Stop paginating after X results
  -v, --verbose         Send verbose output to stderr
  --help                Show this message and exit.

</code></pre></div>

<h2 dir=""auto""><a id=""user-content-database-schema"" class=""anchor"" aria-hidden=""true"" href=""#user-content-database-schema""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Database schema</h2>
<p dir=""auto"">The database created by this tool has the following schema:</p>

<div class=""highlight highlight-source-sql notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""CREATE TABLE [drive_users] (
   [permissionId] TEXT PRIMARY KEY,
   [kind] TEXT,
   [displayName] TEXT,
   [photoLink] TEXT,
   [me] INTEGER,
   [emailAddress] TEXT
);
CREATE TABLE [drive_folders] (
   [id] TEXT PRIMARY KEY,
   [_parent] TEXT,
   [_owner] TEXT,
   [lastModifyingUser] TEXT,
   [kind] TEXT,
   [name] TEXT,
   [mimeType] TEXT,
   [starred] INTEGER,
   [trashed] INTEGER,
   [explicitlyTrashed] INTEGER,
   [parents] TEXT,
   [spaces] TEXT,
   [version] TEXT,
   [webViewLink] TEXT,
   [iconLink] TEXT,
   [hasThumbnail] INTEGER,
   [thumbnailVersion] TEXT,
   [viewedByMe] INTEGER,
   [createdTime] TEXT,
   [modifiedTime] TEXT,
   [modifiedByMe] INTEGER,
   [shared] INTEGER,
   [ownedByMe] INTEGER,
   [viewersCanCopyContent] INTEGER,
   [copyRequiresWriterPermission] INTEGER,
   [writersCanShare] INTEGER,
   [folderColorRgb] TEXT,
   [quotaBytesUsed] TEXT,
   [isAppAuthorized] INTEGER,
   [linkShareMetadata] TEXT,
   FOREIGN KEY([_parent]) REFERENCES [drive_folders]([id]),
   FOREIGN KEY([_owner]) REFERENCES [drive_users]([permissionId]),
   FOREIGN KEY([lastModifyingUser]) REFERENCES [drive_users]([permissionId])
);
CREATE TABLE [drive_files] (
   [id] TEXT PRIMARY KEY,
   [_parent] TEXT,
   [_owner] TEXT,
   [lastModifyingUser] TEXT,
   [kind] TEXT,
   [name] TEXT,
   [mimeType] TEXT,
   [starred] INTEGER,
   [trashed] INTEGER,
   [explicitlyTrashed] INTEGER,
   [parents] TEXT,
   [spaces] TEXT,
   [version] TEXT,
   [webViewLink] TEXT,
   [iconLink] TEXT,
   [hasThumbnail] INTEGER,
   [thumbnailVersion] TEXT,
   [viewedByMe] INTEGER,
   [createdTime] TEXT,
   [modifiedTime] TEXT,
   [modifiedByMe] INTEGER,
   [shared] INTEGER,
   [ownedByMe] INTEGER,
   [viewersCanCopyContent] INTEGER,
   [copyRequiresWriterPermission] INTEGER,
   [writersCanShare] INTEGER,
   [quotaBytesUsed] TEXT,
   [isAppAuthorized] INTEGER,
   [linkShareMetadata] TEXT,
   FOREIGN KEY([_parent]) REFERENCES [drive_folders]([id]),
   FOREIGN KEY([_owner]) REFERENCES [drive_users]([permissionId]),
   FOREIGN KEY([lastModifyingUser]) REFERENCES [drive_users]([permissionId])
);""><pre>CREATE TABLE [drive_users] (
   [permissionId] <span class=""pl-k"">TEXT</span> <span class=""pl-k"">PRIMARY KEY</span>,
   [kind] <span class=""pl-k"">TEXT</span>,
   [displayName] <span class=""pl-k"">TEXT</span>,
   [photoLink] <span class=""pl-k"">TEXT</span>,
   [me] <span class=""pl-k"">INTEGER</span>,
   [emailAddress] <span class=""pl-k"">TEXT</span>
);
CREATE TABLE [drive_folders] (
   [id] <span class=""pl-k"">TEXT</span> <span class=""pl-k"">PRIMARY KEY</span>,
   [_parent] <span class=""pl-k"">TEXT</span>,
   [_owner] <span class=""pl-k"">TEXT</span>,
   [lastModifyingUser] <span class=""pl-k"">TEXT</span>,
   [kind] <span class=""pl-k"">TEXT</span>,
   [name] <span class=""pl-k"">TEXT</span>,
   [mimeType] <span class=""pl-k"">TEXT</span>,
   [starred] <span class=""pl-k"">INTEGER</span>,
   [trashed] <span class=""pl-k"">INTEGER</span>,
   [explicitlyTrashed] <span class=""pl-k"">INTEGER</span>,
   [parents] <span class=""pl-k"">TEXT</span>,
   [spaces] <span class=""pl-k"">TEXT</span>,
   [version] <span class=""pl-k"">TEXT</span>,
   [webViewLink] <span class=""pl-k"">TEXT</span>,
   [iconLink] <span class=""pl-k"">TEXT</span>,
   [hasThumbnail] <span class=""pl-k"">INTEGER</span>,
   [thumbnailVersion] <span class=""pl-k"">TEXT</span>,
   [viewedByMe] <span class=""pl-k"">INTEGER</span>,
   [createdTime] <span class=""pl-k"">TEXT</span>,
   [modifiedTime] <span class=""pl-k"">TEXT</span>,
   [modifiedByMe] <span class=""pl-k"">INTEGER</span>,
   [shared] <span class=""pl-k"">INTEGER</span>,
   [ownedByMe] <span class=""pl-k"">INTEGER</span>,
   [viewersCanCopyContent] <span class=""pl-k"">INTEGER</span>,
   [copyRequiresWriterPermission] <span class=""pl-k"">INTEGER</span>,
   [writersCanShare] <span class=""pl-k"">INTEGER</span>,
   [folderColorRgb] <span class=""pl-k"">TEXT</span>,
   [quotaBytesUsed] <span class=""pl-k"">TEXT</span>,
   [isAppAuthorized] <span class=""pl-k"">INTEGER</span>,
   [linkShareMetadata] <span class=""pl-k"">TEXT</span>,
   <span class=""pl-k"">FOREIGN KEY</span>([_parent]) <span class=""pl-k"">REFERENCES</span> [drive_folders]([id]),
   <span class=""pl-k"">FOREIGN KEY</span>([_owner]) <span class=""pl-k"">REFERENCES</span> [drive_users]([permissionId]),
   <span class=""pl-k"">FOREIGN KEY</span>([lastModifyingUser]) <span class=""pl-k"">REFERENCES</span> [drive_users]([permissionId])
);
CREATE TABLE [drive_files] (
   [id] <span class=""pl-k"">TEXT</span> <span class=""pl-k"">PRIMARY KEY</span>,
   [_parent] <span class=""pl-k"">TEXT</span>,
   [_owner] <span class=""pl-k"">TEXT</span>,
   [lastModifyingUser] <span class=""pl-k"">TEXT</span>,
   [kind] <span class=""pl-k"">TEXT</span>,
   [name] <span class=""pl-k"">TEXT</span>,
   [mimeType] <span class=""pl-k"">TEXT</span>,
   [starred] <span class=""pl-k"">INTEGER</span>,
   [trashed] <span class=""pl-k"">INTEGER</span>,
   [explicitlyTrashed] <span class=""pl-k"">INTEGER</span>,
   [parents] <span class=""pl-k"">TEXT</span>,
   [spaces] <span class=""pl-k"">TEXT</span>,
   [version] <span class=""pl-k"">TEXT</span>,
   [webViewLink] <span class=""pl-k"">TEXT</span>,
   [iconLink] <span class=""pl-k"">TEXT</span>,
   [hasThumbnail] <span class=""pl-k"">INTEGER</span>,
   [thumbnailVersion] <span class=""pl-k"">TEXT</span>,
   [viewedByMe] <span class=""pl-k"">INTEGER</span>,
   [createdTime] <span class=""pl-k"">TEXT</span>,
   [modifiedTime] <span class=""pl-k"">TEXT</span>,
   [modifiedByMe] <span class=""pl-k"">INTEGER</span>,
   [shared] <span class=""pl-k"">INTEGER</span>,
   [ownedByMe] <span class=""pl-k"">INTEGER</span>,
   [viewersCanCopyContent] <span class=""pl-k"">INTEGER</span>,
   [copyRequiresWriterPermission] <span class=""pl-k"">INTEGER</span>,
   [writersCanShare] <span class=""pl-k"">INTEGER</span>,
   [quotaBytesUsed] <span class=""pl-k"">TEXT</span>,
   [isAppAuthorized] <span class=""pl-k"">INTEGER</span>,
   [linkShareMetadata] <span class=""pl-k"">TEXT</span>,
   <span class=""pl-k"">FOREIGN KEY</span>([_parent]) <span class=""pl-k"">REFERENCES</span> [drive_folders]([id]),
   <span class=""pl-k"">FOREIGN KEY</span>([_owner]) <span class=""pl-k"">REFERENCES</span> [drive_users]([permissionId]),
   <span class=""pl-k"">FOREIGN KEY</span>([lastModifyingUser]) <span class=""pl-k"">REFERENCES</span> [drive_users]([permissionId])
);</pre></div>

<h2 dir=""auto""><a id=""user-content-thumbnails"" class=""anchor"" aria-hidden=""true"" href=""#user-content-thumbnails""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Thumbnails</h2>
<p dir=""auto"">You can construct a thumbnail image for a known file ID using the following URL:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""https://drive.google.com/thumbnail?sz=w800-h800&amp;id=FILE_ID""><pre class=""notranslate""><code>https://drive.google.com/thumbnail?sz=w800-h800&amp;id=FILE_ID
</code></pre></div>
<p dir=""auto"">Users who are signed into Google Drive and have permission to view a file will be redirected to a thumbnail version of that file. You can tweak the <code>w800</code> and <code>h800</code> parameters to request different thumbnail sizes.</p>
<h2 dir=""auto""><a id=""user-content-privacy-policy"" class=""anchor"" aria-hidden=""true"" href=""#user-content-privacy-policy""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Privacy policy</h2>
<p dir=""auto"">This tool requests access to your Google Drive account in order to retrieve metadata about your files there. It also offers a feature that can download the content of those files.</p>
<p dir=""auto"">The credentials used to access your account are stored in the <code>auth.json</code> file on your computer. The metadata and content retrieved from Google Drive is also stored only on your own personal computer.</p>
<p dir=""auto"">At no point do the developers of this tool gain access to any of your data.</p>
<h2 dir=""auto""><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p dir=""auto"">To contribute to this tool, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""cd google-drive-to-sqlite
python -m venv venv
source venv/bin/activate""><pre class=""notranslate""><code>cd google-drive-to-sqlite
python -m venv venv
source venv/bin/activate
</code></pre></div>
<p dir=""auto"">Or if you are using <code>pipenv</code>:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pipenv shell""><pre class=""notranslate""><code>pipenv shell
</code></pre></div>
<p dir=""auto"">Now install the dependencies and test dependencies:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'""><pre class=""notranslate""><code>pip install -e '.[test]'
</code></pre></div>
<p dir=""auto"">To run the tests:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pytest""><pre class=""notranslate""><code>pytest
</code></pre></div>
</article></div>",1,public,0,,,
461322238,R_kgDOG383_g,sqlite-colorbrewer,eyeseast/sqlite-colorbrewer,0,25778,https://github.com/eyeseast/sqlite-colorbrewer,A custom function to use ColorBrewer scales in SQLite queries,0,2022-02-19T21:53:46Z,2022-03-03T17:16:40Z,2022-03-02T03:04:56Z,,19,4,4,Python,1,1,1,1,0,0,0,0,0,apache-2.0,[],0,0,4,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,0,1,"# sqlite-colorbrewer

[![PyPI](https://img.shields.io/pypi/v/sqlite-colorbrewer.svg)](https://pypi.org/project/sqlite-colorbrewer/)
[![Changelog](https://img.shields.io/github/v/release/eyeseast/sqlite-colorbrewer?include_prereleases&label=changelog)](https://github.com/eyeseast/sqlite-colorbrewer/releases)
[![Tests](https://github.com/eyeseast/sqlite-colorbrewer/workflows/Test/badge.svg)](https://github.com/eyeseast/sqlite-colorbrewer/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/eyeseast/sqlite-colorbrewer/blob/main/LICENSE)

A custom function to use [ColorBrewer](https://colorbrewer2.org/) scales in SQLite queries.

Colors are exported from [here](https://colorbrewer2.org/export/colorbrewer.json).

## Installation

To install as a Python library and use with the [standard SQLite3 module](https://docs.python.org/3/library/sqlite3.html):

    pip install sqlite-colorbrewer

To install this plugin in the same environment as Datasette.

    datasette install sqlite-colorbrewer

## Usage

If you're using this library with Datasette, it will be automatically registered as a plugin and available for use in SQL queries, like so:

```sql
SELECT colorbrewer('Blues', 9, 0);
```

That will return a single value: `""rgb(247,251,255)""`

To use with a SQLite connection outside of Datasette, use the `register` function:

```python
>>> import sqlite3
>>> import sqlite_colorbrewer

>>> conn = sqlite3.connect(':memory')
>>> sqlite_colorbrewer.register(conn)

>>> cursor = conn.execute(""SELECT colorbrewer('Blues', 9, 0);"")
>>> result = next(cursor)
>>> print(result)
rgb(247,251,255)
```

## Development

To set up this plugin locally, first checkout the code. Then create a new virtual environment:

    cd sqlite-colorbrewer
    python3 -mvenv venv
    source venv/bin/activate

Or if you are using `pipenv`:

    pipenv shell

Now install the dependencies and test dependencies:

    pip install -e '.[test]'

To run the tests:

    pytest

To build `sqlite_colorbrewer/colorbrewer.py`:

    ./json_to_python.py
    black . # to format the resulting file

## ColorBrewer

Copyright (c) 2002 Cynthia Brewer, Mark Harrower, and The Pennsylvania State University.

Licensed under the Apache License, Version 2.0 (the ""License""); you may not use this file except in compliance with the License. You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an ""AS IS"" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

See the [ColorBrewer updates](http://www.personal.psu.edu/cab38/ColorBrewer/ColorBrewer_updates.html) for updates to copyright information.
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-sqlite-colorbrewer"" class=""anchor"" aria-hidden=""true"" href=""#user-content-sqlite-colorbrewer""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>sqlite-colorbrewer</h1>
<p dir=""auto""><a href=""https://pypi.org/project/sqlite-colorbrewer/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/04ce9615649f436d9d448e3c317bf8f12473237aa608429218b746755cbd0ef6/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f73716c6974652d636f6c6f726272657765722e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/sqlite-colorbrewer.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/eyeseast/sqlite-colorbrewer/releases""><img src=""https://camo.githubusercontent.com/638c41b57fe97aeaec1e6be4a0f1d0c69e4d9259585553a48a5f21ef40047542/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f65796573656173742f73716c6974652d636f6c6f726272657765723f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/eyeseast/sqlite-colorbrewer?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/eyeseast/sqlite-colorbrewer/actions?query=workflow%3ATest""><img src=""https://github.com/eyeseast/sqlite-colorbrewer/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/eyeseast/sqlite-colorbrewer/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">A custom function to use <a href=""https://colorbrewer2.org/"" rel=""nofollow"">ColorBrewer</a> scales in SQLite queries.</p>
<p dir=""auto"">Colors are exported from <a href=""https://colorbrewer2.org/export/colorbrewer.json"" rel=""nofollow"">here</a>.</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p dir=""auto"">To install as a Python library and use with the <a href=""https://docs.python.org/3/library/sqlite3.html"" rel=""nofollow"">standard SQLite3 module</a>:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install sqlite-colorbrewer""><pre><code>pip install sqlite-colorbrewer
</code></pre></div>
<p dir=""auto"">To install this plugin in the same environment as Datasette.</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""datasette install sqlite-colorbrewer""><pre><code>datasette install sqlite-colorbrewer
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p dir=""auto"">If you're using this library with Datasette, it will be automatically registered as a plugin and available for use in SQL queries, like so:</p>
<div class=""highlight highlight-source-sql position-relative overflow-auto"" data-snippet-clipboard-copy-content=""SELECT colorbrewer('Blues', 9, 0);""><pre><span class=""pl-k"">SELECT</span> colorbrewer(<span class=""pl-s""><span class=""pl-pds"">'</span>Blues<span class=""pl-pds"">'</span></span>, <span class=""pl-c1"">9</span>, <span class=""pl-c1"">0</span>);</pre></div>
<p dir=""auto"">That will return a single value: <code>""rgb(247,251,255)""</code></p>
<p dir=""auto"">To use with a SQLite connection outside of Datasette, use the <code>register</code> function:</p>
<div class=""highlight highlight-source-python position-relative overflow-auto"" data-snippet-clipboard-copy-content=""&gt;&gt;&gt; import sqlite3
&gt;&gt;&gt; import sqlite_colorbrewer

&gt;&gt;&gt; conn = sqlite3.connect(':memory')
&gt;&gt;&gt; sqlite_colorbrewer.register(conn)

&gt;&gt;&gt; cursor = conn.execute(&quot;SELECT colorbrewer('Blues', 9, 0);&quot;)
&gt;&gt;&gt; result = next(cursor)
&gt;&gt;&gt; print(result)
rgb(247,251,255)""><pre><span class=""pl-c1"">&gt;&gt;</span><span class=""pl-c1"">&gt;</span> <span class=""pl-k"">import</span> <span class=""pl-s1"">sqlite3</span>
<span class=""pl-c1"">&gt;&gt;</span><span class=""pl-c1"">&gt;</span> <span class=""pl-k"">import</span> <span class=""pl-s1"">sqlite_colorbrewer</span>

<span class=""pl-c1"">&gt;&gt;</span><span class=""pl-c1"">&gt;</span> <span class=""pl-s1"">conn</span> <span class=""pl-c1"">=</span> <span class=""pl-s1"">sqlite3</span>.<span class=""pl-en"">connect</span>(<span class=""pl-s"">':memory'</span>)
<span class=""pl-c1"">&gt;&gt;</span><span class=""pl-c1"">&gt;</span> <span class=""pl-s1"">sqlite_colorbrewer</span>.<span class=""pl-en"">register</span>(<span class=""pl-s1"">conn</span>)

<span class=""pl-c1"">&gt;&gt;</span><span class=""pl-c1"">&gt;</span> <span class=""pl-s1"">cursor</span> <span class=""pl-c1"">=</span> <span class=""pl-s1"">conn</span>.<span class=""pl-en"">execute</span>(<span class=""pl-s"">""SELECT colorbrewer('Blues', 9, 0);""</span>)
<span class=""pl-c1"">&gt;&gt;</span><span class=""pl-c1"">&gt;</span> <span class=""pl-s1"">result</span> <span class=""pl-c1"">=</span> <span class=""pl-en"">next</span>(<span class=""pl-s1"">cursor</span>)
<span class=""pl-c1"">&gt;&gt;</span><span class=""pl-c1"">&gt;</span> <span class=""pl-en"">print</span>(<span class=""pl-s1"">result</span>)
<span class=""pl-en"">rgb</span>(<span class=""pl-c1"">247</span>,<span class=""pl-c1"">251</span>,<span class=""pl-c1"">255</span>)</pre></div>
<h2 dir=""auto""><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p dir=""auto"">To set up this plugin locally, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""cd sqlite-colorbrewer
python3 -mvenv venv
source venv/bin/activate""><pre><code>cd sqlite-colorbrewer
python3 -mvenv venv
source venv/bin/activate
</code></pre></div>
<p dir=""auto"">Or if you are using <code>pipenv</code>:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pipenv shell""><pre><code>pipenv shell
</code></pre></div>
<p dir=""auto"">Now install the dependencies and test dependencies:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'""><pre><code>pip install -e '.[test]'
</code></pre></div>
<p dir=""auto"">To run the tests:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pytest""><pre><code>pytest
</code></pre></div>
<p dir=""auto"">To build <code>sqlite_colorbrewer/colorbrewer.py</code>:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""./json_to_python.py
black . # to format the resulting file""><pre><code>./json_to_python.py
black . # to format the resulting file
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-colorbrewer"" class=""anchor"" aria-hidden=""true"" href=""#user-content-colorbrewer""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>ColorBrewer</h2>
<p dir=""auto"">Copyright (c) 2002 Cynthia Brewer, Mark Harrower, and The Pennsylvania State University.</p>
<p dir=""auto"">Licensed under the Apache License, Version 2.0 (the ""License""); you may not use this file except in compliance with the License. You may obtain a copy of the License at</p>
<p dir=""auto""><a href=""http://www.apache.org/licenses/LICENSE-2.0"" rel=""nofollow"">http://www.apache.org/licenses/LICENSE-2.0</a></p>
<p dir=""auto"">Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an ""AS IS"" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.</p>
<p dir=""auto"">See the <a href=""http://www.personal.psu.edu/cab38/ColorBrewer/ColorBrewer_updates.html"" rel=""nofollow"">ColorBrewer updates</a> for updates to copyright information.</p>
</article></div>",1,public,0,,,
462903750,R_kgDOG5dZxg,datasette-redirect-forbidden,simonw/datasette-redirect-forbidden,0,9599,https://github.com/simonw/datasette-redirect-forbidden,Redirect forbidden requests to a login page,0,2022-02-23T20:59:26Z,2022-02-23T22:00:12Z,2022-02-23T22:02:38Z,,7,0,0,Python,1,1,1,1,0,0,0,0,1,apache-2.0,[],0,1,0,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,0,1,"# datasette-redirect-forbidden

[![PyPI](https://img.shields.io/pypi/v/datasette-redirect-forbidden.svg)](https://pypi.org/project/datasette-redirect-forbidden/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-redirect-forbidden?include_prereleases&label=changelog)](https://github.com/simonw/datasette-redirect-forbidden/releases)
[![Tests](https://github.com/simonw/datasette-redirect-forbidden/workflows/Test/badge.svg)](https://github.com/simonw/datasette-redirect-forbidden/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-redirect-forbidden/blob/main/LICENSE)

Redirect forbidden requests to a login page

## Installation

Install this plugin in the same environment as Datasette.

    $ datasette install datasette-redirect-forbidden

## Usage

Add the following to your `metadata.yml` (or `metadata.json`) file to configure the plugin:

```yaml
plugins:
  datasette-redirect-forbidden:
    redirect_to: /-/login
```
Any 403 forbidden pages will redirect to the specified page.

## Development

To set up this plugin locally, first checkout the code. Then create a new virtual environment:

    cd datasette-redirect-forbidden
    python3 -mvenv venv
    source venv/bin/activate

Or if you are using `pipenv`:

    pipenv shell

Now install the dependencies and test dependencies:

    pip install -e '.[test]'

To run the tests:

    pytest
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-datasette-redirect-forbidden"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-redirect-forbidden""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-redirect-forbidden</h1>
<p dir=""auto""><a href=""https://pypi.org/project/datasette-redirect-forbidden/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/5fd755c03dc30d32c237a39392ed5ba94a54e5f20a4bbd2c0522c69736d9ac0c/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d72656469726563742d666f7262696464656e2e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-redirect-forbidden.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-redirect-forbidden/releases""><img src=""https://camo.githubusercontent.com/31dd293e33f6786f2b162731eb130a22c26c39fbbc48a29edcc03ad735e643bf/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d72656469726563742d666f7262696464656e3f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-redirect-forbidden?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-redirect-forbidden/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-redirect-forbidden/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-redirect-forbidden/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Redirect forbidden requests to a login page</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p dir=""auto"">Install this plugin in the same environment as Datasette.</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ datasette install datasette-redirect-forbidden""><pre><code>$ datasette install datasette-redirect-forbidden
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p dir=""auto"">Add the following to your <code>metadata.yml</code> (or <code>metadata.json</code>) file to configure the plugin:</p>
<div class=""highlight highlight-source-yaml position-relative overflow-auto"" data-snippet-clipboard-copy-content=""plugins:
  datasette-redirect-forbidden:
    redirect_to: /-/login""><pre><span class=""pl-ent"">plugins</span>:
  <span class=""pl-ent"">datasette-redirect-forbidden</span>:
    <span class=""pl-ent"">redirect_to</span>: <span class=""pl-s"">/-/login</span></pre></div>
<p dir=""auto"">Any 403 forbidden pages will redirect to the specified page.</p>
<h2 dir=""auto""><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p dir=""auto"">To set up this plugin locally, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""cd datasette-redirect-forbidden
python3 -mvenv venv
source venv/bin/activate""><pre><code>cd datasette-redirect-forbidden
python3 -mvenv venv
source venv/bin/activate
</code></pre></div>
<p dir=""auto"">Or if you are using <code>pipenv</code>:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pipenv shell""><pre><code>pipenv shell
</code></pre></div>
<p dir=""auto"">Now install the dependencies and test dependencies:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'""><pre><code>pip install -e '.[test]'
</code></pre></div>
<p dir=""auto"">To run the tests:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pytest""><pre><code>pytest
</code></pre></div>
</article></div>",1,public,0,"{""id"": 400878073, ""node_id"": ""MDEwOlJlcG9zaXRvcnk0MDA4NzgwNzM="", ""name"": ""datasette-plugin-template-repository"", ""full_name"": ""simonw/datasette-plugin-template-repository"", ""private"": false, ""owner"": {""login"": ""simonw"", ""id"": 9599, ""node_id"": ""MDQ6VXNlcjk1OTk="", ""avatar_url"": ""https://avatars.githubusercontent.com/u/9599?v=4"", ""gravatar_id"": """", ""url"": ""https://api.github.com/users/simonw"", ""html_url"": ""https://github.com/simonw"", ""followers_url"": ""https://api.github.com/users/simonw/followers"", ""following_url"": ""https://api.github.com/users/simonw/following{/other_user}"", ""gists_url"": ""https://api.github.com/users/simonw/gists{/gist_id}"", ""starred_url"": ""https://api.github.com/users/simonw/starred{/owner}{/repo}"", ""subscriptions_url"": ""https://api.github.com/users/simonw/subscriptions"", ""organizations_url"": ""https://api.github.com/users/simonw/orgs"", ""repos_url"": ""https://api.github.com/users/simonw/repos"", ""events_url"": ""https://api.github.com/users/simonw/events{/privacy}"", ""received_events_url"": ""https://api.github.com/users/simonw/received_events"", ""type"": ""User"", ""site_admin"": false}, ""html_url"": ""https://github.com/simonw/datasette-plugin-template-repository"", ""description"": ""GitHub template repository for creating new Datasette plugins, using the simonw/datasette-plugin cookiecutter template"", ""fork"": false, ""url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository"", ""forks_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/forks"", ""keys_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/keys{/key_id}"", ""collaborators_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/collaborators{/collaborator}"", ""teams_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/teams"", ""hooks_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/hooks"", ""issue_events_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/issues/events{/number}"", ""events_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/events"", ""assignees_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/assignees{/user}"", ""branches_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/branches{/branch}"", ""tags_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/tags"", ""blobs_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/blobs{/sha}"", ""git_tags_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/tags{/sha}"", ""git_refs_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/refs{/sha}"", ""trees_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/trees{/sha}"", ""statuses_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/statuses/{sha}"", ""languages_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/languages"", ""stargazers_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/stargazers"", ""contributors_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/contributors"", ""subscribers_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/subscribers"", ""subscription_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/subscription"", ""commits_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/commits{/sha}"", ""git_commits_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/commits{/sha}"", ""comments_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/comments{/number}"", ""issue_comment_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/issues/comments{/number}"", ""contents_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/contents/{+path}"", ""compare_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/compare/{base}...{head}"", ""merges_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/merges"", ""archive_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/{archive_format}{/ref}"", ""downloads_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/downloads"", ""issues_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/issues{/number}"", ""pulls_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/pulls{/number}"", ""milestones_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/milestones{/number}"", ""notifications_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/notifications{?since,all,participating}"", ""labels_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/labels{/name}"", ""releases_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/releases{/id}"", ""deployments_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/deployments"", ""created_at"": ""2021-08-28T19:50:28Z"", ""updated_at"": ""2021-12-21T20:45:06Z"", ""pushed_at"": ""2021-12-21T20:45:02Z"", ""git_url"": ""git://github.com/simonw/datasette-plugin-template-repository.git"", ""ssh_url"": ""git@github.com:simonw/datasette-plugin-template-repository.git"", ""clone_url"": ""https://github.com/simonw/datasette-plugin-template-repository.git"", ""svn_url"": ""https://github.com/simonw/datasette-plugin-template-repository"", ""homepage"": """", ""size"": 6, ""stargazers_count"": 12, ""watchers_count"": 12, ""language"": null, ""has_issues"": true, ""has_projects"": true, ""has_downloads"": true, ""has_wiki"": true, ""has_pages"": false, ""forks_count"": 0, ""mirror_url"": null, ""archived"": false, ""disabled"": false, ""open_issues_count"": 0, ""license"": null, ""allow_forking"": true, ""is_template"": true, ""topics"": [], ""visibility"": ""public"", ""forks"": 0, ""open_issues"": 0, ""watchers"": 12, ""default_branch"": ""main"", ""permissions"": {""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}, ""temp_clone_token"": """"}",,
467679579,R_kgDOG-A5Ww,shot-scraper,simonw/shot-scraper,0,9599,https://github.com/simonw/shot-scraper,A command-line utility for taking automated screenshots of websites,0,2022-03-08T21:21:02Z,2022-11-15T14:38:08Z,2022-11-16T04:28:52Z,https://shot-scraper.datasette.io,188,775,775,Python,1,1,1,1,0,40,0,0,16,apache-2.0,"[""playwright"", ""playwright-python"", ""scraping"", ""screenshot-utility"", ""screenshots""]",40,16,775,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,40,3,"# shot-scraper

[![PyPI](https://img.shields.io/pypi/v/shot-scraper.svg)](https://pypi.org/project/shot-scraper/)
[![Changelog](https://img.shields.io/github/v/release/simonw/shot-scraper?include_prereleases&label=changelog)](https://github.com/simonw/shot-scraper/releases)
[![Tests](https://github.com/simonw/shot-scraper/workflows/Test/badge.svg)](https://github.com/simonw/shot-scraper/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/shot-scraper/blob/master/LICENSE)

A command-line utility for taking automated screenshots of websites

For background on this project see [shot-scraper: automated screenshots for documentation, built on Playwright](https://simonwillison.net/2022/Mar/10/shot-scraper/).

## Documentation

- [Full documentation for shot-scraper](https://shot-scraper.datasette.io/)
- [Tutorial: Automating screenshots for the Datasette documentation using shot-scraper](https://simonwillison.net/2022/Oct/14/automating-screenshots/)
- [Release notes](https://github.com/simonw/shot-scraper/releases)

## Get started with GitHub Actions

To get started without installing any software, use the [shot-scraper-template](https://github.com/simonw/shot-scraper-template) template to create your own GitHub repository which takes screenshots of a page using `shot-scraper`. See [Instantly create a GitHub repository to take screenshots of a web page](https://simonwillison.net/2022/Mar/14/shot-scraper-template/) for details.

## Quick installation

You can install the `shot-scraper` CLI tool using [pip](https://pip.pypa.io/):

    pip install shot-scraper
    # Now install the browser it needs:
    shot-scraper install

## Taking your first screenshot

You can take a screenshot of a web page like this:

    shot-scraper https://datasette.io/

This will create a screenshot in a file called `datasette-io.png`.

Many more options are available, see [Taking a screenshot](https://shot-scraper.datasette.io/en/stable/screenshots.html) for details.

## Examples

- The [shot-scraper-demo](https://github.com/simonw/shot-scraper-demo) repository uses this tool to capture recently spotted owls in El Granada, CA according to [this page](https://www.owlsnearme.com/?place=127871), and to  generate an annotated screenshot illustrating a Datasette feature as described [in my blog](https://simonwillison.net/2022/Mar/10/shot-scraper/#a-complex-example).
- The [Datasette Documentation](https://docs.datasette.io/en/latest/) uses screenshots taken by `shot-scraper` running in the [simonw/datasette-screenshots](https://github.com/simonw/datasette-screenshots) GitHub repository, described in detail in [Automating screenshots for the Datasette documentation using shot-scraper](https://simonwillison.net/2022/Oct/14/automating-screenshots/).
- Ben Welsh built [@newshomepages](https://twitter.com/newshomepages), a Twitter bot that uses `shot-scraper` and GitHub Actions to take screenshots of news website homepages and publish them to Twitter. The code for that lives in [palewire/news-homepages](https://github.com/palewire/news-homepages).
- [scrape-hacker-news-by-domain](https://github.com/simonw/scrape-hacker-news-by-domain) uses `shot-scraper javascript` to scrape a web page. See [Scraping web pages from the command-line with shot-scraper](https://simonwillison.net/2022/Mar/14/scraping-web-pages-shot-scraper/) for details of how this works.
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-shot-scraper"" class=""anchor"" aria-hidden=""true"" href=""#user-content-shot-scraper""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>shot-scraper</h1>
<p dir=""auto""><a href=""https://pypi.org/project/shot-scraper/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/286ea85d2fe4edc6a9b1fdb4649d30bdf8021bb7251f1952f5ac780efe108494/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f73686f742d736372617065722e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/shot-scraper.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/shot-scraper/releases""><img src=""https://camo.githubusercontent.com/efbd5167cbe1861a03ee215da1ed60e73452db7584ea177e1334c63c7b35897d/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f73686f742d736372617065723f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/shot-scraper?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/shot-scraper/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/shot-scraper/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/shot-scraper/blob/master/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">A command-line utility for taking automated screenshots of websites</p>
<p dir=""auto"">For background on this project see <a href=""https://simonwillison.net/2022/Mar/10/shot-scraper/"" rel=""nofollow"">shot-scraper: automated screenshots for documentation, built on Playwright</a>.</p>
<h2 dir=""auto""><a id=""user-content-documentation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-documentation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Documentation</h2>
<ul dir=""auto"">
<li><a href=""https://shot-scraper.datasette.io/"" rel=""nofollow"">Full documentation for shot-scraper</a></li>
<li><a href=""https://simonwillison.net/2022/Oct/14/automating-screenshots/"" rel=""nofollow"">Tutorial: Automating screenshots for the Datasette documentation using shot-scraper</a></li>
<li><a href=""https://github.com/simonw/shot-scraper/releases"">Release notes</a></li>
</ul>
<h2 dir=""auto""><a id=""user-content-get-started-with-github-actions"" class=""anchor"" aria-hidden=""true"" href=""#user-content-get-started-with-github-actions""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Get started with GitHub Actions</h2>
<p dir=""auto"">To get started without installing any software, use the <a href=""https://github.com/simonw/shot-scraper-template"">shot-scraper-template</a> template to create your own GitHub repository which takes screenshots of a page using <code>shot-scraper</code>. See <a href=""https://simonwillison.net/2022/Mar/14/shot-scraper-template/"" rel=""nofollow"">Instantly create a GitHub repository to take screenshots of a web page</a> for details.</p>
<h2 dir=""auto""><a id=""user-content-quick-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-quick-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Quick installation</h2>
<p dir=""auto"">You can install the <code>shot-scraper</code> CLI tool using <a href=""https://pip.pypa.io/"" rel=""nofollow"">pip</a>:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install shot-scraper
# Now install the browser it needs:
shot-scraper install""><pre class=""notranslate""><code>pip install shot-scraper
# Now install the browser it needs:
shot-scraper install
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-taking-your-first-screenshot"" class=""anchor"" aria-hidden=""true"" href=""#user-content-taking-your-first-screenshot""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Taking your first screenshot</h2>
<p dir=""auto"">You can take a screenshot of a web page like this:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""shot-scraper https://datasette.io/""><pre class=""notranslate""><code>shot-scraper https://datasette.io/
</code></pre></div>
<p dir=""auto"">This will create a screenshot in a file called <code>datasette-io.png</code>.</p>
<p dir=""auto"">Many more options are available, see <a href=""https://shot-scraper.datasette.io/en/stable/screenshots.html"" rel=""nofollow"">Taking a screenshot</a> for details.</p>
<h2 dir=""auto""><a id=""user-content-examples"" class=""anchor"" aria-hidden=""true"" href=""#user-content-examples""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Examples</h2>
<ul dir=""auto"">
<li>The <a href=""https://github.com/simonw/shot-scraper-demo"">shot-scraper-demo</a> repository uses this tool to capture recently spotted owls in El Granada, CA according to <a href=""https://www.owlsnearme.com/?place=127871"" rel=""nofollow"">this page</a>, and to  generate an annotated screenshot illustrating a Datasette feature as described <a href=""https://simonwillison.net/2022/Mar/10/shot-scraper/#a-complex-example"" rel=""nofollow"">in my blog</a>.</li>
<li>The <a href=""https://docs.datasette.io/en/latest/"" rel=""nofollow"">Datasette Documentation</a> uses screenshots taken by <code>shot-scraper</code> running in the <a href=""https://github.com/simonw/datasette-screenshots"">simonw/datasette-screenshots</a> GitHub repository, described in detail in <a href=""https://simonwillison.net/2022/Oct/14/automating-screenshots/"" rel=""nofollow"">Automating screenshots for the Datasette documentation using shot-scraper</a>.</li>
<li>Ben Welsh built <a href=""https://twitter.com/newshomepages"" rel=""nofollow"">@newshomepages</a>, a Twitter bot that uses <code>shot-scraper</code> and GitHub Actions to take screenshots of news website homepages and publish them to Twitter. The code for that lives in <a href=""https://github.com/palewire/news-homepages"">palewire/news-homepages</a>.</li>
<li><a href=""https://github.com/simonw/scrape-hacker-news-by-domain"">scrape-hacker-news-by-domain</a> uses <code>shot-scraper javascript</code> to scrape a web page. See <a href=""https://simonwillison.net/2022/Mar/14/scraping-web-pages-shot-scraper/"" rel=""nofollow"">Scraping web pages from the command-line with shot-scraper</a> for details of how this works.</li>
</ul>
</article></div>",1,public,0,,0,0
470338069,R_kgDOHAjKFQ,datasette-hashed-urls,simonw/datasette-hashed-urls,0,9599,https://github.com/simonw/datasette-hashed-urls,Optimize Datasette performance behind a caching proxy,0,2022-03-15T21:31:52Z,2022-03-17T03:00:34Z,2022-03-24T17:58:05Z,,38,3,3,Python,1,1,1,1,0,0,0,0,0,apache-2.0,[],0,0,3,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,0,1,"# datasette-hashed-urls

[![PyPI](https://img.shields.io/pypi/v/datasette-hashed-urls.svg)](https://pypi.org/project/datasette-hashed-urls/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-hashed-urls?include_prereleases&label=changelog)](https://github.com/simonw/datasette-hashed-urls/releases)
[![Tests](https://github.com/simonw/datasette-hashed-urls/workflows/Test/badge.svg)](https://github.com/simonw/datasette-hashed-urls/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-hashed-urls/blob/main/LICENSE)

Optimize Datasette performance behind a caching proxy

When you open a database file in immutable mode using the `-i` option, Datasette calculates a SHA-256 hash of the contents of that file on startup.

This content hash can then optionally be used to create URLs that are guaranteed to change if the contents of the file changes in the future.

The result is pages  that can be cached indefinitely by both browsers and caching proxies - providing a significant performance boost.

## Demo

A demo of this plugin is running at https://datasette-hashed-urls.vercel.app/

## Installation

Install this plugin in the same environment as Datasette.

    $ datasette install datasette-hashed-urls

## Usage

Once installed, this plugin will act on any immutable database files that are loaded into Datasette:

    datasette -i fixtures.db

The database will automatically be renamed to incorporate a hash of the contents of the SQLite file - so the above database would be served as:

    http://127.0.0.1:8001/fixtures-aa7318b

Every page that accesss that database, including JSON endpoints, will be served with the following far-future cache expiry header:

    cache-control: max-age=31536000, public

Here `max-age=31536000` is the number of seconds in a year.

A caching proxy such as Cloudflare can then be used to cache and accelerate content served by Datasette.

When the database file is updated and the server is restarted, the hash will change and content will be served from a new URL. Any hits to the previous hashed URLs will be automatically redirected.

If you run Datasette using the `--crossdb` option to enable [cross-database queries](https://docs.datasette.io/en/stable/sql_queries.html#cross-database-queries) the `_memory` database will also have a hash added to its URL - in this case, the hash will be a combination of the hashes of the other attached databases.

## Configuration

You can use the `max_age` plugin configuration setting to change the cache duration specified in the `cache-control` HTTP header.

To set the cache expiry time to one hour you would add this to your Datasette `metadata.json` configuration file:

```json
{
    ""plugins"": {
        ""datasette-hashed-urls"": {
            ""max_age"": 3600
        }
    }
}
```

## History

This functionality used to ship as part of Datasette itself, as a feature called [Hashed URL mode](https://docs.datasette.io/en/0.60.2/performance.html#hashed-url-mode).

That feature has been deprecated and will be removed in Datasette 1.0. This plugin should be used as an alternative.

## Development

To set up this plugin locally, first checkout the code. Then create a new virtual environment:

    cd datasette-hashed-urls
    python3 -mvenv venv
    source venv/bin/activate

Or if you are using `pipenv`:

    pipenv shell

Now install the dependencies and test dependencies:

    pip install -e '.[test]'

To run the tests:

    pytest
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-datasette-hashed-urls"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-hashed-urls""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-hashed-urls</h1>
<p dir=""auto""><a href=""https://pypi.org/project/datasette-hashed-urls/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/55153811ecb81f551b5a9e808170c99673e8aa31647e65e9c7d93780f7082f42/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d6861736865642d75726c732e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-hashed-urls.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-hashed-urls/releases""><img src=""https://camo.githubusercontent.com/a583f3c3cf3ad04b522018036f27666125f6dc718c2addd80ed64834d9068677/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d6861736865642d75726c733f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-hashed-urls?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-hashed-urls/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-hashed-urls/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-hashed-urls/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Optimize Datasette performance behind a caching proxy</p>
<p dir=""auto"">When you open a database file in immutable mode using the <code>-i</code> option, Datasette calculates a SHA-256 hash of the contents of that file on startup.</p>
<p dir=""auto"">This content hash can then optionally be used to create URLs that are guaranteed to change if the contents of the file changes in the future.</p>
<p dir=""auto"">The result is pages  that can be cached indefinitely by both browsers and caching proxies - providing a significant performance boost.</p>
<h2 dir=""auto""><a id=""user-content-demo"" class=""anchor"" aria-hidden=""true"" href=""#user-content-demo""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Demo</h2>
<p dir=""auto"">A demo of this plugin is running at <a href=""https://datasette-hashed-urls.vercel.app/"" rel=""nofollow"">https://datasette-hashed-urls.vercel.app/</a></p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p dir=""auto"">Install this plugin in the same environment as Datasette.</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ datasette install datasette-hashed-urls""><pre><code>$ datasette install datasette-hashed-urls
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p dir=""auto"">Once installed, this plugin will act on any immutable database files that are loaded into Datasette:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""datasette -i fixtures.db""><pre><code>datasette -i fixtures.db
</code></pre></div>
<p dir=""auto"">The database will automatically be renamed to incorporate a hash of the contents of the SQLite file - so the above database would be served as:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""http://127.0.0.1:8001/fixtures-aa7318b""><pre><code>http://127.0.0.1:8001/fixtures-aa7318b
</code></pre></div>
<p dir=""auto"">Every page that accesss that database, including JSON endpoints, will be served with the following far-future cache expiry header:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""cache-control: max-age=31536000, public""><pre><code>cache-control: max-age=31536000, public
</code></pre></div>
<p dir=""auto"">Here <code>max-age=31536000</code> is the number of seconds in a year.</p>
<p dir=""auto"">A caching proxy such as Cloudflare can then be used to cache and accelerate content served by Datasette.</p>
<p dir=""auto"">When the database file is updated and the server is restarted, the hash will change and content will be served from a new URL. Any hits to the previous hashed URLs will be automatically redirected.</p>
<p dir=""auto"">If you run Datasette using the <code>--crossdb</code> option to enable <a href=""https://docs.datasette.io/en/stable/sql_queries.html#cross-database-queries"" rel=""nofollow"">cross-database queries</a> the <code>_memory</code> database will also have a hash added to its URL - in this case, the hash will be a combination of the hashes of the other attached databases.</p>
<h2 dir=""auto""><a id=""user-content-configuration"" class=""anchor"" aria-hidden=""true"" href=""#user-content-configuration""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Configuration</h2>
<p dir=""auto"">You can use the <code>max_age</code> plugin configuration setting to change the cache duration specified in the <code>cache-control</code> HTTP header.</p>
<p dir=""auto"">To set the cache expiry time to one hour you would add this to your Datasette <code>metadata.json</code> configuration file:</p>
<div class=""highlight highlight-source-json position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{
    &quot;plugins&quot;: {
        &quot;datasette-hashed-urls&quot;: {
            &quot;max_age&quot;: 3600
        }
    }
}""><pre>{
    <span class=""pl-ent"">""plugins""</span>: {
        <span class=""pl-ent"">""datasette-hashed-urls""</span>: {
            <span class=""pl-ent"">""max_age""</span>: <span class=""pl-c1"">3600</span>
        }
    }
}</pre></div>
<h2 dir=""auto""><a id=""user-content-history"" class=""anchor"" aria-hidden=""true"" href=""#user-content-history""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>History</h2>
<p dir=""auto"">This functionality used to ship as part of Datasette itself, as a feature called <a href=""https://docs.datasette.io/en/0.60.2/performance.html#hashed-url-mode"" rel=""nofollow"">Hashed URL mode</a>.</p>
<p dir=""auto"">That feature has been deprecated and will be removed in Datasette 1.0. This plugin should be used as an alternative.</p>
<h2 dir=""auto""><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p dir=""auto"">To set up this plugin locally, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""cd datasette-hashed-urls
python3 -mvenv venv
source venv/bin/activate""><pre><code>cd datasette-hashed-urls
python3 -mvenv venv
source venv/bin/activate
</code></pre></div>
<p dir=""auto"">Or if you are using <code>pipenv</code>:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pipenv shell""><pre><code>pipenv shell
</code></pre></div>
<p dir=""auto"">Now install the dependencies and test dependencies:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'""><pre><code>pip install -e '.[test]'
</code></pre></div>
<p dir=""auto"">To run the tests:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pytest""><pre><code>pytest
</code></pre></div>
</article></div>",1,public,0,,,
474176116,R_kgDOHENadA,datasette-packages,simonw/datasette-packages,0,9599,https://github.com/simonw/datasette-packages,Show a list of currently installed Python packages,0,2022-03-25T22:04:23Z,2022-03-25T22:04:45Z,2022-07-03T02:41:55Z,,9,0,0,Python,1,1,1,1,0,0,0,0,1,apache-2.0,[],0,1,0,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,0,1,"# datasette-packages

[![PyPI](https://img.shields.io/pypi/v/datasette-packages.svg)](https://pypi.org/project/datasette-packages/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-packages?include_prereleases&label=changelog)](https://github.com/simonw/datasette-packages/releases)
[![Tests](https://github.com/simonw/datasette-packages/workflows/Test/badge.svg)](https://github.com/simonw/datasette-packages/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-packages/blob/main/LICENSE)

Show a list of currently installed Python packages

## Installation

Install this plugin in the same environment as Datasette.

    $ datasette install datasette-packages

## Usage

Visit `/-/packages` to see a list of installed Python packages.

Visit `/-/packages.json` to get that back as JSON.

## Demo

The output of this plugin can be seen here:

- https://latest-with-plugins.datasette.io/-/packages
- https://latest-with-plugins.datasette.io/-/packages.json

## With datasette-graphql

if you have version 2.1 or higher of the [datasette-graphql](https://datasette.io/plugins/datasette-graphql) plugin installed you can also query the list of packages using this GraphQL query:

```graphql
{
  packages {
    name
    version
  }
}
```
[Demo of this query](https://latest-with-plugins.datasette.io/graphql?query=%7B%0A%20%20packages%20%7B%0A%20%20%20%20name%0A%20%20%20%20version%0A%20%20%7D%0A%7D).

## Development

To set up this plugin locally, first checkout the code. Then create a new virtual environment:

    cd datasette-packages
    python3 -mvenv venv
    source venv/bin/activate

Now install the dependencies and test dependencies:

    pip install -e '.[test]'

To run the tests:

    pytest
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-datasette-packages"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-packages""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-packages</h1>
<p dir=""auto""><a href=""https://pypi.org/project/datasette-packages/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/e627d38389969d85cfcb630a701966524caf79b08068f56eb1143679614ef233/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d7061636b616765732e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-packages.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-packages/releases""><img src=""https://camo.githubusercontent.com/091e917c5f8e80e63def650f03bc9ab9990fbc7e68f8bf87e280aa8d6002ed98/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d7061636b616765733f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-packages?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-packages/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-packages/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-packages/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Show a list of currently installed Python packages</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p dir=""auto"">Install this plugin in the same environment as Datasette.</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ datasette install datasette-packages""><pre class=""notranslate""><code>$ datasette install datasette-packages
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p dir=""auto"">Visit <code>/-/packages</code> to see a list of installed Python packages.</p>
<p dir=""auto"">Visit <code>/-/packages.json</code> to get that back as JSON.</p>
<h2 dir=""auto""><a id=""user-content-demo"" class=""anchor"" aria-hidden=""true"" href=""#user-content-demo""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Demo</h2>
<p dir=""auto"">The output of this plugin can be seen here:</p>
<ul dir=""auto"">
<li><a href=""https://latest-with-plugins.datasette.io/-/packages"" rel=""nofollow"">https://latest-with-plugins.datasette.io/-/packages</a></li>
<li><a href=""https://latest-with-plugins.datasette.io/-/packages.json"" rel=""nofollow"">https://latest-with-plugins.datasette.io/-/packages.json</a></li>
</ul>
<h2 dir=""auto""><a id=""user-content-with-datasette-graphql"" class=""anchor"" aria-hidden=""true"" href=""#user-content-with-datasette-graphql""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>With datasette-graphql</h2>
<p dir=""auto"">if you have version 2.1 or higher of the <a href=""https://datasette.io/plugins/datasette-graphql"" rel=""nofollow"">datasette-graphql</a> plugin installed you can also query the list of packages using this GraphQL query:</p>
<div class=""highlight highlight-source-graphql notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{
  packages {
    name
    version
  }
}""><pre>{
  <span class=""pl-v"">packages</span> {
    <span class=""pl-v"">name</span>
    <span class=""pl-v"">version</span>
  }
}</pre></div>
<p dir=""auto""><a href=""https://latest-with-plugins.datasette.io/graphql?query=%7B%0A%20%20packages%20%7B%0A%20%20%20%20name%0A%20%20%20%20version%0A%20%20%7D%0A%7D"" rel=""nofollow"">Demo of this query</a>.</p>
<h2 dir=""auto""><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p dir=""auto"">To set up this plugin locally, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""cd datasette-packages
python3 -mvenv venv
source venv/bin/activate""><pre class=""notranslate""><code>cd datasette-packages
python3 -mvenv venv
source venv/bin/activate
</code></pre></div>
<p dir=""auto"">Now install the dependencies and test dependencies:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'""><pre class=""notranslate""><code>pip install -e '.[test]'
</code></pre></div>
<p dir=""auto"">To run the tests:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pytest""><pre class=""notranslate""><code>pytest
</code></pre></div>
</article></div>",1,public,0,,0,
474468776,R_kgDOHEfRqA,datasette-auth0,simonw/datasette-auth0,0,9599,https://github.com/simonw/datasette-auth0,Datasette plugin that authenticates users using Auth0,0,2022-03-26T21:19:31Z,2022-03-27T17:59:49Z,2022-03-28T03:04:52Z,,11,3,3,Python,1,1,1,1,0,0,0,0,0,apache-2.0,"[""auth0"", ""datasette"", ""datasette-plugin""]",0,0,3,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,0,1,"# datasette-auth0

[![PyPI](https://img.shields.io/pypi/v/datasette-auth0.svg)](https://pypi.org/project/datasette-auth0/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-auth0?include_prereleases&label=changelog)](https://github.com/simonw/datasette-auth0/releases)
[![Tests](https://github.com/simonw/datasette-auth0/workflows/Test/badge.svg)](https://github.com/simonw/datasette-auth0/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-auth0/blob/main/LICENSE)

Datasette plugin that authenticates users using [Auth0](https://auth0.com/)

See [Simplest possible OAuth authentication with Auth0](https://til.simonwillison.net/auth0/oauth-with-auth0) for more about how this plugin works.

## Installation

Install this plugin in the same environment as Datasette.

    $ datasette install datasette-auth0

## Demo

You can try this out at [datasette-auth0-demo.datasette.io](https://datasette-auth0-demo.datasette.io/) - click on the top right menu icon and select ""Sign in with Auth0"".

## Initial configuration

First, create a new application in Auth0. You will need the domain, client ID and client secret for that application.

The domain should be something like `mysite.us.auth0.com`.

Add `http://127.0.0.1:8001/-/auth0-callback` to the list of Allowed Callback URLs.

Then configure these plugin secrets using `metadata.yml`:

```yaml
plugins:
  datasette-auth0:
    domain:
      ""$env"": AUTH0_DOMAIN
    client_id:
      ""$env"": AUTH0_CLIENT_ID
    client_secret:
      ""$env"": AUTH0_CLIENT_SECRET
```
Only the `client_secret` needs to be kept secret, but for consistency I recommend using the `$env` mechanism for all three.

In development, you can run Datasette and pass in environment variables like this:
```
AUTH0_DOMAIN=""your-domain.us.auth0.com"" \
AUTH0_CLIENT_ID=""...client-id-goes-here..."" \
AUTH0_CLIENT_SECRET=""...secret-goes-here..."" \
datasette -m metadata.yml
```

If you are deploying using `datasette publish` you can pass these using `--plugin-secret`. For example, to deploy using Cloud Run you might run the following:
```
datasette publish cloudrun mydatabase.db \
--install datasette-auth0 \
--plugin-secret datasette-auth0 domain ""your-domain.us.auth0.com"" \
--plugin-secret datasette-auth0 client_id ""your-client-id"" \
--plugin-secret datasette-auth0 client_secret ""your-client-secret"" \
--service datasette-auth0-demo
```
Once your Datasette instance is deployed, you will need to add its callback URL to the ""Allowed Callback URLs"" list in Auth0.

The callback URL should be something like:

    https://url-to-your-datasette/-/auth0-callback

## Usage

Once installed, a ""Sign in with Auth0"" menu item will appear in the Datasette main menu.

You can sign in and then visit the `/-/actor` page to see full details of the `auth0` profile that has been authenticated.

You can then use [Datasette permissions](https://docs.datasette.io/en/stable/authentication.html#configuring-permissions-in-metadata-json) to grant or deny access to different parts of Datasette based on the authenticated user.

## Development

To set up this plugin locally, first checkout the code. Then create a new virtual environment:

    cd datasette-auth0
    python3 -mvenv venv
    source venv/bin/activate

Now install the dependencies and test dependencies:

    pip install -e '.[test]'

To run the tests:

    pytest
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-datasette-auth0"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-auth0""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-auth0</h1>
<p dir=""auto""><a href=""https://pypi.org/project/datasette-auth0/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/f43e5887f316cf40043f9c8a820290d3d12ae6b474804d66627cadf27de5145b/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d61757468302e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-auth0.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-auth0/releases""><img src=""https://camo.githubusercontent.com/62decb2ba747ff51dd7bd5bdb614a6973179ff5642425c8e8525764285bc372c/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d61757468303f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-auth0?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-auth0/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-auth0/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-auth0/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Datasette plugin that authenticates users using <a href=""https://auth0.com/"" rel=""nofollow"">Auth0</a></p>
<p dir=""auto"">See <a href=""https://til.simonwillison.net/auth0/oauth-with-auth0"" rel=""nofollow"">Simplest possible OAuth authentication with Auth0</a> for more about how this plugin works.</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p dir=""auto"">Install this plugin in the same environment as Datasette.</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""$ datasette install datasette-auth0""><pre><code>$ datasette install datasette-auth0
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-demo"" class=""anchor"" aria-hidden=""true"" href=""#user-content-demo""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Demo</h2>
<p dir=""auto"">You can try this out at <a href=""https://datasette-auth0-demo.datasette.io/"" rel=""nofollow"">datasette-auth0-demo.datasette.io</a> - click on the top right menu icon and select ""Sign in with Auth0"".</p>
<h2 dir=""auto""><a id=""user-content-initial-configuration"" class=""anchor"" aria-hidden=""true"" href=""#user-content-initial-configuration""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Initial configuration</h2>
<p dir=""auto"">First, create a new application in Auth0. You will need the domain, client ID and client secret for that application.</p>
<p dir=""auto"">The domain should be something like <code>mysite.us.auth0.com</code>.</p>
<p dir=""auto"">Add <code>http://127.0.0.1:8001/-/auth0-callback</code> to the list of Allowed Callback URLs.</p>
<p dir=""auto"">Then configure these plugin secrets using <code>metadata.yml</code>:</p>
<div class=""highlight highlight-source-yaml position-relative overflow-auto"" data-snippet-clipboard-copy-content=""plugins:
  datasette-auth0:
    domain:
      &quot;$env&quot;: AUTH0_DOMAIN
    client_id:
      &quot;$env&quot;: AUTH0_CLIENT_ID
    client_secret:
      &quot;$env&quot;: AUTH0_CLIENT_SECRET""><pre><span class=""pl-ent"">plugins</span>:
  <span class=""pl-ent"">datasette-auth0</span>:
    <span class=""pl-ent"">domain</span>:
      <span class=""pl-s""><span class=""pl-pds"">""</span><span class=""pl-ent"">$env</span><span class=""pl-pds"">""</span></span>: <span class=""pl-s"">AUTH0_DOMAIN</span>
    <span class=""pl-ent"">client_id</span>:
      <span class=""pl-s""><span class=""pl-pds"">""</span><span class=""pl-ent"">$env</span><span class=""pl-pds"">""</span></span>: <span class=""pl-s"">AUTH0_CLIENT_ID</span>
    <span class=""pl-ent"">client_secret</span>:
      <span class=""pl-s""><span class=""pl-pds"">""</span><span class=""pl-ent"">$env</span><span class=""pl-pds"">""</span></span>: <span class=""pl-s"">AUTH0_CLIENT_SECRET</span></pre></div>
<p dir=""auto"">Only the <code>client_secret</code> needs to be kept secret, but for consistency I recommend using the <code>$env</code> mechanism for all three.</p>
<p dir=""auto"">In development, you can run Datasette and pass in environment variables like this:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""AUTH0_DOMAIN=&quot;your-domain.us.auth0.com&quot; \
AUTH0_CLIENT_ID=&quot;...client-id-goes-here...&quot; \
AUTH0_CLIENT_SECRET=&quot;...secret-goes-here...&quot; \
datasette -m metadata.yml""><pre><code>AUTH0_DOMAIN=""your-domain.us.auth0.com"" \
AUTH0_CLIENT_ID=""...client-id-goes-here..."" \
AUTH0_CLIENT_SECRET=""...secret-goes-here..."" \
datasette -m metadata.yml
</code></pre></div>
<p dir=""auto"">If you are deploying using <code>datasette publish</code> you can pass these using <code>--plugin-secret</code>. For example, to deploy using Cloud Run you might run the following:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""datasette publish cloudrun mydatabase.db \
--install datasette-auth0 \
--plugin-secret datasette-auth0 domain &quot;your-domain.us.auth0.com&quot; \
--plugin-secret datasette-auth0 client_id &quot;your-client-id&quot; \
--plugin-secret datasette-auth0 client_secret &quot;your-client-secret&quot; \
--service datasette-auth0-demo""><pre><code>datasette publish cloudrun mydatabase.db \
--install datasette-auth0 \
--plugin-secret datasette-auth0 domain ""your-domain.us.auth0.com"" \
--plugin-secret datasette-auth0 client_id ""your-client-id"" \
--plugin-secret datasette-auth0 client_secret ""your-client-secret"" \
--service datasette-auth0-demo
</code></pre></div>
<p dir=""auto"">Once your Datasette instance is deployed, you will need to add its callback URL to the ""Allowed Callback URLs"" list in Auth0.</p>
<p dir=""auto"">The callback URL should be something like:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""https://url-to-your-datasette/-/auth0-callback""><pre><code>https://url-to-your-datasette/-/auth0-callback
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p dir=""auto"">Once installed, a ""Sign in with Auth0"" menu item will appear in the Datasette main menu.</p>
<p dir=""auto"">You can sign in and then visit the <code>/-/actor</code> page to see full details of the <code>auth0</code> profile that has been authenticated.</p>
<p dir=""auto"">You can then use <a href=""https://docs.datasette.io/en/stable/authentication.html#configuring-permissions-in-metadata-json"" rel=""nofollow"">Datasette permissions</a> to grant or deny access to different parts of Datasette based on the authenticated user.</p>
<h2 dir=""auto""><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p dir=""auto"">To set up this plugin locally, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""cd datasette-auth0
python3 -mvenv venv
source venv/bin/activate""><pre><code>cd datasette-auth0
python3 -mvenv venv
source venv/bin/activate
</code></pre></div>
<p dir=""auto"">Now install the dependencies and test dependencies:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'""><pre><code>pip install -e '.[test]'
</code></pre></div>
<p dir=""auto"">To run the tests:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pytest""><pre><code>pytest
</code></pre></div>
</article></div>",1,public,0,,,
474473836,R_kgDOHEflbA,datasette-nteract-data-explorer,hydrosquall/datasette-nteract-data-explorer,0,9020979,https://github.com/hydrosquall/datasette-nteract-data-explorer,automatic visual data explorer for datasette,0,2022-03-26T21:47:17Z,2022-10-04T03:28:02Z,2022-10-19T00:35:29Z,https://datasette-nteract-data-explorer.vercel.app/,137,8,8,TypeScript,1,1,1,1,0,1,0,0,8,apache-2.0,"[""automatic-viz"", ""datasette"", ""datasette-plugin"", ""dataviz""]",1,8,8,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,1,1,,,1,public,0,,0,
479175467,R_kgDOHI-jKw,pypi-to-sqlite,simonw/pypi-to-sqlite,0,9599,https://github.com/simonw/pypi-to-sqlite,Load data about Python packages from PyPI into SQLite,0,2022-04-07T23:09:02Z,2022-04-08T15:16:03Z,2022-04-08T16:29:53Z,,24,2,2,Python,1,1,1,1,0,0,0,0,1,apache-2.0,[],0,1,2,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,0,1,"# pypi-to-sqlite

[![PyPI](https://img.shields.io/pypi/v/pypi-to-sqlite.svg)](https://pypi.org/project/pypi-to-sqlite/)
[![Changelog](https://img.shields.io/github/v/release/simonw/pypi-to-sqlite?include_prereleases&label=changelog)](https://github.com/simonw/pypi-to-sqlite/releases)
[![Tests](https://github.com/simonw/pypi-to-sqlite/workflows/Test/badge.svg)](https://github.com/simonw/pypi-to-sqlite/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/pypi-to-sqlite/blob/master/LICENSE)

Load data about Python packages from PyPI into SQLite

## Installation

Install this tool using `pip`:

    pip install pypi-to-sqlite

## Usage

To create a SQLite database with details of one or more packages, run:

    pypi-to-sqlite pypi.db datasette sqlite-utils

You can also process JSON that you have previously saved to disk like so:

    curl -o datasette.json https://pypi.org/pypi/datasette/json
    pypi-to-sqlite pypi.db -f datasette.json

The tool will create three tables: `packages`, `versions` and `releases`. The full table schema is shown below.

To create the tables with a prefix, use `--prefix prefix`. For example:

    pypi-to-sqlite pypi.db datasette --prefix pypi_

This will create tables called `pypi_packages`, `pypi_versions` and `pypi_releases`.

## Demo

You can see examples of tables created using this tool running in [Datasette](https://datasette.io/) here:

- [packages](https://datasette.io/content/pypi_packages)
- [versions](https://datasette.io/content/pypi_versions)
- [releases](https://datasette.io/content/pypi_releases)

## Database schema

<!-- [[[cog
import cog, json
from pypi_to_sqlite import cli
from click.testing import CliRunner
import sqlite_utils
import tempfile, pathlib
tmpdir = pathlib.Path(tempfile.mkdtemp())
db_path = str(tmpdir / ""pypi.db"")
runner = CliRunner()
result = runner.invoke(cli.cli, [db_path, ""-f"", ""tests/datasette-block.json""])
cog.out(""```sql\n"")
cog.out(sqlite_utils.Database(db_path).schema)
cog.out(""\n```"")
]]] -->
```sql
CREATE TABLE [packages] (
   [name] TEXT PRIMARY KEY,
   [summary] TEXT,
   [classifiers] TEXT,
   [description] TEXT,
   [author] TEXT,
   [author_email] TEXT,
   [description_content_type] TEXT,
   [home_page] TEXT,
   [keywords] TEXT,
   [license] TEXT,
   [maintainer] TEXT,
   [maintainer_email] TEXT,
   [package_url] TEXT,
   [platform] TEXT,
   [project_url] TEXT,
   [project_urls] TEXT,
   [release_url] TEXT,
   [requires_dist] TEXT,
   [requires_python] TEXT,
   [version] TEXT,
   [yanked] INTEGER,
   [yanked_reason] TEXT
);
CREATE TABLE [versions] (
   [id] TEXT PRIMARY KEY,
   [package] TEXT REFERENCES [packages]([name]),
   [name] TEXT
);
CREATE TABLE [releases] (
   [md5_digest] TEXT PRIMARY KEY,
   [package] TEXT REFERENCES [packages]([name]),
   [version] TEXT REFERENCES [versions]([id]),
   [packagetype] TEXT,
   [filename] TEXT,
   [comment_text] TEXT,
   [digests] TEXT,
   [has_sig] INTEGER,
   [python_version] TEXT,
   [requires_python] TEXT,
   [size] INTEGER,
   [upload_time] TEXT,
   [upload_time_iso_8601] TEXT,
   [url] TEXT,
   [yanked] INTEGER,
   [yanked_reason] TEXT
);
```
<!-- [[[end]]] -->

## pypi-to-sqlite --help

<!-- [[[cog
result = runner.invoke(cli.cli, [""--help""])
cog.out(""```\n"")
cog.out(result.output.replace(""Usage: cli"", ""Usage: pypi-to-sqlite""))
cog.out(""\n```"")
]]] -->
```
Usage: pypi-to-sqlite [OPTIONS] DB_PATH [PACKAGE]...

  Load data about Python packages from PyPI into SQLite

  Usage example:

      pypi-to-sqlite pypy.db datasette sqlite-utils

  Use -f to load data from a JSON file instead:

      pypi-to-sqlite pypy.db -f datasette.json

  Created tables will be packages, versions and releases

  To create tables called pypi_packages, pypi_versions, pypi_releases use
  --prefix pypi_:

      pypi-to-sqlite pypy.db datasette sqlite-utils --prefix pypi_

Options:
  --version            Show the version and exit.
  -f, --file FILENAME  Import JSON from this file
  -d, --delay FLOAT    Wait this many seconds between requests
  --prefix TEXT        Prefix to use for the created database tables
  --help               Show this message and exit.

```
<!-- [[[end]]] -->

## Development

To contribute to this tool, first checkout the code. Then create a new virtual environment:

    cd pypi-to-sqlite
    python -m venv venv
    source venv/bin/activate

Now install the dependencies and test dependencies:

    pip install -e '.[test]'

To run the tests:

    pytest
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-pypi-to-sqlite"" class=""anchor"" aria-hidden=""true"" href=""#user-content-pypi-to-sqlite""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>pypi-to-sqlite</h1>
<p dir=""auto""><a href=""https://pypi.org/project/pypi-to-sqlite/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/200eab1d4c07550744081184a86be6c72391ed7061054162919ec2ad2b48fa09/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f707970692d746f2d73716c6974652e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/pypi-to-sqlite.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/pypi-to-sqlite/releases""><img src=""https://camo.githubusercontent.com/5efc8be881faf986aca6a6396ff257b787a9be964ef6f1328e46568544be0847/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f707970692d746f2d73716c6974653f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/pypi-to-sqlite?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/pypi-to-sqlite/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/pypi-to-sqlite/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/pypi-to-sqlite/blob/master/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Load data about Python packages from PyPI into SQLite</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p dir=""auto"">Install this tool using <code>pip</code>:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install pypi-to-sqlite""><pre><code>pip install pypi-to-sqlite
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p dir=""auto"">To create a SQLite database with details of one or more packages, run:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pypi-to-sqlite pypi.db datasette sqlite-utils""><pre><code>pypi-to-sqlite pypi.db datasette sqlite-utils
</code></pre></div>
<p dir=""auto"">You can also process JSON that you have previously saved to disk like so:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""curl -o datasette.json https://pypi.org/pypi/datasette/json
pypi-to-sqlite pypi.db -f datasette.json""><pre><code>curl -o datasette.json https://pypi.org/pypi/datasette/json
pypi-to-sqlite pypi.db -f datasette.json
</code></pre></div>
<p dir=""auto"">The tool will create three tables: <code>packages</code>, <code>versions</code> and <code>releases</code>. The full table schema is shown below.</p>
<p dir=""auto"">To create the tables with a prefix, use <code>--prefix prefix</code>. For example:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pypi-to-sqlite pypi.db datasette --prefix pypi_""><pre><code>pypi-to-sqlite pypi.db datasette --prefix pypi_
</code></pre></div>
<p dir=""auto"">This will create tables called <code>pypi_packages</code>, <code>pypi_versions</code> and <code>pypi_releases</code>.</p>
<h2 dir=""auto""><a id=""user-content-demo"" class=""anchor"" aria-hidden=""true"" href=""#user-content-demo""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Demo</h2>
<p dir=""auto"">You can see examples of tables created using this tool running in <a href=""https://datasette.io/"" rel=""nofollow"">Datasette</a> here:</p>
<ul dir=""auto"">
<li><a href=""https://datasette.io/content/pypi_packages"" rel=""nofollow"">packages</a></li>
<li><a href=""https://datasette.io/content/pypi_versions"" rel=""nofollow"">versions</a></li>
<li><a href=""https://datasette.io/content/pypi_releases"" rel=""nofollow"">releases</a></li>
</ul>
<h2 dir=""auto""><a id=""user-content-database-schema"" class=""anchor"" aria-hidden=""true"" href=""#user-content-database-schema""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Database schema</h2>

<div class=""highlight highlight-source-sql position-relative overflow-auto"" data-snippet-clipboard-copy-content=""CREATE TABLE [packages] (
   [name] TEXT PRIMARY KEY,
   [summary] TEXT,
   [classifiers] TEXT,
   [description] TEXT,
   [author] TEXT,
   [author_email] TEXT,
   [description_content_type] TEXT,
   [home_page] TEXT,
   [keywords] TEXT,
   [license] TEXT,
   [maintainer] TEXT,
   [maintainer_email] TEXT,
   [package_url] TEXT,
   [platform] TEXT,
   [project_url] TEXT,
   [project_urls] TEXT,
   [release_url] TEXT,
   [requires_dist] TEXT,
   [requires_python] TEXT,
   [version] TEXT,
   [yanked] INTEGER,
   [yanked_reason] TEXT
);
CREATE TABLE [versions] (
   [id] TEXT PRIMARY KEY,
   [package] TEXT REFERENCES [packages]([name]),
   [name] TEXT
);
CREATE TABLE [releases] (
   [md5_digest] TEXT PRIMARY KEY,
   [package] TEXT REFERENCES [packages]([name]),
   [version] TEXT REFERENCES [versions]([id]),
   [packagetype] TEXT,
   [filename] TEXT,
   [comment_text] TEXT,
   [digests] TEXT,
   [has_sig] INTEGER,
   [python_version] TEXT,
   [requires_python] TEXT,
   [size] INTEGER,
   [upload_time] TEXT,
   [upload_time_iso_8601] TEXT,
   [url] TEXT,
   [yanked] INTEGER,
   [yanked_reason] TEXT
);""><pre>CREATE TABLE [packages] (
   [name] <span class=""pl-k"">TEXT</span> <span class=""pl-k"">PRIMARY KEY</span>,
   [summary] <span class=""pl-k"">TEXT</span>,
   [classifiers] <span class=""pl-k"">TEXT</span>,
   [description] <span class=""pl-k"">TEXT</span>,
   [author] <span class=""pl-k"">TEXT</span>,
   [author_email] <span class=""pl-k"">TEXT</span>,
   [description_content_type] <span class=""pl-k"">TEXT</span>,
   [home_page] <span class=""pl-k"">TEXT</span>,
   [keywords] <span class=""pl-k"">TEXT</span>,
   [license] <span class=""pl-k"">TEXT</span>,
   [maintainer] <span class=""pl-k"">TEXT</span>,
   [maintainer_email] <span class=""pl-k"">TEXT</span>,
   [package_url] <span class=""pl-k"">TEXT</span>,
   [platform] <span class=""pl-k"">TEXT</span>,
   [project_url] <span class=""pl-k"">TEXT</span>,
   [project_urls] <span class=""pl-k"">TEXT</span>,
   [release_url] <span class=""pl-k"">TEXT</span>,
   [requires_dist] <span class=""pl-k"">TEXT</span>,
   [requires_python] <span class=""pl-k"">TEXT</span>,
   [version] <span class=""pl-k"">TEXT</span>,
   [yanked] <span class=""pl-k"">INTEGER</span>,
   [yanked_reason] <span class=""pl-k"">TEXT</span>
);
CREATE TABLE [versions] (
   [id] <span class=""pl-k"">TEXT</span> <span class=""pl-k"">PRIMARY KEY</span>,
   [package] <span class=""pl-k"">TEXT</span> <span class=""pl-k"">REFERENCES</span> [packages]([name]),
   [name] <span class=""pl-k"">TEXT</span>
);
CREATE TABLE [releases] (
   [md5_digest] <span class=""pl-k"">TEXT</span> <span class=""pl-k"">PRIMARY KEY</span>,
   [package] <span class=""pl-k"">TEXT</span> <span class=""pl-k"">REFERENCES</span> [packages]([name]),
   [version] <span class=""pl-k"">TEXT</span> <span class=""pl-k"">REFERENCES</span> [versions]([id]),
   [packagetype] <span class=""pl-k"">TEXT</span>,
   [filename] <span class=""pl-k"">TEXT</span>,
   [comment_text] <span class=""pl-k"">TEXT</span>,
   [digests] <span class=""pl-k"">TEXT</span>,
   [has_sig] <span class=""pl-k"">INTEGER</span>,
   [python_version] <span class=""pl-k"">TEXT</span>,
   [requires_python] <span class=""pl-k"">TEXT</span>,
   [size] <span class=""pl-k"">INTEGER</span>,
   [upload_time] <span class=""pl-k"">TEXT</span>,
   [upload_time_iso_8601] <span class=""pl-k"">TEXT</span>,
   [url] <span class=""pl-k"">TEXT</span>,
   [yanked] <span class=""pl-k"">INTEGER</span>,
   [yanked_reason] <span class=""pl-k"">TEXT</span>
);</pre></div>

<h2 dir=""auto""><a id=""user-content-pypi-to-sqlite---help"" class=""anchor"" aria-hidden=""true"" href=""#user-content-pypi-to-sqlite---help""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>pypi-to-sqlite --help</h2>

<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""Usage: pypi-to-sqlite [OPTIONS] DB_PATH [PACKAGE]...

  Load data about Python packages from PyPI into SQLite

  Usage example:

      pypi-to-sqlite pypy.db datasette sqlite-utils

  Use -f to load data from a JSON file instead:

      pypi-to-sqlite pypy.db -f datasette.json

  Created tables will be packages, versions and releases

  To create tables called pypi_packages, pypi_versions, pypi_releases use
  --prefix pypi_:

      pypi-to-sqlite pypy.db datasette sqlite-utils --prefix pypi_

Options:
  --version            Show the version and exit.
  -f, --file FILENAME  Import JSON from this file
  -d, --delay FLOAT    Wait this many seconds between requests
  --prefix TEXT        Prefix to use for the created database tables
  --help               Show this message and exit.
""><pre><code>Usage: pypi-to-sqlite [OPTIONS] DB_PATH [PACKAGE]...

  Load data about Python packages from PyPI into SQLite

  Usage example:

      pypi-to-sqlite pypy.db datasette sqlite-utils

  Use -f to load data from a JSON file instead:

      pypi-to-sqlite pypy.db -f datasette.json

  Created tables will be packages, versions and releases

  To create tables called pypi_packages, pypi_versions, pypi_releases use
  --prefix pypi_:

      pypi-to-sqlite pypy.db datasette sqlite-utils --prefix pypi_

Options:
  --version            Show the version and exit.
  -f, --file FILENAME  Import JSON from this file
  -d, --delay FLOAT    Wait this many seconds between requests
  --prefix TEXT        Prefix to use for the created database tables
  --help               Show this message and exit.

</code></pre></div>

<h2 dir=""auto""><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p dir=""auto"">To contribute to this tool, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""cd pypi-to-sqlite
python -m venv venv
source venv/bin/activate""><pre><code>cd pypi-to-sqlite
python -m venv venv
source venv/bin/activate
</code></pre></div>
<p dir=""auto"">Now install the dependencies and test dependencies:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'""><pre><code>pip install -e '.[test]'
</code></pre></div>
<p dir=""auto"">To run the tests:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pytest""><pre><code>pytest
</code></pre></div>
</article></div>",1,public,0,,,
485962807,R_kgDOHPc0Nw,datasette-total-page-time,simonw/datasette-total-page-time,0,9599,https://github.com/simonw/datasette-total-page-time,Add a note to the Datasette footer measuring the total page load time,0,2022-04-26T22:09:58Z,2022-04-26T22:10:27Z,2022-04-26T22:11:48Z,,0,0,0,Python,1,1,1,1,0,0,0,0,0,apache-2.0,[],0,0,0,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,0,1,"# datasette-total-page-time

[![PyPI](https://img.shields.io/pypi/v/datasette-total-page-time.svg)](https://pypi.org/project/datasette-total-page-time/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-total-page-time?include_prereleases&label=changelog)](https://github.com/simonw/datasette-total-page-time/releases)
[![Tests](https://github.com/simonw/datasette-total-page-time/workflows/Test/badge.svg)](https://github.com/simonw/datasette-total-page-time/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-total-page-time/blob/main/LICENSE)

Add a note to the Datasette footer measuring the total page load time

## Installation

Install this plugin in the same environment as Datasette.

    datasette install datasette-total-page-time

## Usage

Once this plugin is installed, a note will appear in the footer of every page showing how long the page took to generate.

> Queries took 326.74ms · Page took 386.310ms

## How it works

Measuring how long a page takes to load and then injecting that note into the page is tricky, because you need to finish generating the page before you know how long it took to load it!

This plugin uses the [asgi_wrapper](https://docs.datasette.io/en/stable/plugin_hooks.html#asgi-wrapper-datasette) plugin hook to measure the time taken by Datasette and then inject the following JavaScript at the bottom of the response, after the closing `</html>` tag but with the correct measured value:

```html
<script>
let footer = document.querySelector(""footer"");
if (footer) {
    let ms = 37.224;
    let s = ` &middot; Page took ${ms.toFixed(3)}ms`;
    footer.innerHTML += s;
}
</script>
```
This script is injected only on pages with the `text/html` content type - so it should not affect JSON or CSV returned by Datasette.

## Development

To set up this plugin locally, first checkout the code. Then create a new virtual environment:

    cd datasette-total-page-time
    python3 -mvenv venv
    source venv/bin/activate

Now install the dependencies and test dependencies:

    pip install -e '.[test]'

To run the tests:

    pytest
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-datasette-total-page-time"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-total-page-time""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-total-page-time</h1>
<p dir=""auto""><a href=""https://pypi.org/project/datasette-total-page-time/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/46f2a11c9943f96c31f5d07f9be81db92f572d8d4a0a89fd74eb607647893920/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d746f74616c2d706167652d74696d652e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-total-page-time.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-total-page-time/releases""><img src=""https://camo.githubusercontent.com/7fb0313d1610a325951cfd81a63bb460a452ac3be321a8a3e275296a9d65dcd2/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d746f74616c2d706167652d74696d653f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-total-page-time?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-total-page-time/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-total-page-time/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-total-page-time/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Add a note to the Datasette footer measuring the total page load time</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p dir=""auto"">Install this plugin in the same environment as Datasette.</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""datasette install datasette-total-page-time""><pre class=""notranslate""><code class=""notranslate"">datasette install datasette-total-page-time
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p dir=""auto"">Once this plugin is installed, a note will appear in the footer of every page showing how long the page took to generate.</p>
<blockquote>
<p dir=""auto"">Queries took 326.74ms · Page took 386.310ms</p>
</blockquote>
<h2 dir=""auto""><a id=""user-content-how-it-works"" class=""anchor"" aria-hidden=""true"" href=""#user-content-how-it-works""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>How it works</h2>
<p dir=""auto"">Measuring how long a page takes to load and then injecting that note into the page is tricky, because you need to finish generating the page before you know how long it took to load it!</p>
<p dir=""auto"">This plugin uses the <a href=""https://docs.datasette.io/en/stable/plugin_hooks.html#asgi-wrapper-datasette"" rel=""nofollow"">asgi_wrapper</a> plugin hook to measure the time taken by Datasette and then inject the following JavaScript at the bottom of the response, after the closing <code>&lt;/html&gt;</code> tag but with the correct measured value:</p>
<div class=""highlight highlight-text-html-basic position-relative overflow-auto"" data-snippet-clipboard-copy-content=""&lt;script&gt;
let footer = document.querySelector(&quot;footer&quot;);
if (footer) {
    let ms = 37.224;
    let s = ` &amp;middot; Page took ${ms.toFixed(3)}ms`;
    footer.innerHTML += s;
}
&lt;/script&gt;""><pre><span class=""pl-kos"">&lt;</span><span class=""pl-ent"">script</span><span class=""pl-kos"">&gt;</span>
<span class=""pl-k"">let</span> <span class=""pl-s1"">footer</span> <span class=""pl-c1"">=</span> <span class=""pl-smi"">document</span><span class=""pl-kos"">.</span><span class=""pl-en"">querySelector</span><span class=""pl-kos"">(</span><span class=""pl-s"">""footer""</span><span class=""pl-kos"">)</span><span class=""pl-kos"">;</span>
<span class=""pl-k"">if</span> <span class=""pl-kos"">(</span><span class=""pl-s1"">footer</span><span class=""pl-kos"">)</span> <span class=""pl-kos"">{</span>
    <span class=""pl-k"">let</span> <span class=""pl-s1"">ms</span> <span class=""pl-c1"">=</span> <span class=""pl-c1"">37.224</span><span class=""pl-kos"">;</span>
    <span class=""pl-k"">let</span> <span class=""pl-s1"">s</span> <span class=""pl-c1"">=</span> <span class=""pl-s"">` &amp;middot; Page took <span class=""pl-s1""><span class=""pl-kos"">${</span><span class=""pl-s1"">ms</span><span class=""pl-kos"">.</span><span class=""pl-en"">toFixed</span><span class=""pl-kos"">(</span><span class=""pl-c1"">3</span><span class=""pl-kos"">)</span><span class=""pl-kos"">}</span></span>ms`</span><span class=""pl-kos"">;</span>
    <span class=""pl-s1"">footer</span><span class=""pl-kos"">.</span><span class=""pl-c1"">innerHTML</span> <span class=""pl-c1"">+=</span> <span class=""pl-s1"">s</span><span class=""pl-kos"">;</span>
<span class=""pl-kos"">}</span>
<span class=""pl-kos"">&lt;/</span><span class=""pl-ent"">script</span><span class=""pl-kos"">&gt;</span></pre></div>
<p dir=""auto"">This script is injected only on pages with the <code>text/html</code> content type - so it should not affect JSON or CSV returned by Datasette.</p>
<h2 dir=""auto""><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p dir=""auto"">To set up this plugin locally, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""cd datasette-total-page-time
python3 -mvenv venv
source venv/bin/activate""><pre class=""notranslate""><code class=""notranslate"">cd datasette-total-page-time
python3 -mvenv venv
source venv/bin/activate
</code></pre></div>
<p dir=""auto"">Now install the dependencies and test dependencies:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'""><pre class=""notranslate""><code class=""notranslate"">pip install -e '.[test]'
</code></pre></div>
<p dir=""auto"">To run the tests:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pytest""><pre class=""notranslate""><code class=""notranslate"">pytest
</code></pre></div>
</article></div>",1,public,0,,,
486080358,R_kgDOHPj_Zg,datasette-gzip,simonw/datasette-gzip,0,9599,https://github.com/simonw/datasette-gzip,Add gzip compression to Datasette,0,2022-04-27T06:54:52Z,2022-04-28T03:19:50Z,2022-04-28T16:06:51Z,,15,4,4,Python,1,1,1,1,0,0,0,0,0,apache-2.0,[],0,0,4,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,0,1,"# datasette-gzip

[![PyPI](https://img.shields.io/pypi/v/datasette-gzip.svg)](https://pypi.org/project/datasette-gzip/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-gzip?include_prereleases&label=changelog)](https://github.com/simonw/datasette-gzip/releases)
[![Tests](https://github.com/simonw/datasette-gzip/workflows/Test/badge.svg)](https://github.com/simonw/datasette-gzip/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-gzip/blob/main/LICENSE)

Add gzip compression to Datasette

## Installation

Install this plugin in the same environment as Datasette.

    datasette install datasette-gzip

## Usage

Once installed, Datasette will obey the `Accept-Encoding:` header sent by browsers or other user agents and return content compressed in the most appropriate way.

This plugin is a thin wrapper for the [asgi-gzip library](https://github.com/simonw/asgi-gzip), which extracts the [GzipMiddleware](https://www.starlette.io/middleware/#gzipmiddleware) from Starlette.

## Development

To set up this plugin locally, first checkout the code. Then create a new virtual environment:

    cd datasette-gzip
    python3 -mvenv venv
    source venv/bin/activate

Now install the dependencies and test dependencies:

    pip install -e '.[test]'

To run the tests:

    pytest
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-datasette-gzip"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-gzip""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-gzip</h1>
<p dir=""auto""><a href=""https://pypi.org/project/datasette-gzip/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/2710ff5daf8e208e73a7b4cc0c852401cdb1214c7266da2fe9bd9d451a0b6710/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d677a69702e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-gzip.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-gzip/releases""><img src=""https://camo.githubusercontent.com/0ac879f01bbec94064e9c423f0d6285e073140ec9583e348c450be58246a8ef5/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d677a69703f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-gzip?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-gzip/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-gzip/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-gzip/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Add gzip compression to Datasette</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p dir=""auto"">Install this plugin in the same environment as Datasette.</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""datasette install datasette-gzip""><pre class=""notranslate""><code class=""notranslate"">datasette install datasette-gzip
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p dir=""auto"">Once installed, Datasette will obey the <code>Accept-Encoding:</code> header sent by browsers or other user agents and return content compressed in the most appropriate way.</p>
<p dir=""auto"">This plugin is a thin wrapper for the <a href=""https://github.com/simonw/asgi-gzip"">asgi-gzip library</a>, which extracts the <a href=""https://www.starlette.io/middleware/#gzipmiddleware"" rel=""nofollow"">GzipMiddleware</a> from Starlette.</p>
<h2 dir=""auto""><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p dir=""auto"">To set up this plugin locally, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""cd datasette-gzip
python3 -mvenv venv
source venv/bin/activate""><pre class=""notranslate""><code class=""notranslate"">cd datasette-gzip
python3 -mvenv venv
source venv/bin/activate
</code></pre></div>
<p dir=""auto"">Now install the dependencies and test dependencies:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'""><pre class=""notranslate""><code class=""notranslate"">pip install -e '.[test]'
</code></pre></div>
<p dir=""auto"">To run the tests:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pytest""><pre class=""notranslate""><code class=""notranslate"">pytest
</code></pre></div>
</article></div>",1,public,0,,,
486732669,R_kgDOHQLzfQ,datasette-copy-to-memory,simonw/datasette-copy-to-memory,0,9599,https://github.com/simonw/datasette-copy-to-memory,Copy database files into an in-memory database on startup,0,2022-04-28T20:02:21Z,2022-04-30T21:32:54Z,2022-04-30T19:49:29Z,,19,2,2,Python,1,1,1,1,0,0,0,0,2,apache-2.0,"[""datasette"", ""datasette-plugin""]",0,2,2,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,0,1,"# datasette-copy-to-memory

[![PyPI](https://img.shields.io/pypi/v/datasette-copy-to-memory.svg)](https://pypi.org/project/datasette-copy-to-memory/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-copy-to-memory?include_prereleases&label=changelog)](https://github.com/simonw/datasette-copy-to-memory/releases)
[![Tests](https://github.com/simonw/datasette-copy-to-memory/workflows/Test/badge.svg)](https://github.com/simonw/datasette-copy-to-memory/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-copy-to-memory/blob/main/LICENSE)

Copy database files into an in-memory database on startup

This plugin is **highly experimental**. It currently exists to support Datasette performance research, and is not designed for actual production usage.

## Installation

Install this plugin in the same environment as Datasette.

    datasette install datasette-copy-to-memory

## Usage

On startup, Datasette will create an in-memory named database for each attached database. This database will have the same name but with `_memory` at the end.

So running this:

    datasette fixtures.db

Will serve two databases: the original at `/fixtures` and the in-memory copy at `/fixtures_memory`.

## Demo

A demo is running on [latest-with-plugins.datasette.io](https://latest-with-plugins.datasette.io/) - the [/fixtures_memory](https://latest-with-plugins.datasette.io/fixtures_memory) table there is provided by this plugin.

## Configuration

By default every attached database file will be loaded into a `_memory` copy.

You can use plugin configuration to specify just a subset of the database. For example, to create `github_memory` but not `fixtures_memory` you would use the following `metadata.yml` file:

```yaml
plugins:
  datasette-copy-to-memory:
    databases:
    - github
```
Then start Datasette like this:

    datasette github.db fixtures.db -m metadata.yml

If you don't want to have a `fixtures` and `fixtures_memory` database, you can use `replace: true` to have the plugin replace the file-backed database with the new in-memory one, reusing the same database name:

```yaml
plugins:
  datasette-copy-to-memory:
    replace: true
```
Then:

    datasette github.db fixtures.db -m metadata.yml

This will result in both `/github` and `/fixtures` but no `/github_memory` or `/fixtures_memory`.

## Development

To set up this plugin locally, first checkout the code. Then create a new virtual environment:

    cd datasette-copy-to-memory
    python3 -m venv venv
    source venv/bin/activate

Now install the dependencies and test dependencies:

    pip install -e '.[test]'

To run the tests:

    pytest
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-datasette-copy-to-memory"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-copy-to-memory""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-copy-to-memory</h1>
<p dir=""auto""><a href=""https://pypi.org/project/datasette-copy-to-memory/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/d3a367abdd18cd053406fa7ada0fe34b03d87d6a8ba6572c6c4722e46ab398b3/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d636f70792d746f2d6d656d6f72792e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-copy-to-memory.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-copy-to-memory/releases""><img src=""https://camo.githubusercontent.com/688ecff26c66603539b1877da4753ec7bfdfab4555a6c4846e1ee0b1c428f60b/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d636f70792d746f2d6d656d6f72793f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-copy-to-memory?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-copy-to-memory/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-copy-to-memory/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-copy-to-memory/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Copy database files into an in-memory database on startup</p>
<p dir=""auto"">This plugin is <strong>highly experimental</strong>. It currently exists to support Datasette performance research, and is not designed for actual production usage.</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p dir=""auto"">Install this plugin in the same environment as Datasette.</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""datasette install datasette-copy-to-memory""><pre class=""notranslate""><code class=""notranslate"">datasette install datasette-copy-to-memory
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p dir=""auto"">On startup, Datasette will create an in-memory named database for each attached database. This database will have the same name but with <code>_memory</code> at the end.</p>
<p dir=""auto"">So running this:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""datasette fixtures.db""><pre class=""notranslate""><code class=""notranslate"">datasette fixtures.db
</code></pre></div>
<p dir=""auto"">Will serve two databases: the original at <code>/fixtures</code> and the in-memory copy at <code>/fixtures_memory</code>.</p>
<h2 dir=""auto""><a id=""user-content-demo"" class=""anchor"" aria-hidden=""true"" href=""#user-content-demo""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Demo</h2>
<p dir=""auto"">A demo is running on <a href=""https://latest-with-plugins.datasette.io/"" rel=""nofollow"">latest-with-plugins.datasette.io</a> - the <a href=""https://latest-with-plugins.datasette.io/fixtures_memory"" rel=""nofollow"">/fixtures_memory</a> table there is provided by this plugin.</p>
<h2 dir=""auto""><a id=""user-content-configuration"" class=""anchor"" aria-hidden=""true"" href=""#user-content-configuration""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Configuration</h2>
<p dir=""auto"">By default every attached database file will be loaded into a <code>_memory</code> copy.</p>
<p dir=""auto"">You can use plugin configuration to specify just a subset of the database. For example, to create <code>github_memory</code> but not <code>fixtures_memory</code> you would use the following <code>metadata.yml</code> file:</p>
<div class=""highlight highlight-source-yaml position-relative overflow-auto"" data-snippet-clipboard-copy-content=""plugins:
  datasette-copy-to-memory:
    databases:
    - github""><pre><span class=""pl-ent"">plugins</span>:
  <span class=""pl-ent"">datasette-copy-to-memory</span>:
    <span class=""pl-ent"">databases</span>:
    - <span class=""pl-s"">github</span></pre></div>
<p dir=""auto"">Then start Datasette like this:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""datasette github.db fixtures.db -m metadata.yml""><pre class=""notranslate""><code class=""notranslate"">datasette github.db fixtures.db -m metadata.yml
</code></pre></div>
<p dir=""auto"">If you don't want to have a <code>fixtures</code> and <code>fixtures_memory</code> database, you can use <code>replace: true</code> to have the plugin replace the file-backed database with the new in-memory one, reusing the same database name:</p>
<div class=""highlight highlight-source-yaml position-relative overflow-auto"" data-snippet-clipboard-copy-content=""plugins:
  datasette-copy-to-memory:
    replace: true""><pre><span class=""pl-ent"">plugins</span>:
  <span class=""pl-ent"">datasette-copy-to-memory</span>:
    <span class=""pl-ent"">replace</span>: <span class=""pl-c1"">true</span></pre></div>
<p dir=""auto"">Then:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""datasette github.db fixtures.db -m metadata.yml""><pre class=""notranslate""><code class=""notranslate"">datasette github.db fixtures.db -m metadata.yml
</code></pre></div>
<p dir=""auto"">This will result in both <code>/github</code> and <code>/fixtures</code> but no <code>/github_memory</code> or <code>/fixtures_memory</code>.</p>
<h2 dir=""auto""><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p dir=""auto"">To set up this plugin locally, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""cd datasette-copy-to-memory
python3 -m venv venv
source venv/bin/activate""><pre class=""notranslate""><code class=""notranslate"">cd datasette-copy-to-memory
python3 -m venv venv
source venv/bin/activate
</code></pre></div>
<p dir=""auto"">Now install the dependencies and test dependencies:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'""><pre class=""notranslate""><code class=""notranslate"">pip install -e '.[test]'
</code></pre></div>
<p dir=""auto"">To run the tests:</p>
<div class=""snippet-clipboard-content position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pytest""><pre class=""notranslate""><code class=""notranslate"">pytest
</code></pre></div>
</article></div>",1,public,0,,,
489156146,R_kgDOHSfuMg,datasette-upload-dbs,simonw/datasette-upload-dbs,0,9599,https://github.com/simonw/datasette-upload-dbs,Upload SQLite database files to Datasette,0,2022-05-05T23:36:51Z,2022-05-17T16:38:00Z,2022-09-09T16:23:13Z,,93,5,5,Python,1,1,1,1,0,0,0,0,1,apache-2.0,[],0,1,5,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,0,2,"# datasette-upload-dbs

[![PyPI](https://img.shields.io/pypi/v/datasette-upload-dbs.svg)](https://pypi.org/project/datasette-upload-dbs/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-upload-dbs?include_prereleases&label=changelog)](https://github.com/simonw/datasette-upload-dbs/releases)
[![Tests](https://github.com/simonw/datasette-upload-dbs/workflows/Test/badge.svg)](https://github.com/simonw/datasette-upload-dbs/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-upload-dbs/blob/main/LICENSE)

Upload SQLite database files to Datasette

## Installation

Install this plugin in the same environment as Datasette.

    datasette install datasette-upload-dbs

## Configuration

This plugin requires you to configure a directory in which uploaded files will be stored.

On startup, Datasette will automatically load any SQLite files that it finds in that directory. This means it is safe to restart your server in between file uploads.

To configure the directory as `/home/datasette/uploads`, add this to a `metadata.yml` configuration file:

```yaml
plugins:
  datasette-upload-dbs:
    directory: /home/datasette/uploads
```

Or if you are using `metadata.json`:

```json
{
  ""plugins"": {
    ""datasette-upload-dbs"": {
      ""directory"": ""/home/datasette/uploads""
    }
  }
}
```
You can use `"".""` for the current folder when the server starts, or `""uploads""` for a folder relative to that folder. The folder will be created on startup if it does not already exist.

Then start Datasette like this:

    datasette -m metadata.yml

## Usage

Only users with the `upload-dbs` permission will be able to upload files. The `root` user has this permission by default - other users can be granted access using permission plugins, see the [Permissions](https://docs.datasette.io/en/stable/authentication.html#permissions) documentation for details.

To start Datasette as the root user, run this:

    datasette -m metadata.yml --root

And follow the link that is displayd on the console.

If a user has that permission they will see an ""Upload database"" link in the navigation menu.

This will take them to `/-/upload-dbs` where they will be able to upload database files, by selecting them or by dragging them onto the drop area.

![Animated demo showing a file being dropped onto a box, then uploading and redirecting to the database page](https://github.com/simonw/datasette-upload-dbs/raw/main/upload-demo.gif)

## Development

To set up this plugin locally, first checkout the code. Then create a new virtual environment:

    cd datasette-upload-dbs
    python3 -m venv venv
    source venv/bin/activate

Now install the dependencies and test dependencies:

    pip install -e '.[test]'

To run the tests:

    pytest
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-datasette-upload-dbs"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-upload-dbs""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-upload-dbs</h1>
<p dir=""auto""><a href=""https://pypi.org/project/datasette-upload-dbs/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/dc98cd1fbc11a52c3610a10dd294832a4489aa3366a5e88b61ab3b7144bf7425/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d75706c6f61642d6462732e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-upload-dbs.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-upload-dbs/releases""><img src=""https://camo.githubusercontent.com/c00a952891cccd5b0a084a4a8aebf0f38a89669c5e7769a926b45fcaa5fa33bf/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d75706c6f61642d6462733f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-upload-dbs?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-upload-dbs/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-upload-dbs/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-upload-dbs/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Upload SQLite database files to Datasette</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p dir=""auto"">Install this plugin in the same environment as Datasette.</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""datasette install datasette-upload-dbs""><pre class=""notranslate""><code>datasette install datasette-upload-dbs
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-configuration"" class=""anchor"" aria-hidden=""true"" href=""#user-content-configuration""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Configuration</h2>
<p dir=""auto"">This plugin requires you to configure a directory in which uploaded files will be stored.</p>
<p dir=""auto"">On startup, Datasette will automatically load any SQLite files that it finds in that directory. This means it is safe to restart your server in between file uploads.</p>
<p dir=""auto"">To configure the directory as <code>/home/datasette/uploads</code>, add this to a <code>metadata.yml</code> configuration file:</p>
<div class=""highlight highlight-source-yaml notranslate position-relative overflow-auto"" dir=""auto"" data-snippet-clipboard-copy-content=""plugins:
  datasette-upload-dbs:
    directory: /home/datasette/uploads""><pre><span class=""pl-ent"">plugins</span>:
  <span class=""pl-ent"">datasette-upload-dbs</span>:
    <span class=""pl-ent"">directory</span>: <span class=""pl-s"">/home/datasette/uploads</span></pre></div>
<p dir=""auto"">Or if you are using <code>metadata.json</code>:</p>
<div class=""highlight highlight-source-json notranslate position-relative overflow-auto"" dir=""auto"" data-snippet-clipboard-copy-content=""{
  &quot;plugins&quot;: {
    &quot;datasette-upload-dbs&quot;: {
      &quot;directory&quot;: &quot;/home/datasette/uploads&quot;
    }
  }
}""><pre>{
  <span class=""pl-ent"">""plugins""</span>: {
    <span class=""pl-ent"">""datasette-upload-dbs""</span>: {
      <span class=""pl-ent"">""directory""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>/home/datasette/uploads<span class=""pl-pds"">""</span></span>
    }
  }
}</pre></div>
<p dir=""auto"">You can use <code>"".""</code> for the current folder when the server starts, or <code>""uploads""</code> for a folder relative to that folder. The folder will be created on startup if it does not already exist.</p>
<p dir=""auto"">Then start Datasette like this:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""datasette -m metadata.yml""><pre class=""notranslate""><code>datasette -m metadata.yml
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p dir=""auto"">Only users with the <code>upload-dbs</code> permission will be able to upload files. The <code>root</code> user has this permission by default - other users can be granted access using permission plugins, see the <a href=""https://docs.datasette.io/en/stable/authentication.html#permissions"" rel=""nofollow"">Permissions</a> documentation for details.</p>
<p dir=""auto"">To start Datasette as the root user, run this:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""datasette -m metadata.yml --root""><pre class=""notranslate""><code>datasette -m metadata.yml --root
</code></pre></div>
<p dir=""auto"">And follow the link that is displayd on the console.</p>
<p dir=""auto"">If a user has that permission they will see an ""Upload database"" link in the navigation menu.</p>
<p dir=""auto"">This will take them to <code>/-/upload-dbs</code> where they will be able to upload database files, by selecting them or by dragging them onto the drop area.</p>
<p dir=""auto""><a target=""_blank"" rel=""noopener noreferrer"" href=""https://github.com/simonw/datasette-upload-dbs/raw/main/upload-demo.gif""><img src=""https://github.com/simonw/datasette-upload-dbs/raw/main/upload-demo.gif"" alt=""Animated demo showing a file being dropped onto a box, then uploading and redirecting to the database page"" data-animated-image="""" style=""max-width: 100%;""></a></p>
<h2 dir=""auto""><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p dir=""auto"">To set up this plugin locally, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""cd datasette-upload-dbs
python3 -m venv venv
source venv/bin/activate""><pre class=""notranslate""><code>cd datasette-upload-dbs
python3 -m venv venv
source venv/bin/activate
</code></pre></div>
<p dir=""auto"">Now install the dependencies and test dependencies:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'""><pre class=""notranslate""><code>pip install -e '.[test]'
</code></pre></div>
<p dir=""auto"">To run the tests:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pytest""><pre class=""notranslate""><code>pytest
</code></pre></div>
</article></div>",1,public,0,,0,
499911426,R_kgDOHcwLAg,datasette-query-files,eyeseast/datasette-query-files,0,25778,https://github.com/eyeseast/datasette-query-files,Write Datasette canned queries as plain SQL files,0,2022-06-04T18:52:07Z,2022-07-02T19:46:52Z,2022-07-02T20:40:51Z,,24,8,8,Python,1,1,1,1,0,0,0,0,2,apache-2.0,"[""datasette"", ""datasette-plugin"", ""python"", ""sql"", ""sqlite""]",0,2,8,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,0,1,"# datasette-query-files

[![PyPI](https://img.shields.io/pypi/v/datasette-query-files.svg)](https://pypi.org/project/datasette-query-files/)
[![Changelog](https://img.shields.io/github/v/release/eyeseast/datasette-query-files?include_prereleases&label=changelog)](https://github.com/eyeseast/datasette-query-files/releases)
[![Tests](https://github.com/eyeseast/datasette-query-files/workflows/Test/badge.svg)](https://github.com/eyeseast/datasette-query-files/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/eyeseast/datasette-query-files/blob/main/LICENSE)

Write Datasette canned queries as plain SQL files.

## Installation

Install this plugin in the same environment as Datasette.

    datasette install datasette-query-files

Or using `pip` or `pipenv`:

    pip install datasette-query-files
    pipenv install datasette-query-files

## Usage

This plugin will look for [canned queries](https://docs.datasette.io/en/stable/sql_queries.html#canned-queries) in the filesystem, in addition any defined in metadata.

Let's say you're working in a directory called `project-directory`, with a database file called `my-project.db`. Start by creating a `queries` directory with a `my-project` directory inside it. Any SQL file inside that `my-project` folder will become a canned query that can be run on the `my-project` database. If you have a `query-name.sql` file and a `query-name.json` (or `query-name.yml`) file in the same directory, the JSON file will be used as query metadata.

```
project-directory/
  my-project.db
  queries/
    my-project/
      query-name.sql # a query
      query-name.yml # query metadata
```

## Development

To set up this plugin locally, first checkout the code. Then create a new virtual environment:

    cd datasette-query-files
    python3 -m venv venv
    source venv/bin/activate

Now install the dependencies and test dependencies:

    pip install -e '.[test]'

To run the tests:

    pytest
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-datasette-query-files"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-query-files""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-query-files</h1>
<p dir=""auto""><a href=""https://pypi.org/project/datasette-query-files/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/c204f01390d188a0f01eeafcd5f9f369b653b2cfaaef56534a36497a575cc317/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d71756572792d66696c65732e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-query-files.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/eyeseast/datasette-query-files/releases""><img src=""https://camo.githubusercontent.com/d614cd96ac7137aced94769f27f45a90dfc8ddd9bbff39b15d6c276704b7e703/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f65796573656173742f6461746173657474652d71756572792d66696c65733f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/eyeseast/datasette-query-files?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/eyeseast/datasette-query-files/actions?query=workflow%3ATest""><img src=""https://github.com/eyeseast/datasette-query-files/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/eyeseast/datasette-query-files/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Write Datasette canned queries as plain SQL files.</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p dir=""auto"">Install this plugin in the same environment as Datasette.</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""datasette install datasette-query-files""><pre class=""notranslate""><code>datasette install datasette-query-files
</code></pre></div>
<p dir=""auto"">Or using <code>pip</code> or <code>pipenv</code>:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install datasette-query-files
pipenv install datasette-query-files""><pre class=""notranslate""><code>pip install datasette-query-files
pipenv install datasette-query-files
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p dir=""auto"">This plugin will look for <a href=""https://docs.datasette.io/en/stable/sql_queries.html#canned-queries"" rel=""nofollow"">canned queries</a> in the filesystem, in addition any defined in metadata.</p>
<p dir=""auto"">Let's say you're working in a directory called <code>project-directory</code>, with a database file called <code>my-project.db</code>. Start by creating a <code>queries</code> directory with a <code>my-project</code> directory inside it. Any SQL file inside that <code>my-project</code> folder will become a canned query that can be run on the <code>my-project</code> database. If you have a <code>query-name.sql</code> file and a <code>query-name.json</code> (or <code>query-name.yml</code>) file in the same directory, the JSON file will be used as query metadata.</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""project-directory/
  my-project.db
  queries/
    my-project/
      query-name.sql # a query
      query-name.yml # query metadata""><pre class=""notranslate""><code>project-directory/
  my-project.db
  queries/
    my-project/
      query-name.sql # a query
      query-name.yml # query metadata
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p dir=""auto"">To set up this plugin locally, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""cd datasette-query-files
python3 -m venv venv
source venv/bin/activate""><pre class=""notranslate""><code>cd datasette-query-files
python3 -m venv venv
source venv/bin/activate
</code></pre></div>
<p dir=""auto"">Now install the dependencies and test dependencies:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'""><pre class=""notranslate""><code>pip install -e '.[test]'
</code></pre></div>
<p dir=""auto"">To run the tests:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pytest""><pre class=""notranslate""><code>pytest
</code></pre></div>
</article></div>",1,public,0,"{""id"": 400878073, ""node_id"": ""MDEwOlJlcG9zaXRvcnk0MDA4NzgwNzM="", ""name"": ""datasette-plugin-template-repository"", ""full_name"": ""simonw/datasette-plugin-template-repository"", ""private"": false, ""owner"": {""login"": ""simonw"", ""id"": 9599, ""node_id"": ""MDQ6VXNlcjk1OTk="", ""avatar_url"": ""https://avatars.githubusercontent.com/u/9599?v=4"", ""gravatar_id"": """", ""url"": ""https://api.github.com/users/simonw"", ""html_url"": ""https://github.com/simonw"", ""followers_url"": ""https://api.github.com/users/simonw/followers"", ""following_url"": ""https://api.github.com/users/simonw/following{/other_user}"", ""gists_url"": ""https://api.github.com/users/simonw/gists{/gist_id}"", ""starred_url"": ""https://api.github.com/users/simonw/starred{/owner}{/repo}"", ""subscriptions_url"": ""https://api.github.com/users/simonw/subscriptions"", ""organizations_url"": ""https://api.github.com/users/simonw/orgs"", ""repos_url"": ""https://api.github.com/users/simonw/repos"", ""events_url"": ""https://api.github.com/users/simonw/events{/privacy}"", ""received_events_url"": ""https://api.github.com/users/simonw/received_events"", ""type"": ""User"", ""site_admin"": false}, ""html_url"": ""https://github.com/simonw/datasette-plugin-template-repository"", ""description"": ""GitHub template repository for creating new Datasette plugins, using the simonw/datasette-plugin cookiecutter template"", ""fork"": false, ""url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository"", ""forks_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/forks"", ""keys_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/keys{/key_id}"", ""collaborators_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/collaborators{/collaborator}"", ""teams_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/teams"", ""hooks_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/hooks"", ""issue_events_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/issues/events{/number}"", ""events_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/events"", ""assignees_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/assignees{/user}"", ""branches_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/branches{/branch}"", ""tags_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/tags"", ""blobs_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/blobs{/sha}"", ""git_tags_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/tags{/sha}"", ""git_refs_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/refs{/sha}"", ""trees_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/trees{/sha}"", ""statuses_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/statuses/{sha}"", ""languages_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/languages"", ""stargazers_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/stargazers"", ""contributors_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/contributors"", ""subscribers_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/subscribers"", ""subscription_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/subscription"", ""commits_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/commits{/sha}"", ""git_commits_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/commits{/sha}"", ""comments_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/comments{/number}"", ""issue_comment_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/issues/comments{/number}"", ""contents_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/contents/{+path}"", ""compare_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/compare/{base}...{head}"", ""merges_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/merges"", ""archive_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/{archive_format}{/ref}"", ""downloads_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/downloads"", ""issues_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/issues{/number}"", ""pulls_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/pulls{/number}"", ""milestones_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/milestones{/number}"", ""notifications_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/notifications{?since,all,participating}"", ""labels_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/labels{/name}"", ""releases_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/releases{/id}"", ""deployments_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/deployments"", ""created_at"": ""2021-08-28T19:50:28Z"", ""updated_at"": ""2022-06-10T13:28:46Z"", ""pushed_at"": ""2022-03-16T23:42:16Z"", ""git_url"": ""git://github.com/simonw/datasette-plugin-template-repository.git"", ""ssh_url"": ""git@github.com:simonw/datasette-plugin-template-repository.git"", ""clone_url"": ""https://github.com/simonw/datasette-plugin-template-repository.git"", ""svn_url"": ""https://github.com/simonw/datasette-plugin-template-repository"", ""homepage"": """", ""size"": 9, ""stargazers_count"": 15, ""watchers_count"": 15, ""language"": null, ""has_issues"": true, ""has_projects"": true, ""has_downloads"": true, ""has_wiki"": true, ""has_pages"": false, ""forks_count"": 0, ""mirror_url"": null, ""archived"": false, ""disabled"": false, ""open_issues_count"": 0, ""license"": null, ""allow_forking"": true, ""is_template"": true, ""web_commit_signoff_required"": false, ""topics"": [], ""visibility"": ""public"", ""forks"": 0, ""open_issues"": 0, ""watchers"": 15, ""default_branch"": ""main"", ""permissions"": {""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}, ""temp_clone_token"": """"}",0,
501303242,R_kgDOHeFHyg,datasette-socrata,simonw/datasette-socrata,0,9599,https://github.com/simonw/datasette-socrata,Import data from Socrata into Datasette,0,2022-06-08T15:13:44Z,2022-06-09T21:13:34Z,2022-06-17T20:29:53Z,,38,0,0,Python,1,1,1,1,0,0,0,0,1,apache-2.0,[],0,1,0,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,0,1,"# datasette-socrata

[![PyPI](https://img.shields.io/pypi/v/datasette-socrata.svg)](https://pypi.org/project/datasette-socrata/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-socrata?include_prereleases&label=changelog)](https://github.com/simonw/datasette-socrata/releases)
[![Tests](https://github.com/simonw/datasette-socrata/workflows/Test/badge.svg)](https://github.com/simonw/datasette-socrata/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-socrata/blob/main/LICENSE)

Import data from Socrata into Datasette

## Installation

Install this plugin in the same environment as Datasette.

    datasette install datasette-socrata

## Usage

Make sure you have [enabled WAL mode](https://til.simonwillison.net/sqlite/enabling-wal-mode) on your database files before using this plugin.

Once installed, an interface for importing data from Socrata will become available at this URL:

    /-/import-socrata

Users will be able to paste in a URL to a dataset on Socrata in order to initialize an import.

You can also pre-fill the form by passing a `?url=` parameter, for example:

    /-/import-socrata?url=https://data.sfgov.org/City-Infrastructure/Street-Tree-List/tkzw-k3nq

Any database that is attached to Datasette, is NOT loaded as immutable (with the `-i` option) and that has WAL mode enabled will be available for users to import data into.

The `import-socrata` permission governs access. By default the `root` actor (accessible using `datasette --root` to start Datasette) is granted that permission.

You can use permission plugins such as [datasette-permissions-sql](https://github.com/simonw/datasette-permissions-sql) to grant additional access to other users.

## Configuration

If you only want Socrata imports to be allowed to a specific database, you can configure that using plugin configration in `metadata.yml`:

```yaml
plugins:
  datasette-socrata:
    database: socrata
```

## Development

To set up this plugin locally, first checkout the code. Then create a new virtual environment:

    cd datasette-socrata
    python3 -m venv venv
    source venv/bin/activate

Now install the dependencies and test dependencies:

    pip install -e '.[test]'

To run the tests:

    pytest
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-datasette-socrata"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-socrata""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-socrata</h1>
<p dir=""auto""><a href=""https://pypi.org/project/datasette-socrata/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/db40380ae3f84b80e55f3156dd6039851796a73ae5eb79e839c54dc78bc28c28/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d736f63726174612e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-socrata.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-socrata/releases""><img src=""https://camo.githubusercontent.com/d093beb5cfaa4caa322dc585e63996f01f1fd16cbe6cbc6f05cb9db54c7b88ec/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d736f63726174613f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-socrata?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-socrata/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-socrata/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-socrata/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Import data from Socrata into Datasette</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p dir=""auto"">Install this plugin in the same environment as Datasette.</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""datasette install datasette-socrata""><pre class=""notranslate""><code>datasette install datasette-socrata
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p dir=""auto"">Make sure you have <a href=""https://til.simonwillison.net/sqlite/enabling-wal-mode"" rel=""nofollow"">enabled WAL mode</a> on your database files before using this plugin.</p>
<p dir=""auto"">Once installed, an interface for importing data from Socrata will become available at this URL:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""/-/import-socrata""><pre class=""notranslate""><code>/-/import-socrata
</code></pre></div>
<p dir=""auto"">Users will be able to paste in a URL to a dataset on Socrata in order to initialize an import.</p>
<p dir=""auto"">You can also pre-fill the form by passing a <code>?url=</code> parameter, for example:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""/-/import-socrata?url=https://data.sfgov.org/City-Infrastructure/Street-Tree-List/tkzw-k3nq""><pre class=""notranslate""><code>/-/import-socrata?url=https://data.sfgov.org/City-Infrastructure/Street-Tree-List/tkzw-k3nq
</code></pre></div>
<p dir=""auto"">Any database that is attached to Datasette, is NOT loaded as immutable (with the <code>-i</code> option) and that has WAL mode enabled will be available for users to import data into.</p>
<p dir=""auto"">The <code>import-socrata</code> permission governs access. By default the <code>root</code> actor (accessible using <code>datasette --root</code> to start Datasette) is granted that permission.</p>
<p dir=""auto"">You can use permission plugins such as <a href=""https://github.com/simonw/datasette-permissions-sql"">datasette-permissions-sql</a> to grant additional access to other users.</p>
<h2 dir=""auto""><a id=""user-content-configuration"" class=""anchor"" aria-hidden=""true"" href=""#user-content-configuration""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Configuration</h2>
<p dir=""auto"">If you only want Socrata imports to be allowed to a specific database, you can configure that using plugin configration in <code>metadata.yml</code>:</p>
<div class=""highlight highlight-source-yaml notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""plugins:
  datasette-socrata:
    database: socrata""><pre><span class=""pl-ent"">plugins</span>:
  <span class=""pl-ent"">datasette-socrata</span>:
    <span class=""pl-ent"">database</span>: <span class=""pl-s"">socrata</span></pre></div>
<h2 dir=""auto""><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p dir=""auto"">To set up this plugin locally, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""cd datasette-socrata
python3 -m venv venv
source venv/bin/activate""><pre class=""notranslate""><code>cd datasette-socrata
python3 -m venv venv
source venv/bin/activate
</code></pre></div>
<p dir=""auto"">Now install the dependencies and test dependencies:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'""><pre class=""notranslate""><code>pip install -e '.[test]'
</code></pre></div>
<p dir=""auto"">To run the tests:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pytest""><pre class=""notranslate""><code>pytest
</code></pre></div>
</article></div>",1,public,0,,,
506026919,R_kgDOHilbpw,datasette-scale-to-zero,simonw/datasette-scale-to-zero,0,9599,https://github.com/simonw/datasette-scale-to-zero,Quit Datasette if it has not received traffic for a specified time period,0,2022-06-21T22:49:04Z,2022-07-13T14:02:31Z,2022-08-05T22:29:17Z,https://datasette.io/plugins/datasette-scale-to-zero,22,8,8,Python,1,1,1,1,0,0,0,0,0,apache-2.0,"[""datasette"", ""datasette-plugin""]",0,0,8,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,0,1,"# datasette-scale-to-zero

[![PyPI](https://img.shields.io/pypi/v/datasette-scale-to-zero.svg)](https://pypi.org/project/datasette-scale-to-zero/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-scale-to-zero?include_prereleases&label=changelog)](https://github.com/simonw/datasette-scale-to-zero/releases)
[![Tests](https://github.com/simonw/datasette-scale-to-zero/workflows/Test/badge.svg)](https://github.com/simonw/datasette-scale-to-zero/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-scale-to-zero/blob/main/LICENSE)

Quit Datasette if it has not received traffic for a specified time period

Some hosting providers such as [Fly](https://fly.io/) offer a scale to zero mechanism, where servers can shut down and will be automatically started when new traffic arrives.

This plugin can be used to configure Datasette to quit X minutes (or seconds, or hours) after the last request it received. It can also cause the Datasette server to exit after a configured maximum time whether or not it is receiving traffic.

## Installation

Install this plugin in the same environment as Datasette.

    datasette install datasette-scale-to-zero

## Configuration

This plugin will only take effect if it has been configured.

Add the following to your ``metadata.json`` or ``metadata.yml`` configuration file:

```json
{
    ""plugins"": {
        ""datasette-scale-to-zero"": {
            ""duration"": ""10m""
        }
    }
}
```
This will cause Datasette to quit if it has not received any HTTP traffic for 10 minutes.

You can set this value using a suffix of `m` for minutes, `h` for hours or `s` for seconds.

To cause Datasette to exit if the server has been running for longer than a specific time, use `""max-age""`:

```json
{
    ""plugins"": {
        ""datasette-scale-to-zero"": {
            ""max-age"": ""10h""
        }
    }
}
```
This example will exit the Datasette server if it has been running for more than ten hours.

You can use `""duration""` and `""max-age""` together in the same configuration file:

```json
{
    ""plugins"": {
        ""datasette-scale-to-zero"": {
            ""max-age"": ""10h"",
            ""duration"": ""5m""
        }
    }
}
```
This example will quit if no traffic has been received in five minutes, or if the server has been running for ten hours.

## Development

To set up this plugin locally, first checkout the code. Then create a new virtual environment:

    cd datasette-scale-to-zero
    python3 -m venv venv
    source venv/bin/activate

Now install the dependencies and test dependencies:

    pip install -e '.[test]'

To run the tests:

    pytest
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-datasette-scale-to-zero"" class=""anchor"" href=""#user-content-datasette-scale-to-zero"" aria-hidden=""true""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-scale-to-zero</h1>
<p dir=""auto""><a href=""https://pypi.org/project/datasette-scale-to-zero/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/ca2386dea8d6d4b851a3603de67aefac24923128b004780fa51d9c4b8d0ff1c6/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d7363616c652d746f2d7a65726f2e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-scale-to-zero.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-scale-to-zero/releases""><img src=""https://camo.githubusercontent.com/096b12a9ae4fda1fae3c316e4b4625210e2a51100ed34dd764c7baf7f15fd266/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d7363616c652d746f2d7a65726f3f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-scale-to-zero?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-scale-to-zero/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-scale-to-zero/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-scale-to-zero/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Quit Datasette if it has not received traffic for a specified time period</p>
<p dir=""auto"">Some hosting providers such as <a href=""https://fly.io/"" rel=""nofollow"">Fly</a> offer a scale to zero mechanism, where servers can shut down and will be automatically started when new traffic arrives.</p>
<p dir=""auto"">This plugin can be used to configure Datasette to quit X minutes (or seconds, or hours) after the last request it received. It can also cause the Datasette server to exit after a configured maximum time whether or not it is receiving traffic.</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" href=""#user-content-installation"" aria-hidden=""true""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p dir=""auto"">Install this plugin in the same environment as Datasette.</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""datasette install datasette-scale-to-zero""><pre class=""notranslate""><code>datasette install datasette-scale-to-zero
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-configuration"" class=""anchor"" href=""#user-content-configuration"" aria-hidden=""true""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Configuration</h2>
<p dir=""auto"">This plugin will only take effect if it has been configured.</p>
<p dir=""auto"">Add the following to your <code>metadata.json</code> or <code>metadata.yml</code> configuration file:</p>
<div class=""highlight highlight-source-json notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{
    &quot;plugins&quot;: {
        &quot;datasette-scale-to-zero&quot;: {
            &quot;duration&quot;: &quot;10m&quot;
        }
    }
}""><pre>{
    <span class=""pl-ent"">""plugins""</span>: {
        <span class=""pl-ent"">""datasette-scale-to-zero""</span>: {
            <span class=""pl-ent"">""duration""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>10m<span class=""pl-pds"">""</span></span>
        }
    }
}</pre></div>
<p dir=""auto"">This will cause Datasette to quit if it has not received any HTTP traffic for 10 minutes.</p>
<p dir=""auto"">You can set this value using a suffix of <code>m</code> for minutes, <code>h</code> for hours or <code>s</code> for seconds.</p>
<p dir=""auto"">To cause Datasette to exit if the server has been running for longer than a specific time, use <code>""max-age""</code>:</p>
<div class=""highlight highlight-source-json notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{
    &quot;plugins&quot;: {
        &quot;datasette-scale-to-zero&quot;: {
            &quot;max-age&quot;: &quot;10h&quot;
        }
    }
}""><pre>{
    <span class=""pl-ent"">""plugins""</span>: {
        <span class=""pl-ent"">""datasette-scale-to-zero""</span>: {
            <span class=""pl-ent"">""max-age""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>10h<span class=""pl-pds"">""</span></span>
        }
    }
}</pre></div>
<p dir=""auto"">This example will exit the Datasette server if it has been running for more than ten hours.</p>
<p dir=""auto"">You can use <code>""duration""</code> and <code>""max-age""</code> together in the same configuration file:</p>
<div class=""highlight highlight-source-json notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{
    &quot;plugins&quot;: {
        &quot;datasette-scale-to-zero&quot;: {
            &quot;max-age&quot;: &quot;10h&quot;,
            &quot;duration&quot;: &quot;5m&quot;
        }
    }
}""><pre>{
    <span class=""pl-ent"">""plugins""</span>: {
        <span class=""pl-ent"">""datasette-scale-to-zero""</span>: {
            <span class=""pl-ent"">""max-age""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>10h<span class=""pl-pds"">""</span></span>,
            <span class=""pl-ent"">""duration""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>5m<span class=""pl-pds"">""</span></span>
        }
    }
}</pre></div>
<p dir=""auto"">This example will quit if no traffic has been received in five minutes, or if the server has been running for ten hours.</p>
<h2 dir=""auto""><a id=""user-content-development"" class=""anchor"" href=""#user-content-development"" aria-hidden=""true""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p dir=""auto"">To set up this plugin locally, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""cd datasette-scale-to-zero
python3 -m venv venv
source venv/bin/activate""><pre class=""notranslate""><code>cd datasette-scale-to-zero
python3 -m venv venv
source venv/bin/activate
</code></pre></div>
<p dir=""auto"">Now install the dependencies and test dependencies:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'""><pre class=""notranslate""><code>pip install -e '.[test]'
</code></pre></div>
<p dir=""auto"">To run the tests:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pytest""><pre class=""notranslate""><code>pytest
</code></pre></div>
</article></div>",1,public,0,,0,
508461227,R_kgDOHk6Aqw,s3-ocr,simonw/s3-ocr,0,9599,https://github.com/simonw/s3-ocr,Tools for running OCR against files stored in S3,0,2022-06-28T21:33:09Z,2022-08-10T21:24:45Z,2022-08-10T04:43:17Z,,41,63,63,Python,1,1,1,1,0,3,0,0,7,apache-2.0,"[""ocr"", ""s3"", ""textract""]",3,7,63,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,3,2,"# s3-ocr

[![PyPI](https://img.shields.io/pypi/v/s3-ocr.svg)](https://pypi.org/project/s3-ocr/)
[![Changelog](https://img.shields.io/github/v/release/simonw/s3-ocr?include_prereleases&label=changelog)](https://github.com/simonw/s3-ocr/releases)
[![Tests](https://github.com/simonw/s3-ocr/workflows/Test/badge.svg)](https://github.com/simonw/s3-ocr/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/s3-ocr/blob/master/LICENSE)

Tools for running OCR against files stored in S3

Background on this project: [s3-ocr: Extract text from PDF files stored in an S3 bucket](https://simonwillison.net/2022/Jun/30/s3-ocr/)

## Installation

Install this tool using `pip`:

    pip install s3-ocr

## Demo

You can see the results of running this tool against three PDFs from the Internet Archive ([one](https://archive.org/details/unmaskingrobert00houdgoog), [two](https://archive.org/details/practicalmagicia00harr), [three](https://archive.org/details/latestmagicbeing00hoff)) in [this example table](https://s3-ocr-demo.datasette.io/pages/pages?_facet=path#facet-path) hosted using [Datasette](https://datasette.io/).

## Starting OCR against PDFs in a bucket

The `start` command takes a list of keys and submits them to [Textract](https://aws.amazon.com/textract/) for OCR processing.

You need to have AWS configured using environment variables, credentials file in your home directory or a JSON or INI file generated using [s3-credentials](https://datasette.io/tools/s3-credentials).

You can start the process running like this:

    s3-ocr start name-of-your-bucket my-pdf-file.pdf

The paths you specify should be paths within the bucket. If you stored your PDF files in folders inside the bucket it should look like this:

    s3-ocr start name-of-your-bucket path/to/one.pdf path/to/two.pdf

OCR can take some time. The results of the OCR will be stored in `textract-output` in your bucket.

To process every file in the bucket with a `.pdf` extension use `--all`:

    s3-ocr start name-of-bucket --all

To process every file with a `.pdf` extension within a specific folder, use `--prefix`:

    s3-ocr start name-of-bucket --prefix path/to/folder

### s3-ocr start --help

<!-- [[[cog
import cog
from s3_ocr import cli
from click.testing import CliRunner
runner = CliRunner()
result = runner.invoke(cli.cli, [""start"", ""--help""])
help = result.output.replace(""Usage: cli"", ""Usage: s3-ocr"")
cog.out(
    ""```\n{}\n```"".format(help)
)
]]] -->
```
Usage: s3-ocr start [OPTIONS] BUCKET [KEYS]...

  Start OCR tasks for PDF files in an S3 bucket

      s3-ocr start name-of-bucket path/to/one.pdf path/to/two.pdf

  To process every file with a .pdf extension:

      s3-ocr start name-of-bucket --all

  To process every .pdf in the PUBLIC/ folder:

      s3-ocr start name-of-bucket --prefix PUBLIC/

Options:
  --all                 Process all PDF files in the bucket
  --prefix TEXT         Process all PDF files within this prefix
  --dry-run             Show what this would do, but don't actually do it
  --no-retry            Don't retry failed requests
  --access-key TEXT     AWS access key ID
  --secret-key TEXT     AWS secret access key
  --session-token TEXT  AWS session token
  --endpoint-url TEXT   Custom endpoint URL
  -a, --auth FILENAME   Path to JSON/INI file containing credentials
  --help                Show this message and exit.

```
<!-- [[[end]]] -->

## Checking status

The `s3-ocr status <bucket-name>` command shows a rough indication of progress through the tasks:

```
% s3-ocr status sfms-history
153 complete out of 532 jobs
```
It compares the jobs that have been submitted, based on `.s3-ocr.json` files, to the jobs that have their results written to the `textract-output/` folder.

### s3-ocr status --help

<!-- [[[cog
result = runner.invoke(cli.cli, [""status"", ""--help""])
help = result.output.replace(""Usage: cli"", ""Usage: s3-ocr"")
cog.out(
    ""```\n{}\n```"".format(help.split(""--access-key"")[0] + ""--access-key ..."")
)
]]] -->
```
Usage: s3-ocr status [OPTIONS] BUCKET

  Show status of OCR jobs for a bucket

Options:
  --access-key ...
```
<!-- [[[end]]] -->

## Inspecting a job

The `s3-ocr inspect-job <job_id>` command can be used to check the status of a specific job ID:
```
% s3-ocr inspect-job b267282745685226339b7e0d4366c4ff6887b7e293ed4b304dc8bb8b991c7864
{
  ""DocumentMetadata"": {
    ""Pages"": 583
  },
  ""JobStatus"": ""SUCCEEDED"",
  ""DetectDocumentTextModelVersion"": ""1.0""
}
```

### s3-ocr inspect-job --help

<!-- [[[cog
result = runner.invoke(cli.cli, [""inspect-job"", ""--help""])
help = result.output.replace(""Usage: cli"", ""Usage: s3-ocr"")
cog.out(
    ""```\n{}\n```"".format(help.split(""--access-key"")[0] + ""--access-key ..."")
)
]]] -->
```
Usage: s3-ocr inspect-job [OPTIONS] JOB_ID

  Show the current status of an OCR job

      s3-ocr inspect-job <job_id>

Options:
  --access-key ...
```
<!-- [[[end]]] -->

## Fetching the results

Once an OCR job has completed you can download the resulting JSON using the `fetch` command:

    s3-ocr fetch name-of-bucket path/to/file.pdf

This will save files in the current directory with names like this:

- `4d9b5cf580e761fdb16fd24edce14737ebc562972526ef6617554adfa11d6038-1.json`
- `4d9b5cf580e761fdb16fd24edce14737ebc562972526ef6617554adfa11d6038-2.json`

The number of files will vary depending on the length of the document.

If you don't want separate files you can combine them together using the `-c/--combine` option:

    s3-ocr fetch name-of-bucket path/to/file.pdf --combine output.json

The `output.json` file will then contain data that looks something like this:

```
{
  ""Blocks"": [
    {
      ""BlockType"": ""PAGE"",
      ""Geometry"": {...}
      ""Page"": 1,
      ...
    },
    {
      ""BlockType"": ""LINE"",
      ""Page"": 1,
      ...
      ""Text"": ""Barry"",
    },
```
### s3-ocr fetch --help

<!-- [[[cog
result = runner.invoke(cli.cli, [""fetch"", ""--help""])
help = result.output.replace(""Usage: cli"", ""Usage: s3-ocr"")
cog.out(
    ""```\n{}\n```"".format(help.split(""--access-key"")[0] + ""--access-key ..."")
)
]]] -->
```
Usage: s3-ocr fetch [OPTIONS] BUCKET KEY

  Fetch the OCR results for a specified file

      s3-ocr fetch name-of-bucket path/to/key.pdf

  This will save files in the current directory called things like

      a806e67e504fc15f...48314e-1.json     a806e67e504fc15f...48314e-2.json

  To combine these together into a single JSON file with a specified name, use:

      s3-ocr fetch name-of-bucket path/to/key.pdf --combine output.json

  Use ""--output -"" to print the combined JSON to standard output instead.

Options:
  -c, --combine FILENAME  Write combined JSON to file
  --access-key ...
```
<!-- [[[end]]] -->

## Fetching just the text of a page

If you don't want to deal with the JSON directly, you can use the `text` command to retrieve just the text extracted from a PDF:

    s3-ocr text name-of-bucket path/to/file.pdf

This will output plain text to standard output.

To save that to a file, use this:

    s3-ocr text name-of-bucket path/to/file.pdf > text.txt

Separate pages will be separated by three newlines. To separate them using a `----` horizontal divider instead add `--divider`:

    s3-ocr text name-of-bucket path/to/file.pdf --divider

### s3-ocr text --help

<!-- [[[cog
result = runner.invoke(cli.cli, [""text"", ""--help""])
help = result.output.replace(""Usage: cli"", ""Usage: s3-ocr"")
cog.out(
    ""```\n{}\n```"".format(help.split(""--access-key"")[0] + ""--access-key ..."")
)
]]] -->
```
Usage: s3-ocr text [OPTIONS] BUCKET KEY

  Retrieve the text from an OCRd PDF file

      s3-ocr text name-of-bucket path/to/key.pdf

Options:
  --divider             Add ---- between pages
  --access-key ...
```
<!-- [[[end]]] -->

## Avoiding processing duplicates

If you move files around within your S3 bucket `s3-ocr` can lose track of which files have already been processed. This can lead to additional Textract charges for processing should you run `s3-ocr start` against those new files.

The `s3-ocr dedupe` command addresses this by scanning your bucket for files that have a new name but have previously been processed. It does this by looking at the `ETag` for each file, which represents the MD5 hash of the file contents.

The command will write out new `.s3ocr.json` files for each detected duplicate. This will avoid those duplicates being run those duplicates through OCR a second time should yo run `s3-ocr start`.

    s3-ocr dedupe name-of-bucket

Add `--dry-run` for a preview of the changes that will be made to your bucket.

### s3-ocr dedupe --help

<!-- [[[cog
result = runner.invoke(cli.cli, [""dedupe"", ""--help""])
help = result.output.replace(""Usage: cli"", ""Usage: s3-ocr"")
cog.out(
    ""```\n{}\n```"".format(help.split(""--access-key"")[0] + ""--access-key ..."")
)
]]] -->
```
Usage: s3-ocr dedupe [OPTIONS] BUCKET

  Scan every file in the bucket checking for duplicates - files that have not
  yet been OCRd but that have the same contents (based on ETag) as a file that
  HAS been OCRd.

      s3-ocr dedupe name-of-bucket

Options:
  --dry-run             Show output without writing anything to S3
  --access-key ...
```
<!-- [[[end]]] -->

## Changes made to your bucket

To keep track of which files have been submitted for processing, `s3-ocr` will create a JSON file for every file that it adds to the OCR queue.

This file will be called:

    path-to-file/name-of-file.pdf.s3-ocr.json

Each of these JSON files contains data that looks like this:

```json
{
  ""job_id"": ""a34eb4e8dc7e70aa9668f7272aa403e85997364199a654422340bc5ada43affe"",
  ""etag"": ""\""b0c77472e15500347ebf46032a454e8e\""""
}
```
The recorded `job_id` can be used later to associate the file with the results of the OCR task in `textract-output/`.

The `etag` is the ETag of the S3 object at the time it was submitted. This can be used later to determine if a file has changed since it last had OCR run against it.

This design for the tool, with the `.s3-ocr.json` files tracking jobs that have been submitted, means that it is safe to run `s3-ocr start` against the same bucket multiple times without the risk of starting duplicate OCR jobs.

## Creating a SQLite index of your OCR results

The `s3-ocr index <bucket> <database_file>` command creates a SQLite database containing the results of the OCR, and configures SQLite full-text search against the text:

```
% s3-ocr index sfms-history index.db
Fetching job details  [####################################]  100%
Populating pages table  [####################----------------]   55%  00:03:18
```
The schema of the resulting database looks like this (excluding the FTS tables):
```sql
CREATE TABLE [pages] (
   [path] TEXT,
   [page] INTEGER,
   [folder] TEXT,
   [text] TEXT,
   PRIMARY KEY ([path], [page])
);
CREATE TABLE [ocr_jobs] (
   [key] TEXT PRIMARY KEY,
   [job_id] TEXT,
   [etag] TEXT,
   [s3_ocr_etag] TEXT
);
CREATE TABLE [fetched_jobs] (
   [job_id] TEXT PRIMARY KEY
);
```
The database is designed to be used with [Datasette](https://datasette.io).

### s3-ocr index --help

<!-- [[[cog
result = runner.invoke(cli.cli, [""index"", ""--help""])
help = result.output.replace(""Usage: cli"", ""Usage: s3-ocr"")
cog.out(
    ""```\n{}\n```"".format(help.split(""--access-key"")[0] + ""--access-key ..."")
)
]]] -->
```
Usage: s3-ocr index [OPTIONS] BUCKET DATABASE

  Create a SQLite database with OCR results for files in a bucket

Options:
  --access-key ...
```
<!-- [[[end]]] -->

## Development

To contribute to this tool, first checkout the code. Then create a new virtual environment:

    cd s3-ocr
    python -m venv venv
    source venv/bin/activate

Now install the dependencies and test dependencies:

    pip install -e '.[test]'

To run the tests:

    pytest

To regenerate the README file with the latest `--help`:

    cog -r README.md
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-s3-ocr"" class=""anchor"" href=""#user-content-s3-ocr"" aria-hidden=""true""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>s3-ocr</h1>
<p dir=""auto""><a href=""https://pypi.org/project/s3-ocr/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/697cb78ed5f8b2955201be9925084bf6c5603a1cebc448d174f5684ae1453d68/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f73332d6f63722e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/s3-ocr.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/s3-ocr/releases""><img src=""https://camo.githubusercontent.com/a5fa08d6edb96e5b9f55b10efcb2820fdb2168be4a766b9a38a87da938251c66/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f73332d6f63723f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/s3-ocr?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/s3-ocr/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/s3-ocr/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/s3-ocr/blob/master/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Tools for running OCR against files stored in S3</p>
<p dir=""auto"">Background on this project: <a href=""https://simonwillison.net/2022/Jun/30/s3-ocr/"" rel=""nofollow"">s3-ocr: Extract text from PDF files stored in an S3 bucket</a></p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" href=""#user-content-installation"" aria-hidden=""true""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p dir=""auto"">Install this tool using <code>pip</code>:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install s3-ocr""><pre class=""notranslate""><code>pip install s3-ocr
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-demo"" class=""anchor"" href=""#user-content-demo"" aria-hidden=""true""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Demo</h2>
<p dir=""auto"">You can see the results of running this tool against three PDFs from the Internet Archive (<a href=""https://archive.org/details/unmaskingrobert00houdgoog"" rel=""nofollow"">one</a>, <a href=""https://archive.org/details/practicalmagicia00harr"" rel=""nofollow"">two</a>, <a href=""https://archive.org/details/latestmagicbeing00hoff"" rel=""nofollow"">three</a>) in <a href=""https://s3-ocr-demo.datasette.io/pages/pages?_facet=path#facet-path"" rel=""nofollow"">this example table</a> hosted using <a href=""https://datasette.io/"" rel=""nofollow"">Datasette</a>.</p>
<h2 dir=""auto""><a id=""user-content-starting-ocr-against-pdfs-in-a-bucket"" class=""anchor"" href=""#user-content-starting-ocr-against-pdfs-in-a-bucket"" aria-hidden=""true""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Starting OCR against PDFs in a bucket</h2>
<p dir=""auto"">The <code>start</code> command takes a list of keys and submits them to <a href=""https://aws.amazon.com/textract/"" rel=""nofollow"">Textract</a> for OCR processing.</p>
<p dir=""auto"">You need to have AWS configured using environment variables, credentials file in your home directory or a JSON or INI file generated using <a href=""https://datasette.io/tools/s3-credentials"" rel=""nofollow"">s3-credentials</a>.</p>
<p dir=""auto"">You can start the process running like this:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""s3-ocr start name-of-your-bucket my-pdf-file.pdf""><pre class=""notranslate""><code>s3-ocr start name-of-your-bucket my-pdf-file.pdf
</code></pre></div>
<p dir=""auto"">The paths you specify should be paths within the bucket. If you stored your PDF files in folders inside the bucket it should look like this:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""s3-ocr start name-of-your-bucket path/to/one.pdf path/to/two.pdf""><pre class=""notranslate""><code>s3-ocr start name-of-your-bucket path/to/one.pdf path/to/two.pdf
</code></pre></div>
<p dir=""auto"">OCR can take some time. The results of the OCR will be stored in <code>textract-output</code> in your bucket.</p>
<p dir=""auto"">To process every file in the bucket with a <code>.pdf</code> extension use <code>--all</code>:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""s3-ocr start name-of-bucket --all""><pre class=""notranslate""><code>s3-ocr start name-of-bucket --all
</code></pre></div>
<p dir=""auto"">To process every file with a <code>.pdf</code> extension within a specific folder, use <code>--prefix</code>:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""s3-ocr start name-of-bucket --prefix path/to/folder""><pre class=""notranslate""><code>s3-ocr start name-of-bucket --prefix path/to/folder
</code></pre></div>
<h3 dir=""auto""><a id=""user-content-s3-ocr-start---help"" class=""anchor"" href=""#user-content-s3-ocr-start---help"" aria-hidden=""true""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>s3-ocr start --help</h3>

<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""Usage: s3-ocr start [OPTIONS] BUCKET [KEYS]...

  Start OCR tasks for PDF files in an S3 bucket

      s3-ocr start name-of-bucket path/to/one.pdf path/to/two.pdf

  To process every file with a .pdf extension:

      s3-ocr start name-of-bucket --all

  To process every .pdf in the PUBLIC/ folder:

      s3-ocr start name-of-bucket --prefix PUBLIC/

Options:
  --all                 Process all PDF files in the bucket
  --prefix TEXT         Process all PDF files within this prefix
  --dry-run             Show what this would do, but don't actually do it
  --no-retry            Don't retry failed requests
  --access-key TEXT     AWS access key ID
  --secret-key TEXT     AWS secret access key
  --session-token TEXT  AWS session token
  --endpoint-url TEXT   Custom endpoint URL
  -a, --auth FILENAME   Path to JSON/INI file containing credentials
  --help                Show this message and exit.
""><pre class=""notranslate""><code>Usage: s3-ocr start [OPTIONS] BUCKET [KEYS]...

  Start OCR tasks for PDF files in an S3 bucket

      s3-ocr start name-of-bucket path/to/one.pdf path/to/two.pdf

  To process every file with a .pdf extension:

      s3-ocr start name-of-bucket --all

  To process every .pdf in the PUBLIC/ folder:

      s3-ocr start name-of-bucket --prefix PUBLIC/

Options:
  --all                 Process all PDF files in the bucket
  --prefix TEXT         Process all PDF files within this prefix
  --dry-run             Show what this would do, but don't actually do it
  --no-retry            Don't retry failed requests
  --access-key TEXT     AWS access key ID
  --secret-key TEXT     AWS secret access key
  --session-token TEXT  AWS session token
  --endpoint-url TEXT   Custom endpoint URL
  -a, --auth FILENAME   Path to JSON/INI file containing credentials
  --help                Show this message and exit.

</code></pre></div>

<h2 dir=""auto""><a id=""user-content-checking-status"" class=""anchor"" href=""#user-content-checking-status"" aria-hidden=""true""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Checking status</h2>
<p dir=""auto"">The <code>s3-ocr status &lt;bucket-name&gt;</code> command shows a rough indication of progress through the tasks:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""% s3-ocr status sfms-history
153 complete out of 532 jobs""><pre class=""notranslate""><code>% s3-ocr status sfms-history
153 complete out of 532 jobs
</code></pre></div>
<p dir=""auto"">It compares the jobs that have been submitted, based on <code>.s3-ocr.json</code> files, to the jobs that have their results written to the <code>textract-output/</code> folder.</p>
<h3 dir=""auto""><a id=""user-content-s3-ocr-status---help"" class=""anchor"" href=""#user-content-s3-ocr-status---help"" aria-hidden=""true""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>s3-ocr status --help</h3>

<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""Usage: s3-ocr status [OPTIONS] BUCKET

  Show status of OCR jobs for a bucket

Options:
  --access-key ...""><pre class=""notranslate""><code>Usage: s3-ocr status [OPTIONS] BUCKET

  Show status of OCR jobs for a bucket

Options:
  --access-key ...
</code></pre></div>

<h2 dir=""auto""><a id=""user-content-inspecting-a-job"" class=""anchor"" href=""#user-content-inspecting-a-job"" aria-hidden=""true""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Inspecting a job</h2>
<p dir=""auto"">The <code>s3-ocr inspect-job &lt;job_id&gt;</code> command can be used to check the status of a specific job ID:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""% s3-ocr inspect-job b267282745685226339b7e0d4366c4ff6887b7e293ed4b304dc8bb8b991c7864
{
  &quot;DocumentMetadata&quot;: {
    &quot;Pages&quot;: 583
  },
  &quot;JobStatus&quot;: &quot;SUCCEEDED&quot;,
  &quot;DetectDocumentTextModelVersion&quot;: &quot;1.0&quot;
}""><pre class=""notranslate""><code>% s3-ocr inspect-job b267282745685226339b7e0d4366c4ff6887b7e293ed4b304dc8bb8b991c7864
{
  ""DocumentMetadata"": {
    ""Pages"": 583
  },
  ""JobStatus"": ""SUCCEEDED"",
  ""DetectDocumentTextModelVersion"": ""1.0""
}
</code></pre></div>
<h3 dir=""auto""><a id=""user-content-s3-ocr-inspect-job---help"" class=""anchor"" href=""#user-content-s3-ocr-inspect-job---help"" aria-hidden=""true""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>s3-ocr inspect-job --help</h3>

<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""Usage: s3-ocr inspect-job [OPTIONS] JOB_ID

  Show the current status of an OCR job

      s3-ocr inspect-job &lt;job_id&gt;

Options:
  --access-key ...""><pre class=""notranslate""><code>Usage: s3-ocr inspect-job [OPTIONS] JOB_ID

  Show the current status of an OCR job

      s3-ocr inspect-job &lt;job_id&gt;

Options:
  --access-key ...
</code></pre></div>

<h2 dir=""auto""><a id=""user-content-fetching-the-results"" class=""anchor"" href=""#user-content-fetching-the-results"" aria-hidden=""true""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Fetching the results</h2>
<p dir=""auto"">Once an OCR job has completed you can download the resulting JSON using the <code>fetch</code> command:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""s3-ocr fetch name-of-bucket path/to/file.pdf""><pre class=""notranslate""><code>s3-ocr fetch name-of-bucket path/to/file.pdf
</code></pre></div>
<p dir=""auto"">This will save files in the current directory with names like this:</p>
<ul dir=""auto"">
<li><code>4d9b5cf580e761fdb16fd24edce14737ebc562972526ef6617554adfa11d6038-1.json</code></li>
<li><code>4d9b5cf580e761fdb16fd24edce14737ebc562972526ef6617554adfa11d6038-2.json</code></li>
</ul>
<p dir=""auto"">The number of files will vary depending on the length of the document.</p>
<p dir=""auto"">If you don't want separate files you can combine them together using the <code>-c/--combine</code> option:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""s3-ocr fetch name-of-bucket path/to/file.pdf --combine output.json""><pre class=""notranslate""><code>s3-ocr fetch name-of-bucket path/to/file.pdf --combine output.json
</code></pre></div>
<p dir=""auto"">The <code>output.json</code> file will then contain data that looks something like this:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{
  &quot;Blocks&quot;: [
    {
      &quot;BlockType&quot;: &quot;PAGE&quot;,
      &quot;Geometry&quot;: {...}
      &quot;Page&quot;: 1,
      ...
    },
    {
      &quot;BlockType&quot;: &quot;LINE&quot;,
      &quot;Page&quot;: 1,
      ...
      &quot;Text&quot;: &quot;Barry&quot;,
    },""><pre class=""notranslate""><code>{
  ""Blocks"": [
    {
      ""BlockType"": ""PAGE"",
      ""Geometry"": {...}
      ""Page"": 1,
      ...
    },
    {
      ""BlockType"": ""LINE"",
      ""Page"": 1,
      ...
      ""Text"": ""Barry"",
    },
</code></pre></div>
<h3 dir=""auto""><a id=""user-content-s3-ocr-fetch---help"" class=""anchor"" href=""#user-content-s3-ocr-fetch---help"" aria-hidden=""true""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>s3-ocr fetch --help</h3>

<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""Usage: s3-ocr fetch [OPTIONS] BUCKET KEY

  Fetch the OCR results for a specified file

      s3-ocr fetch name-of-bucket path/to/key.pdf

  This will save files in the current directory called things like

      a806e67e504fc15f...48314e-1.json     a806e67e504fc15f...48314e-2.json

  To combine these together into a single JSON file with a specified name, use:

      s3-ocr fetch name-of-bucket path/to/key.pdf --combine output.json

  Use &quot;--output -&quot; to print the combined JSON to standard output instead.

Options:
  -c, --combine FILENAME  Write combined JSON to file
  --access-key ...""><pre class=""notranslate""><code>Usage: s3-ocr fetch [OPTIONS] BUCKET KEY

  Fetch the OCR results for a specified file

      s3-ocr fetch name-of-bucket path/to/key.pdf

  This will save files in the current directory called things like

      a806e67e504fc15f...48314e-1.json     a806e67e504fc15f...48314e-2.json

  To combine these together into a single JSON file with a specified name, use:

      s3-ocr fetch name-of-bucket path/to/key.pdf --combine output.json

  Use ""--output -"" to print the combined JSON to standard output instead.

Options:
  -c, --combine FILENAME  Write combined JSON to file
  --access-key ...
</code></pre></div>

<h2 dir=""auto""><a id=""user-content-fetching-just-the-text-of-a-page"" class=""anchor"" href=""#user-content-fetching-just-the-text-of-a-page"" aria-hidden=""true""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Fetching just the text of a page</h2>
<p dir=""auto"">If you don't want to deal with the JSON directly, you can use the <code>text</code> command to retrieve just the text extracted from a PDF:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""s3-ocr text name-of-bucket path/to/file.pdf""><pre class=""notranslate""><code>s3-ocr text name-of-bucket path/to/file.pdf
</code></pre></div>
<p dir=""auto"">This will output plain text to standard output.</p>
<p dir=""auto"">To save that to a file, use this:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""s3-ocr text name-of-bucket path/to/file.pdf &gt; text.txt""><pre class=""notranslate""><code>s3-ocr text name-of-bucket path/to/file.pdf &gt; text.txt
</code></pre></div>
<p dir=""auto"">Separate pages will be separated by three newlines. To separate them using a <code>----</code> horizontal divider instead add <code>--divider</code>:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""s3-ocr text name-of-bucket path/to/file.pdf --divider""><pre class=""notranslate""><code>s3-ocr text name-of-bucket path/to/file.pdf --divider
</code></pre></div>
<h3 dir=""auto""><a id=""user-content-s3-ocr-text---help"" class=""anchor"" href=""#user-content-s3-ocr-text---help"" aria-hidden=""true""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>s3-ocr text --help</h3>

<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""Usage: s3-ocr text [OPTIONS] BUCKET KEY

  Retrieve the text from an OCRd PDF file

      s3-ocr text name-of-bucket path/to/key.pdf

Options:
  --divider             Add ---- between pages
  --access-key ...""><pre class=""notranslate""><code>Usage: s3-ocr text [OPTIONS] BUCKET KEY

  Retrieve the text from an OCRd PDF file

      s3-ocr text name-of-bucket path/to/key.pdf

Options:
  --divider             Add ---- between pages
  --access-key ...
</code></pre></div>

<h2 dir=""auto""><a id=""user-content-avoiding-processing-duplicates"" class=""anchor"" href=""#user-content-avoiding-processing-duplicates"" aria-hidden=""true""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Avoiding processing duplicates</h2>
<p dir=""auto"">If you move files around within your S3 bucket <code>s3-ocr</code> can lose track of which files have already been processed. This can lead to additional Textract charges for processing should you run <code>s3-ocr start</code> against those new files.</p>
<p dir=""auto"">The <code>s3-ocr dedupe</code> command addresses this by scanning your bucket for files that have a new name but have previously been processed. It does this by looking at the <code>ETag</code> for each file, which represents the MD5 hash of the file contents.</p>
<p dir=""auto"">The command will write out new <code>.s3ocr.json</code> files for each detected duplicate. This will avoid those duplicates being run those duplicates through OCR a second time should yo run <code>s3-ocr start</code>.</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""s3-ocr dedupe name-of-bucket""><pre class=""notranslate""><code>s3-ocr dedupe name-of-bucket
</code></pre></div>
<p dir=""auto"">Add <code>--dry-run</code> for a preview of the changes that will be made to your bucket.</p>
<h3 dir=""auto""><a id=""user-content-s3-ocr-dedupe---help"" class=""anchor"" href=""#user-content-s3-ocr-dedupe---help"" aria-hidden=""true""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>s3-ocr dedupe --help</h3>

<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""Usage: s3-ocr dedupe [OPTIONS] BUCKET

  Scan every file in the bucket checking for duplicates - files that have not
  yet been OCRd but that have the same contents (based on ETag) as a file that
  HAS been OCRd.

      s3-ocr dedupe name-of-bucket

Options:
  --dry-run             Show output without writing anything to S3
  --access-key ...""><pre class=""notranslate""><code>Usage: s3-ocr dedupe [OPTIONS] BUCKET

  Scan every file in the bucket checking for duplicates - files that have not
  yet been OCRd but that have the same contents (based on ETag) as a file that
  HAS been OCRd.

      s3-ocr dedupe name-of-bucket

Options:
  --dry-run             Show output without writing anything to S3
  --access-key ...
</code></pre></div>

<h2 dir=""auto""><a id=""user-content-changes-made-to-your-bucket"" class=""anchor"" href=""#user-content-changes-made-to-your-bucket"" aria-hidden=""true""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Changes made to your bucket</h2>
<p dir=""auto"">To keep track of which files have been submitted for processing, <code>s3-ocr</code> will create a JSON file for every file that it adds to the OCR queue.</p>
<p dir=""auto"">This file will be called:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""path-to-file/name-of-file.pdf.s3-ocr.json""><pre class=""notranslate""><code>path-to-file/name-of-file.pdf.s3-ocr.json
</code></pre></div>
<p dir=""auto"">Each of these JSON files contains data that looks like this:</p>
<div class=""highlight highlight-source-json notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{
  &quot;job_id&quot;: &quot;a34eb4e8dc7e70aa9668f7272aa403e85997364199a654422340bc5ada43affe&quot;,
  &quot;etag&quot;: &quot;\&quot;b0c77472e15500347ebf46032a454e8e\&quot;&quot;
}""><pre>{
  <span class=""pl-ent"">""job_id""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>a34eb4e8dc7e70aa9668f7272aa403e85997364199a654422340bc5ada43affe<span class=""pl-pds"">""</span></span>,
  <span class=""pl-ent"">""etag""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span><span class=""pl-cce"">\""</span>b0c77472e15500347ebf46032a454e8e<span class=""pl-cce"">\""</span><span class=""pl-pds"">""</span></span>
}</pre></div>
<p dir=""auto"">The recorded <code>job_id</code> can be used later to associate the file with the results of the OCR task in <code>textract-output/</code>.</p>
<p dir=""auto"">The <code>etag</code> is the ETag of the S3 object at the time it was submitted. This can be used later to determine if a file has changed since it last had OCR run against it.</p>
<p dir=""auto"">This design for the tool, with the <code>.s3-ocr.json</code> files tracking jobs that have been submitted, means that it is safe to run <code>s3-ocr start</code> against the same bucket multiple times without the risk of starting duplicate OCR jobs.</p>
<h2 dir=""auto""><a id=""user-content-creating-a-sqlite-index-of-your-ocr-results"" class=""anchor"" href=""#user-content-creating-a-sqlite-index-of-your-ocr-results"" aria-hidden=""true""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Creating a SQLite index of your OCR results</h2>
<p dir=""auto"">The <code>s3-ocr index &lt;bucket&gt; &lt;database_file&gt;</code> command creates a SQLite database containing the results of the OCR, and configures SQLite full-text search against the text:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""% s3-ocr index sfms-history index.db
Fetching job details  [####################################]  100%
Populating pages table  [####################----------------]   55%  00:03:18""><pre class=""notranslate""><code>% s3-ocr index sfms-history index.db
Fetching job details  [####################################]  100%
Populating pages table  [####################----------------]   55%  00:03:18
</code></pre></div>
<p dir=""auto"">The schema of the resulting database looks like this (excluding the FTS tables):</p>
<div class=""highlight highlight-source-sql notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""CREATE TABLE [pages] (
   [path] TEXT,
   [page] INTEGER,
   [folder] TEXT,
   [text] TEXT,
   PRIMARY KEY ([path], [page])
);
CREATE TABLE [ocr_jobs] (
   [key] TEXT PRIMARY KEY,
   [job_id] TEXT,
   [etag] TEXT,
   [s3_ocr_etag] TEXT
);
CREATE TABLE [fetched_jobs] (
   [job_id] TEXT PRIMARY KEY
);""><pre>CREATE TABLE [pages] (
   [<span class=""pl-k"">path</span>] <span class=""pl-k"">TEXT</span>,
   [page] <span class=""pl-k"">INTEGER</span>,
   [folder] <span class=""pl-k"">TEXT</span>,
   [<span class=""pl-k"">text</span>] <span class=""pl-k"">TEXT</span>,
   <span class=""pl-k"">PRIMARY KEY</span> ([<span class=""pl-k"">path</span>], [page])
);
CREATE TABLE [ocr_jobs] (
   [key] <span class=""pl-k"">TEXT</span> <span class=""pl-k"">PRIMARY KEY</span>,
   [job_id] <span class=""pl-k"">TEXT</span>,
   [etag] <span class=""pl-k"">TEXT</span>,
   [s3_ocr_etag] <span class=""pl-k"">TEXT</span>
);
CREATE TABLE [fetched_jobs] (
   [job_id] <span class=""pl-k"">TEXT</span> <span class=""pl-k"">PRIMARY KEY</span>
);</pre></div>
<p dir=""auto"">The database is designed to be used with <a href=""https://datasette.io"" rel=""nofollow"">Datasette</a>.</p>
<h3 dir=""auto""><a id=""user-content-s3-ocr-index---help"" class=""anchor"" href=""#user-content-s3-ocr-index---help"" aria-hidden=""true""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>s3-ocr index --help</h3>

<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""Usage: s3-ocr index [OPTIONS] BUCKET DATABASE

  Create a SQLite database with OCR results for files in a bucket

Options:
  --access-key ...""><pre class=""notranslate""><code>Usage: s3-ocr index [OPTIONS] BUCKET DATABASE

  Create a SQLite database with OCR results for files in a bucket

Options:
  --access-key ...
</code></pre></div>

<h2 dir=""auto""><a id=""user-content-development"" class=""anchor"" href=""#user-content-development"" aria-hidden=""true""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p dir=""auto"">To contribute to this tool, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""cd s3-ocr
python -m venv venv
source venv/bin/activate""><pre class=""notranslate""><code>cd s3-ocr
python -m venv venv
source venv/bin/activate
</code></pre></div>
<p dir=""auto"">Now install the dependencies and test dependencies:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'""><pre class=""notranslate""><code>pip install -e '.[test]'
</code></pre></div>
<p dir=""auto"">To run the tests:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pytest""><pre class=""notranslate""><code>pytest
</code></pre></div>
<p dir=""auto"">To regenerate the README file with the latest <code>--help</code>:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""cog -r README.md""><pre class=""notranslate""><code>cog -r README.md
</code></pre></div>
</article></div>",1,public,0,,0,
510136835,R_kgDOHmgSAw,datasette-expose-env,simonw/datasette-expose-env,0,9599,https://github.com/simonw/datasette-expose-env,Datasette plugin to expose selected environment variables at /-/env for debugging,0,2022-07-03T21:14:29Z,2022-07-03T21:27:12Z,2022-07-03T21:28:55Z,https://datasette.io/plugins/datasette-expose-env,0,0,0,Python,1,1,1,1,0,0,0,0,0,apache-2.0,"[""datasette"", ""datasette-plugin""]",0,0,0,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,0,1,"# datasette-expose-env

[![PyPI](https://img.shields.io/pypi/v/datasette-expose-env.svg)](https://pypi.org/project/datasette-expose-env/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-expose-env?include_prereleases&label=changelog)](https://github.com/simonw/datasette-expose-env/releases)
[![Tests](https://github.com/simonw/datasette-expose-env/workflows/Test/badge.svg)](https://github.com/simonw/datasette-expose-env/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-expose-env/blob/main/LICENSE)

Datasette plugin to expose selected environment variables at `/-/env` for debugging

## Installation

Install this plugin in the same environment as Datasette.

    datasette install datasette-expose-env

## Configuration

Decide on a list of environment variables you would like to expose, then add the following to your `metadata.yml` configuration:

```yaml
plugins:
    datasette-expose-env:
    - ENV_VAR_1
    - ENV_VAR_2
    - ENV_VAR_3
```

If you are using JSON in a `metadata.json` file use the following:

```json
{
    ""plugins"": {
        ""datasette-expose-env"": [
            ""ENV_VAR_1"",
            ""ENV_VAR_2"",
            ""ENV_VAR_3""
        ]
    }
}
```

Visit `/-/env` on your Datasette instance to see the values of the environment variables.

## Development

To set up this plugin locally, first checkout the code. Then create a new virtual environment:

    cd datasette-expose-env
    python3 -m venv venv
    source venv/bin/activate

Now install the dependencies and test dependencies:

    pip install -e '.[test]'

To run the tests:

    pytest
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-datasette-expose-env"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-expose-env""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-expose-env</h1>
<p dir=""auto""><a href=""https://pypi.org/project/datasette-expose-env/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/517c6344ee172d57f8ff5e4fcd9342e7ecfec45aababd88e8e85f53c9ba43ece/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d6578706f73652d656e762e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-expose-env.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-expose-env/releases""><img src=""https://camo.githubusercontent.com/070f8badacd8aece25028f405f9622f5a9e066d0252971b3cda58801c04c3105/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d6578706f73652d656e763f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-expose-env?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-expose-env/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-expose-env/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-expose-env/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Datasette plugin to expose selected environment variables at <code>/-/env</code> for debugging</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p dir=""auto"">Install this plugin in the same environment as Datasette.</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""datasette install datasette-expose-env""><pre class=""notranslate""><code>datasette install datasette-expose-env
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-configuration"" class=""anchor"" aria-hidden=""true"" href=""#user-content-configuration""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Configuration</h2>
<p dir=""auto"">Decide on a list of environment variables you would like to expose, then add the following to your <code>metadata.yml</code> configuration:</p>
<div class=""highlight highlight-source-yaml notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""plugins:
    datasette-expose-env:
    - ENV_VAR_1
    - ENV_VAR_2
    - ENV_VAR_3""><pre><span class=""pl-ent"">plugins</span>:
    <span class=""pl-ent"">datasette-expose-env</span>:
    - <span class=""pl-s"">ENV_VAR_1</span>
    - <span class=""pl-s"">ENV_VAR_2</span>
    - <span class=""pl-s"">ENV_VAR_3</span></pre></div>
<p dir=""auto"">If you are using JSON in a <code>metadata.json</code> file use the following:</p>
<div class=""highlight highlight-source-json notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""{
    &quot;plugins&quot;: {
        &quot;datasette-expose-env&quot;: [
            &quot;ENV_VAR_1&quot;,
            &quot;ENV_VAR_2&quot;,
            &quot;ENV_VAR_3&quot;
        ]
    }
}""><pre>{
    <span class=""pl-ent"">""plugins""</span>: {
        <span class=""pl-ent"">""datasette-expose-env""</span>: [
            <span class=""pl-s""><span class=""pl-pds"">""</span>ENV_VAR_1<span class=""pl-pds"">""</span></span>,
            <span class=""pl-s""><span class=""pl-pds"">""</span>ENV_VAR_2<span class=""pl-pds"">""</span></span>,
            <span class=""pl-s""><span class=""pl-pds"">""</span>ENV_VAR_3<span class=""pl-pds"">""</span></span>
        ]
    }
}</pre></div>
<p dir=""auto"">Visit <code>/-/env</code> on your Datasette instance to see the values of the environment variables.</p>
<h2 dir=""auto""><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p dir=""auto"">To set up this plugin locally, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""cd datasette-expose-env
python3 -m venv venv
source venv/bin/activate""><pre class=""notranslate""><code>cd datasette-expose-env
python3 -m venv venv
source venv/bin/activate
</code></pre></div>
<p dir=""auto"">Now install the dependencies and test dependencies:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'""><pre class=""notranslate""><code>pip install -e '.[test]'
</code></pre></div>
<p dir=""auto"">To run the tests:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pytest""><pre class=""notranslate""><code>pytest
</code></pre></div>
</article></div>",1,public,0,"{""id"": 400878073, ""node_id"": ""MDEwOlJlcG9zaXRvcnk0MDA4NzgwNzM="", ""name"": ""datasette-plugin-template-repository"", ""full_name"": ""simonw/datasette-plugin-template-repository"", ""private"": false, ""owner"": {""login"": ""simonw"", ""id"": 9599, ""node_id"": ""MDQ6VXNlcjk1OTk="", ""avatar_url"": ""https://avatars.githubusercontent.com/u/9599?v=4"", ""gravatar_id"": """", ""url"": ""https://api.github.com/users/simonw"", ""html_url"": ""https://github.com/simonw"", ""followers_url"": ""https://api.github.com/users/simonw/followers"", ""following_url"": ""https://api.github.com/users/simonw/following{/other_user}"", ""gists_url"": ""https://api.github.com/users/simonw/gists{/gist_id}"", ""starred_url"": ""https://api.github.com/users/simonw/starred{/owner}{/repo}"", ""subscriptions_url"": ""https://api.github.com/users/simonw/subscriptions"", ""organizations_url"": ""https://api.github.com/users/simonw/orgs"", ""repos_url"": ""https://api.github.com/users/simonw/repos"", ""events_url"": ""https://api.github.com/users/simonw/events{/privacy}"", ""received_events_url"": ""https://api.github.com/users/simonw/received_events"", ""type"": ""User"", ""site_admin"": false}, ""html_url"": ""https://github.com/simonw/datasette-plugin-template-repository"", ""description"": ""GitHub template repository for creating new Datasette plugins, using the simonw/datasette-plugin cookiecutter template"", ""fork"": false, ""url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository"", ""forks_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/forks"", ""keys_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/keys{/key_id}"", ""collaborators_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/collaborators{/collaborator}"", ""teams_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/teams"", ""hooks_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/hooks"", ""issue_events_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/issues/events{/number}"", ""events_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/events"", ""assignees_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/assignees{/user}"", ""branches_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/branches{/branch}"", ""tags_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/tags"", ""blobs_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/blobs{/sha}"", ""git_tags_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/tags{/sha}"", ""git_refs_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/refs{/sha}"", ""trees_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/trees{/sha}"", ""statuses_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/statuses/{sha}"", ""languages_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/languages"", ""stargazers_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/stargazers"", ""contributors_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/contributors"", ""subscribers_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/subscribers"", ""subscription_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/subscription"", ""commits_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/commits{/sha}"", ""git_commits_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/commits{/sha}"", ""comments_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/comments{/number}"", ""issue_comment_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/issues/comments{/number}"", ""contents_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/contents/{+path}"", ""compare_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/compare/{base}...{head}"", ""merges_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/merges"", ""archive_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/{archive_format}{/ref}"", ""downloads_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/downloads"", ""issues_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/issues{/number}"", ""pulls_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/pulls{/number}"", ""milestones_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/milestones{/number}"", ""notifications_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/notifications{?since,all,participating}"", ""labels_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/labels{/name}"", ""releases_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/releases{/id}"", ""deployments_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/deployments"", ""created_at"": ""2021-08-28T19:50:28Z"", ""updated_at"": ""2022-06-10T13:28:46Z"", ""pushed_at"": ""2022-03-16T23:42:16Z"", ""git_url"": ""git://github.com/simonw/datasette-plugin-template-repository.git"", ""ssh_url"": ""git@github.com:simonw/datasette-plugin-template-repository.git"", ""clone_url"": ""https://github.com/simonw/datasette-plugin-template-repository.git"", ""svn_url"": ""https://github.com/simonw/datasette-plugin-template-repository"", ""homepage"": """", ""size"": 9, ""stargazers_count"": 15, ""watchers_count"": 15, ""language"": null, ""has_issues"": true, ""has_projects"": true, ""has_downloads"": true, ""has_wiki"": true, ""has_pages"": false, ""forks_count"": 0, ""mirror_url"": null, ""archived"": false, ""disabled"": false, ""open_issues_count"": 0, ""license"": null, ""allow_forking"": true, ""is_template"": true, ""web_commit_signoff_required"": false, ""topics"": [], ""visibility"": ""public"", ""forks"": 0, ""open_issues"": 0, ""watchers"": 15, ""default_branch"": ""main"", ""permissions"": {""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}, ""temp_clone_token"": """"}",0,
511787166,R_kgDOHoFAng,sqlite-comprehend,simonw/sqlite-comprehend,0,9599,https://github.com/simonw/sqlite-comprehend,Tools for running data in a SQLite database through AWS Comprehend,0,2022-07-08T06:26:15Z,2022-07-11T21:44:34Z,2022-07-12T14:21:42Z,,77,6,6,Python,1,1,1,1,0,0,0,0,1,apache-2.0,[],0,1,6,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,0,1,"# sqlite-comprehend

[![PyPI](https://img.shields.io/pypi/v/sqlite-comprehend.svg)](https://pypi.org/project/sqlite-comprehend/)
[![Changelog](https://img.shields.io/github/v/release/simonw/sqlite-comprehend?include_prereleases&label=changelog)](https://github.com/simonw/sqlite-comprehend/releases)
[![Tests](https://github.com/simonw/sqlite-comprehend/workflows/Test/badge.svg)](https://github.com/simonw/sqlite-comprehend/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/sqlite-comprehend/blob/master/LICENSE)

Tools for running data in a SQLite database through [AWS Comprehend](https://aws.amazon.com/comprehend/)

See [sqlite-comprehend: run AWS entity extraction against content in a SQLite database](https://simonwillison.net/2022/Jul/11/sqlite-comprehend/) for background on this project.

## Installation

Install this tool using `pip`:

    pip install sqlite-comprehend

## Demo

You can see examples of tables generated using this command here:

- [comprehend_entities](https://datasette.simonwillison.net/simonwillisonblog/comprehend_entities) - the extracted entities, classified by type
- [blog_entry_comprehend_entities](https://datasette.simonwillison.net/simonwillisonblog/blog_entry_comprehend_entities) - a table relating entities to the entries that they appear in
- [comprehend_entity_types](https://datasette.simonwillison.net/simonwillisonblog/comprehend_entity_types) - a small lookup table of entity types

## Configuration

You will need AWS credentials with the `comprehend:BatchDetectEntities` [IAM permission](https://docs.aws.amazon.com/comprehend/latest/dg/access-control-managing-permissions.html).

You can configure credentials [using these instructions](https://boto3.amazonaws.com/v1/documentation/api/latest/guide/credentials.html). You can also save them to a JSON or INI configuration file and pass them to the command using `-a credentials.ini`, or pass them using the `--access-key` and `--secret-key` options.

## Entity extraction

The `sqlite-comprehend entities` command runs entity extraction against every row in the specified table and saves the results to your database.

Specify the database, the table and one or more columns containing text in that table. The following runs against the `text` column in the `pages` table of the `sfms.db` SQLite database:

    sqlite-comprehend sfms.db pages text

Results will be written into a `pages_comprehend_entities` table. Change the name of the output table by passing `-o other_table_name`.

You can run against a subset of rows by adding a `--where` clause:

    sqlite-comprehend sfms.db pages text --where 'id < 10'

You can also used named parameters in your `--where` clause:

    sqlite-comprehend sfms.db pages text --where 'id < :maxid' -p maxid 10

Only the first 5,000 characters for each row will be considered. Be sure to review [Comprehend's pricing](https://aws.amazon.com/comprehend/pricing/) - which starts at $0.0001 per hundred characters.

If your context includes HTML tags, you can strip them out before extracting entities by adding `--strip-tags`:

    sqlite-comprehend sfms.db pages text --strip-tags

Rows that have been processed are recorded in the `pages_comprehend_entities_done` table. If you run the command more than once it will only process rows that have been newly added.

You can delete records from that `_done` table to run them again.

### sqlite-comprehend entities --help

<!-- [[[cog
from click.testing import CliRunner
from sqlite_comprehend import cli
runner = CliRunner()
result = runner.invoke(cli.cli, [""entities"", ""--help""])
help = result.output.replace(""Usage: cli"", ""Usage: sqlite-comprehend"")
cog.out(
    ""```\n{}\n```"".format(help)
)
]]] -->
```
Usage: sqlite-comprehend entities [OPTIONS] DATABASE TABLE COLUMNS...

  Detect entities in columns in a table

  To extract entities from columns text1 and text2 in mytable:

      sqlite-comprehend entities my.db mytable text1 text2

  To run against just a subset of the rows in the table, add:

      --where ""id < :max_id"" -p max_id 50

  Results will be written to a table called mytable_comprehend_entities

  To specify a different output table, use -o custom_table_name

Options:
  --where TEXT                WHERE clause to filter table
  -p, --param <TEXT TEXT>...  Named :parameters for SQL query
  -o, --output TEXT           Custom output table
  -r, --reset                 Start from scratch, deleting previous results
  --strip-tags                Strip HTML tags before extracting entities
  --access-key TEXT           AWS access key ID
  --secret-key TEXT           AWS secret access key
  --session-token TEXT        AWS session token
  --endpoint-url TEXT         Custom endpoint URL
  -a, --auth FILENAME         Path to JSON/INI file containing credentials
  --help                      Show this message and exit.

```
<!-- [[[end]]] -->

## Schema

Assuming an input table called `pages` the tables created by this tool will have the following schema:

<!-- [[[cog
import cog, json
from sqlite_comprehend import cli
from unittest.mock import patch
from click.testing import CliRunner
import sqlite_utils
import tempfile, pathlib
tmpdir = pathlib.Path(tempfile.mkdtemp())
db_path = str(tmpdir / ""data.db"")
db = sqlite_utils.Database(db_path)
db[""pages""].insert_all(
    [
        {
            ""id"": 1,
            ""text"": ""John Bob"",
        },
        {
            ""id"": 2,
            ""text"": ""Sandra X"",
        },
    ],
    pk=""id"",
)
with patch('boto3.client') as client:
    client.return_value.batch_detect_entities.return_value = {
        ""ResultList"": [
            {
                ""Index"": 0,
                ""Entities"": [
                    {
                        ""Score"": 0.8,
                        ""Type"": ""PERSON"",
                        ""Text"": ""John Bob"",
                        ""BeginOffset"": 0,
                        ""EndOffset"": 5,
                    },
                ],
            },
            {
                ""Index"": 1,
                ""Entities"": [
                    {
                        ""Score"": 0.8,
                        ""Type"": ""PERSON"",
                        ""Text"": ""Sandra X"",
                        ""BeginOffset"": 0,
                        ""EndOffset"": 5,
                    },
                ],
            },
        ],
        ""ErrorList"": [],
    }
    runner = CliRunner()
    result = runner.invoke(cli.cli, [
        ""entities"", db_path, ""pages"", ""text""
    ])
cog.out(""```sql\n"")
cog.out(db.schema)
cog.out(""\n```"")
]]] -->
```sql
CREATE TABLE [pages] (
   [id] INTEGER PRIMARY KEY,
   [text] TEXT
);
CREATE TABLE [comprehend_entity_types] (
   [id] INTEGER PRIMARY KEY,
   [value] TEXT
);
CREATE TABLE [comprehend_entities] (
   [id] INTEGER PRIMARY KEY,
   [name] TEXT,
   [type] INTEGER REFERENCES [comprehend_entity_types]([id])
);
CREATE TABLE [pages_comprehend_entities] (
   [id] INTEGER REFERENCES [pages]([id]),
   [score] FLOAT,
   [entity] INTEGER REFERENCES [comprehend_entities]([id]),
   [begin_offset] INTEGER,
   [end_offset] INTEGER
);
CREATE UNIQUE INDEX [idx_comprehend_entity_types_value]
    ON [comprehend_entity_types] ([value]);
CREATE UNIQUE INDEX [idx_comprehend_entities_type_name]
    ON [comprehend_entities] ([type], [name]);
CREATE TABLE [pages_comprehend_entities_done] (
   [id] INTEGER PRIMARY KEY REFERENCES [pages]([id])
);
```
<!-- [[[end]]] -->

## Development

To contribute to this tool, first checkout the code. Then create a new virtual environment:

    cd sqlite-comprehend
    python -m venv venv
    source venv/bin/activate

Now install the dependencies and test dependencies:

    pip install -e '.[test]'

To run the tests:

    pytest
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-sqlite-comprehend"" class=""anchor"" aria-hidden=""true"" href=""#user-content-sqlite-comprehend""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>sqlite-comprehend</h1>
<p dir=""auto""><a href=""https://pypi.org/project/sqlite-comprehend/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/469666373ceff91e6d61dd34291b32805e10d84b0499d1d671f5379190a6358f/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f73716c6974652d636f6d70726568656e642e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/sqlite-comprehend.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/sqlite-comprehend/releases""><img src=""https://camo.githubusercontent.com/3af3baa88364e1d1baa0619b4bb45a483a453734d0b69c5aaf50f3e18e86193d/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f73716c6974652d636f6d70726568656e643f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/sqlite-comprehend?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/sqlite-comprehend/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/sqlite-comprehend/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/sqlite-comprehend/blob/master/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Tools for running data in a SQLite database through <a href=""https://aws.amazon.com/comprehend/"" rel=""nofollow"">AWS Comprehend</a></p>
<p dir=""auto"">See <a href=""https://simonwillison.net/2022/Jul/11/sqlite-comprehend/"" rel=""nofollow"">sqlite-comprehend: run AWS entity extraction against content in a SQLite database</a> for background on this project.</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p dir=""auto"">Install this tool using <code>pip</code>:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install sqlite-comprehend""><pre class=""notranslate""><code>pip install sqlite-comprehend
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-demo"" class=""anchor"" aria-hidden=""true"" href=""#user-content-demo""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Demo</h2>
<p dir=""auto"">You can see examples of tables generated using this command here:</p>
<ul dir=""auto"">
<li><a href=""https://datasette.simonwillison.net/simonwillisonblog/comprehend_entities"" rel=""nofollow"">comprehend_entities</a> - the extracted entities, classified by type</li>
<li><a href=""https://datasette.simonwillison.net/simonwillisonblog/blog_entry_comprehend_entities"" rel=""nofollow"">blog_entry_comprehend_entities</a> - a table relating entities to the entries that they appear in</li>
<li><a href=""https://datasette.simonwillison.net/simonwillisonblog/comprehend_entity_types"" rel=""nofollow"">comprehend_entity_types</a> - a small lookup table of entity types</li>
</ul>
<h2 dir=""auto""><a id=""user-content-configuration"" class=""anchor"" aria-hidden=""true"" href=""#user-content-configuration""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Configuration</h2>
<p dir=""auto"">You will need AWS credentials with the <code>comprehend:BatchDetectEntities</code> <a href=""https://docs.aws.amazon.com/comprehend/latest/dg/access-control-managing-permissions.html"" rel=""nofollow"">IAM permission</a>.</p>
<p dir=""auto"">You can configure credentials <a href=""https://boto3.amazonaws.com/v1/documentation/api/latest/guide/credentials.html"" rel=""nofollow"">using these instructions</a>. You can also save them to a JSON or INI configuration file and pass them to the command using <code>-a credentials.ini</code>, or pass them using the <code>--access-key</code> and <code>--secret-key</code> options.</p>
<h2 dir=""auto""><a id=""user-content-entity-extraction"" class=""anchor"" aria-hidden=""true"" href=""#user-content-entity-extraction""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Entity extraction</h2>
<p dir=""auto"">The <code>sqlite-comprehend entities</code> command runs entity extraction against every row in the specified table and saves the results to your database.</p>
<p dir=""auto"">Specify the database, the table and one or more columns containing text in that table. The following runs against the <code>text</code> column in the <code>pages</code> table of the <code>sfms.db</code> SQLite database:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""sqlite-comprehend sfms.db pages text""><pre class=""notranslate""><code>sqlite-comprehend sfms.db pages text
</code></pre></div>
<p dir=""auto"">Results will be written into a <code>pages_comprehend_entities</code> table. Change the name of the output table by passing <code>-o other_table_name</code>.</p>
<p dir=""auto"">You can run against a subset of rows by adding a <code>--where</code> clause:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""sqlite-comprehend sfms.db pages text --where 'id &lt; 10'""><pre class=""notranslate""><code>sqlite-comprehend sfms.db pages text --where 'id &lt; 10'
</code></pre></div>
<p dir=""auto"">You can also used named parameters in your <code>--where</code> clause:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""sqlite-comprehend sfms.db pages text --where 'id &lt; :maxid' -p maxid 10""><pre class=""notranslate""><code>sqlite-comprehend sfms.db pages text --where 'id &lt; :maxid' -p maxid 10
</code></pre></div>
<p dir=""auto"">Only the first 5,000 characters for each row will be considered. Be sure to review <a href=""https://aws.amazon.com/comprehend/pricing/"" rel=""nofollow"">Comprehend's pricing</a> - which starts at $0.0001 per hundred characters.</p>
<p dir=""auto"">If your context includes HTML tags, you can strip them out before extracting entities by adding <code>--strip-tags</code>:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""sqlite-comprehend sfms.db pages text --strip-tags""><pre class=""notranslate""><code>sqlite-comprehend sfms.db pages text --strip-tags
</code></pre></div>
<p dir=""auto"">Rows that have been processed are recorded in the <code>pages_comprehend_entities_done</code> table. If you run the command more than once it will only process rows that have been newly added.</p>
<p dir=""auto"">You can delete records from that <code>_done</code> table to run them again.</p>
<h3 dir=""auto""><a id=""user-content-sqlite-comprehend-entities---help"" class=""anchor"" aria-hidden=""true"" href=""#user-content-sqlite-comprehend-entities---help""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>sqlite-comprehend entities --help</h3>

<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""Usage: sqlite-comprehend entities [OPTIONS] DATABASE TABLE COLUMNS...

  Detect entities in columns in a table

  To extract entities from columns text1 and text2 in mytable:

      sqlite-comprehend entities my.db mytable text1 text2

  To run against just a subset of the rows in the table, add:

      --where &quot;id &lt; :max_id&quot; -p max_id 50

  Results will be written to a table called mytable_comprehend_entities

  To specify a different output table, use -o custom_table_name

Options:
  --where TEXT                WHERE clause to filter table
  -p, --param &lt;TEXT TEXT&gt;...  Named :parameters for SQL query
  -o, --output TEXT           Custom output table
  -r, --reset                 Start from scratch, deleting previous results
  --strip-tags                Strip HTML tags before extracting entities
  --access-key TEXT           AWS access key ID
  --secret-key TEXT           AWS secret access key
  --session-token TEXT        AWS session token
  --endpoint-url TEXT         Custom endpoint URL
  -a, --auth FILENAME         Path to JSON/INI file containing credentials
  --help                      Show this message and exit.
""><pre class=""notranslate""><code>Usage: sqlite-comprehend entities [OPTIONS] DATABASE TABLE COLUMNS...

  Detect entities in columns in a table

  To extract entities from columns text1 and text2 in mytable:

      sqlite-comprehend entities my.db mytable text1 text2

  To run against just a subset of the rows in the table, add:

      --where ""id &lt; :max_id"" -p max_id 50

  Results will be written to a table called mytable_comprehend_entities

  To specify a different output table, use -o custom_table_name

Options:
  --where TEXT                WHERE clause to filter table
  -p, --param &lt;TEXT TEXT&gt;...  Named :parameters for SQL query
  -o, --output TEXT           Custom output table
  -r, --reset                 Start from scratch, deleting previous results
  --strip-tags                Strip HTML tags before extracting entities
  --access-key TEXT           AWS access key ID
  --secret-key TEXT           AWS secret access key
  --session-token TEXT        AWS session token
  --endpoint-url TEXT         Custom endpoint URL
  -a, --auth FILENAME         Path to JSON/INI file containing credentials
  --help                      Show this message and exit.

</code></pre></div>

<h2 dir=""auto""><a id=""user-content-schema"" class=""anchor"" aria-hidden=""true"" href=""#user-content-schema""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Schema</h2>
<p dir=""auto"">Assuming an input table called <code>pages</code> the tables created by this tool will have the following schema:</p>

<div class=""highlight highlight-source-sql notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""CREATE TABLE [pages] (
   [id] INTEGER PRIMARY KEY,
   [text] TEXT
);
CREATE TABLE [comprehend_entity_types] (
   [id] INTEGER PRIMARY KEY,
   [value] TEXT
);
CREATE TABLE [comprehend_entities] (
   [id] INTEGER PRIMARY KEY,
   [name] TEXT,
   [type] INTEGER REFERENCES [comprehend_entity_types]([id])
);
CREATE TABLE [pages_comprehend_entities] (
   [id] INTEGER REFERENCES [pages]([id]),
   [score] FLOAT,
   [entity] INTEGER REFERENCES [comprehend_entities]([id]),
   [begin_offset] INTEGER,
   [end_offset] INTEGER
);
CREATE UNIQUE INDEX [idx_comprehend_entity_types_value]
    ON [comprehend_entity_types] ([value]);
CREATE UNIQUE INDEX [idx_comprehend_entities_type_name]
    ON [comprehend_entities] ([type], [name]);
CREATE TABLE [pages_comprehend_entities_done] (
   [id] INTEGER PRIMARY KEY REFERENCES [pages]([id])
);""><pre>CREATE TABLE [pages] (
   [id] <span class=""pl-k"">INTEGER</span> <span class=""pl-k"">PRIMARY KEY</span>,
   [<span class=""pl-k"">text</span>] <span class=""pl-k"">TEXT</span>
);
CREATE TABLE [comprehend_entity_types] (
   [id] <span class=""pl-k"">INTEGER</span> <span class=""pl-k"">PRIMARY KEY</span>,
   [value] <span class=""pl-k"">TEXT</span>
);
CREATE TABLE [comprehend_entities] (
   [id] <span class=""pl-k"">INTEGER</span> <span class=""pl-k"">PRIMARY KEY</span>,
   [name] <span class=""pl-k"">TEXT</span>,
   [type] <span class=""pl-k"">INTEGER</span> <span class=""pl-k"">REFERENCES</span> [comprehend_entity_types]([id])
);
CREATE TABLE [pages_comprehend_entities] (
   [id] <span class=""pl-k"">INTEGER</span> <span class=""pl-k"">REFERENCES</span> [pages]([id]),
   [score] FLOAT,
   [entity] <span class=""pl-k"">INTEGER</span> <span class=""pl-k"">REFERENCES</span> [comprehend_entities]([id]),
   [begin_offset] <span class=""pl-k"">INTEGER</span>,
   [end_offset] <span class=""pl-k"">INTEGER</span>
);
CREATE UNIQUE INDEX [idx_comprehend_entity_types_value]
    <span class=""pl-k"">ON</span> [comprehend_entity_types] ([value]);
CREATE UNIQUE INDEX [idx_comprehend_entities_type_name]
    <span class=""pl-k"">ON</span> [comprehend_entities] ([type], [name]);
CREATE TABLE [pages_comprehend_entities_done] (
   [id] <span class=""pl-k"">INTEGER</span> <span class=""pl-k"">PRIMARY KEY</span> <span class=""pl-k"">REFERENCES</span> [pages]([id])
);</pre></div>

<h2 dir=""auto""><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p dir=""auto"">To contribute to this tool, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""cd sqlite-comprehend
python -m venv venv
source venv/bin/activate""><pre class=""notranslate""><code>cd sqlite-comprehend
python -m venv venv
source venv/bin/activate
</code></pre></div>
<p dir=""auto"">Now install the dependencies and test dependencies:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'""><pre class=""notranslate""><code>pip install -e '.[test]'
</code></pre></div>
<p dir=""auto"">To run the tests:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pytest""><pre class=""notranslate""><code>pytest
</code></pre></div>
</article></div>",1,public,0,,0,
526978148,R_kgDOH2kMZA,datasette-mp3-audio,simonw/datasette-mp3-audio,0,9599,https://github.com/simonw/datasette-mp3-audio,Turn .mp3 URLs into an audio player in the Datasette interface,0,2022-08-20T16:20:06Z,2022-08-21T04:49:01Z,2022-08-21T06:43:00Z,,18,1,1,Python,1,1,1,1,0,1,0,0,1,apache-2.0,[],1,1,1,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,1,1,"# datasette-mp3-audio

[![PyPI](https://img.shields.io/pypi/v/datasette-mp3-audio.svg)](https://pypi.org/project/datasette-mp3-audio/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-mp3-audio?include_prereleases&label=changelog)](https://github.com/simonw/datasette-mp3-audio/releases)
[![Tests](https://github.com/simonw/datasette-mp3-audio/workflows/Test/badge.svg)](https://github.com/simonw/datasette-mp3-audio/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-mp3-audio/blob/main/LICENSE)

Turn .mp3 URLs into an audio player in the Datasette interface

## Installation

Install this plugin in the same environment as Datasette.

    datasette install datasette-mp3-audio

## Demo

Try this plugin at [https://scotrail.datasette.io/scotrail/announcements](https://scotrail.datasette.io/scotrail/announcements)

The demo uses ScotRail train announcements from [matteason/scotrail-announcements-june-2022](https://github.com/matteason/scotrail-announcements-june-2022).

## Usage

Once installed, any cells with a value that ends in `.mp3` and starts with either `http://` or `/` or `https://` will be turned into an embedded HTML audio element like this:

```html
<audio controls src=""... value ...""><a href=""..."">Download MP3</a></audio>
```

A ""Play X MP3s on this page"" button will be added to athe top of any table page listing more than one MP3.

## Development

To set up this plugin locally, first checkout the code. Then create a new virtual environment:

    cd datasette-mp3-audio
    python3 -m venv venv
    source venv/bin/activate

Now install the dependencies and test dependencies:

    pip install -e '.[test]'

To run the tests:

    pytest
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-datasette-mp3-audio"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-mp3-audio""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-mp3-audio</h1>
<p dir=""auto""><a href=""https://pypi.org/project/datasette-mp3-audio/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/be406dad915277ef158f3ac64acd7eaa1935dadb232dba65be9ac0d88488aaf6/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d6d70332d617564696f2e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-mp3-audio.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-mp3-audio/releases""><img src=""https://camo.githubusercontent.com/7efb947563daf991f6c300a68a7a14f92ec115507ae8e4771a32c2199dd0260d/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d6d70332d617564696f3f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-mp3-audio?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-mp3-audio/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-mp3-audio/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-mp3-audio/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Turn .mp3 URLs into an audio player in the Datasette interface</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p dir=""auto"">Install this plugin in the same environment as Datasette.</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""datasette install datasette-mp3-audio""><pre class=""notranslate""><code>datasette install datasette-mp3-audio
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-demo"" class=""anchor"" aria-hidden=""true"" href=""#user-content-demo""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Demo</h2>
<p dir=""auto"">Try this plugin at <a href=""https://scotrail.datasette.io/scotrail/announcements"" rel=""nofollow"">https://scotrail.datasette.io/scotrail/announcements</a></p>
<p dir=""auto"">The demo uses ScotRail train announcements from <a href=""https://github.com/matteason/scotrail-announcements-june-2022"">matteason/scotrail-announcements-june-2022</a>.</p>
<h2 dir=""auto""><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p dir=""auto"">Once installed, any cells with a value that ends in <code>.mp3</code> and starts with either <code>http://</code> or <code>/</code> or <code>https://</code> will be turned into an embedded HTML audio element like this:</p>
<div class=""highlight highlight-text-html-basic notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""&lt;audio controls src=&quot;... value ...&quot;&gt;&lt;a href=&quot;...&quot;&gt;Download MP3&lt;/a&gt;&lt;/audio&gt;""><pre><span class=""pl-kos"">&lt;</span><span class=""pl-ent"">audio</span> <span class=""pl-c1"">controls</span> <span class=""pl-c1"">src</span>=""<span class=""pl-s"">... value ...</span>""<span class=""pl-kos"">&gt;</span><span class=""pl-kos"">&lt;</span><span class=""pl-ent"">a</span> <span class=""pl-c1"">href</span>=""<span class=""pl-s"">...</span>""<span class=""pl-kos"">&gt;</span>Download MP3<span class=""pl-kos"">&lt;/</span><span class=""pl-ent"">a</span><span class=""pl-kos"">&gt;</span><span class=""pl-kos"">&lt;/</span><span class=""pl-ent"">audio</span><span class=""pl-kos"">&gt;</span></pre></div>
<p dir=""auto"">A ""Play X MP3s on this page"" button will be added to athe top of any table page listing more than one MP3.</p>
<h2 dir=""auto""><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p dir=""auto"">To set up this plugin locally, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""cd datasette-mp3-audio
python3 -m venv venv
source venv/bin/activate""><pre class=""notranslate""><code>cd datasette-mp3-audio
python3 -m venv venv
source venv/bin/activate
</code></pre></div>
<p dir=""auto"">Now install the dependencies and test dependencies:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'""><pre class=""notranslate""><code>pip install -e '.[test]'
</code></pre></div>
<p dir=""auto"">To run the tests:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pytest""><pre class=""notranslate""><code>pytest
</code></pre></div>
</article></div>",1,public,0,"{""id"": 400878073, ""node_id"": ""MDEwOlJlcG9zaXRvcnk0MDA4NzgwNzM="", ""name"": ""datasette-plugin-template-repository"", ""full_name"": ""simonw/datasette-plugin-template-repository"", ""private"": false, ""owner"": {""login"": ""simonw"", ""id"": 9599, ""node_id"": ""MDQ6VXNlcjk1OTk="", ""avatar_url"": ""https://avatars.githubusercontent.com/u/9599?v=4"", ""gravatar_id"": """", ""url"": ""https://api.github.com/users/simonw"", ""html_url"": ""https://github.com/simonw"", ""followers_url"": ""https://api.github.com/users/simonw/followers"", ""following_url"": ""https://api.github.com/users/simonw/following{/other_user}"", ""gists_url"": ""https://api.github.com/users/simonw/gists{/gist_id}"", ""starred_url"": ""https://api.github.com/users/simonw/starred{/owner}{/repo}"", ""subscriptions_url"": ""https://api.github.com/users/simonw/subscriptions"", ""organizations_url"": ""https://api.github.com/users/simonw/orgs"", ""repos_url"": ""https://api.github.com/users/simonw/repos"", ""events_url"": ""https://api.github.com/users/simonw/events{/privacy}"", ""received_events_url"": ""https://api.github.com/users/simonw/received_events"", ""type"": ""User"", ""site_admin"": false}, ""html_url"": ""https://github.com/simonw/datasette-plugin-template-repository"", ""description"": ""GitHub template repository for creating new Datasette plugins, using the simonw/datasette-plugin cookiecutter template"", ""fork"": false, ""url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository"", ""forks_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/forks"", ""keys_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/keys{/key_id}"", ""collaborators_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/collaborators{/collaborator}"", ""teams_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/teams"", ""hooks_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/hooks"", ""issue_events_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/issues/events{/number}"", ""events_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/events"", ""assignees_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/assignees{/user}"", ""branches_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/branches{/branch}"", ""tags_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/tags"", ""blobs_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/blobs{/sha}"", ""git_tags_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/tags{/sha}"", ""git_refs_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/refs{/sha}"", ""trees_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/trees{/sha}"", ""statuses_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/statuses/{sha}"", ""languages_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/languages"", ""stargazers_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/stargazers"", ""contributors_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/contributors"", ""subscribers_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/subscribers"", ""subscription_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/subscription"", ""commits_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/commits{/sha}"", ""git_commits_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/commits{/sha}"", ""comments_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/comments{/number}"", ""issue_comment_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/issues/comments{/number}"", ""contents_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/contents/{+path}"", ""compare_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/compare/{base}...{head}"", ""merges_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/merges"", ""archive_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/{archive_format}{/ref}"", ""downloads_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/downloads"", ""issues_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/issues{/number}"", ""pulls_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/pulls{/number}"", ""milestones_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/milestones{/number}"", ""notifications_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/notifications{?since,all,participating}"", ""labels_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/labels{/name}"", ""releases_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/releases{/id}"", ""deployments_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/deployments"", ""created_at"": ""2021-08-28T19:50:28Z"", ""updated_at"": ""2022-06-10T13:28:46Z"", ""pushed_at"": ""2022-03-16T23:42:16Z"", ""git_url"": ""git://github.com/simonw/datasette-plugin-template-repository.git"", ""ssh_url"": ""git@github.com:simonw/datasette-plugin-template-repository.git"", ""clone_url"": ""https://github.com/simonw/datasette-plugin-template-repository.git"", ""svn_url"": ""https://github.com/simonw/datasette-plugin-template-repository"", ""homepage"": """", ""size"": 9, ""stargazers_count"": 15, ""watchers_count"": 15, ""language"": null, ""has_issues"": true, ""has_projects"": true, ""has_downloads"": true, ""has_wiki"": true, ""has_pages"": false, ""forks_count"": 0, ""mirror_url"": null, ""archived"": false, ""disabled"": false, ""open_issues_count"": 0, ""license"": null, ""allow_forking"": true, ""is_template"": true, ""web_commit_signoff_required"": false, ""topics"": [], ""visibility"": ""public"", ""forks"": 0, ""open_issues"": 0, ""watchers"": 15, ""default_branch"": ""main"", ""permissions"": {""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}, ""temp_clone_token"": """"}",0,
528611541,R_kgDOH4H41Q,datasette-multiline-links,simonw/datasette-multiline-links,0,9599,https://github.com/simonw/datasette-multiline-links,Make multiple newline separated URLs clickable in Datasette,0,2022-08-24T22:16:30Z,2022-08-24T22:16:52Z,2022-08-24T22:56:16Z,,12,0,0,Python,1,1,1,1,0,0,0,0,0,apache-2.0,[],0,0,0,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,0,1,"# datasette-multiline-links

[![PyPI](https://img.shields.io/pypi/v/datasette-multiline-links.svg)](https://pypi.org/project/datasette-multiline-links/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-multiline-links?include_prereleases&label=changelog)](https://github.com/simonw/datasette-multiline-links/releases)
[![Tests](https://github.com/simonw/datasette-multiline-links/workflows/Test/badge.svg)](https://github.com/simonw/datasette-multiline-links/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-multiline-links/blob/main/LICENSE)

Make multiple newline separated URLs clickable in Datasette

## Installation

Install this plugin in the same environment as Datasette.

    datasette install datasette-multiline-links

## Demo

Try this plugin out against a [Google Sheets spreadsheet](https://docs.google.com/spreadsheets/d/1wZhPLMCHKJvwOkP4juclhjFgqIY8fQFMemwKL2c64vk) of previously featured datasets from [Data is Plural](https://www.data-is-plural.com/) using [Datasette Lite](https://lite.datasette.io/) here:

* <a href=""https://lite.datasette.io/?install=datasette-multiline-links&csv=https://docs.google.com/spreadsheets/d/1wZhPLMCHKJvwOkP4juclhjFgqIY8fQFMemwKL2c64vk/export?format=csv#/data?sql=select+edition%2C+headline%2C+text%2C+links%2C+hattips+from+export+where%0Atext+like+'%25'+||+%3Aq+||+'%25'+or+headline+like+'%25'+||+%3Aq+||+'%25'+order+by+edition+desc&q=loans"">Demo this plugin in Datasette Lite</a>

## Usage

Once installed, if a cell has contents like this:
```
https://example.com
Not a link
https://google.com
```
It will be rendered as:
```html
<a href=""https://example.com"">https://example.com</a>
Not a link
<a href=""https://google.com"">https://google.com</a>
```
## Development

To set up this plugin locally, first checkout the code. Then create a new virtual environment:

    cd datasette-multiline-links
    python3 -m venv venv
    source venv/bin/activate

Now install the dependencies and test dependencies:

    pip install -e '.[test]'

To run the tests:

    pytest
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-datasette-multiline-links"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-multiline-links""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-multiline-links</h1>
<p dir=""auto""><a href=""https://pypi.org/project/datasette-multiline-links/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/059c99528ca3f757a565a1430c1310891efcace69d475fbdb914d16f879f8592/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d6d756c74696c696e652d6c696e6b732e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-multiline-links.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-multiline-links/releases""><img src=""https://camo.githubusercontent.com/5c3c310bb49e90c51aa56403feccb12f948067586b18dc3f1e32833756259b7e/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d6d756c74696c696e652d6c696e6b733f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-multiline-links?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-multiline-links/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-multiline-links/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-multiline-links/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Make multiple newline separated URLs clickable in Datasette</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p dir=""auto"">Install this plugin in the same environment as Datasette.</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""datasette install datasette-multiline-links""><pre class=""notranslate""><code>datasette install datasette-multiline-links
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-demo"" class=""anchor"" aria-hidden=""true"" href=""#user-content-demo""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Demo</h2>
<p dir=""auto"">Try this plugin out against a <a href=""https://docs.google.com/spreadsheets/d/1wZhPLMCHKJvwOkP4juclhjFgqIY8fQFMemwKL2c64vk"" rel=""nofollow"">Google Sheets spreadsheet</a> of previously featured datasets from <a href=""https://www.data-is-plural.com/"" rel=""nofollow"">Data is Plural</a> using <a href=""https://lite.datasette.io/"" rel=""nofollow"">Datasette Lite</a> here:</p>
<ul dir=""auto"">
<li><a href=""https://lite.datasette.io/?install=datasette-multiline-links&amp;csv=https://docs.google.com/spreadsheets/d/1wZhPLMCHKJvwOkP4juclhjFgqIY8fQFMemwKL2c64vk/export?format=csv#/data?sql=select+edition%2C+headline%2C+text%2C+links%2C+hattips+from+export+where%0Atext+like+'%25'+||+%3Aq+||+'%25'+or+headline+like+'%25'+||+%3Aq+||+'%25'+order+by+edition+desc&amp;q=loans"" rel=""nofollow"">Demo this plugin in Datasette Lite</a></li>
</ul>
<h2 dir=""auto""><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p dir=""auto"">Once installed, if a cell has contents like this:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""https://example.com
Not a link
https://google.com""><pre class=""notranslate""><code>https://example.com
Not a link
https://google.com
</code></pre></div>
<p dir=""auto"">It will be rendered as:</p>
<div class=""highlight highlight-text-html-basic notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""&lt;a href=&quot;https://example.com&quot;&gt;https://example.com&lt;/a&gt;
Not a link
&lt;a href=&quot;https://google.com&quot;&gt;https://google.com&lt;/a&gt;""><pre><span class=""pl-kos"">&lt;</span><span class=""pl-ent"">a</span> <span class=""pl-c1"">href</span>=""<span class=""pl-s"">https://example.com</span>""<span class=""pl-kos"">&gt;</span>https://example.com<span class=""pl-kos"">&lt;/</span><span class=""pl-ent"">a</span><span class=""pl-kos"">&gt;</span>
Not a link
<span class=""pl-kos"">&lt;</span><span class=""pl-ent"">a</span> <span class=""pl-c1"">href</span>=""<span class=""pl-s"">https://google.com</span>""<span class=""pl-kos"">&gt;</span>https://google.com<span class=""pl-kos"">&lt;/</span><span class=""pl-ent"">a</span><span class=""pl-kos"">&gt;</span></pre></div>
<h2 dir=""auto""><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p dir=""auto"">To set up this plugin locally, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""cd datasette-multiline-links
python3 -m venv venv
source venv/bin/activate""><pre class=""notranslate""><code>cd datasette-multiline-links
python3 -m venv venv
source venv/bin/activate
</code></pre></div>
<p dir=""auto"">Now install the dependencies and test dependencies:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'""><pre class=""notranslate""><code>pip install -e '.[test]'
</code></pre></div>
<p dir=""auto"">To run the tests:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pytest""><pre class=""notranslate""><code>pytest
</code></pre></div>
</article></div>",1,public,0,"{""id"": 400878073, ""node_id"": ""MDEwOlJlcG9zaXRvcnk0MDA4NzgwNzM="", ""name"": ""datasette-plugin-template-repository"", ""full_name"": ""simonw/datasette-plugin-template-repository"", ""private"": false, ""owner"": {""login"": ""simonw"", ""id"": 9599, ""node_id"": ""MDQ6VXNlcjk1OTk="", ""avatar_url"": ""https://avatars.githubusercontent.com/u/9599?v=4"", ""gravatar_id"": """", ""url"": ""https://api.github.com/users/simonw"", ""html_url"": ""https://github.com/simonw"", ""followers_url"": ""https://api.github.com/users/simonw/followers"", ""following_url"": ""https://api.github.com/users/simonw/following{/other_user}"", ""gists_url"": ""https://api.github.com/users/simonw/gists{/gist_id}"", ""starred_url"": ""https://api.github.com/users/simonw/starred{/owner}{/repo}"", ""subscriptions_url"": ""https://api.github.com/users/simonw/subscriptions"", ""organizations_url"": ""https://api.github.com/users/simonw/orgs"", ""repos_url"": ""https://api.github.com/users/simonw/repos"", ""events_url"": ""https://api.github.com/users/simonw/events{/privacy}"", ""received_events_url"": ""https://api.github.com/users/simonw/received_events"", ""type"": ""User"", ""site_admin"": false}, ""html_url"": ""https://github.com/simonw/datasette-plugin-template-repository"", ""description"": ""GitHub template repository for creating new Datasette plugins, using the simonw/datasette-plugin cookiecutter template"", ""fork"": false, ""url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository"", ""forks_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/forks"", ""keys_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/keys{/key_id}"", ""collaborators_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/collaborators{/collaborator}"", ""teams_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/teams"", ""hooks_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/hooks"", ""issue_events_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/issues/events{/number}"", ""events_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/events"", ""assignees_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/assignees{/user}"", ""branches_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/branches{/branch}"", ""tags_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/tags"", ""blobs_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/blobs{/sha}"", ""git_tags_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/tags{/sha}"", ""git_refs_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/refs{/sha}"", ""trees_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/trees{/sha}"", ""statuses_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/statuses/{sha}"", ""languages_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/languages"", ""stargazers_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/stargazers"", ""contributors_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/contributors"", ""subscribers_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/subscribers"", ""subscription_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/subscription"", ""commits_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/commits{/sha}"", ""git_commits_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/commits{/sha}"", ""comments_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/comments{/number}"", ""issue_comment_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/issues/comments{/number}"", ""contents_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/contents/{+path}"", ""compare_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/compare/{base}...{head}"", ""merges_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/merges"", ""archive_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/{archive_format}{/ref}"", ""downloads_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/downloads"", ""issues_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/issues{/number}"", ""pulls_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/pulls{/number}"", ""milestones_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/milestones{/number}"", ""notifications_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/notifications{?since,all,participating}"", ""labels_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/labels{/name}"", ""releases_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/releases{/id}"", ""deployments_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/deployments"", ""created_at"": ""2021-08-28T19:50:28Z"", ""updated_at"": ""2022-06-10T13:28:46Z"", ""pushed_at"": ""2022-03-16T23:42:16Z"", ""git_url"": ""git://github.com/simonw/datasette-plugin-template-repository.git"", ""ssh_url"": ""git@github.com:simonw/datasette-plugin-template-repository.git"", ""clone_url"": ""https://github.com/simonw/datasette-plugin-template-repository.git"", ""svn_url"": ""https://github.com/simonw/datasette-plugin-template-repository"", ""homepage"": """", ""size"": 9, ""stargazers_count"": 15, ""watchers_count"": 15, ""language"": null, ""has_issues"": true, ""has_projects"": true, ""has_downloads"": true, ""has_wiki"": true, ""has_pages"": false, ""forks_count"": 0, ""mirror_url"": null, ""archived"": false, ""disabled"": false, ""open_issues_count"": 0, ""license"": null, ""allow_forking"": true, ""is_template"": true, ""web_commit_signoff_required"": false, ""topics"": [], ""visibility"": ""public"", ""forks"": 0, ""open_issues"": 0, ""watchers"": 15, ""default_branch"": ""main"", ""permissions"": {""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}, ""temp_clone_token"": """"}",0,
530399214,R_kgDOH50_7g,datasette-sitemap,simonw/datasette-sitemap,0,9599,https://github.com/simonw/datasette-sitemap,Generate sitemap.xml for Datasette sites,0,2022-08-29T21:22:46Z,2022-08-29T23:04:23Z,2022-08-30T17:59:53Z,,25,0,0,Python,1,1,1,1,0,0,0,0,0,apache-2.0,[],0,0,0,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,0,1,"# datasette-sitemap

[![PyPI](https://img.shields.io/pypi/v/datasette-sitemap.svg)](https://pypi.org/project/datasette-sitemap/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-sitemap?include_prereleases&label=changelog)](https://github.com/simonw/datasette-sitemap/releases)
[![Tests](https://github.com/simonw/datasette-sitemap/workflows/Test/badge.svg)](https://github.com/simonw/datasette-sitemap/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-sitemap/blob/main/LICENSE)

Generate sitemap.xml for Datasette sites

## Installation

Install this plugin in the same environment as Datasette.

    datasette install datasette-sitemap

## Demo

This plugin is used for the sitemap on [til.simonwillison.net](https://til.simonwillison.net/):

- https://til.simonwillison.net/sitemap.xml

Here's [the configuration](https://github.com/simonw/til/blob/d4f67743a90a67100b46145986b2dec6f8d96583/metadata.yaml#L14-L16) used for that sitemap.

## Usage

Once configured, this plugin adds a sitemap at `/sitemap.xml` with a list of URLs.

This list is defined using a SQL query in `metadata.json` (or `.yml`) that looks like this:

```json
{
  ""plugins"": {
    ""datasette-sitemap"": {
      ""query"": ""select '/' || id as path from my_table""
    }
  }
}
```

Using `metadata.yml` allows for multi-line SQL queries which can be easier to maintain:

```yaml
plugins:
  datasette-sitemap:
    query: |
      select
        '/' || id as path
      from
        my_table
```
The SQL query must return a column called `path`. The values in this column must begin with a `/`. They will be used to generate a sitemap that looks like this:

```xml
<?xml version=""1.0"" encoding=""UTF-8""?>
<urlset xmlns=""http://www.sitemaps.org/schemas/sitemap/0.9"">
  <url><loc>https://example.com/1</loc></url>
  <url><loc>https://example.com/2</loc></url>
</urlset>
```
You can use ``UNION`` in your SQL query to combine results from multiple tables, or include literal paths that you want to include in the index:

```sql
select
  '/data/table1/' || id as path
  from table1
union
select
  '/data/table2/' || id as path
  from table2
union
select
  '/about' as path
```
If your Datasette instance has multiple databases you can configure the database to query using the `database` configuration property.

By default the domain name for the genearted URLs in the sitemap will be detected from the incoming request.

You can set `base_url` instead to override this. This should not include a trailing slash.

This example shows both of those settings, running the query against the `content` database and setting a custom base URL:

```yaml
plugins:
  datasette-sitemap:
    query: |
      select '/plugins/' || name as path from plugins
      union
      select '/tools/' || name as path from tools
      union
      select '/news' as path
    database: content
    base_url: https://datasette.io
```
[Try that query](https://datasette.io/content?sql=select+%27%2Fplugins%2F%27+||+name+as+path+from+plugins%0D%0Aunion%0D%0Aselect+%27%2Ftools%2F%27+||+name+as+path+from+tools%0D%0Aunion%0D%0Aselect+%27%2Fnews%27+as+path%0D%0A).

## robots.txt

This plugin adds a `robots.txt` file pointing to the sitemap:

```
Sitemap: http://example.com/sitemap.xml
```

You can take full control of the sitemap by installing and configuring the [datasette-block-robots](https://datasette.io/plugins/datasette-block-robots) plugin.

This plugin will add the `Sitemap:` line even if you are using `datasette-block-robots` for the rest of your `robots.txt` file.

## Adding paths to the sitemap from other plugins

This plugin adds a new [plugin hook](https://docs.datasette.io/en/stable/plugin_hooks.html) to Datasete called `sitemap_extra_paths()` which can be used by other plugins to add their own additional lines to the `sitemap.xml` file.

The hook accepts these optional parameters:

- `datasette`: The current [Datasette instance](https://docs.datasette.io/en/stable/internals.html#datasette-class). You can use this to execute SQL queries or read plugin configuration settings.
- `request`: The [Request object](https://docs.datasette.io/en/stable/internals.html#request-object) representing the incoming request to `/sitemap.xml`.

The hook should return a list of strings, each representing a path to be added to the sitemap. Each path must begin with a `/`.

It can also return an `async def` function, which will be awaited and used to generate a list of lines. Use this option if you need to make `await` calls inside you hook implementation.

This example uses the hook to add two extra paths, one of which came from a SQL query:

```python
from datasette import hookimpl

@hookimpl
def sitemap_extra_paths(datasette):
    async def inner():
        db = datasette.get_database()
        path_from_db = (await db.execute(""select '/example'"")).single_value()
        return [""/about"", path_from_db]
    return inner
```

## Development

To set up this plugin locally, first checkout the code. Then create a new virtual environment:

    cd datasette-sitemap
    python3 -m venv venv
    source venv/bin/activate

Now install the dependencies and test dependencies:

    pip install -e '.[test]'

To run the tests:

    pytest
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-datasette-sitemap"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-sitemap""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-sitemap</h1>
<p dir=""auto""><a href=""https://pypi.org/project/datasette-sitemap/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/6771365acdee2a47414d423ce45cba60d99dbc6a20fa98637c0617476f8053da/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d736974656d61702e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-sitemap.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-sitemap/releases""><img src=""https://camo.githubusercontent.com/4ceed1473baea92d47265f3ec83d36d6218a335777f158e5785a7033a2affba6/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d736974656d61703f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-sitemap?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-sitemap/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-sitemap/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-sitemap/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Generate sitemap.xml for Datasette sites</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p dir=""auto"">Install this plugin in the same environment as Datasette.</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""datasette install datasette-sitemap""><pre class=""notranslate""><code>datasette install datasette-sitemap
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-demo"" class=""anchor"" aria-hidden=""true"" href=""#user-content-demo""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Demo</h2>
<p dir=""auto"">This plugin is used for the sitemap on <a href=""https://til.simonwillison.net/"" rel=""nofollow"">til.simonwillison.net</a>:</p>
<ul dir=""auto"">
<li><a href=""https://til.simonwillison.net/sitemap.xml"" rel=""nofollow"">https://til.simonwillison.net/sitemap.xml</a></li>
</ul>
<p dir=""auto"">Here's <a href=""https://github.com/simonw/til/blob/d4f67743a90a67100b46145986b2dec6f8d96583/metadata.yaml#L14-L16"">the configuration</a> used for that sitemap.</p>
<h2 dir=""auto""><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p dir=""auto"">Once configured, this plugin adds a sitemap at <code>/sitemap.xml</code> with a list of URLs.</p>
<p dir=""auto"">This list is defined using a SQL query in <code>metadata.json</code> (or <code>.yml</code>) that looks like this:</p>
<div class=""highlight highlight-source-json notranslate position-relative overflow-auto"" dir=""auto"" data-snippet-clipboard-copy-content=""{
  &quot;plugins&quot;: {
    &quot;datasette-sitemap&quot;: {
      &quot;query&quot;: &quot;select '/' || id as path from my_table&quot;
    }
  }
}""><pre>{
  <span class=""pl-ent"">""plugins""</span>: {
    <span class=""pl-ent"">""datasette-sitemap""</span>: {
      <span class=""pl-ent"">""query""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>select '/' || id as path from my_table<span class=""pl-pds"">""</span></span>
    }
  }
}</pre></div>
<p dir=""auto"">Using <code>metadata.yml</code> allows for multi-line SQL queries which can be easier to maintain:</p>
<div class=""highlight highlight-source-yaml notranslate position-relative overflow-auto"" dir=""auto"" data-snippet-clipboard-copy-content=""plugins:
  datasette-sitemap:
    query: |
      select
        '/' || id as path
      from
        my_table""><pre><span class=""pl-ent"">plugins</span>:
  <span class=""pl-ent"">datasette-sitemap</span>:
    <span class=""pl-ent"">query</span>: <span class=""pl-s"">|</span>
<span class=""pl-s"">      select</span>
<span class=""pl-s"">        '/' || id as path</span>
<span class=""pl-s"">      from</span>
<span class=""pl-s"">        my_table</span></pre></div>
<p dir=""auto"">The SQL query must return a column called <code>path</code>. The values in this column must begin with a <code>/</code>. They will be used to generate a sitemap that looks like this:</p>
<div class=""highlight highlight-text-xml notranslate position-relative overflow-auto"" dir=""auto"" data-snippet-clipboard-copy-content=""&lt;?xml version=&quot;1.0&quot; encoding=&quot;UTF-8&quot;?&gt;
&lt;urlset xmlns=&quot;http://www.sitemaps.org/schemas/sitemap/0.9&quot;&gt;
  &lt;url&gt;&lt;loc&gt;https://example.com/1&lt;/loc&gt;&lt;/url&gt;
  &lt;url&gt;&lt;loc&gt;https://example.com/2&lt;/loc&gt;&lt;/url&gt;
&lt;/urlset&gt;""><pre>&lt;?<span class=""pl-ent"">xml</span><span class=""pl-e""> version</span>=<span class=""pl-s""><span class=""pl-pds"">""</span>1.0<span class=""pl-pds"">""</span></span><span class=""pl-e""> encoding</span>=<span class=""pl-s""><span class=""pl-pds"">""</span>UTF-8<span class=""pl-pds"">""</span></span>?&gt;
&lt;<span class=""pl-ent"">urlset</span> <span class=""pl-e"">xmlns</span>=<span class=""pl-s""><span class=""pl-pds"">""</span>http://www.sitemaps.org/schemas/sitemap/0.9<span class=""pl-pds"">""</span></span>&gt;
  &lt;<span class=""pl-ent"">url</span>&gt;&lt;<span class=""pl-ent"">loc</span>&gt;https://example.com/1&lt;/<span class=""pl-ent"">loc</span>&gt;&lt;/<span class=""pl-ent"">url</span>&gt;
  &lt;<span class=""pl-ent"">url</span>&gt;&lt;<span class=""pl-ent"">loc</span>&gt;https://example.com/2&lt;/<span class=""pl-ent"">loc</span>&gt;&lt;/<span class=""pl-ent"">url</span>&gt;
&lt;/<span class=""pl-ent"">urlset</span>&gt;</pre></div>
<p dir=""auto"">You can use <code>UNION</code> in your SQL query to combine results from multiple tables, or include literal paths that you want to include in the index:</p>
<div class=""highlight highlight-source-sql notranslate position-relative overflow-auto"" dir=""auto"" data-snippet-clipboard-copy-content=""select
  '/data/table1/' || id as path
  from table1
union
select
  '/data/table2/' || id as path
  from table2
union
select
  '/about' as path""><pre><span class=""pl-k"">select</span>
  <span class=""pl-s""><span class=""pl-pds"">'</span>/data/table1/<span class=""pl-pds"">'</span></span> <span class=""pl-k"">||</span> id <span class=""pl-k"">as</span> <span class=""pl-k"">path</span>
  <span class=""pl-k"">from</span> table1
<span class=""pl-k"">union</span>
<span class=""pl-k"">select</span>
  <span class=""pl-s""><span class=""pl-pds"">'</span>/data/table2/<span class=""pl-pds"">'</span></span> <span class=""pl-k"">||</span> id <span class=""pl-k"">as</span> <span class=""pl-k"">path</span>
  <span class=""pl-k"">from</span> table2
<span class=""pl-k"">union</span>
<span class=""pl-k"">select</span>
  <span class=""pl-s""><span class=""pl-pds"">'</span>/about<span class=""pl-pds"">'</span></span> <span class=""pl-k"">as</span> <span class=""pl-k"">path</span></pre></div>
<p dir=""auto"">If your Datasette instance has multiple databases you can configure the database to query using the <code>database</code> configuration property.</p>
<p dir=""auto"">By default the domain name for the genearted URLs in the sitemap will be detected from the incoming request.</p>
<p dir=""auto"">You can set <code>base_url</code> instead to override this. This should not include a trailing slash.</p>
<p dir=""auto"">This example shows both of those settings, running the query against the <code>content</code> database and setting a custom base URL:</p>
<div class=""highlight highlight-source-yaml notranslate position-relative overflow-auto"" dir=""auto"" data-snippet-clipboard-copy-content=""plugins:
  datasette-sitemap:
    query: |
      select '/plugins/' || name as path from plugins
      union
      select '/tools/' || name as path from tools
      union
      select '/news' as path
    database: content
    base_url: https://datasette.io""><pre><span class=""pl-ent"">plugins</span>:
  <span class=""pl-ent"">datasette-sitemap</span>:
    <span class=""pl-ent"">query</span>: <span class=""pl-s"">|</span>
<span class=""pl-s"">      select '/plugins/' || name as path from plugins</span>
<span class=""pl-s"">      union</span>
<span class=""pl-s"">      select '/tools/' || name as path from tools</span>
<span class=""pl-s"">      union</span>
<span class=""pl-s"">      select '/news' as path</span>
<span class=""pl-s""></span>    <span class=""pl-ent"">database</span>: <span class=""pl-s"">content</span>
    <span class=""pl-ent"">base_url</span>: <span class=""pl-s"">https://datasette.io</span></pre></div>
<p dir=""auto""><a href=""https://datasette.io/content?sql=select+%27%2Fplugins%2F%27+%7C%7C+name+as+path+from+plugins%0D%0Aunion%0D%0Aselect+%27%2Ftools%2F%27+%7C%7C+name+as+path+from+tools%0D%0Aunion%0D%0Aselect+%27%2Fnews%27+as+path%0D%0A"" rel=""nofollow"">Try that query</a>.</p>
<h2 dir=""auto""><a id=""user-content-robotstxt"" class=""anchor"" aria-hidden=""true"" href=""#user-content-robotstxt""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>robots.txt</h2>
<p dir=""auto"">This plugin adds a <code>robots.txt</code> file pointing to the sitemap:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""Sitemap: http://example.com/sitemap.xml""><pre class=""notranslate""><code>Sitemap: http://example.com/sitemap.xml
</code></pre></div>
<p dir=""auto"">You can take full control of the sitemap by installing and configuring the <a href=""https://datasette.io/plugins/datasette-block-robots"" rel=""nofollow"">datasette-block-robots</a> plugin.</p>
<p dir=""auto"">This plugin will add the <code>Sitemap:</code> line even if you are using <code>datasette-block-robots</code> for the rest of your <code>robots.txt</code> file.</p>
<h2 dir=""auto""><a id=""user-content-adding-paths-to-the-sitemap-from-other-plugins"" class=""anchor"" aria-hidden=""true"" href=""#user-content-adding-paths-to-the-sitemap-from-other-plugins""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Adding paths to the sitemap from other plugins</h2>
<p dir=""auto"">This plugin adds a new <a href=""https://docs.datasette.io/en/stable/plugin_hooks.html"" rel=""nofollow"">plugin hook</a> to Datasete called <code>sitemap_extra_paths()</code> which can be used by other plugins to add their own additional lines to the <code>sitemap.xml</code> file.</p>
<p dir=""auto"">The hook accepts these optional parameters:</p>
<ul dir=""auto"">
<li><code>datasette</code>: The current <a href=""https://docs.datasette.io/en/stable/internals.html#datasette-class"" rel=""nofollow"">Datasette instance</a>. You can use this to execute SQL queries or read plugin configuration settings.</li>
<li><code>request</code>: The <a href=""https://docs.datasette.io/en/stable/internals.html#request-object"" rel=""nofollow"">Request object</a> representing the incoming request to <code>/sitemap.xml</code>.</li>
</ul>
<p dir=""auto"">The hook should return a list of strings, each representing a path to be added to the sitemap. Each path must begin with a <code>/</code>.</p>
<p dir=""auto"">It can also return an <code>async def</code> function, which will be awaited and used to generate a list of lines. Use this option if you need to make <code>await</code> calls inside you hook implementation.</p>
<p dir=""auto"">This example uses the hook to add two extra paths, one of which came from a SQL query:</p>
<div class=""highlight highlight-source-python notranslate position-relative overflow-auto"" dir=""auto"" data-snippet-clipboard-copy-content=""from datasette import hookimpl

@hookimpl
def sitemap_extra_paths(datasette):
    async def inner():
        db = datasette.get_database()
        path_from_db = (await db.execute(&quot;select '/example'&quot;)).single_value()
        return [&quot;/about&quot;, path_from_db]
    return inner""><pre><span class=""pl-k"">from</span> <span class=""pl-s1"">datasette</span> <span class=""pl-k"">import</span> <span class=""pl-s1"">hookimpl</span>

<span class=""pl-en"">@<span class=""pl-s1"">hookimpl</span></span>
<span class=""pl-k"">def</span> <span class=""pl-en"">sitemap_extra_paths</span>(<span class=""pl-s1"">datasette</span>):
    <span class=""pl-k"">async</span> <span class=""pl-k"">def</span> <span class=""pl-en"">inner</span>():
        <span class=""pl-s1"">db</span> <span class=""pl-c1"">=</span> <span class=""pl-s1"">datasette</span>.<span class=""pl-en"">get_database</span>()
        <span class=""pl-s1"">path_from_db</span> <span class=""pl-c1"">=</span> (<span class=""pl-k"">await</span> <span class=""pl-s1"">db</span>.<span class=""pl-en"">execute</span>(<span class=""pl-s"">""select '/example'""</span>)).<span class=""pl-en"">single_value</span>()
        <span class=""pl-k"">return</span> [<span class=""pl-s"">""/about""</span>, <span class=""pl-s1"">path_from_db</span>]
    <span class=""pl-k"">return</span> <span class=""pl-s1"">inner</span></pre></div>
<h2 dir=""auto""><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p dir=""auto"">To set up this plugin locally, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""cd datasette-sitemap
python3 -m venv venv
source venv/bin/activate""><pre class=""notranslate""><code>cd datasette-sitemap
python3 -m venv venv
source venv/bin/activate
</code></pre></div>
<p dir=""auto"">Now install the dependencies and test dependencies:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'""><pre class=""notranslate""><code>pip install -e '.[test]'
</code></pre></div>
<p dir=""auto"">To run the tests:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pytest""><pre class=""notranslate""><code>pytest
</code></pre></div>
</article></div>",1,public,0,"{""id"": 400878073, ""node_id"": ""MDEwOlJlcG9zaXRvcnk0MDA4NzgwNzM="", ""name"": ""datasette-plugin-template-repository"", ""full_name"": ""simonw/datasette-plugin-template-repository"", ""private"": false, ""owner"": {""login"": ""simonw"", ""id"": 9599, ""node_id"": ""MDQ6VXNlcjk1OTk="", ""avatar_url"": ""https://avatars.githubusercontent.com/u/9599?v=4"", ""gravatar_id"": """", ""url"": ""https://api.github.com/users/simonw"", ""html_url"": ""https://github.com/simonw"", ""followers_url"": ""https://api.github.com/users/simonw/followers"", ""following_url"": ""https://api.github.com/users/simonw/following{/other_user}"", ""gists_url"": ""https://api.github.com/users/simonw/gists{/gist_id}"", ""starred_url"": ""https://api.github.com/users/simonw/starred{/owner}{/repo}"", ""subscriptions_url"": ""https://api.github.com/users/simonw/subscriptions"", ""organizations_url"": ""https://api.github.com/users/simonw/orgs"", ""repos_url"": ""https://api.github.com/users/simonw/repos"", ""events_url"": ""https://api.github.com/users/simonw/events{/privacy}"", ""received_events_url"": ""https://api.github.com/users/simonw/received_events"", ""type"": ""User"", ""site_admin"": false}, ""html_url"": ""https://github.com/simonw/datasette-plugin-template-repository"", ""description"": ""GitHub template repository for creating new Datasette plugins, using the simonw/datasette-plugin cookiecutter template"", ""fork"": false, ""url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository"", ""forks_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/forks"", ""keys_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/keys{/key_id}"", ""collaborators_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/collaborators{/collaborator}"", ""teams_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/teams"", ""hooks_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/hooks"", ""issue_events_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/issues/events{/number}"", ""events_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/events"", ""assignees_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/assignees{/user}"", ""branches_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/branches{/branch}"", ""tags_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/tags"", ""blobs_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/blobs{/sha}"", ""git_tags_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/tags{/sha}"", ""git_refs_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/refs{/sha}"", ""trees_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/trees{/sha}"", ""statuses_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/statuses/{sha}"", ""languages_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/languages"", ""stargazers_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/stargazers"", ""contributors_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/contributors"", ""subscribers_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/subscribers"", ""subscription_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/subscription"", ""commits_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/commits{/sha}"", ""git_commits_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/commits{/sha}"", ""comments_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/comments{/number}"", ""issue_comment_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/issues/comments{/number}"", ""contents_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/contents/{+path}"", ""compare_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/compare/{base}...{head}"", ""merges_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/merges"", ""archive_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/{archive_format}{/ref}"", ""downloads_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/downloads"", ""issues_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/issues{/number}"", ""pulls_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/pulls{/number}"", ""milestones_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/milestones{/number}"", ""notifications_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/notifications{?since,all,participating}"", ""labels_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/labels{/name}"", ""releases_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/releases{/id}"", ""deployments_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/deployments"", ""created_at"": ""2021-08-28T19:50:28Z"", ""updated_at"": ""2022-08-27T17:22:27Z"", ""pushed_at"": ""2022-03-16T23:42:16Z"", ""git_url"": ""git://github.com/simonw/datasette-plugin-template-repository.git"", ""ssh_url"": ""git@github.com:simonw/datasette-plugin-template-repository.git"", ""clone_url"": ""https://github.com/simonw/datasette-plugin-template-repository.git"", ""svn_url"": ""https://github.com/simonw/datasette-plugin-template-repository"", ""homepage"": """", ""size"": 9, ""stargazers_count"": 16, ""watchers_count"": 16, ""language"": null, ""has_issues"": true, ""has_projects"": true, ""has_downloads"": true, ""has_wiki"": true, ""has_pages"": false, ""forks_count"": 0, ""mirror_url"": null, ""archived"": false, ""disabled"": false, ""open_issues_count"": 0, ""license"": null, ""allow_forking"": true, ""is_template"": true, ""web_commit_signoff_required"": false, ""topics"": [], ""visibility"": ""public"", ""forks"": 0, ""open_issues"": 0, ""watchers"": 16, ""default_branch"": ""main"", ""permissions"": {""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}, ""temp_clone_token"": """"}",0,
532404547,R_kgDOH7vZQw,datasette-render-image-tags,simonw/datasette-render-image-tags,0,9599,https://github.com/simonw/datasette-render-image-tags,Turn any URLs ending in .jpg/.png/.gif into img tags with width 200,0,2022-09-04T00:36:44Z,2022-09-04T00:37:13Z,2022-09-04T00:48:32Z,,10,0,0,Python,1,1,1,1,0,0,0,0,0,apache-2.0,[],0,0,0,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,0,1,"# datasette-render-image-tags

[![PyPI](https://img.shields.io/pypi/v/datasette-render-image-tags.svg)](https://pypi.org/project/datasette-render-image-tags/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-render-image-tags?include_prereleases&label=changelog)](https://github.com/simonw/datasette-render-image-tags/releases)
[![Tests](https://github.com/simonw/datasette-render-image-tags/workflows/Test/badge.svg)](https://github.com/simonw/datasette-render-image-tags/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-render-image-tags/blob/main/LICENSE)

Turn any URLs ending in .jpg/.png/.gif into img tags with width 200

## Installation

Install this plugin in the same environment as Datasette.

    datasette install datasette-render-image-tags

## Usage

Once installed, any cells contaning a URL that ends with `.png` or `.jpg` or `.jpeg` or `.gif` will be rendered using an image tag, with a width of 200px.

## Development

To set up this plugin locally, first checkout the code. Then create a new virtual environment:

    cd datasette-render-image-tags
    python3 -m venv venv
    source venv/bin/activate

Now install the dependencies and test dependencies:

    pip install -e '.[test]'

To run the tests:

    pytest
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-datasette-render-image-tags"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-render-image-tags""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-render-image-tags</h1>
<p dir=""auto""><a href=""https://pypi.org/project/datasette-render-image-tags/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/57f7abc8fd4b0dd9d66333a0711ecae79dd3fec5e13057e175624b6ceded2cf3/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d72656e6465722d696d6167652d746167732e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-render-image-tags.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-render-image-tags/releases""><img src=""https://camo.githubusercontent.com/41d793901ad12364d0a8e2f28a0e5f972129ebd4816b0db29500eb2eaf2d6f9d/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d72656e6465722d696d6167652d746167733f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-render-image-tags?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-render-image-tags/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-render-image-tags/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-render-image-tags/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Turn any URLs ending in .jpg/.png/.gif into img tags with width 200</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p dir=""auto"">Install this plugin in the same environment as Datasette.</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""datasette install datasette-render-image-tags""><pre class=""notranslate""><code>datasette install datasette-render-image-tags
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p dir=""auto"">Once installed, any cells contaning a URL that ends with <code>.png</code> or <code>.jpg</code> or <code>.jpeg</code> or <code>.gif</code> will be rendered using an image tag, with a width of 200px.</p>
<h2 dir=""auto""><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p dir=""auto"">To set up this plugin locally, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""cd datasette-render-image-tags
python3 -m venv venv
source venv/bin/activate""><pre class=""notranslate""><code>cd datasette-render-image-tags
python3 -m venv venv
source venv/bin/activate
</code></pre></div>
<p dir=""auto"">Now install the dependencies and test dependencies:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'""><pre class=""notranslate""><code>pip install -e '.[test]'
</code></pre></div>
<p dir=""auto"">To run the tests:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pytest""><pre class=""notranslate""><code>pytest
</code></pre></div>
</article></div>",1,public,0,"{""id"": 400878073, ""node_id"": ""MDEwOlJlcG9zaXRvcnk0MDA4NzgwNzM="", ""name"": ""datasette-plugin-template-repository"", ""full_name"": ""simonw/datasette-plugin-template-repository"", ""private"": false, ""owner"": {""login"": ""simonw"", ""id"": 9599, ""node_id"": ""MDQ6VXNlcjk1OTk="", ""avatar_url"": ""https://avatars.githubusercontent.com/u/9599?v=4"", ""gravatar_id"": """", ""url"": ""https://api.github.com/users/simonw"", ""html_url"": ""https://github.com/simonw"", ""followers_url"": ""https://api.github.com/users/simonw/followers"", ""following_url"": ""https://api.github.com/users/simonw/following{/other_user}"", ""gists_url"": ""https://api.github.com/users/simonw/gists{/gist_id}"", ""starred_url"": ""https://api.github.com/users/simonw/starred{/owner}{/repo}"", ""subscriptions_url"": ""https://api.github.com/users/simonw/subscriptions"", ""organizations_url"": ""https://api.github.com/users/simonw/orgs"", ""repos_url"": ""https://api.github.com/users/simonw/repos"", ""events_url"": ""https://api.github.com/users/simonw/events{/privacy}"", ""received_events_url"": ""https://api.github.com/users/simonw/received_events"", ""type"": ""User"", ""site_admin"": false}, ""html_url"": ""https://github.com/simonw/datasette-plugin-template-repository"", ""description"": ""GitHub template repository for creating new Datasette plugins, using the simonw/datasette-plugin cookiecutter template"", ""fork"": false, ""url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository"", ""forks_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/forks"", ""keys_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/keys{/key_id}"", ""collaborators_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/collaborators{/collaborator}"", ""teams_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/teams"", ""hooks_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/hooks"", ""issue_events_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/issues/events{/number}"", ""events_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/events"", ""assignees_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/assignees{/user}"", ""branches_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/branches{/branch}"", ""tags_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/tags"", ""blobs_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/blobs{/sha}"", ""git_tags_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/tags{/sha}"", ""git_refs_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/refs{/sha}"", ""trees_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/trees{/sha}"", ""statuses_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/statuses/{sha}"", ""languages_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/languages"", ""stargazers_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/stargazers"", ""contributors_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/contributors"", ""subscribers_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/subscribers"", ""subscription_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/subscription"", ""commits_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/commits{/sha}"", ""git_commits_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/commits{/sha}"", ""comments_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/comments{/number}"", ""issue_comment_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/issues/comments{/number}"", ""contents_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/contents/{+path}"", ""compare_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/compare/{base}...{head}"", ""merges_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/merges"", ""archive_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/{archive_format}{/ref}"", ""downloads_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/downloads"", ""issues_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/issues{/number}"", ""pulls_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/pulls{/number}"", ""milestones_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/milestones{/number}"", ""notifications_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/notifications{?since,all,participating}"", ""labels_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/labels{/name}"", ""releases_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/releases{/id}"", ""deployments_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/deployments"", ""created_at"": ""2021-08-28T19:50:28Z"", ""updated_at"": ""2022-08-27T17:22:27Z"", ""pushed_at"": ""2022-03-16T23:42:16Z"", ""git_url"": ""git://github.com/simonw/datasette-plugin-template-repository.git"", ""ssh_url"": ""git@github.com:simonw/datasette-plugin-template-repository.git"", ""clone_url"": ""https://github.com/simonw/datasette-plugin-template-repository.git"", ""svn_url"": ""https://github.com/simonw/datasette-plugin-template-repository"", ""homepage"": """", ""size"": 9, ""stargazers_count"": 16, ""watchers_count"": 16, ""language"": null, ""has_issues"": true, ""has_projects"": true, ""has_downloads"": true, ""has_wiki"": true, ""has_pages"": false, ""forks_count"": 0, ""mirror_url"": null, ""archived"": false, ""disabled"": false, ""open_issues_count"": 0, ""license"": null, ""allow_forking"": true, ""is_template"": true, ""web_commit_signoff_required"": false, ""topics"": [], ""visibility"": ""public"", ""forks"": 0, ""open_issues"": 0, ""watchers"": 16, ""default_branch"": ""main"", ""permissions"": {""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}, ""temp_clone_token"": """"}",0,
534735639,R_kgDOH99rFw,datasette-sandstorm-support,simonw/datasette-sandstorm-support,0,9599,https://github.com/simonw/datasette-sandstorm-support,Authentication and permissions for Datasette on Sandstorm,0,2022-09-09T17:11:04Z,2022-09-09T17:36:46Z,2022-09-16T17:14:12Z,,15,1,1,Python,1,1,1,1,0,1,0,0,1,apache-2.0,[],1,1,1,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,1,2,"# datasette-sandstorm-support

[![PyPI](https://img.shields.io/pypi/v/datasette-sandstorm-support.svg)](https://pypi.org/project/datasette-sandstorm-support/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-sandstorm-support?include_prereleases&label=changelog)](https://github.com/simonw/datasette-sandstorm-support/releases)
[![Tests](https://github.com/simonw/datasette-sandstorm-support/workflows/Test/badge.svg)](https://github.com/simonw/datasette-sandstorm-support/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-sandstorm-support/blob/main/LICENSE)

Authentication and permissions for Datasette on Sandstorm

## Installation

Install this plugin in the same environment as Datasette.

    datasette install datasette-sandstorm-support

## Usage

This plugin is part of [datasette-sandstorm](https://github.com/ocdtrekkie/datasette-sandstorm).

## Development

To set up this plugin locally, first checkout the code. Then create a new virtual environment:

    cd datasette-sandstorm-support
    python3 -m venv venv
    source venv/bin/activate

Now install the dependencies and test dependencies:

    pip install -e '.[test]'

To run the tests:

    pytest
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-datasette-sandstorm-support"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-sandstorm-support""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-sandstorm-support</h1>
<p dir=""auto""><a href=""https://pypi.org/project/datasette-sandstorm-support/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/6e32b5bcf5e4e48dcb4a2ea4c18cd41da494afb1c48d2458ff9b01f0ec485495/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d73616e6473746f726d2d737570706f72742e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-sandstorm-support.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-sandstorm-support/releases""><img src=""https://camo.githubusercontent.com/ed98d1cc299400313c2b6f5f65227b2c41429724e4b09b44581cd0653ce654c1/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d73616e6473746f726d2d737570706f72743f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-sandstorm-support?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-sandstorm-support/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-sandstorm-support/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-sandstorm-support/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Authentication and permissions for Datasette on Sandstorm</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p dir=""auto"">Install this plugin in the same environment as Datasette.</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""datasette install datasette-sandstorm-support""><pre class=""notranslate""><code>datasette install datasette-sandstorm-support
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p dir=""auto"">This plugin is part of <a href=""https://github.com/ocdtrekkie/datasette-sandstorm"">datasette-sandstorm</a>.</p>
<h2 dir=""auto""><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p dir=""auto"">To set up this plugin locally, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""cd datasette-sandstorm-support
python3 -m venv venv
source venv/bin/activate""><pre class=""notranslate""><code>cd datasette-sandstorm-support
python3 -m venv venv
source venv/bin/activate
</code></pre></div>
<p dir=""auto"">Now install the dependencies and test dependencies:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'""><pre class=""notranslate""><code>pip install -e '.[test]'
</code></pre></div>
<p dir=""auto"">To run the tests:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pytest""><pre class=""notranslate""><code>pytest
</code></pre></div>
</article></div>",1,public,0,"{""id"": 400878073, ""node_id"": ""MDEwOlJlcG9zaXRvcnk0MDA4NzgwNzM="", ""name"": ""datasette-plugin-template-repository"", ""full_name"": ""simonw/datasette-plugin-template-repository"", ""private"": false, ""owner"": {""login"": ""simonw"", ""id"": 9599, ""node_id"": ""MDQ6VXNlcjk1OTk="", ""avatar_url"": ""https://avatars.githubusercontent.com/u/9599?v=4"", ""gravatar_id"": """", ""url"": ""https://api.github.com/users/simonw"", ""html_url"": ""https://github.com/simonw"", ""followers_url"": ""https://api.github.com/users/simonw/followers"", ""following_url"": ""https://api.github.com/users/simonw/following{/other_user}"", ""gists_url"": ""https://api.github.com/users/simonw/gists{/gist_id}"", ""starred_url"": ""https://api.github.com/users/simonw/starred{/owner}{/repo}"", ""subscriptions_url"": ""https://api.github.com/users/simonw/subscriptions"", ""organizations_url"": ""https://api.github.com/users/simonw/orgs"", ""repos_url"": ""https://api.github.com/users/simonw/repos"", ""events_url"": ""https://api.github.com/users/simonw/events{/privacy}"", ""received_events_url"": ""https://api.github.com/users/simonw/received_events"", ""type"": ""User"", ""site_admin"": false}, ""html_url"": ""https://github.com/simonw/datasette-plugin-template-repository"", ""description"": ""GitHub template repository for creating new Datasette plugins, using the simonw/datasette-plugin cookiecutter template"", ""fork"": false, ""url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository"", ""forks_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/forks"", ""keys_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/keys{/key_id}"", ""collaborators_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/collaborators{/collaborator}"", ""teams_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/teams"", ""hooks_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/hooks"", ""issue_events_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/issues/events{/number}"", ""events_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/events"", ""assignees_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/assignees{/user}"", ""branches_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/branches{/branch}"", ""tags_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/tags"", ""blobs_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/blobs{/sha}"", ""git_tags_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/tags{/sha}"", ""git_refs_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/refs{/sha}"", ""trees_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/trees{/sha}"", ""statuses_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/statuses/{sha}"", ""languages_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/languages"", ""stargazers_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/stargazers"", ""contributors_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/contributors"", ""subscribers_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/subscribers"", ""subscription_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/subscription"", ""commits_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/commits{/sha}"", ""git_commits_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/git/commits{/sha}"", ""comments_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/comments{/number}"", ""issue_comment_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/issues/comments{/number}"", ""contents_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/contents/{+path}"", ""compare_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/compare/{base}...{head}"", ""merges_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/merges"", ""archive_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/{archive_format}{/ref}"", ""downloads_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/downloads"", ""issues_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/issues{/number}"", ""pulls_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/pulls{/number}"", ""milestones_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/milestones{/number}"", ""notifications_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/notifications{?since,all,participating}"", ""labels_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/labels{/name}"", ""releases_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/releases{/id}"", ""deployments_url"": ""https://api.github.com/repos/simonw/datasette-plugin-template-repository/deployments"", ""created_at"": ""2021-08-28T19:50:28Z"", ""updated_at"": ""2022-08-27T17:22:27Z"", ""pushed_at"": ""2022-03-16T23:42:16Z"", ""git_url"": ""git://github.com/simonw/datasette-plugin-template-repository.git"", ""ssh_url"": ""git@github.com:simonw/datasette-plugin-template-repository.git"", ""clone_url"": ""https://github.com/simonw/datasette-plugin-template-repository.git"", ""svn_url"": ""https://github.com/simonw/datasette-plugin-template-repository"", ""homepage"": """", ""size"": 9, ""stargazers_count"": 16, ""watchers_count"": 16, ""language"": null, ""has_issues"": true, ""has_projects"": true, ""has_downloads"": true, ""has_wiki"": true, ""has_pages"": false, ""forks_count"": 0, ""mirror_url"": null, ""archived"": false, ""disabled"": false, ""open_issues_count"": 0, ""license"": null, ""allow_forking"": true, ""is_template"": true, ""web_commit_signoff_required"": false, ""topics"": [], ""visibility"": ""public"", ""forks"": 0, ""open_issues"": 0, ""watchers"": 16, ""default_branch"": ""main"", ""permissions"": {""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}, ""temp_clone_token"": """"}",0,
545764894,R_kgDOIIe2Hg,datasette-public,simonw/datasette-public,0,9599,https://github.com/simonw/datasette-public,Make specific Datasette tables visible to the public,0,2022-10-05T00:03:28Z,2022-10-05T00:03:40Z,2022-10-07T22:34:48Z,,15,0,0,Python,1,1,1,1,0,0,0,0,0,apache-2.0,[],0,0,0,main,"{""admin"": false, ""maintain"": false, ""push"": false, ""triage"": false, ""pull"": false}",,,0,1,"# datasette-public

[![PyPI](https://img.shields.io/pypi/v/datasette-public.svg)](https://pypi.org/project/datasette-public/)
[![Changelog](https://img.shields.io/github/v/release/simonw/datasette-public?include_prereleases&label=changelog)](https://github.com/simonw/datasette-public/releases)
[![Tests](https://github.com/simonw/datasette-public/workflows/Test/badge.svg)](https://github.com/simonw/datasette-public/actions?query=workflow%3ATest)
[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](https://github.com/simonw/datasette-public/blob/main/LICENSE)

Make specific Datasette tables visible to the public

## Installation

Install this plugin in the same environment as Datasette.

    datasette install datasette-public

## Usage

Any tables listed in the `_public_tables` table will be visible to the public, even if the rest of the Datasette instance does not allow anonymous access.

The root user (and any user with the new `public-tables` permission) will get a new option in the table action menu allowing them to toggle a table between public and private.

Installing this plugin also causes `allow-sql` permission checks to fall back to checking if the user has access to the entire database. This is to avoid users with access to a single public table being able to access data from other tables using the `?_where=` query string parameter.

## Configuration

This plugin creates a new table in one of your databases called `_public_tables`.

This table defaults to being created in the first database passed to Datasette.

To create it in a different named database, use this plugin configuration:

```json
{
  ""plugins"": {
    ""datasette-public"": {
      ""database"": ""database_to_create_table_in""
    }
  }
}
```

## Development

To set up this plugin locally, first checkout the code. Then create a new virtual environment:

    cd datasette-public
    python3 -m venv venv
    source venv/bin/activate

Now install the dependencies and test dependencies:

    pip install -e '.[test]'

To run the tests:

    pytest
","<div id=""readme"" class=""md"" data-path=""README.md""><article class=""markdown-body entry-content container-lg"" itemprop=""text""><h1 dir=""auto""><a id=""user-content-datasette-public"" class=""anchor"" aria-hidden=""true"" href=""#user-content-datasette-public""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>datasette-public</h1>
<p dir=""auto""><a href=""https://pypi.org/project/datasette-public/"" rel=""nofollow""><img src=""https://camo.githubusercontent.com/1798aea770a4aa04657940b1f93ba819d45dca65244ab80f88ef88b148d3eb38/68747470733a2f2f696d672e736869656c64732e696f2f707970692f762f6461746173657474652d7075626c69632e737667"" alt=""PyPI"" data-canonical-src=""https://img.shields.io/pypi/v/datasette-public.svg"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-public/releases""><img src=""https://camo.githubusercontent.com/57a70736767e5fae2b3260d8f712e4727075426f1a0dcf7bf2acb8041a1c37a6/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f72656c656173652f73696d6f6e772f6461746173657474652d7075626c69633f696e636c7564655f70726572656c6561736573266c6162656c3d6368616e67656c6f67"" alt=""Changelog"" data-canonical-src=""https://img.shields.io/github/v/release/simonw/datasette-public?include_prereleases&amp;label=changelog"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-public/actions?query=workflow%3ATest""><img src=""https://github.com/simonw/datasette-public/workflows/Test/badge.svg"" alt=""Tests"" style=""max-width: 100%;""></a>
<a href=""https://github.com/simonw/datasette-public/blob/main/LICENSE""><img src=""https://camo.githubusercontent.com/1698104e976c681143eb0841f9675c6f802bb7aa832afc0c7a4e719b1f3cf955/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6963656e73652d417061636865253230322e302d626c75652e737667"" alt=""License"" data-canonical-src=""https://img.shields.io/badge/license-Apache%202.0-blue.svg"" style=""max-width: 100%;""></a></p>
<p dir=""auto"">Make specific Datasette tables visible to the public</p>
<h2 dir=""auto""><a id=""user-content-installation"" class=""anchor"" aria-hidden=""true"" href=""#user-content-installation""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Installation</h2>
<p dir=""auto"">Install this plugin in the same environment as Datasette.</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""datasette install datasette-public""><pre class=""notranslate""><code>datasette install datasette-public
</code></pre></div>
<h2 dir=""auto""><a id=""user-content-usage"" class=""anchor"" aria-hidden=""true"" href=""#user-content-usage""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Usage</h2>
<p dir=""auto"">Any tables listed in the <code>_public_tables</code> table will be visible to the public, even if the rest of the Datasette instance does not allow anonymous access.</p>
<p dir=""auto"">The root user (and any user with the new <code>public-tables</code> permission) will get a new option in the table action menu allowing them to toggle a table between public and private.</p>
<p dir=""auto"">Installing this plugin also causes <code>allow-sql</code> permission checks to fall back to checking if the user has access to the entire database. This is to avoid users with access to a single public table being able to access data from other tables using the <code>?_where=</code> query string parameter.</p>
<h2 dir=""auto""><a id=""user-content-configuration"" class=""anchor"" aria-hidden=""true"" href=""#user-content-configuration""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Configuration</h2>
<p dir=""auto"">This plugin creates a new table in one of your databases called <code>_public_tables</code>.</p>
<p dir=""auto"">This table defaults to being created in the first database passed to Datasette.</p>
<p dir=""auto"">To create it in a different named database, use this plugin configuration:</p>
<div class=""highlight highlight-source-json notranslate position-relative overflow-auto"" dir=""auto"" data-snippet-clipboard-copy-content=""{
  &quot;plugins&quot;: {
    &quot;datasette-public&quot;: {
      &quot;database&quot;: &quot;database_to_create_table_in&quot;
    }
  }
}""><pre>{
  <span class=""pl-ent"">""plugins""</span>: {
    <span class=""pl-ent"">""datasette-public""</span>: {
      <span class=""pl-ent"">""database""</span>: <span class=""pl-s""><span class=""pl-pds"">""</span>database_to_create_table_in<span class=""pl-pds"">""</span></span>
    }
  }
}</pre></div>
<h2 dir=""auto""><a id=""user-content-development"" class=""anchor"" aria-hidden=""true"" href=""#user-content-development""><svg class=""octicon octicon-link"" viewBox=""0 0 16 16"" version=""1.1"" width=""16"" height=""16"" aria-hidden=""true""><path fill-rule=""evenodd"" d=""M7.775 3.275a.75.75 0 001.06 1.06l1.25-1.25a2 2 0 112.83 2.83l-2.5 2.5a2 2 0 01-2.83 0 .75.75 0 00-1.06 1.06 3.5 3.5 0 004.95 0l2.5-2.5a3.5 3.5 0 00-4.95-4.95l-1.25 1.25zm-4.69 9.64a2 2 0 010-2.83l2.5-2.5a2 2 0 012.83 0 .75.75 0 001.06-1.06 3.5 3.5 0 00-4.95 0l-2.5 2.5a3.5 3.5 0 004.95 4.95l1.25-1.25a.75.75 0 00-1.06-1.06l-1.25 1.25a2 2 0 01-2.83 0z""></path></svg></a>Development</h2>
<p dir=""auto"">To set up this plugin locally, first checkout the code. Then create a new virtual environment:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""cd datasette-public
python3 -m venv venv
source venv/bin/activate""><pre class=""notranslate""><code>cd datasette-public
python3 -m venv venv
source venv/bin/activate
</code></pre></div>
<p dir=""auto"">Now install the dependencies and test dependencies:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pip install -e '.[test]'""><pre class=""notranslate""><code>pip install -e '.[test]'
</code></pre></div>
<p dir=""auto"">To run the tests:</p>
<div class=""snippet-clipboard-content notranslate position-relative overflow-auto"" data-snippet-clipboard-copy-content=""pytest""><pre class=""notranslate""><code>pytest
</code></pre></div>
</article></div>",1,public,0,,0,