Skip to main content

Index DBnomics data with Apache Solr for full-text and faceted search

Project description

DBnomics Solr

Index DBnomics data into Apache Solr for full-text and faceted search.

Requirements:

  • a running instance of Apache Solr; at the time this documentation is written, we use the version 7.3.

See dbnomics-docker to run a local DBnomics instance with Docker that includes a service for Apache Solr.

Configuration

Environment variables:

Index a provider

Replace wto by the real provider slug in the following command:

dbnomics-solr index-provider /path/to/wto-json-data

Full mode vs incremental mode

When data is stored in a regular directory, the script always indexes all datasets and series of a provider. This is called full mode.

When data is stored in a Git repository, the script runs by default in incremental mode: it indexes only the datasets modified since the last indexation.

It is possible to force the full mode with the --full option.

Bare repositories

The script has an option --bare-repo-fallback which tries to add .git at the end of the storage dir name, if not found.

Remove all data from a provider

To remove all the documents related to a provider (type:provider, type:dataset and type:series):

dbnomics-solr --debug delete-provider --code <provider_code>
dbnomics-solr --debug delete-provider --slug <provider_slug>

# Examples:
dbnomics-solr --debug delete-provider --code WTO
dbnomics-solr --debug delete-provider --slug wto

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dbnomics_solr-1.1.15.tar.gz (26.0 kB view hashes)

Uploaded Source

Built Distribution

dbnomics_solr-1.1.15-py3-none-any.whl (30.3 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page