Project description

dlt-metabase-source

Parent tables

Stateful tables: these get replaced on each load

'stats', 'cards', 'collections', 'dashboards', 'databases', 'metrics', 'pulses',
'tables', 'segments', 'users', 'fields'

Append (event) tables: these endpoints buffer a small event window, you need to merge it afterwards

to do - add time filter parameters to only load filtered requested data.

'activity', 'logs'

some of these tables have sub-tables

to join the parent table to the sub table, use the join parent.dlt_id = child.parent_dlt_id

Usage

optionally Create a virtual environment

python3 -m venv ./dlt_metabase_env4
source ./dlt_metabase_env4/bin/activate

install library

pip install dlt-metabase-source

If the library cannot be found, ensure you have the required python version as per the pyproject.tomlfile. (3.8+)

You can run the snippet file below to load a sample data set. You would need to add your target credentials first.

python run_load.py

First, import the loading method and add your credentials

from dlt_metabase_source import load


# target credentials
# example for bigquery
creds = {
  "type": "service_account",
  "project_id": "zinc-mantra-353207",
  "private_key_id": "example",
  "private_key": "",
  "client_email": "example@zinc-mantra-353207.iam.gserviceaccount.com",
  "client_id": "100909481823688180493"}
  
# or example for redshift:
# creds = ["redshift", "database_name", "schema_name", "user_name", "host", "password"]

Metabase credentials


url='http....com',
user='example@ai',
password='dolphins',

Now, you can use the code below to do a serial load:

mock_data=True flag below will load sample data.

Remove or set to False the mock_data flag to enable loading your data.

# remove some tables from this list of you only want some endpoints
tables=['activity', 'logs', 'stats', 'cards', 'collections', 'dashboards', 'databases', 'metrics', 'pulses',
                 'tables', 'segments', 'users', 'fields']
                 
load(url=url,
         user=user',
         password=password,
         target_credentials=creds,
         tables=tables,
         schema_name='metabase',
         mock_data=True)

or, for parallel load, create airflow tasks for each table like so:


for table in tables:
    load(url=url,
         user=user',
         password=password,
         target_credentials=creds,
         tables=[table],
         schema_name='metabase',
         mock_data=True)

If you want to do your own pipeline or consume the source differently:

from dlt_metabase_source import MetabaseSource, MetabaseMockSource

prod = MetabaseSource(url='http....com',
         user='example@ai',
         password='dolphins')
              
dummy = PersonioSourceDummy()

sample_data = dummy.tasks() 

for task in tasks:
    print(task['table_name'])
    for row in task['data']
        print(row)

Project details

These details have not been verified by PyPI

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.0.26

Aug 2, 2022

This version

0.0.25

Aug 2, 2022

0.0.24

Jul 22, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dlt-metabase-source-0.0.25.tar.gz (395.4 kB view hashes)

Uploaded Aug 2, 2022 Source

Built Distribution

dlt_metabase_source-0.0.25-py3-none-any.whl (425.4 kB view hashes)

Uploaded Aug 2, 2022 Python 3

Hashes for dlt-metabase-source-0.0.25.tar.gz

Hashes for dlt-metabase-source-0.0.25.tar.gz
Algorithm	Hash digest
SHA256	`f07249a9f0156cf64bb5cfd610f39e3f65198334b3e10ded84bed59c7b44828a`
MD5	`32c882ac1c7e484d5d8e07faa1aa0440`
BLAKE2b-256	`07cbb040f2197c3877b4f5b31f933798adc5b47f96f2f3cae986eb41944abb09`

Hashes for dlt_metabase_source-0.0.25-py3-none-any.whl

Hashes for dlt_metabase_source-0.0.25-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4d4eda7842e4b9d6e6c65e414af0fff01254fdff26d650b209fc0e455baa214b`
MD5	`03215da6836bddc7ab8149303ac0307b`
BLAKE2b-256	`71eac49b5db59d16c1b133f75073fe2ce812a4de1cc0f8757b63ebcce56194c7`