Skip to main content

Tuberia... when data engineering meets software engineering

Project description

                      ▄▄▄█████▓ █    ██  ▄▄▄▄   ▓█████  ██▀███   ██▓ ▄▄▄
                      ▓  ██▒ ▓▒ ██  ▓██▒▓█████▄ ▓█   ▀ ▓██ ▒ ██▒▓██▒▒████▄
                      ▒ ▓██░ ▒░▓██  ▒██░▒██▒ ▄██▒███   ▓██ ░▄█ ▒▒██▒▒██  ▀█▄
                      ░ ▓██▓ ░ ▓▓█  ░██░▒██░█▀  ▒▓█  ▄ ▒██▀▀█▄  ░██░░██▄▄▄▄██
                        ▒██▒ ░ ▒▒█████▓ ░▓█  ▀█▓░▒████▒░██▓ ▒██▒░██░ ▓█   ▓██▒
                        ▒ ░░   ░▒▓▒ ▒ ▒ ░▒▓███▀▒░░ ▒░ ░░ ▒▓ ░▒▓░░▓   ▒▒   ▓▒█░
                          ░    ░░▒░ ░ ░ ▒░▒   ░  ░ ░  ░  ░▒ ░ ▒░ ▒ ░  ▒   ▒▒ ░
                        ░       ░░░ ░ ░  ░    ░    ░     ░░   ░  ▒ ░  ░   ▒
                                  ░      ░         ░  ░   ░      ░        ░  ░

Tuberia CI pipeline status Tuberia coverage status Tuberia contributors Tuberia total downloads Tuberia downloads per month
Data engineering meets software engineering.


Getting started

You need:

  • Spark 3.2.
  • Java JDK 11 (Required by Spark).
  • Poetry.
  • Make.

Once you have all the tools installed just open a shell on the root folder of the project and install the dependencies in a new virtual environment with:

$ make install

The previous command also installs some pre-commits.

Check that your package is installed with:

$ poetry run tuberia
▄▄▄█████▓     ██  ▄▄▄▄   ▓█████  ██▀███   ██▓ ▄▄▄
▓  ██▒ ▓▒ ██  ▓██▒▓█████▄ ▓█    ▓██  ██▒▓██▒▒████▄
▒ ▓██░ ▒░▓██  ▒██░▒██▒ ▄██▒███   ▓██ ░▄█ ▒▒██▒▒██  ▀█▄
░ ▓██▓  ▓▓█  ░██░▒██░█▀  ▒▓█   ▒██▀▀█▄  ░██░░██▄▄▄▄██
  ▒██▒  ▒▒█████▓ ░▓█  ▀█▓░▒████▒░██▓ ▒██▒░██░ ▓█   ▓██▒
   ░░   ░▒▓▒   ░▒▓███▀▒░░ ▒░ ░░ ▒▓ ░▒▓░░▓   ▒▒   ▓▒█░
        ░░▒░   ▒░▒          ░▒  ▒░       ▒▒          ░░░                 ░░                                                           ░
Version 0.0.0

If you can see that funky logo your installation is correct. Note that the version may change.

If you do not want to use poetry run in front of all your commands just activate the virtual environment with poetry shell. Use exit if you want to deactivate the environment.

How do I build the package?

You can build the package without installing the dependencies or without a proper Spark installation. Use make build or just make. You should see something like:

$ make
poetry build
Building tuberia (0.0.0)
  - Building sdist
  - Built tuberia-0.0.0.tar.gz
  - Building wheel
  - Built tuberia-0.0.0-py3-none-any.whl

How do I run tests?

Run tests locally with:

$ make test

Contribution guidelines

  • The code is auto-formatted by Black, so you can write the code without following any style guide and Black will take care of making it consistent with the current codebase.
  • Write tests: test not added in the PR, test that will never be added.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tuberia-0.0.0.tar.gz (3.9 kB view hashes)

Uploaded Source

Built Distribution

tuberia-0.0.0-py3-none-any.whl (3.8 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page