Skip to main content

ETL programming in Python

Project description

pygrametl (pronounced py-gram-e-t-l) is a Python framework which offers commonly used functionality for development of Extract-Transform-Load (ETL) processes.

pygrametl allows developers to code the ETL process in Python code, instead of drawing it using a graphical user interface. In order to facilitate this, pygrametl provides object oriented abstractions for commonly used operations, such as providing a uniform interface to data from various sources, performing data processing in parallel, maintaining slowly changing dimensions, or creating snowflake schemas.

Providing these abstractions as a framework instead of as an integrated application, allows pygrametl to seamlessly integrate with other Python code. This allows developers to quickly create ETL flows using the abstractions provided, and have direct access to a complete programming language if more complex operations are needed.

Note: The parallel capabilities of pygrametl are currently considered experimental and subject to change, and will in many cases give better results if Jython is used instead of CPython, due to its lack of GIL.

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pygrametl-2.3.2.tar.gz (99.0 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page