Skip to main content

Composable filesystem hooks and operators for Airflow.

Project description

airflow-fs

https://img.shields.io/circleci/project/github/jrderuiter/airflow-fs/master.svg https://img.shields.io/pypi/v/airflow_fs.svg

airflow-fs is Python package that provides hooks and operators for manipulating files across a variety of file systems using Apache Airflow.

Why airflow-fs?

Airflow-fs implements a single interface for different file system hooks, in contrast to Airflows builtin file system hooks/operators. This approach allows us to interact with files independently of the underlying file system, using a common set of operators for performing general operations such as copying and deleting files.

Currently, airflow-fs supports the following file systems: local, FTP, HDFS, S3 and SFTP. Support for additional file systems can be added by implementing an additional file system hook that adheres to the required hook interface. See the documentation for more details.

Documentation

Detailed documentation is available at: https://jrderuiter.github.io/airflow-fs.

License

This software is freely available under the MIT license.

History

Version 0.1.0

Initial release supporting local, FTP, HDFS, S3 and SFTP file systems.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page