Apache Airflow: Open source Workflow management
Apache Airflow is the open source workflow management solution we offer to our customers.
- Creation of a data warehouse: multiple data sources to be unified in a single database.
- Simplification of Business Intelligence (BI) analysis processes.
- ETL and data Integration: introduction and maintenance of data retrieval, cleaning and loading processes.
Define, schedule, execute and monitor
data integration workflows.
Web interface for administration
API in the
What is Apache Airflow
Airflow is an open source solution for defining, scheduling, executing and monitoring data integration workflows.
The application core allows writing software components that are orchestrated and executed according to a graph schema (DAG). The platform features many ready-to-use integrations for more immediate and extensive coverage of all the needs
of a business process.
The product is maintained and evolved by the Apache Software Foundation.
Better collaboration and
unification of systems
Reduction of errors and
Data extraction and upload
Coordinates third party systems to perform data extraction and loading tasks
and cost savings
Ability to centralize the definition and management of data flows
Several integrations already in place such as Amazon S3, Azure, Apache Spark, etc...