Apache's Opensource World Of Data Tools
Apache is the opensource hub for various datatools. Let's explore them and understand which tool fits your usecase
Mar 8, 20236 min read
Search for a command to run...
Apache is the opensource hub for various datatools. Let's explore them and understand which tool fits your usecase
ETL (Extract, Transform, Load) and ELT (Extract, Load, Transform) are both data integration processes used to move and transform data from one system to another. The main difference between ETL and ELT is the order in which the data is processed. In ...
Spinning up Airflow environment following a online tutorial? Ensure you are incorporating the best practices
Apache airflow has Celery, Kubernetes, CeleryKubernetes and Dask Executors. This post explores what those are and how to use them.
Get started with Apache Airflow