Google Cloud Composer: Overcoming The Short-living Tasks Problem


Running short-living tasks

echo task information from Airflow UI
# time echo my-awesome-string

real 0m0.000s
user 0m0.000s
sys 0m0.000s
# date +"%9N"; echo my-awesome-string ; date +"%9N"

A more realistic, yet simple pipeline

  1. Run a query on BigQuery using the BigQuery operator.
  2. Export the data to GCS.
  3. Compose the exported file into a single file.
  4. Copy the composed file to S3.
Sample Pipeline

The Chain Airflow plugin




