I\'ve read Airflow\'s FAQ about \"What\'s the deal with start_date?\", but it still isn\'t clear to me why it is recommended against using dynamic start_date
.
First run would be at start_date+schedule_interval
. It doesn't run dag on start_date
, it always runs on start_date+schedule_interval
.
As they mentioned in document if you give start_date
dynamic for e.g. datetime.now()
and give some schedule_interval
(1 hour), it will never execute that run as now()
moves along with time and datetime.now()+ 1 hour
is not possible
The scheduler expects to see a constant start date and interval. If you change it the scheduler might not notice until it reloads the DagBag, and if the new start date doesn't line up with your old schedule it might break depends_on_past behavior.
If you don't need depends_on_past the simplest might be to stop using the scheduler, set the start date to some arbitrary old date, and externally trigger the DAG however you like using a crontab or similar.