Skip to content

Kubernetes Executor: non-existent upstream failed #19329

@bertrand-buffat

Description

@bertrand-buffat

Apache Airflow version

2.1.3

Operating System

Debian GNU/Linux

Versions of Apache Airflow Providers

apache-airflow-providers-amazon==1.4.0
apache-airflow-providers-cncf-kubernetes==2.0.2

Deployment

Other 3rd-party Helm chart

Deployment details

No response

What happened

Some tasks are randomly set to Upstream Failed while upstream task actually succeeded

What you expected to happen

I expect downstream tasks to be scheduled

How to reproduce

Failure happens randomly during peak hours of scheduling

Screenshot 2021-10-30 at 03 25 08

We use Kubernetes Executor

Config for scheduler:

[scheduler]
job_heartbeat_sec = 5
clean_tis_without_dagrun_interval = 15.0
scheduler_heartbeat_sec = 5
num_runs = -1
processor_poll_interval = 1
min_file_process_interval = 300
dag_dir_list_interval = 300
print_stats_interval = 30
pool_metrics_interval = 5.0
scheduler_health_check_threshold = 30
orphaned_tasks_check_interval = 300.0
child_process_log_directory = /home/airflow/logs/scheduler
scheduler_zombie_task_threshold = 300
catchup_by_default = True
max_tis_per_query = 512
use_row_level_locking = True
max_dagruns_to_create_per_loop = 500
max_dagruns_per_loop_to_schedule = 500
schedule_after_task_execution = True
parsing_processes = 8
file_parsing_sort_mode = alphabetical
use_job_schedule = True
allow_trigger_in_future = False
dependency_detector = airflow.serialization.serialized_objects.DependencyDetector
run_duration = -1
statsd_port = 8125
statsd_prefix = airflow
max_threads = 5
authenticate = False

Anything else

No response

Are you willing to submit PR?

  • Yes I am willing to submit a PR!

Code of Conduct

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions