본문 바로가기
Data/Data Engineering

[Airflow] 6-2. SubDagOperator -> TaskGroup 변환하기 (ImportError : SubDagOperator )

by DenverAlmighty 2024. 3. 6.
반응형

velog 마이그레이션 

원글: [Airflow] 6-1. SubDagOperator -> TaskGroup 변환하기 (ImportError : SubDagOperator ) (2022년 9월 11일)

 

Task를 그룹화하기 위해 SubDagOperator를 사용하면 아래와 같은 오류가 발생한다.

Broken DAG: [/opt/airflow/dags/group_dag.py] Traceback (most recent call last):
  File "<frozen importlib._bootstrap>", line 219, in _call_with_frames_removed
  File "/opt/airflow/dags/group_dag.py", line 3, in <module>
    from airflow.operators.bash import SubDagOperator
ImportError: cannot import name 'SubDagOperator' from 'airflow.operators.bash' (/home/airflow/.local/lib/python3.7/site-packages/airflow/operators/bash.py)

SubDagOperator를 사용할 수 없다는 오류인데, Airflow Docs를 찾아보면 SubDagOperator 대신 TaskGroup을 사용하라고 나온다.

https://airflow.apache.org/docs/apache-airflow/stable/_api/airflow/operators/subdag/index.html#module-airflow.operators.subdag

from airflow import DAG
from airflow.operators.bash import BashOperator
#from airflow.operators.bash import SubDagOperator
from subdags.subdag_sleeps import subdag_sleeps
from datetime import datetime

with DAG('group_dag', start_date=datetime(2022, 9, 1),
    schedule_interval='@daily', catchup=False) as dag:

    args = {'start_date': dag.start_date, 'schedule_interval': dag.schedule_interval, 'catchup': dag.catchup}

    '''
    sleeps = SubDagOperator(
        task_id='sleeps',
        subdag=subdag_sleeps(dag.dag_id, 'sleeps', args)
    )
    '''
    sleeps = sleep_tasks()
    

    check_files = BashOperator(
        task_id='check_files',
        bash_command='sleep 10'
    )

    sleeps >> check_files
from airflow import DAG
from airflow.operators.bash import BashOperator
/# 추가
from airflow.utils.task_group import TaskGroup

#def subdag_sleeps(parent_dag_id, child_dag_id, args):
def sleep_tasks():

    # 그룹 이름 정의
    with TaskGroup("sleeps", tooltip="Sleep tasks") as group:    
    #with DAG(f"{parent_dag_id}.{child_dag_id}", start_date=args['start_date'], schedule_interval=args['schedule_interval'], catchup=args['catchup']) as dag:

        sleep_a = BashOperator(
            task_id='sleep_a',
            bash_command='sleep 10'
        )

        sleep_b = BashOperator(
            task_id='sleep_b',
            bash_command='sleep 10'
        )

        sleep_c = BashOperator(
            task_id='sleep_c',
            bash_command='sleep 10'
        )
        return dag
 
728x90
반응형