2017-04-08 180 views
4

这里是我的DAG文件和BashOperator任务:错误调用BashOperator:bash命令失败

my_dag = { 
dag_id = 'my_dag', 
start_date = datetime(year=2017, month=3, day=28), 
schedule_interval='01***', 
} 

my_bash_task = BashOperator(
task_id="my_bash_task", 
bash_command=bash_command, 
dag=my_dag) 

bash_command = "/home/jak/my_projects/workflow_env/repo_workflow/db_backup_bash.sh ""

在此之后answer我甚至给了bash的文件后空间,以避免TemplateNotFound错误。但是,当运行此任务给了我这个错误:airflow.exceptions.AirflowException:Bash命令失败

bash_command文件内容为:

#!/bin/bash 
DATABASE=db_name 
FILE=$DATABASE-`date +%F-%H%M%S`.backup 
export PGPASSWORD=password 
pg_dump -h localhost -p 5432 -U developer -F c -b -v -f ~/Dropbox/database_backup/location/$FILE db_name 
unset PGPASSWORD 

但是代替bash_command指向bash的文件写入多行字符串命令的工作原理:

bash_command = """ 
DATABASE=db_name 
FILE=$DATABASE-`date +%F-%H%M%S`.backup 
export PGPASSWORD=password 
pg_dump -h localhost -p 5432 -U developer -F c -b -v -f ~/Dropbox/database_backup/location/$FILE db_name 
unset PGPASSWORD 
""" 

正因为如此,我假设错误不是因为bash命令。 我甚至尝试用#!/ bin/sh替换bash文件中的#!/ bin/bash,但这也不起作用。

我跑了终止符sh db_back_up_bash.sh,它工作正常。

更新实际代码:

bash_file_location_to_backup_db = '{{"/home/jak/my_projects/workflow_env/repo_workflow/db_backup_bash.sh"}}' 
# bash_file_location_to_backup_db = "/home/jak/my_projects/workflow_env/repo_workflow/db_backup_bash.sh " 
bash_command = """ 
DATABASE=ksaprice_scraping 
FILE=$DATABASE-`date +%F-%H%M%S`.backup 
export PGPASSWORD=password 
pg_dump -h localhost -p 5432 -U developer -F c -b -v -f ~/Dropbox/database_backup/ksaprice/$FILE ksaprice_scraping 
unset PGPASSWORD 
""" 

backup_scraped_db_in_dropbox_task = BashOperator(
    task_id="backup_scraped_db_in_dropbox_task", 
    # bash_command=bash_command,# this works fine 
    bash_command=bash_file_location_to_backup_db,#this give error :airflow.exceptions.AirflowException: Bash command failed 
    dag=dag_crawl 
) 

错误跟踪:

[2017-04-11 20:02:14,905] {bash_operator.py:90} INFO - Output: 
2017-04-11 20:02:14,905 | INFO| root : Output: 
[2017-04-11 20:02:14,906] {bash_operator.py:94} INFO - /tmp/airflowtmp7FffJ2/backup_scraped_db_in_dropbox_taskQ6IVxm: line 1: /home/jak/my_projects/workflow_env/repo_workflow/db_backup_bash.sh: Permission denied 
2017-04-11 20:02:14,906 | INFO| root : /tmp/airflowtmp7FffJ2/backup_scraped_db_in_dropbox_taskQ6IVxm: line 1: /home/jak/my_projects/workflow_env/repo_workflow/db_backup_bash.sh: Permission denied 
[2017-04-11 20:02:14,906] {bash_operator.py:97} INFO - Command exited with return code 126 
2017-04-11 20:02:14,906 | INFO| root : Command exited with return code 126 
[2017-04-11 20:02:14,906] {models.py:1417} ERROR - Bash command failed 
Traceback (most recent call last): 
    File "/home/jak/my_projects/workflow_env/local/lib/python2.7/site-packages/airflow/models.py", line 1374, in run 
    result = task_copy.execute(context=context) 
    File "/home/jak/my_projects/workflow_env/local/lib/python2.7/site-packages/airflow/operators/bash_operator.py", line 100, in execute 
    raise AirflowException("Bash command failed") 
AirflowException: Bash command failed 
2017-04-11 20:02:14,906 | ERROR| root : Bash command failed 
Traceback (most recent call last): 
    File "/home/jak/my_projects/workflow_env/local/lib/python2.7/site-packages/airflow/models.py", line 1374, in run 
    result = task_copy.execute(context=context) 
    File "/home/jak/my_projects/workflow_env/local/lib/python2.7/site-packages/airflow/operators/bash_operator.py", line 100, in execute 
    raise AirflowException("Bash command failed") 
AirflowException: Bash command failed 
[2017-04-11 20:02:14,907] {models.py:1441} INFO - Marking task as FAILED. 
2017-04-11 20:02:14,907 | INFO| root : Marking task as FAILED. 
[2017-04-11 20:02:14,947] {models.py:1462} ERROR - Bash command failed 
2017-04-11 20:02:14,947 | ERROR| root : Bash command failed 
Traceback (most recent call last): 
    File "/home/jak/my_projects/workflow_env/bin/airflow", line 28, in <module> 
    args.func(args) 
    File "/home/jak/my_projects/workflow_env/local/lib/python2.7/site-packages/airflow/bin/cli.py", line 585, in test 
    ti.run(ignore_task_deps=True, ignore_ti_state=True, test_mode=True) 
    File "/home/jak/my_projects/workflow_env/local/lib/python2.7/site-packages/airflow/utils/db.py", line 53, in wrapper 
    result = func(*args, **kwargs) 
    File "/home/jak/my_projects/workflow_env/local/lib/python2.7/site-packages/airflow/models.py", line 1374, in run 
    result = task_copy.execute(context=context) 
    File "/home/jak/my_projects/workflow_env/local/lib/python2.7/site-packages/airflow/operators/bash_operator.py", line 100, in execute 
    raise AirflowException("Bash command failed") 
airflow.exceptions.AirflowException: Bash command faile 

回答

1

我认为这是在气流中的错误,神社不应该指望.SH文件包含模板信息在BashOperator中。

我周围有通过将命令转换为格式神社将正确解释:

bash_command = '{{"/home/jak/my_projects/workflow_env/repo_workflow/db_backup_bash.sh"}}'

+0

按照你的建议,结果在错误:** airflow.exceptions.AirflowException:bash命令失败**。我认为你的@jhnclvr结果与此anwser具有相同的影响:http://stackoverflow.com/questions/42259298/bashoperator-doent-run-bash-file-apache-airflow – javed

+0

你确定你仍然得到:TemplateNotFound?我已经使用这种模式很多次,现在我的BashOperators,它工作正常。如果可能的话,也许在这里发布整个堆栈跟踪,但我认为现在只有一个实际的异常/错误,当你运行bash命令。 – jhnclvr

+0

不是TemplateNotFound错误,请检查有问题的代码 – javed