메뉴 건너뛰기




Volumn , Issue , 2007, Pages 357-375

Workflow management in Condor

Author keywords

[No Author keywords available]

Indexed keywords


EID: 84892356132     PISSN: None     EISSN: None     Source Type: Book    
DOI: 10.1007/978-1-84628-757-2_22     Document Type: Chapter
Times cited : (109)

References (2)
  • 1
    • 84892245767 scopus 로고    scopus 로고
    • Note that DAGMan assumes the batch system guarantees that it will not "lose" jobs after they have been successfully submitted. Currently, if the job is lost by the batch system after being successfully submitted by DAGMan, DAGMan will wait indefinitely for the status of the job in the queue to change. An explicit query for the status of submitted jobs as opposed to waiting for the batch system to record job status changes may be necessary to address this. Also, if a job languishes in the queue forever, DAGMan currently is not able to "timeout" and remove the job and mark it as failed. When removing jobs, detecting and responding to the failure of a remove operation leaving a job "stuck" in the queue is an interesting question
    • Note that DAGMan assumes the batch system guarantees that it will not "lose" jobs after they have been successfully submitted. Currently, if the job is lost by the batch system after being successfully submitted by DAGMan, DAGMan will wait indefinitely for the status of the job in the queue to change. An explicit query for the status of submitted jobs (as opposed to waiting for the batch system to record job status changes) may be necessary to address this. Also, if a job languishes in the queue forever, DAGMan currently is not able to "timeout" and remove the job and mark it as failed. When removing jobs, detecting and responding to the failure of a remove operation (leaving a job "stuck" in the queue) is an interesting question.
  • 2
    • 84892244079 scopus 로고    scopus 로고
    • recent versions of Condor, the job can be edited to contain "noop job = true" which immediately terminates the job successfully
    • In recent versions of Condor, the job can be edited to contain "noop job = true" which immediately terminates the job successfully.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.