[gram-user] globusrun-ws: Job failed: The executable could not be started.
Neha Sharma
neha at fnal.gov
Mon Apr 6 11:30:21 CDT 2009
Hi
Yes, it does work with jobmanager Fork and jobmanager Condor
Cemon is basically jobmanager condor modified to perform matchmaking
between an incoming job and various available resources.
-Neha
On Apr 6, 2009, at 11:13 AM, Martin Feller wrote:
> Does it work with Fork as local resource manager (-Ft Fork)?
> Just curious: what is Cemon?
>
> -Martin
>
>
> Neha Sharma wrote:
>> Hi
>>
>> I am not able to figure out what could be the cause of this error.
>> I am
>> wondering if anyone on this list has seen this before..
>>
>> globusrun-ws: Job failed: The executable could not be started.
>>
>>
>> The command that I run is:
>> +++++++++++++++++++++++
>> globusrun-ws -dbg -submit -Jf neha.epr.fg -F fermigridosg1.fnal.gov:
>> 9443
>> -Ft Cemon -streaming -se n.err -so n.out -c /bin/true
>>
>> The executable exists on the ws container node.
>>
>> Running container in full debug mode does not show anything besides
>> the
>> same error as above
>>
>> === REQUEST MESSAGE (length 4834) (time 1239033124.965094000) ===
>> <soapenv:Envelope
>> xmlns:soapenv="http://schemas.xmlsoap.org/soap/envelope/"
>> xmlns:xsd="http://www.w3.org/2001/XMLSchema"
>> xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
>> xmlns:wsa="http://schemas.xmlsoap.org/ws/2004/03/
>> addressing"><soapenv:Header><wsa:MessageID
>> soapenv:mustUnderstand="0">uuid:e03764a0-22c2-11de-8cea-
>> bf45018d1031</wsa:MessageID><wsa:To
>> soapenv:mustUnderstand="0">https://fnpcsrv1.fnal.gov:39240/wsrf/services/NotificationConsumerService
>> </wsa:To><wsa:Action
>> soapenv:mustUnderstand="0">http://docs.oasis-open.org/wsn/2004/06/wsn-WS-BaseNotification/Notify
>> </wsa:Action><wsa:From
>> soapenv:mustUnderstand="0"><wsa:Address>http://schemas.xmlsoap.org/ws/2004/03/addressing/role/anonymous
>> </wsa:Address></wsa:From><ns06:ResourceID
>> ns04:type="ns05:string"
>> xmlns:ns04="http://www.w3.org/2001/XMLSchema-instance"
>> xmlns:ns05="http://www.w3.org/2001/XMLSchema"
>> xmlns:ns06="http://www.globus.org/docs.oasis-open.org/wsn/2004/06/wsn-WS-BaseNotification-1.2-draft-01.wsdl
>> "
>> soapenv:mustUnderstand="0">dc10e2c0-22c2-11de-8ed9-001422086c92</
>> ns06:ResourceID></soapenv:Header><soapenv:Body><Notify
>> xmlns="http://docs.oasis-open.org/wsn/2004/06/wsn-WS-BaseNotification-1.2-draft-01.xsd
>> "><NotificationMessage><Topic
>> Dialect="http://docs.oasis-open.org/wsn/2004/06/TopicExpression/Simple
>> "
>> xmlns:ns1="http://www.globus.org/namespaces/2004/10/gram/job/
>> types">ns1:state</Topic><ProducerReference><wsa:Address>https://131.225.107.165:9443/wsrf/services/ManagedJobFactoryService
>> </wsa:Address><wsa:ReferenceProperties><ns2:ResourceID
>> xmlns:ns2="http://www.globus.org/namespaces/2004/10/gram/
>> job">dcbd5910-22c2-11de-8cea-bf45018d1031</ns2:ResourceID></
>> wsa:ReferenceProperties><wsa:ReferenceParameters/></
>> ProducerReference><Message
>> xsi:type="ns3:StateChangeNotificationMessageWrapperType"
>> xmlns:ns3="http://www.globus.org/namespaces/2004/10/gram/
>> job"><ns3:stateChangeNotificationMessage><ns4:state
>> xmlns:ns4="http://www.globus.org/namespaces/2004/10/gram/job/
>> types">Failed</ns4:state><ns5:fault
>> xmlns:ns5="http://www.globus.org/namespaces/2004/10/gram/job/
>> faults"><ns5:executionFailedFault><ns6:Timestamp
>> xmlns:ns6="http://docs.oasis-open.org/wsrf/2004/06/wsrf-WS-BaseFaults-1.2-draft-01.xsd
>> ">2009-04-06T15:52:00.310Z</ns6:Timestamp><ns7:Originator
>> xmlns:ns7="http://docs.oasis-open.org/wsrf/2004/06/wsrf-WS-BaseFaults-1.2-draft-01.xsd
>> "><wsa:Address>https://131.225.107.165:9443/wsrf/services/ManagedJobFactoryService
>> </
>> wsa:Address
>> ><wsa:ReferenceProperties><ns3:ResourceID>dcbd5910-22c2-11de-8cea-
>> bf45018d1031</ns3:ResourceID></
>> wsa:ReferenceProperties><wsa:ReferenceParameters/></
>> ns7:Originator><ns8:Description
>> xmlns:ns8="http://docs.oasis-open.org/wsrf/2004/06/wsrf-WS-BaseFaults-1.2-draft-01.xsd
>> ">The
>> executable could not be started.</ns8:Description><ns9:FaultCause
>> xmlns:ns9="http://docs.oasis-open.org/wsrf/2004/06/wsrf-WS-BaseFaults-1.2-draft-01.xsd
>> "><ns9:Timestamp>2009-04-06T15:52:00.310Z</
>> ns9:Timestamp><ns9:ErrorCode
>> dialect="http://www.globus.org/fault/stacktrace">
>> at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native
>> Method)
>> at
>> sun
>> .reflect
>> .NativeConstructorAccessorImpl
>> .newInstance(NativeConstructorAccessorImpl.java:39)
>>
>> at
>> sun
>> .reflect
>> .DelegatingConstructorAccessorImpl
>> .newInstance(DelegatingConstructorAccessorImpl.java:27)
>>
>> at java.lang.reflect.Constructor.newInstance(Constructor.java:494)
>> at java.lang.Class.newInstance0(Class.java:350)
>> at java.lang.Class.newInstance(Class.java:303)
>> at org.globus.exec.utils.FaultUtils.makeFault(FaultUtils.java:485)
>> at
>> org
>> .globus
>> .exec.utils.FaultUtils.createExecutionFailedFault(FaultUtils.java:
>> 396)
>>
>> at
>> org
>> .globus
>> .exec
>> .service
>> .exec.StateMachine.createFaultFromErrorCode(StateMachine.java:3120)
>>
>> at
>> org
>> .globus
>> .exec
>> .service.exec.StateMachine.processSubmitState(StateMachine.java:1172)
>>
>> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>> at
>> sun
>> .reflect
>> .NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>>
>> at
>> sun
>> .reflect
>> .DelegatingMethodAccessorImpl
>> .invoke(DelegatingMethodAccessorImpl.java:25)
>>
>> at java.lang.reflect.Method.invoke(Method.java:585)
>> at
>> org
>> .globus
>> .exec.service.exec.StateMachine.processState(StateMachine.java:329)
>>
>> at org.globus.exec.service.exec.RunThread.run(RunThread.java:85)
>> </
>> ns9:ErrorCode><ns9:Description>org.globus.exec.generated.ExecutionFailedFaultType</ns9:Description></ns9:FaultCause><ns5:stateWhenFailureOccurred>Unsubmitted</ns5:stateWhenFailureOccurred><ns5:command>submit</ns5:command><ns5:gt2ErrorCode>17</ns5:gt2ErrorCode><ns5:attribute>stdin</ns5:attribute></ns5:executionFailedFault></ns5:fault><ns10:exitCode
>> xmlns:ns10="http://www.globus.org/namespaces/2004/10/gram/job/
>> types">0</ns10:exitCode><ns11:holding
>> xmlns:ns11="http://www.globus.org/namespaces/2004/10/gram/job/
>> types">false</ns11:holding></ns3:stateChangeNotificationMessage></
>> Message></NotificationMessage></Notify></soapenv:Body></
>> soapenv:Envelope>
>>
>> ----------------------------------------------
>> Current job state: Failed
>>
>> The sudoers file is also correct
>> ++++++++++++++++++++++++
>> # cat /etc/sudoers
>> Runas_Alias GLOBUSUSERS = ALL, !root
>>
>> globus ALL=(GLOBUSUSERS) \
>> NOPASSWD: \
>> /usr/local/vdt-1.10.1/globus/libexec/globus-job-manager-
>> script.pl *
>>
>> globus ALL=(GLOBUSUSERS) \
>> NOPASSWD: \
>> /usr/local/vdt-1.10.1/globus/libexec/globus-gram-local-proxy-
>> tool *
>>
>>
>>
>> Thanks
>> -Neha
>>
>>
>
More information about the gram-user
mailing list