[gridway-user] Installation test problem

Eduardo Huedo Cuesta ehuedo at fdi.ucm.es
Sat Aug 5 09:04:44 CDT 2006


Dear Ryan,

That problem is related to the GramJob API in GT4.0.2. We will fix it in the next release of GridWay.
In the meanwhile, it is possible to adjust the polling interval to update the status of jobs.

Best regards,

Eduardo.

----- Mensaje original -----
De: Ryan.Fraser at csiro.au
Fecha: Lunes, Julio 31, 2006 11:29 am
Asunto: RE: [gridway-user] Installation test problem
A: Ryan.Fraser at csiro.au, gridway-user at globus.org

> Thanks Eduardo and Ruben
> 
> Installed Ruben's example (changed hostname) and now I can 
> submit but
> the job remains pending and then times out - 
> 
>  
> 
> [fra283 at ng2test ~]$ gwhost 0
> 
> HID 
> OS              ARCH   MHZ %CPU  MEM(F/T)     DISK(F/T)     N(U/F/T)
> LRMS                 HOSTNAME
> 
> 0   Linux2.6.12.6-x i686  3201  
> 185   180/431   
> 40461/74312        0/2/2
> Fork                 ng2test.ivec.org
> 
>  
> 
> QUEUENAME            SL(F/T) WALLT CPUT  COUNT MAXR  MAXQ  STATUS
> DISPATCH   PRIORITY
> 
> default              2/1     0     0     0     0     0     0
> Immediate  NULL
> 
>  
> 
> With closer inspection - I noticed the job actually runs but globus
> isn't updating the state correctly in gwd...
> 
> Why would this be?
> 
> Cheers
> 
> Ryan
> 
>  
> 
>  
> 
>  
> 
>   _____  
> 
> From: Fraser, Ryan (E&M, Kensington) 
> Sent: Wednesday, 26 July 2006 3:26 PM
> To: gridway-user at globus.org
> Cc: Fraser, Ryan (E&M, Kensington)
> Subject: RE: [gridway-user] Installation test problem
> 
>  
> 
> Thanks Eduardo 
> 
> That made that error go away. Now however I submit the job and 
> the job
> goes to the gridway scheduler but isn't being passed to the gt4
> installation on the same machine. I'm just using static host 
> informationand the contents of the file to describe the site is:
> 
> HOSTNAME="ng2test.ivec.org" ARCH="i686" OS_NAME="Linux"
> OS_VERSION="2.6-xen"
> 
> CPU_MODEL="Intel(R) Pentium(R) 4 CPU 3" CPU_MHZ=3201 CPU_FREE=185
> CPU_SMP=2
> 
> NODECOUNT=2 SIZE_MEM_MB=431 FREE_MEM_MB=180 SIZE_DISK_MB=74312
> 
> FREE_DISK_MB=40461 FORK_NAME="Fork" LRMS_NAME="Fork"
> 
> LRMS_TYPE="fork" QUEUE_NAME[0]="default" QUEUE_NODECOUNT[0]=1
> 
> QUEUE_FREENODECOUNT[0]=1 QUEUE_MAXTIME[0]=0 QUEUE_MAXCPUTIME[0]=0
> 
> QUEUE_MAXCOUNT[0]=0 QUEUE_MAXRUNNINGJOBS[0]=0 
> QUEUE_MAXJOBSINQUEUE[0]=0
> QUEUE_STATUS[0]="0" QUEUE_DISPATCHTYPE[0]="Immediate"
> 
>  
> 
> Is there something wrong with the details in this file? 
> ng2test.ivec.orgis a gt4 container and I'm simply just trying to 
> get the Fork engine
> running.
> 
> Any help is appreciated.
> 
> Cheers
> 
> Ryan
> 
>  
> 
>   _____  
> 
> From: owner-gridway-user at globus.org
> [mailto:owner-gridway-user at globus.org] On Behalf Of Fraser, Ryan (E&M,
> Kensington)
> Sent: Friday, 21 July 2006 1:50 PM
> To: gridway-user at globus.org
> Subject: [gridway-user] Installation test problem
> 
>  
> 
> Hi I have just install gw5 on a gt4.02 box and it appears to have
> installed ok and gridway runs (I'm running it in multi user mode)
> 
>  
> 
> I am trying to submit the simple hostname job with gwsubmit as user
> fra283:
> 
> EXECUTABLE = /bin/hostname
> 
>  
> 
> For testing I am using a STATIC info provider for gt4 (ws):
> 
> IM_MAD = static:gw_im_mad_static:-l 
> examples/im/host.static::gridftp:ws
> EM_MAD = ws:gw_em_mad_ws:rsl2
> 
> TM_MAD = gridftp:gw_tm_mad_ftp:
> 
>  
> 
> And the host file has the following:
> 
> HOSTNAME="ng2test.ivec.org" ARCH="i686" OS_NAME="Linux"
> OS_VERSION="2.6-xen"
> 
> CPU_MODEL="Intel(R) Pentium(R) 4 CPU 3" CPU_MHZ=3201 CPU_FREE=185
> CPU_SMP=2
> 
> NODECOUNT=2 SIZE_MEM_MB=431 FREE_MEM_MB=180 SIZE_DISK_MB=74312
> 
> FREE_DISK_MB=40461 FORK_NAME="Fork" LRMS_NAME="Fork"
> 
> LRMS_TYPE="fork" QUEUE_NAME[0]="default" QUEUE_NODECOUNT[0]=1
> 
> QUEUE_FREENODECOUNT[0]=1 QUEUE_MAXTIME[0]=0 QUEUE_MAXCPUTIME[0]=0
> 
> QUEUE_MAXCOUNT[0]=0 QUEUE_MAXRUNNINGJOBS[0]=0 
> QUEUE_MAXJOBSINQUEUE[0]=0
> QUEUE_STATUS[0]="0" QUEUE_DISPATCHTYPE[0]="Immediate"
> 
>  
> 
>  
> 
> I am initially just trying to get scheduler to work with the 
> local GT4
> fork engine (I can do a globusrun-ws to fork ok with this user)
> 
> The gwd.log has the following:
> 
> tail: /opt/gw5.0/var/gwd.log: file truncated
> 
> Fri Jul 21 13:46:17 2006 [IM][I]: Discovering hosts.
> 
> Fri Jul 21 13:46:17 2006 [IM][I]: Monitoring hosts.
> 
> Fri Jul 21 13:46:17 2006 [IM][I]: Hosts discovered by MAD (static):
> ng2test.ivec.org
> 
> Fri Jul 21 13:48:10 2006 [UM][I]: Loading execution MADs for user
> fra283.
> 
> Fri Jul 21 13:48:11 2006 
> [UM][W]:       Mode rsl2  for 
> execution MAD ws
> not specified or not supported, using mode rsl.
> 
> Fri Jul 21 13:48:11 2006 
> [UM][I]:       Execution MAD ws loaded
> (exec:gw_em_mad_ws, mode:rsl2 ).
> 
> Fri Jul 21 13:48:11 2006 [UM][I]: Loading transfer MADs for user 
> fra283.
> Fri Jul 21 13:48:11 2006 
> [UM][I]:       Transfer MAD 
> gridftp loaded
> (exec: gw_tm_mad_ftp, arg: ).
> 
> Fri Jul 21 13:48:11 2006 [UM][I]: User fra283 registered.
> 
> Fri Jul 21 13:48:11 2006 [DM][I]: New job 0 allocated and initialized.
> 
> Fri Jul 21 13:48:12 2006 [DM][I]: Dispatching job 0 to 
> ng2test.ivec.org(default).
> 
> Fri Jul 21 13:48:17 2006 [IM][I]: Monitoring hosts.
> 
> Fri Jul 21 13:48:17 2006 [EM][E]: Submission of job 0 failed: 
> Unable to
> parse RSL.
> 
> Fri Jul 21 13:48:17 2006 [EM][W]: Job 0 failed, retrying 
> execution, 2
> retries left.
> 
> Fri Jul 21 13:48:17 2006 [EM][E]: Submission of job 0 failed: 
> Unable to
> parse RSL.
> 
> Fri Jul 21 13:48:17 2006 [EM][W]: Job 0 failed, retrying 
> execution, 1
> retries left.
> 
> Fri Jul 21 13:48:17 2006 [EM][E]: Submission of job 0 failed: 
> Unable to
> parse RSL.
> 
> Fri Jul 21 13:48:17 2006 [EM][E]: Job 0 failed, no retries left.
> 
> Fri Jul 21 13:48:19 2006 [DM][I]: Job 0 failed.
> 
>  
> 
>  
> 
> Any help is really appreciated
> 
> Ta
> 
>  
> 
> Ryan Fraser (SE)
> 
> CSIRO Exploration & Mining ,
> ARRC, 26 Dick Perry Ave,
> Kensington, WA 6151 Australia 
> Phone +61 8 6436 8760 Fax +61 8 6436 8555
> 
>  
> 
> 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.globus.org/pipermail/gridway-user/attachments/20060805/2d69341e/attachment.htm>


More information about the gridway-user mailing list