RESCOMP Archives

April 2006

RESCOMP@LISTSERV.MIAMIOH.EDU

Options: Use Monospaced Font
Show Text Part by Default
Show All Mail Headers

Message: [<< First] [< Prev] [Next >] [Last >>]
Topic: [<< First] [< Prev] [Next >] [Last >>]
Author: [<< First] [< Prev] [Next >] [Last >>]

Print Reply
Subject:
From:
David Woods <[log in to unmask]>
Reply To:
Research Computing Support <[log in to unmask]>, David Woods <[log in to unmask]>
Date:
Wed, 5 Apr 2006 15:09:42 -0400
Content-Type:
text/plain
Parts/Attachments:
text/plain (73 lines)
From the logs, it looks like both jobs ran on compute-1-7.

Looking at the logs on compute-1-7 in /var/spool/PBS, I don't see anything
unusual - but I'm not sure what something "bad" would look like.

It looks like the jobs ended around noon today - 2208 at 12:48 and 2209 at
11:49.  The logs show a Type 54 (PBS_BATCH_Copyfiles) and type 6
(PBS_BATCH_DeleteJob) request from the PBS server at the same time as the
jobs ended.  The entries in the logs like just like other job ending entries
I found in the logs.  The PBS temp directories for the jobs are gone from
/tmp so it looks like PBS cleaned up after the jobs.  

Dave

-----Original Message-----
From: Research Computing Support [mailto:[log in to unmask]] On
Behalf Of Robin
Sent: Wednesday, April 05, 2006 2:51 PM
To: [log in to unmask]
Subject: Fwd: Vanishing jobs ?

All,

Logs are at /var/spool/PBS/server_logs

20060404:04/04/2006 07:14:16;0100;Server@mulnx31;Job; 
2208.mulnx31;enqueuing into serial, state 1 hop 1
20060404:04/04/2006 07:14:16;0008;Server@mulnx31;Job;2208.mulnx31;Job  
Queued at request of [log in to unmask], owner =  
[log in to unmask], job name = Minim81s6, queue = serial
20060404:04/04/2006 07:14:16;0008;Server@mulnx31;Job;2208.mulnx31;Job  
Run at request of [log in to unmask] on hosts  
compute-1-7:ncpus=1
20060404:04/04/2006 07:14:16;0008;Server@mulnx31;Job;2208.mulnx31;Job  
Modified at request of [log in to unmask]
20060405:04/05/2006 12:48:44;0080;Server@mulnx31;Job; 
2208.mulnx31;Obit received
20060405:04/05/2006 12:48:44;0010;Server@mulnx31;Job; 
2208.mulnx31;Exit_status=0 resources_used.cpupercent=98  
resources_used.cput=29:27:32 resources_used.mem=10588kb  
resources_used.ncpus=1 resources_used.vmem=98496kb  
resources_used.walltime=29:34:28
20060405:04/05/2006 12:48:44;0100;Server@mulnx31;Job; 
2208.mulnx31;dequeuing from serial, state 5

Hmm.. thoughts ?

Thanks,
--------------
Robin
[log in to unmask]
513-529-1483

"Academia politics is the most vicious precisely because the stake is  
so small" - Kissinger



Begin forwarded message:

> From: James Moller <[log in to unmask]>
> Date: April 5, 2006 2:03:31 PM EDT
> To: [log in to unmask]
> Subject: Vanishing jobs ?
> Reply-To: Research Computing Support <[log in to unmask]>,  
> James Moller <[log in to unmask]>
>
> Some of my jobs (viz, Minim81s6 - possible job id 2208 and  
> Minim81s7 -  possible job id 2209) appear to have vanished while  
> running this AM. The wall time does not appear to have been  
> reached. Also, no "dot o" file was left to advise of the outcome of  
> the job.

ATOM RSS1 RSS2