Subject: | |
From: | |
Reply To: | |
Date: | Wed, 5 Apr 2006 15:09:42 -0400 |
Content-Type: | text/plain |
Parts/Attachments: |
|
|
From the logs, it looks like both jobs ran on compute-1-7.
Looking at the logs on compute-1-7 in /var/spool/PBS, I don't see anything
unusual - but I'm not sure what something "bad" would look like.
It looks like the jobs ended around noon today - 2208 at 12:48 and 2209 at
11:49. The logs show a Type 54 (PBS_BATCH_Copyfiles) and type 6
(PBS_BATCH_DeleteJob) request from the PBS server at the same time as the
jobs ended. The entries in the logs like just like other job ending entries
I found in the logs. The PBS temp directories for the jobs are gone from
/tmp so it looks like PBS cleaned up after the jobs.
Dave
-----Original Message-----
From: Research Computing Support [mailto:[log in to unmask]] On
Behalf Of Robin
Sent: Wednesday, April 05, 2006 2:51 PM
To: [log in to unmask]
Subject: Fwd: Vanishing jobs ?
All,
Logs are at /var/spool/PBS/server_logs
20060404:04/04/2006 07:14:16;0100;Server@mulnx31;Job;
2208.mulnx31;enqueuing into serial, state 1 hop 1
20060404:04/04/2006 07:14:16;0008;Server@mulnx31;Job;2208.mulnx31;Job
Queued at request of [log in to unmask], owner =
[log in to unmask], job name = Minim81s6, queue = serial
20060404:04/04/2006 07:14:16;0008;Server@mulnx31;Job;2208.mulnx31;Job
Run at request of [log in to unmask] on hosts
compute-1-7:ncpus=1
20060404:04/04/2006 07:14:16;0008;Server@mulnx31;Job;2208.mulnx31;Job
Modified at request of [log in to unmask]
20060405:04/05/2006 12:48:44;0080;Server@mulnx31;Job;
2208.mulnx31;Obit received
20060405:04/05/2006 12:48:44;0010;Server@mulnx31;Job;
2208.mulnx31;Exit_status=0 resources_used.cpupercent=98
resources_used.cput=29:27:32 resources_used.mem=10588kb
resources_used.ncpus=1 resources_used.vmem=98496kb
resources_used.walltime=29:34:28
20060405:04/05/2006 12:48:44;0100;Server@mulnx31;Job;
2208.mulnx31;dequeuing from serial, state 5
Hmm.. thoughts ?
Thanks,
--------------
Robin
[log in to unmask]
513-529-1483
"Academia politics is the most vicious precisely because the stake is
so small" - Kissinger
Begin forwarded message:
> From: James Moller <[log in to unmask]>
> Date: April 5, 2006 2:03:31 PM EDT
> To: [log in to unmask]
> Subject: Vanishing jobs ?
> Reply-To: Research Computing Support <[log in to unmask]>,
> James Moller <[log in to unmask]>
>
> Some of my jobs (viz, Minim81s6 - possible job id 2208 and
> Minim81s7 - possible job id 2209) appear to have vanished while
> running this AM. The wall time does not appear to have been
> reached. Also, no "dot o" file was left to advise of the outcome of
> the job.
|
|
|