Newsgroups: comp.parallel.pvm
From: pauls@iil.intel.com (Paul Schuster)
Subject: building robust applications
Summary: problems with pvmd when building robust applications
Keywords: robust pvm
Organization: Intel Israel (74) Ltd.
Date: 26 May 1996 05:41:36 GMT
Message-ID: <4o8qug$7gn@ilx018.iil.intel.com>

Hi,

I am using PVM 3.3.10 as a message passing kernal for a prototype 
distributed data collection program.

One of my PVM tasks monitors the others + the pvmd's and attempts to 
restart them if they fail. Most of the time this works fine, but I
occasionally see that attempts to restart a failed pvmd repetitively
fail due to the existence of an old /tmp/pvmd.<UID> file. As soon as 
I remove that file on the remote machine, the restart succeeds. (Not
sure if it is because of this file or the named pipe it points to).

Has anyone else come across this problem ? Are there any tried and
tested solutions ?

Thanks,

Paul.

