Newsgroups: comp.parallel.pvm
From: Graham Edward Fagg <sssfagg@csres.cs.reading.ac.uk>
Subject: Re: problems with 3.3 patch level 6 on Solaris 2.x
Organization: University of Reading, U.K.
Date: Mon, 23 Jan 1995 15:15:47 +0000
Mime-Version: 1.0
Content-Type: TEXT/PLAIN; charset=US-ASCII
Message-ID: <Pine.SOL.3.91.950123151037.11117B-100000@suma3>

On Tue, 17 Jan 1995, Alexander Rausch wrote:
> I am using PVM 3.3.6 on a mixed set of SUN OS 4.1.x and Solaris 2.3. Sometimes
> (randomly) the Solaris machines seem to disconnect from the rest of the virtual
> machines indeed. The complete systems hangs after I have got the error message:
> libpvm [t40002]: pvmendtask() shmctl RMID: Invalid argument
> Any ideas?
> Alexander

What do the log files of the machine that have disconnected contain?
Is it pvmbailout() ? or something else?

We need to find out why the machines left in the first place.. not the 
servers reaction.

But for now I'm running the Solaris and SunOS machines as seperate 
clusters. They are a lot more stable that way. If you need to run both 
together try the previous patch level....

Graham.
===============================================================================
Graham Edward Fagg ||| *** Cluster Computing Lab. ***  ||| e-mail me some time
 Computer Science ||  Software Engineering Subject Grp.  ||  G.E.Fagg@rdg.ac.uk
0734-875123 7626 | http://www.cs.reading.ac.uk/people/gef/ | PVM/MPI/LINDA/VMD
===============================================================================


