Newsgroups: comp.parallel.pvm
From: Ronald Holzloehner <ronald@physics.ucsb.edu>
Subject: slave pvmd startup problems
Organization: University of California, Physics Department
Date: Tue, 30 Jan 1996 04:24:39 -0800
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
Content-Disposition: inline; filename="ronald"
Message-ID: <310E0E07.41C6@physics.ucsb.edu>

From ronald@possum.physics.ucsb.edu  Tue Jan 30 04:20:02 1996
Received: from possum.physics.ucsb.edu (possum.physics.ucsb.edu [128.111.8.212]) by junction.ucsb.edu (950413.SGI.8.6.12/8.6.12) with SMTP id EAA18467 for <ronald@junction.ucsb.edu>; Tue, 30 Jan 1996 04:20:01 -0800
Received:  by possum.physics.ucsb.edu (5.65/PCS-v1.2-eef)
	id AA09121; Tue, 30 Jan 96 04:20:56 -0800
Date: Tue, 30 Jan 1996 04:20:55 -0800 (PST)
From: Ronald Holzloehner <ronald@physics.ucsb.edu>
To: ronald@junction.ucsb.edu
Subject: Slave pvmd start problems... (fwd)
Message-Id: <Pine.ULT.3.91.960130042037.9055B-100000@possum.physics.ucsb.edu>
Mime-Version: 1.0
Content-Type: TEXT/PLAIN; charset=US-ASCII
Status: O



--
Try this:   main() {  for(;;) { fprintf(stderr,"\n\aBEEP!");  fork(); } }

 Ronald Holzloehner			ronald@physics.ucsb.edu
 Office: 4316 Broida, UCSB		Home Phone: (805) 683-9522
 Phone: (805) 893-7634, Fax: -3307	PGP PUBLIC KEY in .plan
 WWW: http://www.physics.ucsb.edu/~ronald   
---------- Forwarded message ----------
Date: Tue, 30 Jan 1996 04:04:42 -0800 (PST)
From: Ronald Holzloehner <ronald@physics.ucsb.edu>
To: pvm@msr.epm.ornl.gov
Subject: Slave pvmd start problems...


	Hi,

  I have problems in automatically starting slave pvmds
   on remote machines.
* The exact version and patch level of PVM you're using 
3.3.10

* The machine type(s) you're using, hardware and software
uname-a  :
Machine 1:
ULTRIX possum.physics.ucsb.edu 4.2 0 RISC   --> PMAX
Machine 2:
IRIX64 junction 6.1 07121823 IP21 mips   --> SGI64

* The machine architecture PVM chooses for you...
see above , after "-->"

* A short description of the problem (what happens, when it happens...
"pvm hostfile"   starts pvmd on local and remote machines, but fails
  to recognize them -- I have to do it manually with "so=ms" in hostfile.
Session:
***************************************** 

possum (p4) ronald ~/pvm3/lib [90] ./pvmd ~/hostfile2
/usr/tmp/aaaa08622
*** Manual startup ***
Login to "junction.ucsb.edu" and type:
$PVM_ROOT/lib/pvmd -s -d0 -njunction.ucsb.edu 1 806f08d4:0ad4 4096 2
806f4a4b:00
00
Type response: ddpro<2315> arch<SGI64> ip<806f4a4b:67cd> mtu<4096>
Thanks

***************  remote machine:  ********
% $PVM_ROOT/lib/pvmd -s -d0 \
-njunction.ucsb.edu 1 806f08d4:0ad4 4096 2 806f4a4b:00
ddpro<2315> arch<SGI64> ip<806f4a4b:67cd> mtu<4096>
***********************************************
% ./pvm
pvmd already running.
pvm> conf
2 hosts, 2 data formats
                    HOST     DTID     ARCH   SPEED
 possum.physics.ucsb.edu    40000     PMAX    1000
       junction.ucsb.edu    80000    SGI64    1000
pvm>
******************
Contents of "hostfile2"  :
#Start
junction.ucsb.edu	so=ms
surface.ucsb.edu	so=ms
bragg.ucsb.edu		so=ms
#End
********************

this works -- if I omit the "so=ms", pvmd starts up on all sites, 
but pvm does not recognize the remote sites:
****************************
pvm> conf
1 host, 1 data format
                    HOST     DTID     ARCH   SPEED
 possum.physics.ucsb.edu    40000     PMAX    1000
pvm> add junction
0 successful
                    HOST     DTID
                junction Can't start pvmd
pvm>
*****************************
But, on junction:
%  ps -ef | grep pvm
ronald 18088     1  0 03:52:27 ?        0:00 /people/junction/ronald/pvm3/lib/
 SGI64/pvmd3 -s -d0 -njunction 1 806f08d4:0c58  (<-- line too long here)
****************************


Since this happened already earlies on different machines,
  I suspected an error in my   .cshrc:
************************************

set cshrc_version = 9_1_R
echo "Start  ~/.cshrc R.H. Version $cshrc_version"

alias new 'source ~/.login; source ~/.cshrc; rehash'
alias ee "~/bin/joe ~/.cshrc"
alias el "~/bin/joe ~/.login"

# PVM 3.3.10:
setenv PVM_ROOT         $HOME/pvm3

# did .login run before?
if( ! $?LOGIN_WAS_HERE ) then
                echo "Exiting .cshrc because .login did not yet run"
                echo ""
#               source ~/.login
                exit
endif
#endif
setenv  CSHRC_WAS_HERE
...
  and more stuff...
  do I have to set a path in the .cshrc?

Greets,
Ron

--
 Ronald Holzloehner			ronald@physics.ucsb.edu
 Office: 4316 Broida, UCSB		Home Phone: (805) 683-9522
 Phone: (805) 893-7634, Fax: -3307	PGP PUBLIC KEY in .plan
 WWW: http://www.physics.ucsb.edu/~ronald   



