Newsgroups: comp.parallel.mpi
From: Steven VanderWiel <svw>
Subject: SGI MIPCH Question
Organization: University of Minnesota
Date: 11 Sep 1995 19:13:35 GMT
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
Message-ID: <4321ov$d61@epx.cis.umn.edu>

Hi,

We are running version 1.0.10 of mpich on a cluster of 4 SGI Challenges
each of which has 4 processors.  When we compile with DEVICE, COMM = ch_shmem
everything works fine on a single machine.  When we compile with
DEVICE, COMM = ch_p4 everything works fine if we only use one processor on 
each machine.  However, when we set DEVICE, COMM = ch_p4 and try to run on 
all 16 processors of the cluster we have problems.  Specifically, the problem
manifests itself as follows:

     > mpirun -p4pg pgfile gauss_mpi
     p0_9146:  p4_error: more slaves than msg queues
     : 3
     P4 procgroup file is pgfile.
     >

where the contents of pgfile is:

    polar   3   /export/home/kittpeak/lilja/svw/bin/gauss_mpi
    grizzly 4   /export/home/kittpeak/lilja/svw/bin/gauss_mpi
    panda   4   /export/home/kittpeak/lilja/svw/bin/gauss_mpi
    kodiak  4   /export/home/kittpeak/lilja/svw/bin/gauss_mpi

Any ideas/advice?  Thanks.

Steve VanderWiel
University of Minnesota
Department of Electrical Engineering
4-174 EE/CSci Building
200 Union Street SE
Minneapolis, MN  55455

email : svw@ee.umn.edu
phone : (612) 626-8981
FAX   : 625-4583
WWW   : http://www-mount.ee.umn.edu/~svw


