Newsgroups: comp.parallel.mpi
From: lusk@donner.mcs.anl.gov (Rusty Lusk)
Subject: Re: SIGSEGV-Error using MPICH
Organization: Argonne National Laboratory
Date: 20 Mar 1996 16:10:05 GMT
Message-ID: <4ipakt$kku@milo.mcs.anl.gov>


|> One of the 4 processes is the master. This master wants to send some data to
|> the 3 slaves. These slaves are waiting for the receive. But then the following
|> output appears on the screen.
|> 
|> p0_26180:  p4_error: Found a dead connection while looking for messages: 1
|> bm_list_21829:  p4_error: interrupt SIGINT: 2
|> rm_l_0_16555:  p4_error: interrupt SIGINT: 2
|> p2_14250:  p4_error: interrupt SIGINT: 2
|> rm_l_0_24352:  p4_error: interrupt SIGINT: 2
|> p3_26911:  p4_error: interrupt SIGINT: 2
|> p1_18516:  p4_error: interrupt SIGSEGV: 11
|> rm_l_779447913_13653:  p4_error: interrupt SIGINT: 2

Process 1 experienced a segmentation fault, which was trapped by the
system, which shut down the other processes.  It is impossible to tell
from this output what caused the segmentation fault.  If you think the
problem is in MPICH rather than in your code, you should report the
problem to mpi-bugs@mcs.anl.gov.

Rusty

