Newsgroups: comp.parallel.mpi
From: rvdg@cs.utexas.edu (Robert van de Geijn)
Subject: Paper on Parallel Level 3 BLAS (MPI implementation)
Organization: CS Dept, University of Texas at Austin
Date: 12 Oct 1995 21:30:36 -0500
Message-ID: <45kj0c$v75@daffy.cs.utexas.edu>

We would like to announce the following paper that may be of
interest to the comp.parallel and comp.parallel.mpi community

Almadena Chtchelkanova, John Gunnels, Greg Morrow, James
Overfelt, Robert A. van de Geijn, "Parallel Implementation of BLAS:
General Techniques for Level 3 BLAS," TR-95-40, Department of Computer
Sciences, University of Texas, Oct. 1995. 

Abstract 

In this paper, we present straight forward techniques for a highly
efficient, scalable implementation of common matrix-matrix operations
generally known as the Level 3 Basic Linear Algebra Subprograms
(BLAS).  This work builds on our recent discovery of a parallel
matrix-matrix multiplication implementation, which has yielded
superior performance, and requires little work space.  We show that
the techniques used for the matrix-matrix multiplication naturally
extend to all important level 3 BLAS and thus this approach becomes an
enabling technology for efficient parallel implementation of these
routines and libraries that use BLAS.  Representative performance
results on the Intel Paragon system are given.

For further information, see 

    http://www.cs.utexas.edu/users/rvdg/reports.html

MPI codes associated with this paper will be released shortly.

Robert A. van de Geijn                  rvdg@cs.utexas.edu  
Associate Professor                     http://www.cs.utexas.edu/users/rvdg
Department of Computer Sciences         (Work)  (512) 471-9720
The University of Texas                 (Home)  (512) 251-8301 
Austin, TX 78712                        (FAX)   (512) 471-8885 


-- 


