[comp.sys.alliant] non-zero ipl

weening@Gang-of-Four.Stanford.EDU (Joe Weening) (03/07/90)

Here's my understanding based on conversations with Alliant C.S.
people: "ipl" refers to the interrupt level at which a processor is
running.  At various points, the kernel assumes it is at level 0; if
it checks and finds that it isn't, you get a panic message.

Note that there are several of these: "kernel non-zero ipl" and just
plain "non-zero ipl"; also "fbuserr" and "cbuserr" and maybe others.
Sometimes, it is OK to proceed through these problems.  To do this,
type <esc>P at the console.  However, this doesn't always work and you
may get strange events after doing so.

After we brought up 5.5, we got one of the non-zero ipl errors so
often that Alliant installed a patch to automatically proceed and not
type an error message.  This kept us running for a while, but then we
found that an application run by one of our users consistently caused
a different bug, that we could not proceed through.

Alliant told us that Concentrix 5.5.1 would soon be sent to people
having problems, so I debated whether to wait for it, but finally
decided to roll back to 5.0.  This was at the end of January, and we
haven't yet received 5.5.1, so I'm glad I decided to roll back.  We
don't really need any of the features that 5.5 supports.
--
Joe Weening                                Computer Science Dept.
weening@Gang-of-Four.Stanford.EDU          Stanford University

neray@Alliant.COM (Phil Neray) (03/10/90)

Reply-To: neray@alliant.Alliant.COM (Phil Neray)
Distribution:
Organization: Alliant Computer Systems, Littleton, MA
Keywords: Multi-Threading UNIX

In article <1990Mar7.013905.29368@Neon.Stanford.EDU>
weening@Gang-of-Four.Stanford.EDU (Joe Weening) writes:

>Alliant told us that Concentrix 5.5.1 would soon be sent to people
>having problems, so I debated whether to wait for it, but finally
>decided to roll back to 5.0.  This was at the end of January, and we
>haven't yet received 5.5.1, so I'm glad I decided to roll back.  We
>don't really need any of the features that 5.5 supports.

Concentrix 5.5 has some very useful features for systems that have
lots of users (and need high throughput), and/or systems with the
requirement for lots of aggregate I/O.

This is because Concentrix is now extensively multi-threaded.

Concentrix has always been a symmetric multiprocessing implementation
of UNIX (which means the kernel could run on any available
processor). With 5.5, the kernel itself has been parallelized,
allowing mutiple sections of the kernel to be executing simultaneously
on multiple processors.

Most super and mini-super multiprocessor UNIX implementations have
"coarse-grain" locking - ie., the entire file system gets locked, or
the entire virtual memory system, or the entire network system. 

Concentrix 5.5 has thousands of fine-grain locks that control access 
to critical data structures and code regions in the kernel. Also, 
Concentrix has some built-in strategies to minimize the duration of a
lock and the number of locks that can be held by a particular process.

(Other multi-threaded UNIX implementations that we know of include
Sequent and Encore.)

Multi-threading increases overall multi-user throughput, multi-stream
aggregate I/O, and multi-stream network throughput. For example, the
maximum number of simulated users supported on an FX/80, as measured
by the AIM Multiuser Benchmark, is now over 3x what it was with
Concentrix 5.0. (Of course, the FX/80 was designed for large number
crunching jobs, but most systems are also used for standard UNIX
development activities like compiling and file copying which is what
the AIM benchmark measures.)

The aggregate I/O with disk striping (TFS) is about 2x compared with
Concentrix 5.0, and the aggregate network I/O is also about 2x (this
is for smaller packet sizes which are representative of terminal and
X11 network traffic). "Lock-wait" has been virtually eliminated.

Note that multi-threading does NOT improve single-stream I/O (that's 
in development for a future release).

Also, this multi-threaded Concentrix has now been ported to the
FX/2800 (up to 28 Intel i860 40 Mhz 64-bit processors with 1 GB
memory), which should be a very high throughput system due to the
high scalar performance of the processors (>41 VAX MIPS, Dhrystone)
and increased I/O bandwidth (multiple 20 MB/s VME channels).

My apologies if this sounds too much like a commercial, but the
Concentrix developers (led by Jeff Jones and Tony Anzelmo) have done
some really fine work here and I thought you should know.

----------------------------------------------------------------------

Phil Neray			Domain:	neray@alliant.com
Alliant Computer Systems	UUCP:	{mit-eddie|linus}!alliant!neray
Littleton, MA 01460		Phone:	(508) 486-1429

-- 
Larry Fisher			Domain:	fisher@alliant.com
Alliant Computer Systems	UUCP:	{mit-eddie|linus}!alliant!fisher
Littleton, MA 01460		Phone:	(508) 486-1449

collins@osf.org (Jeff Collins) (03/13/90)

In article <3723@alliant.Alliant.COM>, neray@Alliant.COM (Phil Neray) writes:
> Concentrix 5.5 has some very useful features for systems that have
> lots of users (and need high throughput), and/or systems with the
> requirement for lots of aggregate I/O.

	Good words on the benefits of Concentrix 5.5 deleted for brevity...

> The aggregate I/O with disk striping (TFS) is about 2x compared with
> Concentrix 5.0, and the aggregate network I/O is also about 2x (this
> is for smaller packet sizes which are representative of terminal and
> X11 network traffic). "Lock-wait" has been virtually eliminated.
> 
	Actually this understates the improvements that have been made in 
	the I/O throughput.  A more accurate statement would be that "aggregate
	I/O now scales with the number of I/O channels".  Previous to Concentrix
	5.5, the aggregate I/O topped out fairly early.  With the complete
	multithreading available in 5.5, this no longer happens.  I am sure that
	someone at Alliant can give very explicit numbers comparing 5.5 to version
	5.0 (what about it Jeff?), but I know that it is very impressive.

> My apologies if this sounds too much like a commercial, but the
> Concentrix developers (led by Jeff Jones and Tony Anzelmo) have done
> some really fine work here and I thought you should know.
> 
	I have to agree, the Concentrix 5.5 developers did a great job.

  Jeffery A. Collins                            Phone: (617) 621-8958
  Open Software Foundation                      FAX:   (617) 225-2782
  11 Cambridge Center                           Email: collins@osf.org
  Cambridge MA 02142                                   uunet!osf.org!collins