[comp.os.vms] Buffered vs. direct I/O's

SYSRUTH@utorphys.BITNET (01/19/88)

In terms of CPU and other resource usage, I would like to know which of these
is more expensive and precisely why. We are considering changing our charging
algorithm to reflect the fact that things other than CPU are our main costs
these days, and I/O operations are among the things we might charge for. I
would assume that buffered I/O is generally to/from things like terminals
and tape drives (if using RMS and not direct QIO's), while direct would be
for devices such as array processors, image processors (and display screens),
etc. Where exactly do the disks come in? I guess most system devices must be
buffered somewhat, but to be honest, this is an area where my understanding
has always been a bit fuzzy. So is it more expensive to do number of single
direct I/O's to a device, or to fill something up and flush it out a bunch
at a time?
     
If I had to make a guess, I'd say there probably isn't that much difference,
since direct I/O's may be smaller but more of them are needed, while buffered
I/O has to collect somewhere before the operation that writes/reads it.
     
Can someone explain this to me clearly and cleanly?
     
Many thanks to all who respond - this may be a question that puzzles others
as well.
     
Ruth Milner
Systems Manager
University of Toronto Physics
     
SYSRUTH@UTORPHYS (BITNET)

dp@JASPER.Palladian.COM (Jeffrey Del Papa) (01/19/88)

    Date: Mon, 18 Jan 88 12:06 EST
    From: SYSRUTH%utorphys.bitnet@RELAY.CS.NET

    In terms of CPU and other resource usage, I would like to know which of these
    is more expensive and precisely why. We are considering changing our charging
    algorithm to reflect the fact that things other than CPU are our main costs
    these days, and I/O operations are among the things we might charge for. I
    would assume that buffered I/O is generally to/from things like terminals
    and tape drives (if using RMS and not direct QIO's), while direct would be
    for devices such as array processors, image processors (and display screens),
    etc. Where exactly do the disks come in? I guess most system devices must be
    buffered somewhat, but to be honest, this is an area where my understanding
    has always been a bit fuzzy. So is it more expensive to do number of single
    direct I/O's to a device, or to fill something up and flush it out a bunch
    at a time?

    If I had to make a guess, I'd say there probably isn't that much difference,
    since direct I/O's may be smaller but more of them are needed, while buffered
    I/O has to collect somewhere before the operation that writes/reads it.

    Can someone explain this to me clearly and cleanly?

    Many thanks to all who respond - this may be a question that puzzles others
    as well.

    Ruth Milner
    Systems Manager
    University of Toronto Physics

    SYSRUTH@UTORPHYS (BITNET)

the difference is hardwre and device driver dependent. It is not something under
users control. Disks, tapes, and much network hardware are DMA devices and do
direct I/O. the operation is queued, the users buffer is validated and wired
down, and the hardware is started. transfers occur directly to memory without
CPU intervention.

When a buffered I/O is queued, the users buffer is validated, a block of
non-paged memory is allocated, and depending on direction, the buffer is copied
to it. the driver either sends chars from the in pool buffer, or recieves chars
and stuffs them into the buffer. note: the driver software may use DMA to
actually effect the transfer with the outside world (eg: most modern terminal
controllers are DMA, but the i/o is buffered because it does examine each char
at length) At the end of a read, the pool buffer is copied into the users. since
data is transferred (at interrupt level) to/from already wired memory (non-paged
pool), the users buffer does not have to be wired down, as the copy operation
can fault.

what kind of I/O to perform is chosen by the driver writer in advance, based on
the nature of the hardware and driver philosophy (it is a bit in the driver
tables that you control) The large vs. small I/O you are mentioning in your
message is a function of RMS record blocking. RMS always uses a "natural" block
for the device (for disks this is some multiple of a disk sector, for a tape it
is the block size, which is usually much larger than a single record to avoid
tape waste) and (un)packs it for the user. No $qio is involved when the user
asks/gives RMS (for) a record, it is just a subroutine call (that transfers into
EXEC mode), and all the resources involved in the (un)pack are charged by the
normal CPU time mechanisim. RMS will do read-ahead (issuing $qios for several
large blocks) but it tries hard to always write the data as soon as possible.

In general the the only reasonable things to account for on a VMS system are CPU
time (and the VMS mechanisim is not absolutely accurate - it isn't consistent
about who it charges for some system operations, and it charges a fixed amount
for some other system operations no matter how much less time they take) connect
time, disk space, and trees destroyed. The I/O counts for a given job vary too
much with system load and tuning, and a page fault may or may not involve disk
access, so no equitable charging is possible (and with modern page printers,
accounting for tree consumption is difficult)

 this may seem restrictive, but you will catch much user flack if running the
same program over the same data (something physics types tend to do) results in
greatly varying prices, and you will have enough gripes without that one.

<dp>

nagy%warner.hepnet@LBL.GOV (Frank J. Nagy, VAX Wizard & Guru) (01/19/88)

First, the difference between buffered and direct I/Os is determined
by the device driver.  Devices which can do direct I/O (disks, array
processors, image processors, tapes, etc.) do their DMA directly
to or from buffers in the user's virtual memory space.  Buffered
devices (terminals, some network operations, etc.) use buffers
allocated in system space in nonpaged pool.  In earlier versions
of VMS the difference reflected that between DMA devices and programmed
I/O devices (usually the DZ11 terminal controllers).  So to summarize
the differences are concerned with data transfers to/from the buffers
in the user process's virtual address space:

	direct I/O devices transfer data directly to/from the user
	buffers by locking those buffers into memory, double-mapping
	them into system address space (to allow non-process context
	reference by the device driver) and initiating DMA operations
	between these buffers and the device.

	buffered I/O devices transfer data between the user buffers and
	system buffers in nonpaged pool.  The device driver then uses
	either DMA (new terminal controllers, network interfaces) or
	programmed I/O to move data between the system buffers and the
	device.
     
> In terms of CPU and other resource usage, I would like to know which of these
> is more expensive and precisely why. 

Buffered I/O can be more expensive in CPU time (extra memory move) but
the amount of data transferred in each operation is usually fairly small
for buffered I/O.  Depending upon the device (such as DZ11s), programmed
I/O between the system buffers and the device can use *lots* of CPU time
(i.e. DZ11 causes lots of interrupt processing).  Buffered I/O uses
system memory resources (nonpaged pool).

Direct I/O requires CPU overhead to setup (map the user buffers into
system space, lock buffers into memory, setup bus adapter maps for the
Unibus or QBus) but then the DMA operation does not require the CPU.
Typically direct I/O is used for DMA devices and large operations
(disks and tapes).  

Note that disk I/O still uses direct I/O even with RMS.  The RMS buffers
the direct I/O is to/from are in the process's virtual address space.
So, RMS buffering does not count as buffered I/O per se.  RMS buffering
allows lots of record operations to be done to/from the RMS buffers and
then direct I/O to/from the disks in large chunks as set by the RMS
buffer size and multibuffer count parameters.


= Frank J. Nagy   "VAX Guru & Wizard"
= Fermilab Research Division EED/Controls
= HEPNET: WARNER::NAGY (43198::NAGY) or FNAL::NAGY (43009::NAGY)
= BitNet: NAGY@FNAL
= USnail: Fermilab POB 500 MS/220 Batavia, IL 60510