acp@ms.uky.edu (ACPNET consultant) (03/09/90)
This is on an AT&T 6386 WGS running Sysv/386 3.2.1.
The hard disk in the machine is ~145 megabytes, and the tape drive only
holds 60meg, so I'm trying to fit a full disk backup on one tape with
something like
find . -print | cpio -oc | compress | dd [blocking options] > /dev/rmt/c0s0
Now, including dd in the pipe is *loads* faster than not including
it, but I haven't been able to find any dd options which let the 
tape stream.  It keeps stopping and backing up.  When using something
like ibs=32k obs=512 (the tape writes 512-byte records), dd reports
no full input blocks, all partial blocks--I assume this corresponds to
bursts produced by compress.  Something like bs=512 produces all full
blocks but doesn't affect the tape speed.
I assume something as slow as a tape drive could keep up with compress
if the buffering was done properly; it seems that when dd has a large
input buffer, it should hold off writing until the buffer was full;
this would at least allow the tape to stream for several records while
dd fills up its input buffer again.
Should I be using something other than dd?  Any other suggestions?
Kenneth Herron
-- 
acp@ms.uky.edu        University of Kentucky         ACP Network Consultant
ukma!acp         Dept. of Mathematics, room 715 POT          (606) 257-2975
                       Lexington, KY 40506plocher@sally.Sun.COM (John Plocher) (03/09/90)
+-- In <14496@s.ms.uky.edu> acp@ms.uky.edu (ACPNET consultant) writes | This is on an AT&T 6386 WGS running Sysv/386 3.2.1. | | find . -print | cpio -oc | compress | dd [blocking options] > /dev/rmt/c0s0 | it, but I haven't been able to find any dd options which let the | tape stream. It keeps stopping and backing up. When using something | like ibs=32k obs=512 (the tape writes 512-byte records), dd reports +-- Try something like find . -print | cpio -oc | compress | dd of=/dev/rmt/c0s0 -obs 1024k this writes 1Mb at a time to the tape - and the drive *will* stream. Don't worry about tape blocking - the QIC tapes are set up to do all blocking internally, unlike the 9 track reel tape drives you may be familiar with... You may also want to explore the different cpio options avaliable: -I rawdevice which selects the Input device, -O rawdevice which selects the Output device, and -C size which specifies a blocking size -I and -O will detect end of tape and prompt you to change tapes and press return when ready to continue ... find . -print | cpio -oc -O /dev/rst/c0s0 -C 102400 Lastly, there are several programs on the net that are replacements for either cpio (afio), or dd (ddd) which allow double buffering, multi-process buffering, ... which may be another solution to your problems... -John
hcj@lzsc.ATT.COM (HC Johnson) (03/10/90)
In article <14496@s.ms.uky.edu>, acp@ms.uky.edu (ACPNET consultant) writes: > This is on an AT&T 6386 WGS running Sysv/386 3.2.1. > > find . -print | cpio -oc | compress | dd [blocking options] > /dev/rmt/c0s0 > > Now, including dd in the pipe is *loads* faster than not including > it, but I haven't been able to find any dd options which let the > tape stream. It keeps stopping and backing up. When using something If you want your tape to screem ( in bursts) try: dd of=/dev/rmt/c0s0 of=512b this will output 512 BLOCKS of 512 at a time. The tape will run continuously for the big write, then rest while the buffer is refilled by the input to dd. Howard C. Johnson ATT Bell Labs att!lzsc!hcj hcj@lzsc.att.com
nvk@ddsw1.MCS.COM (Norman Kohn) (03/10/90)
In article <14496@s.ms.uky.edu> acp@ms.uky.edu (ACPNET consultant) writes: >find . -print | cpio -oc | compress | dd [blocking options] > /dev/rmt/c0s0 > >Now, including dd in the pipe is *loads* faster than not including >it, but I haven't been able to find any dd options which let the >tape stream. It keeps stopping and backing up. When using something >like ibs=32k obs=512 (the tape writes 512-byte records), dd reports >no full input blocks, all partial blocks--I assume this corresponds to >bursts produced by compress. Something like bs=512 produces all full >blocks but doesn't affect the tape speed. > Try dd ibs=5120 obs=512k (as I recall, this is what ISC uses when loading the system from tape). With ISC it does indeed produce improvement, perhaps even better than bs=512k which I have also used. While I haven't looked, I don't think that the blocks written on tape are any different: cpio and the tape driver still prepare the blocks they want, and all you've done is modify the amount of data that arrives at one time down the pipe. Reading, of course, you reverse the respective ibs/obs sizes. -- Norman Kohn | ...ddsw1!nvk Chicago, Il. | days/ans svc: (312) 650-6840 | eves: (312) 373-0564
cpcahil@virtech.uucp (Conor P. Cahill) (03/12/90)
In article <1990Mar10.041015.23468@ddsw1.MCS.COM> nvk@ddsw1.MCS.COM (Norman Kohn) writes: >In article <14496@s.ms.uky.edu> acp@ms.uky.edu (ACPNET consultant) writes: >>find . -print | cpio -oc | compress | dd [blocking options] > /dev/rmt/c0s0 >> >> [discusion of how to get dd to stream deleted] >> >Try dd ibs=5120 obs=512k While all the suggestions that I have seen for this may actually get the tape to stream on some systems, there is no answer for every system. In addition, if there is some other high priority stuff going on, there is no command that will get the tape to stream always. For example, the following dd will stream an entire tape to a file without stopping at all: dd if=/dev/rmt0 of=junkfile bs=50k However, if I am working on the console and cause just a few lines to be output on the console, the streaming will stop. This even occurs if I use bs=2048k. Whenever I type ls -l on the console the tape drive will stop and restart. So the answer is: If you are working on the console, you may never get the tape to stream for very large chunks. If you are working elsewhere, you might still have problems because of what you are running on the system. If you system is totally quiet (except for the backup stuff itself) you "should" be able to get the data to stream. PS - Another part of this is how fast the compress can feed the data to dd. If your tape drive can get ahead of the compress output, you can't stream. That is why some people were saying to use ...|dd obs=big_value that way dd stored up the data and then fed it to the tape in one big chunk. I guess I am just rambling on now so I will stop. Remember there is no one solution, and sometimes none at all, to getting a tape drive to stream. -- Conor P. Cahill (703)430-9247 Virtual Technologies, Inc., uunet!virtech!cpcahil 46030 Manekin Plaza, Suite 160 Sterling, VA 22170
friedl@mtndew.UUCP (Steve Friedl) (03/14/90)
It was suggested that this be used to send compressed data to a tape drive: > find . -print | cpio -oc | compress | dd [blocking options] > /dev/rmt/c0s0 And HC Johnson suggests: > > If you want your tape to screem ( in bursts) try: > dd of=/dev/rmt/c0s0 of=512b > this will output 512 BLOCKS of 512 at a time. The tape will run continuously > for the big write, then rest while the buffer is refilled by the input to dd. The adventurous might want to investigate another approach that might help the tape stream even more. The problem with the huge blocksizes is that it stops while the buffer refills, so why not use two processes sharing (at least) two buffers. The parent process reads into some huge buffer, then forks a child to write the buffer to the output place (the tape or whatever). While the child is writing the buffer, the parent is busy filling a second buffer. Once the second one is full, the parent waits for the child to finish, then forks again to let it write. This does involve a process switch each block, but if the blocks are large enough it might not be too bad. Of course, you've got to have copy-on-write VM or this become ridiculous really quickly. I have not actually done this, but I bet a donut that AT&T has. Their "ctccpio" command feeding the stupid 23MB tape unit on the 3B2 has lots of process switches, and I think this is how they manage to keep the tape streaming. It trashes the machine for the duration, but it is >much< faster than a dd with even a large blocksize (it could also be because ctccpio turns on stream mode with some ioctl). You could probably do an even better job on this with a reader and a writer with a common set of shared-memory buffers. Then you get the speed of double buffering without the process switches. While writing this article I started diddling with some actual code to handle the process-switch part. After ten minutes it was just about finished, and I might post it in a few days if I ever get it working. I suppose one could really get a big win by building the shared-memory version right into compress, so there was no extra pipeio going on. Thoughts, anybody? Steve -- Stephen J. Friedl, KA8CMY / Software Consultant / Tustin, CA / 3B2-kind-of-guy +1 714 544 6561 voice / friedl@vsi.com / {uunet,attmail}!mtndew!friedl "How could anybody look at Miss April and *not* believe in a God?" - me
les@chinet.chi.il.us (Leslie Mikesell) (03/15/90)
In article <371@mtndew.UUCP> friedl@mtndew.UUCP (Steve Friedl) writes: >The adventurous might want to investigate another approach that >might help the tape stream even more. The problem with the huge >blocksizes is that it stops while the buffer refills, so why not >use two processes sharing (at least) two buffers. >The parent process reads into some huge buffer, then forks a >child to write the buffer to the output place (the tape or >whatever). While the child is writing the buffer, the parent is >busy filling a second buffer. Once the second one is full, the >parent waits for the child to finish, then forks again to let it >write. "afio" does this if you use the -f flag and it works pretty well. >I have not actually done this, but I bet a donut that AT&T has. >Their "ctccpio" command feeding the stupid 23MB tape unit on the >3B2 has lots of process switches, and I think this is how they >manage to keep the tape streaming. It trashes the machine for >the duration, but it is >much< faster than a dd with even a large >blocksize (it could also be because ctccpio turns on stream mode >with some ioctl). The second guess is correct - afio also knows how to stream the 3B2 tape if you compile the option in and use the -C flag. Turning on streaming mode is not something an ordinary program like dd should attempt because (a) it is wierd - see the code in afio, and (b) writing the wrong sized buffer will crash the machine. The 2nd process doesn't really gain that much because the tape controller eats most of the CPU anyway (23M tape). >You could probably do an even better job on this with a reader >and a writer with a common set of shared-memory buffers. Then >you get the speed of double buffering without the process >switches. I started on this a long time ago, but after crashing the 3B2 I moved the test code to a 3B1 and found that writing to a device from shared memory would crash the machine and that you couldn't malloc() anything after attaching a shared memory segment (3B1 only). Again, the 3B1 floppy controller eats the CPU so there isn't a big gain anyway. I recently tried the same code on a 386 and found that it would run about 50-70K/sec from a cpio of the (ESDI) disk to the (125M) tape. The program uses an arbitrary number of arbitrary sized blocks in shared memory and can copy over to local memory on the device side if needed. The tricky part is that it tries to guess when the tape has stopped and lets most of the shared memory fill up before re-starting. It can usually do slightly better than afio even though there is an extra pipe involved (from cpio). >Thoughts, anybody? I'll mail you a copy - if you have time you can clean it up and post it or let me know what you think of the concept. Les Mikesell les@chinet.chi.il.us