[comp.unix.questions] Why partition disks?

rfinch@caldwr.water.ca.gov (Ralph Finch) (05/31/90)

We are about to receive two 1.2GB SCSI drives (Wren 7s) :-) :-) .
These will be used mainly for data storage with some space for home
directories and public domain stuff.  We are wondering how to
partition these things when they get here.  Actually we would like 1
partition per disk if it won't hurt anything.

1)  Does partitioning affect performance (capacity and speed)?

2)  Does partitioning affect fragmentation?  Does one have to worry
    about fragmentation with Unix, or SCSI, or ?

-- 
Ralph Finch			916-445-0088
rfinch@water.ca.gov		...ucbvax!ucdavis!caldwr!rfinch
Any opinions expressed are my own; they do not represent the DWR

bill@TWG.UUCP (Bill Irwin) (05/31/90)

In article <157@locke.water.ca.gov> rfinch@caldwr.water.ca.gov (Ralph Finch) writes:
>
>1)  Does partitioning affect performance (capacity and speed)?
>
>2)  Does partitioning affect fragmentation?  Does one have to worry
>    about fragmentation with Unix, or SCSI, or ?

Everything  I have ever read about disk management tells me that  smaller
file systems on a disk are more efficient than large ones.  There are far
fewer inodes to check, for one thing.

I'm  not  sure  if you really mean partitions or file systems.   I  would
strongly  recommend your boot disk be split into at least 2 file systems.
One  for  /dev/root,  and  only put the O/S and things  that  can't  live
anywhere else there.  The rest of the disk could be a 2nd and/or 3rd file
system.   Have  the O/S on its own file system allows you to reload  your
O/S  without  having  to  reinstall all your  software,  or  cherry  pick
applications out of a tape archive taken prior to the reload.

You  do need to be concerned about fragmentation with Unix, but I believe
there  are fsck options that can reorganize a file system somewhat.   I'm
not totally familiar with though, so I won't hazzard a wrong suggestion.

You  get a tremendous feeling of wealth with lots of disk space...but  it
always seems to get used up again somehow.....

--
Bill Irwin
TWG The Westrheim Group
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
uunet!van-bc!twg!bill     (604)431-9600 (voice)      (604)431-4629 (fax)
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

meissner@osf.org (Michael Meissner) (06/01/90)

In article <157@locke.water.ca.gov> rfinch@caldwr.water.ca.gov (Ralph
Finch) writes:

| We are about to receive two 1.2GB SCSI drives (Wren 7s) :-) :-) .
| These will be used mainly for data storage with some space for home
| directories and public domain stuff.  We are wondering how to
| partition these things when they get here.  Actually we would like 1
| partition per disk if it won't hurt anything.
| 
| 1)  Does partitioning affect performance (capacity and speed)?
| 
| 2)  Does partitioning affect fragmentation?  Does one have to worry
|     about fragmentation with Unix, or SCSI, or ?

There is some folklore that things placed in the middle of the disk
will be accessed quicker than things placed on either end since the
head is typically somewhere near the middle.  Thus some system
managers have placed absolutely critical partitions in the middle to
deal with this (yes, I've done this in a previous life).  I'm not sure
it's worth it, unless you have some fixed size data base that accounts
for 80% of your disk traffic (though of course it might be an ideal
spot for /usr and/or swap).

It is possible that fsck and/or the kernel on some UNIX ports can't
deal with a 1.2G disk (which is why the original unix partitioned
disks).  If your system limits you to 16-bit inodes, you probably
should partition the disk, since you are getting into the range that
the system won't be able to allocate all of the available disk blocks
(I forget where the cutoff value is).  Also, cpio can only deal with
16-bit in binary mode or ~18 bits in decimal inodes.

Fragmentation will always exist, but UNIX gives you no tools to deal
with it (other than full dump and load).
--
Michael Meissner	email: meissner@osf.org		phone: 617-621-8861
Open Software Foundation, 11 Cambridge Center, Cambridge, MA

Catproof is an oxymoron, Childproof is nearly so

flint@gistdev.gist.com (Flint Pellett) (06/01/90)

Reasons for having more than one file system:

1. Each file system can have at most 65,535 inodes, because the inode
number is stored in a 16 bit field in the directories.  That means you
have a maximum of that many files in the file system-- which may sound
like a lot until you look at that 1.2 GB space and realize that if your
average file is less than about 18K bytes you'll run out of inodes
long before you are able to take advantage of the disk space.  Keeping
news in it's own file system (since it tends to use up lots and lots
of inodes) is often a good idea.

2. If you have any users you don't completely trust - (perhaps because
they are still naive and don't know what they are doing, not necessarily
because they are dishonest) you can restrict them to one file system,
but if there is only one file system then they may do something wrong
and use up all the space everywhere, which could then cause other
things that aren't theirs to bomb off with no space in /tmp, etc.
(Don't laugh- it happens.  I had a group of 3 college level users
end up copying all of /usr into the home directories of new logins
they were creating in /usr2 when they screwed up their creation script.
(They were copying /usr/$TEMPLATE to the new location when $TEMPLATE
was not set, or something like that.)  I first found out about it
when they complained that "Creating logins used to take 10 seconds
and now it takes 20 minutes per login...")

3. You need to think about how you are going to do backups.  If you have
a tape unit that handles 1.2GB/tape maybe this doesn't matter, or if you
have backup software better than the usual default of a cpio to the
tape it may not matter.  But if you are copying onto 150MB tapes with
the normal backup scripts that cpio one filesystem at a time to a series
of tapes it does matter- having to recover a single file that someone
accidentally deletes from the 7th tape of an 8 tape cpio archive will be
pretty painful and slow.  If you created 4 partitions of 300MB each,
then each backup will use at most 2 tapes;  recovering a file becomes
a lot less of a chore, because you won't have to hunt through more
than 2 tapes to find it.  This also makes the backups more "secure":
if tape 2 of your 8 tape backup is unreadable because Sally Operator
spilled Pepsi on it, only part of your backup becomes unuseable, not
all of it.
-- 
Flint Pellett, Global Information Systems Technology, Inc.
1800 Woodfield Drive, Savoy, IL  61874     (217) 352-1165
INTERNET: flint%gistdev@uxc.cso.uiuc.edu
UUCP:     uunet!gistdev!flint

andy@syma.sussex.ac.uk (Andy Clews) (06/01/90)

Someone else has already mentioned that partitioning does positively
affect performance, but you should also look at it from the practical
point of view.  Dividing different stuff into partitions means that you
can do dumps at varying intervals for stuff that's of varying
importance.  For example, we have a /scratch partition (used for
temporary work files and as a "parking" area) that is never dumped,
whereas the user partitions are dumped daily.  Having everything on one
partition forces you to dump everything and thus use more tape. 

You might also want to build file systems with different block sizes in
order to improve fragmentation. For example, our /news partition has
small block sizes due to large number of inodes and relatively small
mean file sizes. The user partitions have a wider spectrum of file size,
and not so many inodes, so are built with larger block sizes.

Should any serious corruption occur it will be far more disruptive to
have to rebuild one large partition, rather than to rebuild just a
single partition out of several.

Andy Clews, Computing Service, Univ. of Sussex, Brighton BN1 9QN, England
JANET: andy@syma.sussex.ac.uk   BITNET: andy%syma.sussex.ac.uk@uk.ac


-- 
Andy Clews, Computing Service, Univ. of Sussex, Brighton BN1 9QN, England
JANET: andy@syma.sussex.ac.uk   BITNET: andy%syma.sussex.ac.uk@uk.ac

meissner@osf.org (Michael Meissner) (06/02/90)

In article <912@gistdev.gist.com> flint@gistdev.gist.com (Flint
Pellett) writes:

| Reasons for having more than one file system:
| 
| 1. Each file system can have at most 65,535 inodes, because the inode
| number is stored in a 16 bit field in the directories. ...

Berkeley based systems typically use a long to hold the inode.

| 2. If you have any users you don't completely trust - (perhaps because
| they are still naive and don't know what they are doing, not necessarily
| because they are dishonest) you can restrict them to one file system,
| but if there is only one file system then they may do something wrong
| and use up all the space everywhere ...

Quotas in BSD are meant to solve this problem, but every BSD system
I've seen disables quotas.  However, in the cases where I was a system
admin, I prefered to use either full disks or partitions of disks for
each project, so that there was no gripping that project XYZ used all
of the disk space, and the people involved are out at a customer site
until next week.

| 3. You need to think about how you are going to do backups.  If you have
| a tape unit that handles 1.2GB/tape maybe this doesn't matter, or if you
| have backup software better than the usual default of a cpio to the
| tape it may not matter.  But if you are copying onto 150MB tapes with
| the normal backup scripts that cpio one filesystem at a time to a series
| of tapes it does matter- having to recover a single file that someone
| accidentally deletes from the 7th tape of an 8 tape cpio archive will be
| pretty painful and slow.  If you created 4 partitions of 300MB each,
| then each backup will use at most 2 tapes;  recovering a file becomes
| a lot less of a chore, because you won't have to hunt through more
| than 2 tapes to find it.  This also makes the backups more "secure":
| if tape 2 of your 8 tape backup is unreadable because Sally Operator
| spilled Pepsi on it, only part of your backup becomes unuseable, not
| all of it.

I believe that BSD's dump puts the index on the first tape, and
doesn't need to sequentially go through each tape, though I haven't
used dump directly in awhile.  That assumes that you can break your
projects into little 300M cubby holes as well.  You can partition too
much, and spend mucho time chasing symbolic links to make everything
look like one massive disk again (assuming you have a system that lets
you use symlinks).
--
Michael Meissner	email: meissner@osf.org		phone: 617-621-8861
Open Software Foundation, 11 Cambridge Center, Cambridge, MA

Catproof is an oxymoron, Childproof is nearly so

guy@auspex.auspex.com (Guy Harris) (06/02/90)

>1. Each file system can have at most 65,535 inodes, because the inode
>number is stored in a 16 bit field in the directories.

Speak for yourself.  There exist versions of UNIX with more modern file
systems that have 32-bit inumbers; the original poster didn't specify
what flavor of UNIX they have, so they may or may not have a 64K inode
limitation on their file systems.

rfinch@caldwr.water.ca.gov (Ralph Finch) (06/02/90)

In article <3419@auspex.auspex.com> guy@auspex.auspex.com (Guy Harris) writes:
>>1. Each file system can have at most 65,535 inodes, because the inode
>>number is stored in a 16 bit field in the directories.
>
>Speak for yourself.  There exist versions of UNIX with more modern file
>systems that have 32-bit inumbers; the original poster didn't specify
>what flavor of UNIX they have, so they may or may not have a 64K inode
>limitation on their file systems.

It's SunOS which I assume uses the 32-bit inumbers.  My thinking right
now is, one of the 1.2GB drives will be divided into 2 partitions, for
/home and /pub stuff; the other just 1 partition, for large files (we
run numerical models which tend to generate a few large files).  Most
concerns expressed so far have been about administering the drives,
which is valid, but right now we just want some handy scratch space,
mainly.
-- 
Ralph Finch			916-445-0088
rfinch@water.ca.gov		...ucbvax!ucdavis!caldwr!rfinch
Any opinions expressed are my own; they do not represent the DWR