perlman@wanginst.UUCP (Gary Perlman) (03/09/85)
Now that I am again at a non-profit organization,
I have restarted development on my statistical programs.
The first phase, now complete, has been to improve the existing programs.
I think there are enough enhancements that people already using
the programs would want to get a new version;
the programs are more robust, more efficient, and easier to use.
The documentation has been rewritten, and I will soon have finished
a video introduction to the package.
The programs are presently being ported to MSDOS,
and you should see some more information on that during 2nd quarter 85.
The usual blurbs on the package are listed below.
Current users should note the section on the highlights of version 5.0.
If you send me mail asking for more information, include your postal
address because it is much more convenient for me to mail paper copies.
UNIX|STAT 5.0
COMPACT DATA ANALYSIS PROGRAMS
UNIX|STAT is a set of UNIX System data manipulation and analysis
programs developed at the University of California, San Diego by Gary
Perlman (now teaching at the Wang Institute of Graduate Studies). The
programs are designed with the UNIX System philosophy that individual
programs should be designed as tools that do one task well and produce
output suitable for input via pipes to other programs. Interactive
use is supported in the UNIX System shell which also provides a
programming language for complex analyses. Typical usage involves a
pipeline of transformations of data followed by input to an analysis
program, summarized schematically by:
INPUT DATA | TRANSFORM | ANALYSIS | OUTPUT RESULTS
Functionality often built into statistical packages (e.g., graphics,
sorting and other data manipulation) is not re-invented in UNIX|STAT
which delegates such responsibility to standard UNIX System tools.
FEATURES
easy to use (negligible training period)
simple input formats (free format field oriented)
used in pipelines with other UNIX System utilities (sort, vi)
flexible data manipulation
data validation provided (range and type checking)
full documentation support (manual entries, tutorials)
extensible (many modular C functions)
faster than most packages (usually less than a second per analysis)
small enough for micros (10-25K byte programs)
runs on any UNIX System (V6, V7, 2.8BSD, 4BSD, III.0, System V, others)
public domain software (can't be distributed for gain)
in use at more than 300 UNIX System sites for five years
CHANGES FOR RELEASE 5.0 (March 5, 1985)
reworked to increase portability, reliability, and usability
all commands now use a standard option parser (getopt)
all calculations are now done in double precision
diagnostic error messages have been improved
regress now does a partial correlation analysis
colex and trans were added as alternatives for dm
F ratio probabilities are now better approximated
some inefficient input was optimized
some non-portable features of C were replaced so that
the programs now run under MSDOS on the IBM PC
the random number seeding has been improved
all programs now use a zero exit status on success
version control was added--we are now at release 5.0
UNIX|STAT is Public Domain
The programs have been released to the public and are distributed
to anyone who wants them. Persons wanting to get a copy of the
package should contact me directly. You can get the package for free
if you send me a tape and a self-addressed prepaid return mailer. Or
you can send me personally $20 US to cover the costs of a tape and mailing.
The distribution includes:
The C source files for all the programs.
The documentation source files.
A collection of test examples.
Contact:
Gary Perlman
Wang Institute of Graduate Studies
Tyng Road
Tyngsboro, MA 01879 USA
(617) 649-9731
uucp: decvax!wanginst!perlman
sdcsvax!sdcsla!perlman
csnet: perlman@wanginst
arpa: sdcsla!perlman@nprdc
NOTES:
UNIX|STAT is unsupported, though known bugs have been removed.
UNIX|STAT may not be distributed for profit.
UNIX|STAT is NOT a product of any company or organization.
UNIX|STAT is distributed on a `` use-at-your-own-risk basis.''
UNIX|STAT(1) UNIX User's Manual UNIX|STAT(1)
NAME
UNIX | STAT - compact data analysis programs
DESCRIPTION
UNIX | STAT is a set of data manipulation and analysis pro-
grams developed at the University of California, San Diego.
The programs are designed with the UNIX System philosophy
that individual programs should be designed as tools that do
one task well and produce output suitable for input via
pipes to other programs. Interactive use is supported in
the UNIX System shell which also provides a programming
language for complex analyses. Functionality often built
into statistical packages (e.g., graphics, sorting and other
data manipulation) is not re-invented in UNIX | STAT which
delegates such responsibility to standard UNIX System tools.
FEATURES
easy to use (negligible training period)
simple input formats (free format field oriented)
used in pipelines with other UNIX utilities (sort, vi)
data validation provided (range and type checking)
small enough for micros (10-25K byte programs fit on 11/23's)
runs on any UNIX (V6, V7, 2.8BSD, 4BSD, III.0, 5.0, others)
in use at > 400 UNIX sites for four years
DATA TRANSFORMATION PROGRAMS
abut join data files
dm column oriented data manipulator
io control and monitor input and output
maketrix create matrix type file from free-form file
perm randomly permute lines in a file
repeat repeat a pattern or file
reverse reverse lines and characters
series print a series of numbers
transpose transpose matrix type file
ANALYSIS PROGRAMS
anova multi-factor anova with repeated measures
calc interactive algebraic modeling calculator
critf/pof F-ratio/probability conversion functions
dataplot flexible data plotting
desc descriptions histograms, frequency tables
dprime signal detection d' and beta calculations
pair paired data statistics, regression, plots
regress multivariate linear regression
ts time series analysis and plots
validata verify data file consistency
vincent time-series comparison
AUTHOR
Gary Perlman (with the help of several others)