[comp.binaries.ibm.pc.d] Text to Speech converter

chris@attain.teradyne.com (Chris Maslyar) (02/05/91)

   I'm looking for a text-to-speech converter for an IBM PC/AT.  Such a 
   program would take text from either the keyboard or a file, apply a 
   few phoneme rules, and produce output through the PC's speaker (I'm not
   looking for quality).  I know that the Mac has several such programs.
   In fact even my old Apple II+ sports these abilities.  I figure there 
   has to be at least one text-speech program for the IBM?!?!

   Any help would be appreciated.

   Thanks

   Chris

heffron@falstaff.css.beckman.com (Matt Heffron) (02/06/91)

In <9840@certes.attain.teradyne.com> chris@attain.teradyne.com (Chris Maslyar) writes:
>   I'm looking for a text-to-speech converter for an IBM PC/AT.  Such a 
>   program would take text from either the keyboard or a file, apply a 
>   few phoneme rules, and produce output through the PC's speaker (I'm not
>   looking for quality).  I know that the Mac has several such programs.
>   In fact even my old Apple II+ sports these abilities.  I figure there 
>   has to be at least one text-speech program for the IBM?!?!

I have a program which I think I got from CompuServe (in Sept '88) which does
this.  The program has no apparent copyright (except as a string in the .exe
file which isn't displayed when the program is run).  I'm appending the .DOC
file below, as it includes the author's snail-mail address (as of 9/88).  I'll
think about posting the object (I don't have source) to c.b.i.p

-Matt Heffron

=====================================cut here==================================
                            TRAN

TRAN is a text-to-speech program for the IBM-PC.  It can read ASCII text
files, translate normal English spelling to phonemes, and sound out each
phoneme through the speaker of the IBM-PC. 

    usage: tran [+/-flags] [-options] [filename]

The filename is an ASCII text file (with no word processor formatting
codes).  If no filename is given TRAN reads input from the keyboard. 
The flags and options control various features of the program.  A '+'
turns the flag on and a '-' turns the flag off.  Options can use either
'+' or '-'. 

    flags: + = on, - = off
           p   type output of phoneme translation [off]
           r   add rule no. to phoneme translation [off]
           s   say output [on]
           t   echo input to console [off]
           T   do phoneme translation [on]
           v   type other internal information [off]

    options:
           c        say the time of day [once]
           C        say the time of day [every 10 seconds]
          d1 N      use N for space rate timing [5]
          d2 N      use N for voice rate timing [1]
           R        print all pronunciation rules
           ?        type usage

The following are example of ways to use the TRAN program.  You can have
tran type out and read this file with the command:

    tran +t tran.doc

If you also want to see the phoneme translation add the +p flag:

    tran +p +t tran.doc

You can save the phoneme translation in a file by typing:

    tran -s +p tran.doc > tran.phn

and listen to the phoneme at some other time file by typing:

    tran -T tran.phn

The TRAN program will say the time once if you type:

    tran +t +c

or will continue to say the time every 10 seconds if you type:

    tran +t +C

There are two timing parameters, d1 and d2, that control the rate that
TRAN speaks.  Making d1 larger increases the pauses between words and
making d2 larger lowers the pitch of the voice phonemes.  Both d1 and d2
must have a value of 1 or greater.  On an IBM-PC/XT, good values for the
timing parameters are d1=2 and d2=1.  If these parameters are not set
explicitly, the program will try to determine acceptable values
automatically.  Setting these values, will let TRAN by-pass the
automatic setting, which save a second or two starting the program. 
These values can be set on the command line:

    tran -d1 2 -d2 1 ... 

or by using the environment variable TRAN to pass these values as
d1,d2:

    set TRAN=2,1

On a 10 MHz IBM-PC/AT the timing prarameters need to be larger, d1=4
d2=13. 

Most of the speech-to-text rules used in the tran program come from an
article in an IEEE journal:

Elovitz, H.S., Johnson, R., McHugh, A., and Shore, J.E.  (1976). 
"Letter-to-Sound Rules for Automatic Translation of English Text to
Phonetics," IEEE Transactions on Acoustics, Speech, and Signal
Processing, Vol.  ASSP-24(6), 446-458. 

The program contains a set of 35 phonemes, each encoded as a sequence of
bits controlling the position (in or out) of the PC speaker.  The
phoneme codes come from the public domain program SPEECH by Andy
McGuire. 

Send your comments and suggestions to:

                                            Stephen Neely
                                            11230 Ohio St. 
                                            Omaha, NE 68131
=====================================cut here==================================
--
Matt Heffron                      heffron@falstaff.css.beckman.com
Beckman Instruments, Inc.         voice: (714) 961-3128
2500 N. Harbor Blvd. MS X-11, Fullerton, CA 92634-3100
Cute saying/disclaimer in development.