[comp.ai] UK Speech Recognition Breakthrough???

paulb@ttidca.TTI.COM (Paul Blumstein) (12/02/89)

I have heard rumors that somewhere in Great Britain there has been a major
breakthrough in the field of Speech Recognition.  However, I have not
heard anything except for vague references.

Is there someone on usenet that knows anything about this or can point me
to a reliable source of information?  If you feel that your answer is of
general interest, you may post it here.  If not (or you are too shy to
post), please send me e-mail.  If I get responses that I feel may interest
the community, I will post a summary.

Thank you, in advance.
=============================================================================
Paul Blumstein       | TB, or not TB, that is the congestion.  Consumption be
Citicorp/TTI         |  done about it?  Of cough, of cough. -- Woody Allen
Santa Monica, CA     +-------------------------------------------------------
{philabs,csun,psivax}!ttidca!paulb  or  paulb@ttidca.TTI.COM
DISCLAIMER: Everything & everyone is hereby disclaimed!

danforth@riacs.edu (Douglas G. Danforth) (12/07/89)

]
]I have heard rumors that somewhere in Great Britain there has been a major
]breakthrough in the field of Speech Recognition.  However, I have not
]heard anything except for vague references.
]
]Is there someone on usenet that knows anything about this or can point me
]to a reliable source of information?  If you feel that your answer is of
]general interest, you may post it here.  If not (or you are too shy to
]post), please send me e-mail.  If I get responses that I feel may interest
]the community, I will post a summary.
]
]Thank you, in advance.
]=============================================================================
]Paul Blumstein       | TB, or not TB, that is the congestion.  Consumption be
]Citicorp/TTI         |  done about it?  Of cough, of cough. -- Woody Allen
]Santa Monica, CA     +-------------------------------------------------------
]{philabs,csun,psivax}!ttidca!paulb  or  paulb@ttidca.TTI.COM
]DISCLAIMER: Everything & everyone is hereby disclaimed!
---------------------------------------------------------------------------------

   While attending the recent NIPS (Neural Information Processing Society)
conference in Denver, Colorado, U.S.A, I had the good fortune to be able to speak
to John Bridle of Royal Signals and Radar Establishment in Malvern, England
who works with Roger Moore who was mentioned on the CNN television boadcast.


   Dr. Bridle's response was simply that their system "ARMADA" was really
very similar to the SPHINX system created by Kai-Fu Lee at Carnegie-Mellon.
I believe the press picked up on the words "first of its kind" in the press
release and missed the phrase "to be developed in the UK" which lead to the
use of "breakthrough" in the CNN report.

   John was kind enough to furnish me with a copy of the press release which
I reproduce below.  For those not familiar with SPHINX I suggest you take a
look at the Carnegy-Mellon technical report: "Large-Vocabulary Speaker-Independent
Continuous Speech Recognition: The SPHINX System" by Kai-Fu Lee, April 18, 1988
CMU-CS-88-148.






     Douglas Danforth
     danforth@riacs.edu





-------------------------------------------------------------------------------
					SPEECH RESEARCH UNIT
					Royal Signals and Radar Establishment
					St. Andrews Road
					Malvern, Worcs.
					WR143PS
					0684-894074

				PRESS RELEASE
			     18 September, 1989



	   RSRE Speech Research Unit Demonstrates Large-Vocabulary
			Continuous Speech Recognition


    The Speech Research Unit at RSRE Malvern has announced the 'ARMADA' system
    which demonstrates large-vocabulary continuous speech recognition.  The
    system, which currently runs in near real-time on an array of 6
    transputers, is the first of its kind to be developed in the UK and is
    technically equivalent to advanced US systems developed under the Spoken
    Language Systems component of the DARPA Strategic Computing Initiative.

    'ARMADA' is the culmination of an intensive 9-month research project at
    RSRE and is the latest demonstration of advanced speech technology at the
    Speech Research Unit.  The SRU carries out a programme of generic research
    into speech recognition and synthesis and has contributed directly to
    earlier commercial speech recognition systems such as LOGOS from Logica,
    ASR1000 from Marconi and MICROTALK from the British Technology Group.

    The 'ARMADA' system demonstrates high-accuracy recognition of continuously
    spoken airborne reconnaissance reports.  This task involves well defined
    voice procedures which are typically of many potential civil and military
    applications.  In speaker dependent evaluations on a 500 word vocabulary
    and in the absence of any grammatical constraints the recogniser scores an
    impressive 82% average word accuracy.  This figure rises to 99% when a
    grammar is used to aid the recognition process.

    The advantages of the 'ARMADA' system stem from the use of advanced
    statistical techniques (based on hidden Markov modelling) to develop
    accurate context-sensitive models of the acoustic realisations of phonemes
    which are the building blocks of speech.  This means that, unlike previous
    systems, 'ARMADA' can recognise words which do not occur in the training
    data, needs minimal speaker enrolment and can easily be reconfigured for
    new applications.  These are all key requirements for practical
    large-vocabulary speech recognition.

    'ARMADA' presents a unique UK opportunity for assessing the potential of
    state-of-the-art speech recognition.  In addition, 'ARMADA' is a valuable
    tool for speech technology research, providing a testbed for novel
    algorithms and a vehicle for the integration of speech and natural language
    processing.