rich@GTE.COM (Rich Sutton) (05/25/89)
Seminar Announcement PHONEME DISCRIMINATION USING CONNECTIONIST NETWORKS R. Watrous Dept. of Computer Science, Univ. of Toronto Siemens Research and Technology Laboratories The application of connectionist networks to speech recognition is assessed using a set of representative phonetic discrimination problems chosen with respect to a theory of phonetics. A connectionist network model called the Temporal Flow Model is defined which represents temporal relationships using delay links and permits general patterns of connectivity. It is argued that the model has properties appropriate for time varying signals such as speech. Networks are trained using gradient descent methods of iterative nonlinear optimization to reduce the mean squared error between the actual and the desired response of the output units. Separate network solutions are demonstrated for all eight phonetic discrimination problems for one male speaker. The network solutions are analyzed carefully and are shown in every case to make use of known acoustic phonetic cues. The network solutions vary in the degree to which they make use of context dependent cues to achieve phoneme recognition. The network solutions were tested on data not used for training and achieved an average accuracy of 99.5%. It is concluded that acoustic phonetic speech recognition can be accomplished using connectionist networks. --------------------------------------------------------------------------- The talk will be at 11am on May 31 in the GTE Labs Auditorium. For further information contact Rich Sutton (Rich%gte.com@relay.cs.net or 617-466-4133). Non-GTE people should arrive early to be escorted to the auditoriumm.