Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voice identification method and voice identification system

A speech recognition and speech technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of fusion decoder, different decoding space organization, and no way to decode space, etc., and achieve the effect of improving recognition accuracy.

Active Publication Date: 2013-09-25
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
View PDF6 Cites 37 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Due to the different organization of the decoding space of the two recognition methods, there is no way to directly integrate the two decoding spaces into one decoder.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice identification method and voice identification system
  • Voice identification method and voice identification system
  • Voice identification method and voice identification system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] Embodiments of the invention will now be described in detail, examples of which are illustrated in the accompanying drawings, wherein like reference numerals refer to like parts throughout. The embodiments are described below in order to explain the present invention by referring to the figures. Also, descriptions of well-known functions and constructions will be omitted for clarity and conciseness.

[0034] figure 1 is a flowchart illustrating a speech recognition method according to an exemplary embodiment of the present invention.

[0035] refer to figure 1, in step S101, receive speech input and extract speech frame features. For example, for a 10-second speech, there will be 1000 frame features. Here, the methods for receiving speech input and extracting frame features can be implemented by various methods in the prior art, and will not be repeated here.

[0036] In step S102, speech decoding is performed on the input speech by using the decoding space to dete...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Disclosed are a voice identification method and a voice identification system. The voice identification method comprises the steps of receiving voice input and extracting a voice frame characteristic; conducting voice decoding on input voice by utilizing decoding space to ensure a voice decoding result. The decoding space comprises multiple decoding paths constructed on the basis of syntax rules, the multiple decoding paths comprise three types of decoding paths, wherein one type of decoding path only comprises language type module nodes, another type of decoding path only comprises statistical language module nodes, the third type of decoding path comprises the language type module nodes and the statistical language module nodes, and a semantic parsing result is determined by recalling the nodes on the selected decoding paths. The voice decoding comprises the steps of enabling the input voice to traverse each decoding path in the decoding space, selecting a decoding path with the largest sum of a language layer score and an acoustic layer score, and determining the voice decoding result according to a triphone acoustic model of the nodes on the selected decoding path.

Description

technical field [0001] The present invention relates to speech recognition technology, more specifically, relates to a speech recognition method and a speech recognition system that realize the integration of sound recognition and semantic understanding by combining recognition based on statistical language model and recognition based on grammatical rules. Background technique [0002] With the development of information technology, speech recognition technology has entered people's life. In the existing common speech recognition technology, the commonly used recognition method is recognition based on statistical language model (Ngram), or recognition based on grammatical rules (grammer). The recognition based on the statistical language model is to combine all the speech layer information into an Ngram language model, and the recognition result is carried out on the decoding space composed of the Ngram model. The recognition based on grammatical rules organizes the languag...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/02G10L15/14
Inventor 贾磊万广鲁
Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products