Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voice endpoint determination

A voice and endpoint technology, applied in speech analysis, speech recognition, natural language analysis, etc., can solve problems such as unsatisfactory and inaccurate results, and achieve the effect of rapid processing

Active Publication Date: 2020-10-20
GOOGLE LLC
View PDF13 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

If the endpoint specifies an incorrect start or end point for the speech input, the results of processing the speech input with a natural language processing system may be imprecise or unsatisfactory

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice endpoint determination
  • Voice endpoint determination
  • Voice endpoint determination

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] figure 1 is a view 100 of example utterances and signals used for a particular user to determine whether the user has finished speaking a voice query. In general, view 100 illustrates signals 103 - 118 generated or detected by computing device 121 while computing device 121 is processing incoming audio input. Computing device 121 receives audio data corresponding to utterance 124 through a microphone or other audio input device of computing device 121 and generates a transcription of utterance 124 in dependence on the user profile assigned to user 127 .

[0019] The utterance timing 130 represents the user 127 speaking the utterance 124 (at figure 1 Among them is the timing of each word of "Text Mom love you (sends a text message to mom to say love you)"). User 127 speaks each word with increasing pause lengths between each word. The number of points between each word is proportional to the length of the pause between each word. Each point may represent a certain pe...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Methods, systems, and apparatus are described, including computer programs encoded on computer storage media, for determining endpoints of speech. In one aspect, a method includes an act of accessing voice query log data including voice queries uttered by a particular user. The acts also include determining a pause threshold from the voice query log data including the voice query uttered by the particular user based on the voice query log data including the voice query uttered by the particular user. The actions also include receiving an utterance from the particular user. The actions also include determining that the particular user has stopped speaking for a period of time at least equal to a pause threshold. The acts also include processing the utterance as a voice query based on determining that the particular user has stopped speaking for a period of time at least equal to the pause threshold.

Description

[0001] Cross References to Related Applications [0002] This application claims the benefit of US Provisional Application No. 62 / 243,463, filed October 19, 2015, the contents of which are incorporated herein by reference. technical field [0003] The present disclosure relates generally to speech recognition, and one particular implementation relates to endpoint determination for speech. Background technique [0004] Natural language processing systems typically use endpoints to determine when a user has started and finished speaking. In determining when an utterance begins or ends, some traditional endpointers evaluate the duration of pauses between words. For example, if the user says "what is <long pause> for dinner", a traditional endpoint can segment the speech input at the long pause and can instruct the natural language processing system to try to process Incomplete phrase "whatis (what to eat)" instead of complete phrase "what is for dinner (what to eat for ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/04G10L15/05G06F40/20G06F40/279
CPCG06F16/1815G06F16/632G10L15/04G10L15/05G06F16/685G10L15/22G10L17/02G10L25/87G10L2025/783G10L15/065G10L15/07G10L15/26G10L25/78
Inventor 西迪·塔德帕特里卡尔迈克尔·布坎南普拉维尔·库马尔·古普塔
Owner GOOGLE LLC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products