Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Identifying spoken commands by templates of ordered voiced and unvoiced sound intervals

a voice and command technology, applied in the field of voice activation technology, can solve the problems of not being able to discriminate on frequency alone, many commands consumers expect to use cannot be used on frequency alone, and not being able to accept awkward commands or arbitrary limitations. , to achieve the effect of minimal hardware and minimal softwar

Inactive Publication Date: 2014-12-30
ELOQUI VOICE SYST LLC
View PDF20 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

The invention is a method for recognizing spoken commands using an integrated signal that combines digital measurements. The method allows for the rapid and reliable identification of commands using a simple process that requires minimal software and hardware. The method can be used in a range of devices and applications, making it more cost-effective than previous methods. Overall, the invention enables a new level of efficiency and reliability for voice-activated applications.

Problems solved by technology

Unfortunately, the prior art serves these applications poorly.
Unfortunately, many commands that consumers expect to use cannot be discriminated on frequency alone, and many command words include brief phonemes for which frequency analysis is ambiguous at best.
Consumers are sensitive about their user interface, and do not gladly tolerate awkward commands or arbitrary limitations.
Converting the sound signal to a frequency spectrum is slower and more expensive than processing the sound as it arrives, in real-time.
For example, the dual-bandpass system of Ariav requires multiple gain stages with multiple filter components, and then the low- and high-frequency channels must be digitized separately, all of which increases board cost and software complexity.
Perhaps one could digitize an unfiltered signal instead, and then use Fourier analysis to separate the two frequency components; but this would require a greatly expanded processor and memory, negating any savings.
Moreover, key features of the sound wave are lost in a conventional FFT because it displaces phase information.
The frequency domain is a valid representation of sound only with complex-number or vector Fourier transformation, requiring even larger processors and memories, with costs that more than offset any other savings.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Identifying spoken commands by templates of ordered voiced and unvoiced sound intervals
  • Identifying spoken commands by templates of ordered voiced and unvoiced sound intervals
  • Identifying spoken commands by templates of ordered voiced and unvoiced sound intervals

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0076]FIG. 1 shows graphs or traces, similar to oscilloscope traces, that display key signals related to command processing. These traces illustrate how the fast and slow variations in the sound signal are used to identify the voiced and unvoiced sound intervals in the command.

[0077]The first section, labeled “1.1 RESET command”, shows the letters of the spoken command RESET, but spread out so that they correspond to the timing of the other traces. The RE portion of the command is a voiced sound, then the S portion is unvoiced, followed by the second E which is voiced, followed by the unvoiced T sound. To recognize the command, all four sound portions must be detected and the sound type of each interval must be identified.

[0078]The trace labeled “1.2 Electronic signal”, shows an analog electronic signal 100 versus time. The electronic signal 100 is derived from the command sounds using a microphone and an amplifier without filtering. The electronic signal 100 includes four distinct ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A method is disclosed for identifying a spoken command by detecting intervals of voiced and unvoiced sound, and then comparing the order of voiced and unvoiced sounds to a set of templates. Each template represents one of the predetermined acceptable commands of the application, and is associated with a predetermined action. When the order of voiced and unvoiced intervals in the spoken command matches the order in one of the templates, the associated action is thus selected. Silent intervals in the command may also be included for enhanced recognition. Efficient protocols are disclosed for discriminating voiced and unvoiced sounds, and for detecting the beginning and ending of each sound interval in the command, and for comparing the command sequence to the templates. In a sparse-command application, this method provides fast and robust recognition, and can be implemented with low-cost hardware and extremely minimal software.

Description

FIELD OF THE INVENTION[0001]The invention relates to voice-activation technology, and particularly to means for recognizing a spoken command by detecting time intervals containing voiced and unvoiced sound.BACKGROUND OF THE INVENTION[0002]Voice-activation technology is a rapidly evolving field. Fascinating applications appear almost daily. Prior art in this field is primarily directed toward the interpretation of free-form speech such as dictation and general questions. Most of the emerging applications, however, involve relatively simple devices that perform just a few specific operations. Desirable products that could be fully operated with a few predetermined commands include consumer devices (games, hobby devices, counters and timers, kitchen gadgets, home automation, exercise and sporting applications, toys, learning aids, products for the disabled), industrial systems (hands-free system interfaces, security monitoring, semi-autonomous machining and assembly, devices for rapid ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(United States)
IPC IPC(8): G10L25/93G10L15/10
CPCG10L25/51G10L25/93
Inventor NEWMAN, DAVID, EDWARD
Owner ELOQUI VOICE SYST LLC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products