Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voice activation

a voice activation and voice technology, applied in the field of speech detection and/or recognition, can solve the problems of activation signal, limited complexity, and unusable voice activity detection benefits in speech recognition systems, and achieve the effect of low cos

Inactive Publication Date: 2006-07-20
DIALOG SEMICONDUCTOR GMBH
View PDF8 Cites 51 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

[0027] A principal object of the present invention is to realize a very flexible and adaptable voice activation circuits module in form of very manufacturable integrated circuits at low cost.
[0028] Another principal object of the present invention is to provide an adaptable and flexible method for operating said voice activation circuits module implementable with the help of integrated circuits.
[0029] Also another principal object of the present invention is to include determinations of “Noise estimation and “Speech estimation” values, done effectively without use of Fast Fourier Transform (FFT) methods or zero crossing algorithms only by analyzing the modulation properties of human voice.

Problems solved by technology

The disadvantage thereby is, that—without additional precautions—some kind of noise can also lead to an activation signal.
Unfortunately, the benefits of voice activity detection are not ordinarily available in speech recognition systems, as the identification of speech is very complex, time-consuming and costly and also considered being not reliable enough.
The main problems in performing a reliable human speech detection and voice activation lie in the fact, that the speech detection procedures have to be adapted to all the possible environmental and operational situations in such a way, that always the most apt procedures i.e. algorithms and their optimum parameters are chosen, as no unique procedure on its own is capable of fulfilling all the desired requirements under all conditions.
These solutions are therefore on one hand, only usable with optimum results for certain well defined cases, thus exhibiting however a somewhat limited complexity or are on the other hand very complex and use extremely demanding algorithms requiring great processing power, thus offering however greater flexibility with respect to their adaptability.
The limitation in applicability of such a low-cost circuit on one hand and the complexity and the power demands of such a higher quality circuit on the other hand are the main disadvantages of these prior art solutions.
These disadvantages pose major problems for the propagation of that sort of circuits.
It is therefore a challenge for the designer of such devices and circuits to achieve a high-quality and also low-cost solution.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice activation
  • Voice activation
  • Voice activation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0054] The preferred embodiments disclose a novel optimized circuit with a modules conception for a speech detection and voice activation system using modern integrated circuits and an exemplary implementation thereto.

[0055] As already stated above speech detection procedures do have to be adapted to all the possible environmental and operational situations in such a way, that always the most apt procedures i.e. algorithms and their optimum parameters are chosen, as no unique procedure on its own is capable of fulfilling all the desired requirements under all conditions. Therefore it is suitable to answer certain relevant questions about the audio signal itself, about the environment, about technical and manufacturing aspects as re-listed in the following: [0056] Is the audio signal loudness high or low in comparison to background noise?

[0057] This question is the base for the algorithm. If the signal, which has to be detected, is loud in comparison to the background noises, the us...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A circuit and a method are given, to realize a very flexible voice activation system using a modular building block approach, that is adaptively tailored to handle certain relevant and case specific operational characteristics describing most of the possible acoustical differing environmental cases to be found in the field of speech recognition. Included are determinations of “Noise estimation and “Speech estimation” values, done effectively without use of Fast Fourier Transform (FFT) methods or zero crossing algorithms only by analyzing the modulation properties of human voice. Said circuit and method are designed in order to be implemented with a very economic number of components, capable to be realized with modern integrated circuit technologies.

Description

BACKGROUND OF THE INVENTION [0001] (1) Field of the Invention [0002] The present invention generally relates to speech detection and / or recognition and more particularly to a system, a circuit and a concomitant method thereof for detecting the presence of a desired signal component within an acoustical signal, especially recognizing a component characterizing human speech. Even more particularly, the present invention is providing a human speaker recognition by means of a detection system with automatically generated activation trigger impulses at the moment a voice activity is detected. [0003] (2) Description of the Prior Art [0004] Sound or acoustical signals are besides others, such as video signals e.g., one main category of analog and—most often also noise polluted—signals modern telecommunications are dealing with; where all signals together—generally after transformation into digital form—are termed as communication data signals. Analyzing and processing such sound signals is...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(United States)
IPC IPC(8): G10L15/20G10L25/78
CPCG10L25/78
Inventor SCHWENG, DETLEF
Owner DIALOG SEMICONDUCTOR GMBH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products