Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and apparatus for estimating pitch frequency of voice signal

A technology of tone frequency and speech signal, which is applied in speech analysis, speech recognition, instruments, etc., and can solve problems such as mistaking unvoiced sounds

Inactive Publication Date: 2004-09-01
NUANCE COMM INC
View PDF1 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Otherwise, the error in one interval may become constant in subsequent intervals, and segments of voiced sounds may be mistaken for unvoiced

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and apparatus for estimating pitch frequency of voice signal
  • Method and apparatus for estimating pitch frequency of voice signal
  • Method and apparatus for estimating pitch frequency of voice signal

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0059] figure 1 is a schematic diagram of a system 20 for analyzing and encoding speech signals according to a preferred embodiment of the present invention. The system includes an audio input device 22 , such as a microphone, connected to an audio processor 24 . Additionally, audio input to the processor may be provided in analog or digital form over a communication line, or retrieved from a storage device. Processor 24 preferably comprises a general purpose computer programmed with appropriate software to perform the functions described below. The software may be provided to the processor in electronic form, such as over a network, or it may be provided on actual media such as a CD-ROM or non-volatile memory. Additionally, processor 24 may include a digital signal processor (DSP) or hardwired logic.

[0060] figure 2 It is a flowchart schematically showing a method for processing signals using the system 20 according to a preferred embodiment of the present invention. ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Estimating a speech signal pitch frequency by determining a speech signal frame line spectrum including spectral lines having respective line amplitudes and frequencies, selecting a predefined number of spectral lines having highest amplitudes, fewer then the total number of the spectral lines, calculating a preliminary utility function over a pitch frequency range to provide a preliminary utility function value for each pitch frequency in the range measuring the compatibility of the selected spectral lines with the pitch frequency, identifying a predefined number of preliminary pitch frequency candidates at least partly responsive to the preliminary utility function, where each candidate is a local maximum of the preliminary utility function, calculating a final utility score for each of the candidates, and selecting any of the candidates to be an estimated pitch frequency of the speech signal at least partly responsive to any of the final utility scores.

Description

technical field [0001] The present invention relates generally to methods and apparatus for processing audio signals, and in particular to methods for estimating the pitch of speech signals. Background technique [0002] Speech signals are generated by modulating the airflow in the speech domain. Voiceless sounds are produced from turbulent noise generated at constricted parts in the vocal tract, while voiced sounds are produced in the throat excited by periodic vibrations of the vocal cords. In general, the vibratory cycle of the larynx vibrates leading to the pitch of the speech. Low bit rate speech coding schemes typically separate the modulation (voiced or unvoiced) from the speech source and code the two parts separately. In order for the speech to be correctly reconstructed, the pitch of the voiced part of the speech needs to be accurately estimated at the time of encoding. Various techniques have been developed for this purpose, including time domain and frequency ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/28G10L19/04G10L25/90G10L25/93
CPCG10L25/90
Inventor 亚历山大·索里恩
Owner NUANCE COMM INC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products