Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Chinese tone recognition method based on time frequency crest line-Hough transformation

A time-frequency ridge line and identification method technology, applied in the field of tone identification, can solve the problems of complex calculation, small calculation amount of time domain method, complex algorithm, etc.

Inactive Publication Date: 2015-01-21
JIANGNAN UNIV
View PDF0 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The time-domain method has a small amount of calculation, but its anti-noise performance is poor and it is prone to frequency doubling or half-doubling. It is impossible for the pitch frequency trace extracted by one method to match the real pitch frequency trace completely.
In addition, after the tone features are extracted, classifiers such as support vector machines, Gaussian mixture models, and neural networks are generally used to identify them. It takes a training process to identify the tone. The algorithm is complex and the calculation time is long.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese tone recognition method based on time frequency crest line-Hough transformation
  • Chinese tone recognition method based on time frequency crest line-Hough transformation
  • Chinese tone recognition method based on time frequency crest line-Hough transformation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] Step 1: Speech signal preprocessing and phoneme segmentation. After the signal is filtered and pre-emphasized, the endpoint detection is performed according to the short-term average amplitude difference and zero-crossing rate to remove the silent segment of the speech, and then the phoneme segmentation is performed to find the final part carrying the tone.

[0030] Step 2: Make the SPWVD time-frequency distribution diagram of the final. SPWVD is used to transform the time-frequency signal of the final signal to obtain the SPWVD time-frequency image. The time-frequency ridge is the area with high energy in the time-frequency image, and the time-frequency ridges of different tones change differently along the time axis. Because the final has strong harmonicity, several time-frequency ridges will appear in the time-frequency diagram at the same time.

[0031] Step 3: Perform binarization, thresholding and thinning processing on the time-frequency distribution image to o...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a Chinese tone recognition method based on time frequency crest line-Hough transformation. Chinese tone recognition is converted into classification of the change trend of a line segment in a time frequency distribution diagram so that a new Chinese tone recognition method and technique can be acquired. The method includes the steps that firstly, final voice signals carrying Chinese tones are expressed through the SPWVD time frequency distribution diagram and tone information is shown through a group of similarly-parallel time frequency crest lines in the time frequency diagram; secondly, due to the fact that the main time frequency crest line is a region with larger energy in the diagram, the change trend of different tones is reflected, and in order to reduce the calculated amount, treatment such as binaryzation, thresholding and refining is conducted on the time frequency distribution diagram, and a center line segment of the main time frequency crest line reflecting the change trend of the tones is acquired; thirdly, Hough transformation is conducted on the time frequency distribution diagram containing the center line of the main crest line, so that the intercept and included angle parameters of the center line of the main crest line are acquired; finally, the tone type is judged according to the intercept and the included angle of the line segment and the coordinate values of a start point and an end point of the line segment.

Description

technical field [0001] The invention belongs to the technical field of tone recognition in speech synthesis and speech recognition. In the present invention, the final phonetic signal carrying Chinese tones is represented by a time-frequency distribution, and the tone information of Chinese is reflected in the ridge line variation trend in the time-frequency distribution image. After preprocessing such as thinning and thinning, the line segments reflecting the trend of tone change are obtained, and the Hough transform is performed on these line segments, and the tones of Chinese are recognized according to the Hough transform parameters. Background technique [0002] In addition to the non-stationary commonality of speech signals, Chinese speech signals also have tone features. Tone is one of the main attributes of Chinese, and it has functions such as word formation, meaning discrimination, and expression improvement. 30% of Chinese has the same sound but different tones. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/02G10L13/08
Inventor 于凤芹
Owner JIANGNAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products