Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A color complex language spectrogram construction method that can realize speech reconstruction

A construction method and voice technology, applied in the field of visual color spectrogram construction, can solve the problems of increasing information dimension and lack of phase information

Inactive Publication Date: 2017-04-19
NORTHEAST NORMAL UNIVERSITY
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, in previous studies, most of the spectrograms exist as a visual display of spectrogram features, and the actual data source for analysis is still the original speech signal data rather than the spectrogram itself.
Especially since the spectrogram is a visual expression of the amplitude-frequency characteristics of speech and lacks phase information, it is impossible to reconstruct speech based on the spectrogram
Although the color spectrogram is based on three color channels, it is a pseudo-color image of the grayscale spectrogram, and does not increase the information dimension because of the color

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A color complex language spectrogram construction method that can realize speech reconstruction
  • A color complex language spectrogram construction method that can realize speech reconstruction
  • A color complex language spectrogram construction method that can realize speech reconstruction

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] The examples in the scheme are used to illustrate the present invention, but are not used to limit the scope of the present invention.

[0031] The specific embodiment of the present invention is divided into two parts and 9 modules in total, and the process is shown in figure 2 . The following descriptions take a voice signal with a sampling rate of 16kHz as an example:

[0032] 1. Speech framing module: Firstly, windowing and framing is performed on the speech signal. For example, a frame signal divided into 1024 points is assumed to be divided into M frames to form a 1024×M framed signal matrix. Its frequency domain resolution is 15.6Hz;

[0033] 2. Fourier analysis module: According to the formula (1), apply FFT to calculate the DFT of each column of the 1024×M framed signal matrix, and obtain the 1024-point DFT of the corresponding column, which is composed of formulas (2) and (3) , (4), (5) 1024×M time-frequency analysis matrix . The matrix is ​​a complex m...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a colored repeated sentence spectrum construction method for speech reconstruction, and belongs to the technical field of speech signal processing. The method is characterized in that two color channels respectively serve as a real part and an imaginary part in fourier transform; the position coordinates of R-B synthetic color in an R-G-B color space are corresponding to the real part and the imaginary part in the fourier transform, wherein G is the symbolic combination of the real part and the imaginary part. According to the method, the real part, the imaginary and the symbols of the real part and the imaginary, corresponding to the repeating number, can be analyzed according to the R-G-B color ratio; the speech spectrum is subjected to image processing, then the speech is reconstructed, thus the fourier transform can be performed by enhancing the speech by the image processing technology and the like, and as a result, the speech reconstruction is realized.

Description

technical field [0001] The invention belongs to the field of speech signal processing, and relates to a method for constructing a visual color spectrogram capable of realizing speech reconstruction. Background technique [0002] As a useful tool for speech analysis and phonetics, spectrogram is a readable symbol system for studying speech information. It will show the closely related time-domain and frequency-domain features and their relationship at the same time, which is impossible for a simple time-domain signal or frequency-domain signal or a simple juxtaposition of the two signals. Therefore, the amount of information carried by the spectrogram is far greater than the sum of the amount of information carried by purely time-domain signals and purely frequency-domain signals. Recently, it can be seen that the research includes using image processing technology for texture feature extraction, combined with subsequent classifiers to identify and confirm the voice identity...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L21/10
Inventor 王双维李广岩梁士利王春蕾曹晓林郑彩侠
Owner NORTHEAST NORMAL UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products