Method of converting whispered voice into normal voice based on radial group neutral network
A technology based on neural network and normal speech, applied in speech analysis, instruments, etc., can solve problems such as difficulty in extracting formants of ear speech, affecting call quality, and distorted speech bands, achieving confidential calls, good intelligibility, The effect of facilitating communication
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0039] Embodiment one: see attached Figures 1 to 4 as shown,
[0040] Ear speech has no pitch period, its energy is 20dB lower than normal speech, and its signal-to-noise ratio is lower. This voice signal not only has a low signal-to-noise ratio but also has poor intelligibility and clarity, which not only affects the quality of the call, but also easily causes fatigue. In this embodiment, an audio file in wav format with a sampling rate of 10KHz is selected, and the workflow of each step will be described in detail below.
[0041] Such as figure 1 As shown, the method of the present embodiment includes the following steps:
[0042] Step 11: Preprocessing the ear-to-ear speech. Firstly, pre-emphasis processing is performed on the ear speech. The purpose of pre-emphasis is to enhance the high-frequency part, make the spectrum of the signal flat, and keep it in the entire frequency band from low frequency to high frequency. The spectrum can be calculated with the same sign...
Embodiment 2
[0086] Embodiment two: see attached Figures 5 to 8 as shown,
[0087] The wav format audio file ear speech "a, o, e, i, u, v" of sampling rate 10KHz is respectively processed as follows: (1) use linear prediction method (LPC) to convert ear speech; (2) use the present invention The method converts ear speech. Figure 5-7 The waveform diagram and spectrogram of the normal speech and the speech "a" processed by the above two algorithms are given respectively. It can be seen that the spectrogram of converted speech by the method of the present invention is closer to the spectrogram of normal speech.
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com