Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

48results about How to "Voice enhancement" patented technology

Headset Communication Method Under A Strong-Noise Environment And Headset

The invention discloses a headset communication method under a strong-noise environment and a headset. The method comprises: using earplugs to reduce medium and high frequency noises entering an ear canal, using an external connection cavity in parallel connection with the ear canal to divert medium and low frequency noises; using an internal microphone to pick up the sound in the ear canal and an environmental noise signal entering the ear canal, using an external microphone to pick up the environmental noise signal, and taking the external microphone signal as reference signals to eliminate the noise element in the internal microphone signal and remain the voice element to obtain transmitting terminal signals of the headset; using sound dynamic compression technology to cut down and compensate the signals picked up by the external microphone in terms of sound pressure level such that the sound pressure range is compressed to a range acceptable by human ears and the signals picked up by the external microphone and the receiving terminal signal received by the headset are broadcast together through a receiver of the headset. By means of the technical scheme of the present invention, the functions of protecting hearing, enhancing voice and monitoring a three-dimensional environment can be achieved comprehensively under strong-noise environments.
Owner:GOERTEK INC

Method and system for enhancing a speech signal of a human speaker in a video using visual information

A method and system for enhancing a speech signal is provided herein. The method may include the following steps: obtaining an original video, wherein the original video includes a sequence of original input images showing a face of at least one human speaker, and an original soundtrack synchronized with said sequence of images; and processing, using a computer processor, the original video, to yield an enhanced speech signal of said at least one human speaker, by detecting sounds that are acoustically unrelated to the speech of the at least one human speaker, based on visual data derived from the sequence of original input images.
Owner:YISSUM RES DEV CO OF THE HEBREW UNIV OF JERUSALEM LTD

Speech enhancement method and system, computer equipment and storage medium

The invention provides a speech enhancement method and system, computer equipment and a storage medium, and relates to the technical field of the human-machine speech interaction. The method comprisesthe following steps: collecting multi-channel acoustic signals through an acoustic vector sensor, preprocessing the multi-channel acoustic signals and acquiring a time-frequency spectrum, filtering the time-frequency spectrum and outputting a signal atlas; performing masking processing on the signal atlas through a nonlinear mask, and outputting an enhanced single-channel speech spectrogram; inputting the single-channel spectrogram into a deep neural network mask estimation model and outputting a mask spectrogram; performing time-frequency masking enhancement on the signal atlas through the mask spectrogram to acquire enhanced amplitude speech spectrogram; reconstructing through the enhanced amplitude speech spectrogram so as to output an enhanced target speech signal. The technical problem that the multi-channel speech enhancement is high in hardware cost, large in collection system volume, and high in operation complexity is solved, and the excellent speech enhancement effect can beacquired under difference interference noise types, strengths and room reverberation conditions.
Owner:PEKING UNIV SHENZHEN GRADUATE SCHOOL

Apparatus and method for codec signal in a communication system

InactiveUS20130132100A1Voice enhancementImprove voice and audio QoSsSpeech analysisFrequency bandVIT signals
The present invention relates to a codec apparatus and method for coding / decoding speech and audio signals in a communication system. In accordance with the present invention, a speech and audio signal in a time domain is transformed into a speech and audio signal in a frequency domain and calculating frequency coefficients of the speech and audio signal, the frequency coefficients are split by a plurality of sub-bands and the sub-band coefficients of the respective sub-bands are calculated from the frequency coefficients, and the sub-band coefficients are quantized depending on a characteristic of the plurality of sub-bands and sub-band quantization indices are calculated by quantizing the sub-band coefficients.
Owner:ELECTRONICS & TELECOMM RES INST

Mobile phone and method for processing down voice

The invention relates to a mobile phone and a downlink voice processing method, comprising a baseband chip and a voice filtration module. The baseband chip sends the instructions of initialization and function setting to the voice filtration module; the voice filtration module receives the voice signals input by a transmitter and sends the voice signals to the baseband chip after processing the voice signals; the baseband chip judges whether to execute echo cancellation to the voice signals, and if yes, the baseband chip converts the voice signals to pulse code modulation (PCM )data and sends the data to the voice filtration module; and the data is processed with echo cancellation by the voice filtration module and is converted to voice signals which are then sent back to the baseband chip. Through treatment to the downlink voice signals, even in the case that the counterpart who is in conversation with the user of such functional mobile phone is in a background in which the noise increases suddenly, the voice of the speaker can be automatically increased and the noise can be reduced; the echoes can be eliminated or reserved; and specific noises can be eliminated, thereby realizing that voices can be clearly heard in a noisy environment or under special weather conditions.
Owner:KONKA GROUP

Voice processing method and device based on generative adversarial network

ActiveCN110444224AVoice enhancementImprove packet loss compensation processing efficiencySpeech analysisNeural architecturesFrequency bandGenerative adversarial network
The invention is applicable to the technical field of voice communication, and provides a voice processing method and device based on a generative adversarial network. The method comprises the following steps: acquiring voice training samples, wherein the voice training samples include N groups of complete voice samples, packet loss voice samples corresponding to the complete voice samples, K groups of broadband voice samples and narrowband voice samples corresponding to the broadband voice samples; putting the voice training samples into the generative adversarial network to carry out packetloss compensation model training based on the packet loss voice samples and the complete voice samples, and band spreading model training based on the broadband voice samples and the narrowband voicesamples, thereby obtaining a voice processing system composed of a packet loss compensation model and a band spreading model; and processing an original voice to be processed through the voice processing system to obtain an enhanced voice after packet loss compensation or band spreading. According to the voice processing method and device, the packet loss compensation processing efficiency based on a packet loss voice in voice processing, and the band spreading processing performance based on a narrowband voice can be improved.
Owner:SHENZHEN UNIV

Headset communication method under a strong-noise environment and headset

ActiveUS9467769B2Reduce medium and high frequency noiseReduce noiseMicrophonesEar treatmentEnvironmental noiseIntermediate frequency
The invention discloses a headset communication method under a strong-noise environment and a headset. The method comprises: using earplugs to reduce medium and high frequency noises entering an ear canal, using an external connection cavity in parallel connection with the ear canal to divert medium and low frequency noises; using an internal microphone to pick up the sound in the ear canal and an environmental noise signal entering the ear canal, using an external microphone to pick up the environmental noise signal, and taking the external microphone signal as reference signals to eliminate the noise element in the internal microphone signal and remain the voice element to obtain transmitting terminal signals of the headset; using sound dynamic compression technology to cut down and compensate the signals picked up by the external microphone in terms of sound pressure level such that the sound pressure range is compressed to a range acceptable by human ears and the signals picked up by the external microphone and the receiving terminal signal received by the headset are broadcast together through a receiver of the headset. By means of the technical scheme of the present invention, the functions of protecting hearing, enhancing voice and monitoring a three-dimensional environment can be achieved comprehensively under strong-noise environments.
Owner:GOERTEK INC

Intelligent cloud boxing-data collection terminal

The invention relates to an intelligent cloud boxing-data collecting terminal. The terminal includes a shell body installed on the surface of a sandbag through ropes and an electric control module arranged on the shell body, wherein the electric control module is composed of a main control CPU, an acceleration sensor used for measuring direction of force conducted by a user and / or internal time of the force conducted by the user, and at least two left / right two-tone LED lights electrically connected to the main control CPU separately; the main control CPU is used for receiving a signal which is transmitted from the acceleration sensor and for measuring the direction of the force conducted by the user, and controlling the left / right two-tone LED lights in a corresponding direction to light up according to the signal. The main control CPU receives a signal which is transmitted from the acceleration sensor and for measuring intensity of the force conducted by the use; according to the strength of the signal, the main control CPU controls the luminance of the left / right two-tone LED lights in the corresponding direction. The intelligent cloud boxing-data collection terminal has the advantages of being reasonable in design, compact in structure and convenient to use.
Owner:广州巨科电子科技有限公司

Microphone array speech enhancement system and method based on multi-task network

PendingCN114694670AStrong noise reduction performanceVoice enhancementSpeech analysisFrequency domainSpeech enhancement
The invention discloses a microphone array voice enhancement system and method based on a multi-task network. The system is composed of a voice preprocessing module, a multi-task network module, a multi-task loss statistics module, a network weight calculation module and a voice reconstruction module. Wherein the voice preprocessing module acquires array voice, reference echo voice and target voice of each task as input voice and preprocesses the input voice; the multi-task network module completes reverberation removal, echo cancellation and noise reduction tasks of each sound channel of the array voice, fuses the multi-sound-channel voice and outputs the multi-sound-channel voice as enhanced voice; the multi-task loss statistics module is used for calculating the loss value of each task in the multi-task network module and counting the total loss of the network; the network weight calculation module calculates a gradient according to the total loss of the network, carries out back propagation on the gradient, and calculates the weight of the updated network; and the voice reconstruction module completes mapping from the frequency domain features to the time domain voice to obtain enhanced clean voice.
Owner:SOUTH CHINA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products