Mixing method, device, and equipment and storage medium
A sound mixing and audio streaming technology, applied in speech analysis, speech recognition, instruments, etc., can solve the problem of volume reduction and achieve clear human voice, good practicability, and good user experience
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0047] figure 1 It is a flow chart of the audio mixing method provided in Embodiment 1. This method is applicable to online voice scenarios such as multi-party conference calls or network conferences. It can be executed by software / hardware deployed on the server or client, such as figure 1 As shown, a mixing method provided in this embodiment includes:
[0048] S102. Receive at least two channels of audio stream data.
[0049] Receive the audio stream data of all channels in the working state in a conference call or web conference.
[0050] S104. Detect the types of the audio stream data of all channels through the pre-trained vocal detection model, so as to identify the audio stream data of the vocal channel and the audio stream data of the noise channel.
[0051] The pre-trained human voice detection model in this embodiment is preferably but not limited to the GMM model based on Gaussian probability density function, the SVM model based on vector machine, the DNN model b...
Embodiment 2
[0069] figure 2 It is a flow chart of the sound mixing method provided by the implementation of the present invention, such as figure 2 As shown, with respect to the foregoing embodiments, the mixing method provided by this embodiment further includes:
[0070] S1051. Determine whether the human voice channel audio stream data is smaller than a preset adjustment amplitude.
[0071] S1052. If yes, normalize the human voice channel audio stream data to the first preset amplitude range, so as to update the human voice channel audio stream data.
[0072] Within the preset time interval, when the maximum amplitude of the human voice channel audio stream data is lower than the preset adjustment amplitude, the human voice channel audio stream data is normalized to the first preset amplitude range, Increase the amplitude of the human voice channel audio stream data, thereby increasing the amplitude of the human voice channel audio stream data before mixing, and increasing the ampl...
Embodiment 3
[0080] image 3 It is a flow chart of the sound mixing method provided by the implementation of the present invention, such as image 3 As shown, in order to better improve the human voice in the result mixing data, with respect to the foregoing embodiments, this embodiment preferably further includes:
[0081] S1053. Determine whether the difference between the amplitude of the human voice channel audio stream data and the noise channel audio stream data is within a preset amplitude difference range.
[0082] S1054. If so, normalize the human voice channel audio stream data to the second preset amplitude range to update the human voice channel audio stream data; normalize the noise channel audio stream data to the third preset The amplitude range is set to update the audio stream data of the noise channel, wherein the second preset amplitude range is greater than the third preset amplitude range.
[0083] In this embodiment, after the human voice channel audio stream data a...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com