Audio mixing method and equipment thereof
A technology of sound mixing and speech frames, which is used in speech analysis, instruments, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0027] This embodiment discloses a sound mixing method, such as figure 1 As shown, the method includes:
[0028] Step S1, the server receives speech coded data sent by at least two source clients.
[0029] In the present invention, the source client refers to the client that provides voice data, that is, the sending end; the sink client refers to the client that receives voice data, that is, the receiving end; in usage scenarios such as VOIP multi-person conferences, any A client can be either a source client or a sink client, and details will not be described later.
[0030] Step S2, the server uniformly decodes the received coded data into a PCM (pulse-code modulation, pulse code modulation) format.
[0031] It should be noted that, unless otherwise specified, the speech data in the subsequent steps refers to the decoded speech data in PCM format.
[0032] Step S3, the server judges whether the sampling rates and channels of the speech coding data transmitted by all sourc...
Embodiment 2
[0055] This embodiment is an improvement made to the first embodiment above. That is, during the mixing process of the server, the Figure 5 Anti-spill treatment shown.
[0056] Such as Figure 5 As shown, the audio mixing process disclosed in this embodiment includes the following steps:
[0057] Step S10, superimposing the aligned PCM data of all source clients.
[0058] Step S20, judging whether the superimposed current speech frame overflows. If it overflows, turn to the following step S30; if it does not overflow (not shown in the figure), the attenuation factor is 1, and the attenuation process is not performed on the current speech frame, and ends at a speech frame before the subsequent beginning to overflow the speech frame, its attenuation Factors are all 1.
[0059] Step S30, calculating the attenuation factor of the current speech frame, the attenuation factor being the ratio of the overflow threshold to the maximum overflow value, and performing attenuation pr...
Embodiment 3
[0066] Corresponding to Embodiment 1 of the above method, this embodiment discloses a server for processing audio mixing in a multi-person conference. Such as Figure 6 As shown, the server includes:
[0067] Receiving and decoding module 100, configured to receive and decode speech coded data sent by at least two source clients, and the decoded data is unified into PCM format;
[0068] Judging module 200, for judging whether the sampling rate and sound channel of the speech coded data transmitted by all source clients are consistent with the standard;
[0069] The channel conversion module 300 is used to extend or merge the voice data of all source clients different from the standard channel when the judgment module 200 judges that the channel of the voice data of the source client is different from the standard channel;
[0070] The re-sampling module 400 is used for re-sampling the voice data of the different source client based on the standard sampling rate when the judg...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com