A variable-length panoramic sound signal encoding and decoding method

A coding method and sound signal technology, applied in speech analysis, instruments, etc., can solve problems affecting the organization of audio data, wasting time, etc., and achieve the effect of perfect audio coding technology for panoramic sound

Active Publication Date: 2020-08-11
WAVARTS TECH CO LTD
View PDF15 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] According to the existing technology, the frame length of audio encoding is fixed, so once the audio data is reorganized, it will inevitably affect the organization of all subsequent audio data
Then from the modified part to the end of the movie, it needs to be re-encoded, which is a waste of time; and we hope that only the modified part will be re-coded

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A variable-length panoramic sound signal encoding and decoding method
  • A variable-length panoramic sound signal encoding and decoding method
  • A variable-length panoramic sound signal encoding and decoding method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0064] Example 1: Will figure 1 When encoding the audio data shown, if the length of a certain frame is k samples (k figure 2 shown. Such as image 3 As shown, by encoding according to the method provided by the present invention, only a code stream with a length of k samples is output, and the length information of this audio frame is written in the frame_length field in the code stream (also need to be multiplied by bit depth and AES number of channels).

[0065] When decoding, read the code stream data of 1024 samples each time, and decode at least one frame of audio data accordingly. Within these 1024 samples, if an audio frame of length k (k≤1024) is encountered at the n-th sample point position (for example, corresponding to the m-th audio frame), it will be processed according to the length information frame_length in the code stream decoding. Audio data can only be decoded when any frame of code stream is complete.

[0066] 1. n+k=1024, then output the audio data ...

Embodiment 2

[0071] Embodiment 2: Assume that the video frame rate is 24fps, the audio sampling rate is 48000Hz, and the audio bit depth is 24bit.

[0072] Such as Figure 5 As shown, the encoding method of inserting 200 video frames at the 1000th video frame of the movie is as follows:

[0073] Step 1: Calculate the number of audio samples corresponding to the 1000th video frame

[0074] StartPos=1000*48000 / 24=2,000,000;

[0075] Step 2: Calculate the number of audio frames corresponding to the 1000th video frame

[0076] StartFrame=StartPos / 1024=1953, and then save the data after the 1953rd audio frame as a file Data0.pcm;

[0077] Step 3: Calculate the number of audio samples to be inserted InsertSamples=200*48000 / 24=400,000;

[0078] Step 4: Count all audio samples that need to be reorganized

[0079] DataToReenc=InsertSamples+1024=401,024;

[0080] Step 5: Reorganize Audio Data

[0081] 401,024=391*1024+640, including 391 full audio frames and a short frame with a length of 640...

Embodiment 3

[0088] Embodiment 3: It is assumed that the video frame rate is 24fps, the audio sampling rate is 48000Hz, and the audio bit depth is 24bit.

[0089] Such as Figure 6 As shown, the encoding method of deleting 300 video frames at the 1000th video frame of the movie is as follows:

[0090] Step 1: Calculate the number of audio samples corresponding to the 1000th video frame

[0091] StartPos=1000*48000 / 24=2,000,000;

[0092] Step 2: Calculate the number of audio frames corresponding to the 1000th video frame StartFrame=StartPos / 1024=1953;

[0093] Step 3: Calculate the number of samples corresponding to the deletion end point EndPos=1300*48000 / 24=2,600,000;''Step 4: Calculate the number of audio frames corresponding to the deletion end point EndFrame=EndPos / 1024=2539, the data after the 2539th audio frame Save as a file Data0.pcm;

[0094] Step 5: Count all audio samples that need to be reorganized

[0095] DataToReenc = (EndFrame-StartFrame+1)*1024 - (EndPos-StartPos) =10...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a variable-length panoramic sound signal coding and decoding method, and the coding comprises the steps: calculating the number of audio sample points of a to-be-coded panoramic sound signal; dividing the number of audio sample points into complete frames according to the number of sampling points; taking the portions which are not complete frames into short frames; codingthe complete frame portion in a standard manner, coding the short frame portion in a short frame mode, and writing the length information of the short frames in a frame_length field in a code stream.The decoding comprises the steps: reading the code stream data, corresponding to the number of sampling points, of one audio frame each time; performing decoding in a standard mode if the audio framecorresponding to the code stream data is a complete frame; performing decoding according to the length information frame_length field in the code stream if the audio frame corresponding to the code stream data includes a short frame. The method can achieve the adding, deleting and replacement of any segment in a film project, can achieve the complete and accurate coding and decoding, and cannot affect compiled data.

Description

technical field [0001] The invention relates to the technical field of coding and decoding processing of panoramic sound signals, in particular to a coding and decoding method of variable-length panoramic sound signals. Background technique [0002] After years of development, stereo, 5.1, 7.1 surround sound and other systems have been widely used. Due to the lack of sound height information, these systems can only present two-dimensional sound at most. In the real world, panoramic sound (also known as three-dimensional sound) is the most realistic presentation and expression of sound. Regardless of nature, art or audio-visual entertainment, panoramic sound is the future development trend. [0003] In existing systems, panoramic sound can be a multi-channel signal (such as 9.1, 11.1, 13.1, and 22.2, etc.), multiple sound objects, or a combination of both. In the panoramic sound system, the multi-channel signal can be a surround sound signal, such as 5.1, 7.1, etc., or it ca...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G10L19/008
CPCG10L19/008
Inventor 潘兴德黄旭吴超刚陈笑天
Owner WAVARTS TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products