Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Video stream generation method and device, equipment and storage medium

A video stream and video frame technology, applied in the field of video stream generation methods, devices, equipment and storage media, can solve problems such as difficult positioning, inconvenient meetings, background noise, etc.

Active Publication Date: 2021-05-11
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF13 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In related technologies, for indoor multi-person video conferencing, the communication between the two places requires a strong bond and perception. Traditional equipment has background noise and human voice interference, and the camera is not well focused and it is difficult to locate the main speaker.
For audio and video communication in outdoor halls, stations, and open spaces, there will be noisy background and human voice interference, which is inconvenient for meetings

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Video stream generation method and device, equipment and storage medium
  • Video stream generation method and device, equipment and storage medium
  • Video stream generation method and device, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

example 1

[0046] In this example, as image 3 As shown, the step of determining the first speaking user corresponding to the voice data includes:

[0047] Step 301, acquire a panoramic image corresponding to voice data, and analyze the user's lip shape features in the panoramic image.

[0048] Wherein, the lip shape features include but not limited to lip shape change features and the like.

[0049] In this embodiment, the panoramic image collected by the preset panoramic camera at the time point of the voice data can be used as the panoramic image corresponding to the voice data, and then the panoramic image can be analyzed according to the image processing technology in the field of image processing technology Lip features in .

[0050] Step 302: Determine the first speaking user corresponding to the voice data according to the lip shape feature.

[0051] It is easy to understand that when a user utters different voice data, the corresponding lip shape features are obviously differ...

example 2

[0054] In this example, as Figure 4 As shown, the step of determining the first speaking user corresponding to the voice data includes:

[0055] Step 401, calculating the time delay difference between the voice data and the first microphone and the second microphone in the preset microphone array.

[0056] It can be understood that the microphones in this embodiment include microphones arranged in two different positions, that is, the first microphone and the second microphone, and calculate the time delay difference between the voice data and the first microphone and the second microphone in the preset microphone array .

[0057] In this embodiment, if the angle between the end-fire direction of the first microphone and the second microphone is a, the time delay difference between the voice data and the first microphone and the second microphone can be calculated using the following formula (1): :

[0058]

[0059] Wherein, in the above formula (1), τ is the time delay...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a video stream generation method and device, equipment and a storage medium, and relates to the technical field of voice, the technical field of video processing, the technical field of computer vision and the technical field of deep learning. According to the specific implementation scheme, the method comprises the steps of: when voice data is monitored, determining a first speaking user corresponding to the voice data; controlling a preset camera to focus on the first speaking user to shoot a first video frame image, and collecting first speaking data of the first speaking user; performing denoising processing on noise data in the first speech data to obtain first target data; and generating a video stream according to the first target data and the first video frame image. Therefore, in the video stream transmission scene, the spokesman is focused to shoot the video frame image, and the noise of the non-spokesman is suppressed, so that the quality of the video stream is improved, and the video requirements in various scenes are met.

Description

technical field [0001] The present disclosure relates to the field of speech technology, video processing technology, computer vision technology and deep learning technology, and in particular to a method, device, device and storage medium for generating a video stream. Background technique [0002] With the development of computer technology, video scenarios based on computer technology are becoming more and more common, for example, indoor video conferences or outdoor video conferences. [0003] In related technologies, for indoor multi-person video conferencing, communication between the two places requires a strong bond and perception. Traditional equipment has background noise and human voice interference, and the camera is not well focused and difficult to locate the main speaker. For audio and video communication in outdoor halls, stations, and open spaces, there will be noisy background and human voice interference, which is inconvenient for meetings. Contents of t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): H04N7/15H04N5/232G10L21/0208H04L29/06
CPCH04N7/15G10L21/0208H04L65/403H04L65/80H04L65/762H04N23/67
Inventor 曹璨李峥戴宁姜俊王昕魏建强付明鑫
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products