Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method, system and storage medium for driving virtual human by voice in real time

A virtual human and voice technology, applied in the field of information security, can solve problems such as lack of timing information, inability to meet real-time performance, and reduced facial movement accuracy of virtual human images

Active Publication Date: 2022-03-04
北京影创信息科技有限公司
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, the existing voice-driven virtual human technology cannot be well extended to real-time applications such as virtual meetings. Voice acquisition, neural network computing speed, audio and video delays and other issues cannot meet the real-time requirements; on the other hand, the voice-driven virtual human technology in real-time scenarios requires the length of the input voice segment to be as short as possible, so as to meet the real-time requirements
In the process of voice feature calculation, due to the lack of timing information and necessary semantic information in short voice segments, it is easy to reduce the accuracy of the facial movement of the avatar during the driving process, thereby reducing the sense of reality.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method, system and storage medium for driving virtual human by voice in real time
  • Method, system and storage medium for driving virtual human by voice in real time
  • Method, system and storage medium for driving virtual human by voice in real time

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0053] Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, the same numerals in different drawings refer to the same or similar elements unless otherwise indicated. The implementations described in the following exemplary embodiments do not represent all implementations consistent with this application. Rather, they are merely examples of apparatuses and methods consistent with aspects of the present application as recited in the appended claims.

[0054] figure 1 It is a flow chart of a method for driving a virtual human by voice in real time provided by the embodiment of the present application. Such as figure 1 As shown, the method for real-time voice-driven virtual human provided by the embodiment of the present application includes the following steps:

[0055] S1. Acquire the RGB image of the human face.

[0056] Specific...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present application provides a method, system and storage medium for driving a virtual person by voice in real time. The method for driving a virtual person by voice in real time includes: obtaining an RGB image of a human face; performing 3D face reconstruction on the RGB image of a human face to obtain an RGB image of a human face Corresponding 3D face parameters; pre-acquisition of a piece of voice and save it in the cache queue after denoising processing; real-time collection of voice clips and save them in the cache queue after denoising processing; read all voice clips in the cache queue and Splicing, to get the spliced ​​speech clips, and use the spliced ​​speech clips and the pre-trained neural network to get the predicted 3D facial expression parameters; get the rendered RGB image according to the predicted 3D facial expression parameters and 3D face parameters . This application can achieve the real-time performance of the entire driving process without reducing the quality of the virtual human, so that the voice-driven virtual human technology can be used in various real-time applications.

Description

technical field [0001] The present application belongs to the technical field of information security, and in particular relates to a method, system and storage medium for real-time voice driving of a virtual human. Background technique [0002] Voice-driven virtual human technology is a kind of virtual human-driven technology, which uses voice to drive a preset virtual human model to generate a dynamic virtual human image that conforms to the voice content. In recent years, with the development and maturity of voice-driven virtual human technology, voice-driven virtual human technology has derived quite a lot of applications, such as virtual anchors, virtual customer service and virtual idols. Since the avatar often needs to be presented directly to the user, the user has high requirements for the authenticity and accuracy of the voice-driven results. [0003] However, the existing voice-driven virtual human technology cannot be well extended to real-time applications such...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06T13/20G06T17/00G06N3/04G06N3/08
CPCG06T13/205G06T17/00G06N3/08G06N3/045
Inventor 徐迪马宜祯张彦博常友坚毛文涛蔡宝军
Owner 北京影创信息科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products