Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and system for driving virtual human in real time through voice and storage medium

A virtual human and voice technology, applied in the field of information security, can solve problems such as lack of timing information, reduced sense of reality, and reduced accuracy of facial movement of virtual human images

Active Publication Date: 2021-12-21
北京影创信息科技有限公司
View PDF5 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, the existing voice-driven virtual human technology cannot be well extended to real-time applications such as virtual meetings. Voice acquisition, neural network computing speed, audio and video delays and other issues cannot meet the real-time requirements; on the other hand, the voice-driven virtual human technology in real-time scenarios requires the length of the input voice segment to be as short as possible, so as to meet the real-time requirements
In the process of voice feature calculation, due to the lack of timing information and necessary semantic information in short voice segments, it is easy to reduce the accuracy of the facial movement of the avatar during the driving process, thereby reducing the sense of reality.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for driving virtual human in real time through voice and storage medium
  • Method and system for driving virtual human in real time through voice and storage medium
  • Method and system for driving virtual human in real time through voice and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0053] Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, the same numerals in different drawings refer to the same or similar elements unless otherwise indicated. The implementations described in the following exemplary embodiments do not represent all implementations consistent with this application. Rather, they are merely examples of apparatuses and methods consistent with aspects of the present application as recited in the appended claims.

[0054] figure 1 It is a flow chart of a method for driving a virtual human by voice in real time provided by the embodiment of the present application. like figure 1 As shown, the method for real-time voice-driven virtual human provided by the embodiment of the present application includes the following steps:

[0055] S1. Acquire the RGB image of the human face.

[0056] Specificall...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a method and system for driving a virtual human in real time through voice and a storage medium. The method for driving the virtual human in real time through voice comprises the steps: acquiring a face RGB image; performing 3D face reconstruction on the face RGB image to obtain a 3D face parameter corresponding to the face RGB image; pre-acquiring a voice clip, denoising the voice clip and then storing the voice clip in a cache queue; acquiring a voice clip in real time, denoising the voice clip, and storing the voice clip in the cache queue; reading all the voice clips in the cache queue and splicing the voice clips to obtain spliced voice clips, and obtaining predicted 3D facial expression parameters by using the spliced voice clips and a pre-trained neural network; and obtaining a rendered RGB image according to the predicted 3D face expression parameters and 3D face parameters. According to the method, the real-time performance of the whole driving process can be achieved on the basis of not reducing the quality of the virtual human, so that the voice-driven virtual human technology can be applied to various real-time applications.

Description

technical field [0001] The present application belongs to the technical field of information security, and in particular relates to a method, system and storage medium for real-time voice driving of a virtual human. Background technique [0002] Voice-driven virtual human technology is a kind of virtual human-driven technology, which uses voice to drive a preset virtual human model to generate a dynamic virtual human image that conforms to the voice content. In recent years, with the development and maturity of voice-driven virtual human technology, voice-driven virtual human technology has derived quite a lot of applications, such as virtual anchors, virtual customer service and virtual idols. Since the avatar often needs to be presented directly to the user, the user has high requirements for the authenticity and accuracy of the voice-driven results. [0003] However, the existing voice-driven virtual human technology cannot be well extended to real-time applications such...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06T13/20G06T17/00G06N3/04G06N3/08
CPCG06T13/205G06T17/00G06N3/08G06N3/045
Inventor 徐迪马宜祯张彦博常友坚毛文涛蔡宝军
Owner 北京影创信息科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products