Method, system and storage medium for driving virtual human by voice in real time

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A virtual human and voice technology, applied in the field of information security, can solve problems such as lack of timing information, inability to meet real-time performance, and reduced facial movement accuracy of virtual human images

Active Publication Date: 2022-03-04

北京影创信息科技有限公司

View PDF5 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0003] However, the existing voice-driven virtual human technology cannot be well extended to real-time applications such as virtual meetings. Voice acquisition, neural network computing speed, audio and video delays and other issues cannot meet the real-time requirements; on the other hand, the voice-driven virtual human technology in real-time scenarios requires the length of the input voice segment to be as short as possible, so as to meet the real-time requirements

In the process of voice feature calculation, due to the lack of timing information and necessary semantic information in short voice segments, it is easy to reduce the accuracy of the facial movement of the avatar during the driving process, thereby reducing the sense of reality.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0053] Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, the same numerals in different drawings refer to the same or similar elements unless otherwise indicated. The implementations described in the following exemplary embodiments do not represent all implementations consistent with this application. Rather, they are merely examples of apparatuses and methods consistent with aspects of the present application as recited in the appended claims.

[0054] figure 1 It is a flow chart of a method for driving a virtual human by voice in real time provided by the embodiment of the present application. Such as figure 1 As shown, the method for real-time voice-driven virtual human provided by the embodiment of the present application includes the following steps:

[0055] S1. Acquire the RGB image of the human face.

[0056] Specific...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The present application provides a method, system and storage medium for driving a virtual person by voice in real time. The method for driving a virtual person by voice in real time includes: obtaining an RGB image of a human face; performing 3D face reconstruction on the RGB image of a human face to obtain an RGB image of a human face Corresponding 3D face parameters; pre-acquisition of a piece of voice and save it in the cache queue after denoising processing; real-time collection of voice clips and save them in the cache queue after denoising processing; read all voice clips in the cache queue and Splicing, to get the spliced speech clips, and use the spliced speech clips and the pre-trained neural network to get the predicted 3D facial expression parameters; get the rendered RGB image according to the predicted 3D facial expression parameters and 3D face parameters . This application can achieve the real-time performance of the entire driving process without reducing the quality of the virtual human, so that the voice-driven virtual human technology can be used in various real-time applications.

Description

technical field [0001] The present application belongs to the technical field of information security, and in particular relates to a method, system and storage medium for real-time voice driving of a virtual human. Background technique [0002] Voice-driven virtual human technology is a kind of virtual human-driven technology, which uses voice to drive a preset virtual human model to generate a dynamic virtual human image that conforms to the voice content. In recent years, with the development and maturity of voice-driven virtual human technology, voice-driven virtual human technology has derived quite a lot of applications, such as virtual anchors, virtual customer service and virtual idols. Since the avatar often needs to be presented directly to the user, the user has high requirements for the authenticity and accuracy of the voice-driven results. [0003] However, the existing voice-driven virtual human technology cannot be well extended to real-time applications such...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G06T13/20G06T17/00G06N3/04G06N3/08

CPCG06T13/205G06T17/00G06N3/08G06N3/045

Inventor 徐迪马宜祯张彦博常友坚毛文涛蔡宝军

Owner 北京影创信息科技有限公司

Features

Generate Ideas
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Method, system and storage medium for driving virtual human by voice in real time

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology