A combined modeling method of speech recognition and synthesis for edge devices

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A speech recognition and edge device technology, applied in the field of speech recognition-synthesis joint modeling, can solve problems such as lack of model performance, loopholes in the operation process, poor input data characteristics, etc., to achieve rich creativity, robust performance, and rich functions Effect

Active Publication Date: 2022-07-01

NORTHWEST UNIV

View PDF4 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

The reasons are mostly poor input data characteristics, lack of performance of the model itself, and loopholes in the operation process.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0029] The present invention is further described below in conjunction with accompanying drawing and embodiment, but the present invention is not limited to following embodiment:

[0030] like figure 1 , 2 , 3, an edge device-oriented speech recognition-synthesis joint modeling method, including the following steps:

[0031] 1) Collect dataset samples. Divided into a. clean audio in a quiet environment b. different types of noise audio (specifically: white noise, pink noise, speech babble, etc., refer to the noise noise library for classification) All audio data is a sampling rate of 16k, and the storage format is pcm ( Shaanxi, Minnan, Changsha, Sichuan, Hebei, Shanghai six dialects);

[0032] 2) Carry out data processing. First, do noise fusion processing, add noise to clean audio, package and assemble into clean audio and the corresponding noise-added frequency;

[0033] 3) Build an edge server, perform audio front-end processing on this layer of equipment for de-rever...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

A combined modeling method of speech recognition-synthesis for edge devices, through the research on real-time computing, the distribution of edge computing strategies and the inspiration of "copy without aliasing" in entertainment games, the back-end integrates speech recognition and speech synthesis technology. Model iteration method. Based on the speech enhancement function in the field of audio processing, a real-time and efficient processing module is constructed, and an iterative model of speech recognition and synthesis for Chinese dialects is constructed based on speech recognition technology and speech synthesis technology. The real-time dialect processing model based on the real-time dialect processing model effectively utilizes the richer processing capabilities of the edge environment, combines speech recognition and speech synthesis technologies, and designs a speech model with richer functions and more robust performance.

Description

technical field [0001] The invention belongs to the technical fields of edge computing and audio research, relates to edge servers, speech enhancement, speech recognition, speech synthesis, and neural networks, and in particular relates to an edge device-oriented speech recognition-synthesis joint modeling method. Background technique [0002] After Industry 4.0, the rapid rise of artificial intelligence and the Internet of Things (IoT) has provided great potential for the convenience of human beings in terms of clothing, food, housing and transportation, and many smart products have emerged as the times require. At the same time, with the development of edge computing in recent years, the edge computing strategy can effectively realize the distribution of large-scale tasks, solve real-time problems, and improve the calculation ability of the model. Therefore, it provides infinite possibilities for continuously strengthening and expanding the functions of smart products. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G10L13/02G10L13/04G10L15/02G10L15/06G10L15/26G10L25/24G10L25/30

CPCG10L13/02G10L13/04G10L15/02G10L15/063G10L15/26G10L25/24G10L25/30

Inventor 王海秦晨光张晓刘艺赵子鑫高岭任杰郑杰

Owner NORTHWEST UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

A combined modeling method of speech recognition and synthesis for edge devices

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology