Voice segmentation model training method and device and electronic equipment

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of segmentation model and training method, which is applied in speech analysis, speech recognition, instruments, etc., and can solve the problems of low speech segmentation accuracy and complex speech signal features.

Active Publication Date: 2020-06-19

SOUNDAI TECH CO LTD

View PDF9 Cites 2 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] However, the characteristics of speech signals are relatively complex, so the accuracy of speech segmentation is still relatively low, which has become an urgent problem to be solved.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0097] Embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although certain embodiments of the present disclosure are shown in the drawings, it should be understood that the disclosure may be embodied in various forms and should not be construed as limited to the embodiments set forth herein; A more thorough and complete understanding of the present disclosure. It should be understood that the drawings and embodiments of the present disclosure are for exemplary purposes only, and are not intended to limit the protection scope of the present disclosure.

[0098] It should be understood that the various steps described in the method implementations of the present disclosure may be executed in different orders, and / or executed in parallel. Additionally, method embodiments may include additional steps and / or omit performing illustrated steps. The scope of the present disclosure is not limited in this regard. ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The embodiment of the disclosure discloses a voice segmentation model training method and device, electronic equipment and a computer readable storage medium. The training method of the voice segmentation model comprises the following steps: acquiring an original voice graph of a sample voice file; acquiring annotation information in the original voice graph; initializing model parameters; inputting the original voice graph into a voice segmentation model to obtain prediction information of a target voice, setting the prediction information of the target voice to be obtained through a plurality of feature graphs with different scales output by the voice segmentation model; calculating an error between the prediction information and the annotation information according to a target function,and updating parameters of the voice segmentation model; and inputting the original voice graph into the voice segmentation model after parameter updating so as to iterate the parameter updating process. According to the method, the voice segmentation model is trained through the original voice image, and the technical problem of inaccurate voice segmentation caused by complex voice signals in the prior art is solved.

Description

technical field [0001] The present disclosure relates to the field of speech segmentation, and in particular to a training method, device, electronic equipment and computer-readable storage medium of a speech segmentation model. Background technique [0002] As a means of human-computer interaction, speech recognition technology is of great significance in liberating human hands. With the emergence of various smart speakers, voice interaction has become the new value of the Internet portal. More and more smart devices have joined the trend of voice recognition and become a bridge for communication between people and devices. Voice segmentation technology is a branch of speech recognition technology, which is used to divide a piece of voice into different categories according to time periods, such as segmenting the voices of non-simultaneous speakers in a piece of voice, voice endpoint detection and wake-up word alignment etc., all belong to the category of speech segmentati...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/06G10L15/04

CPCG10L15/063G10L15/04Y02T10/40

Inventor 王超冯大航陈孝良

Owner SOUNDAI TECH CO LTD

Who we serve

R&D Engineer
R&D Manager
IP Professional

Why Patsnap Eureka

Industry Leading Data Capabilities
Powerful AI technology
Patent DNA Extraction

Social media

Patsnap Eureka Blog

Learn More

PatSnap group products

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Voice segmentation model training method and device and electronic equipment

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology