Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voice segmentation model training method and device and electronic equipment

A technology of segmentation model and training method, which is applied in speech analysis, speech recognition, instruments, etc., and can solve the problems of low speech segmentation accuracy and complex speech signal features.

Active Publication Date: 2020-06-19
SOUNDAI TECH CO LTD
View PDF9 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, the characteristics of speech signals are relatively complex, so the accuracy of speech segmentation is still relatively low, which has become an urgent problem to be solved.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice segmentation model training method and device and electronic equipment
  • Voice segmentation model training method and device and electronic equipment
  • Voice segmentation model training method and device and electronic equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0097] Embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although certain embodiments of the present disclosure are shown in the drawings, it should be understood that the disclosure may be embodied in various forms and should not be construed as limited to the embodiments set forth herein; A more thorough and complete understanding of the present disclosure. It should be understood that the drawings and embodiments of the present disclosure are for exemplary purposes only, and are not intended to limit the protection scope of the present disclosure.

[0098] It should be understood that the various steps described in the method implementations of the present disclosure may be executed in different orders, and / or executed in parallel. Additionally, method embodiments may include additional steps and / or omit performing illustrated steps. The scope of the present disclosure is not limited in this regard. ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the disclosure discloses a voice segmentation model training method and device, electronic equipment and a computer readable storage medium. The training method of the voice segmentation model comprises the following steps: acquiring an original voice graph of a sample voice file; acquiring annotation information in the original voice graph; initializing model parameters; inputting the original voice graph into a voice segmentation model to obtain prediction information of a target voice, setting the prediction information of the target voice to be obtained through a plurality of feature graphs with different scales output by the voice segmentation model; calculating an error between the prediction information and the annotation information according to a target function,and updating parameters of the voice segmentation model; and inputting the original voice graph into the voice segmentation model after parameter updating so as to iterate the parameter updating process. According to the method, the voice segmentation model is trained through the original voice image, and the technical problem of inaccurate voice segmentation caused by complex voice signals in the prior art is solved.

Description

technical field [0001] The present disclosure relates to the field of speech segmentation, and in particular to a training method, device, electronic equipment and computer-readable storage medium of a speech segmentation model. Background technique [0002] As a means of human-computer interaction, speech recognition technology is of great significance in liberating human hands. With the emergence of various smart speakers, voice interaction has become the new value of the Internet portal. More and more smart devices have joined the trend of voice recognition and become a bridge for communication between people and devices. Voice segmentation technology is a branch of speech recognition technology, which is used to divide a piece of voice into different categories according to time periods, such as segmenting the voices of non-simultaneous speakers in a piece of voice, voice endpoint detection and wake-up word alignment etc., all belong to the category of speech segmentati...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/06G10L15/04
CPCG10L15/063G10L15/04Y02T10/40
Inventor 王超冯大航陈孝良
Owner SOUNDAI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products