Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voice segmentation model training method and device and electronic equipment

A technology of segmentation model and training method, which is applied in speech analysis, speech recognition, instruments, etc., and can solve the problems of complex speech signal features and low speech segmentation accuracy targets.

Active Publication Date: 2020-06-19
SOUNDAI TECH CO LTD
View PDF6 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, the characteristics of speech signals are relatively complex, so the accuracy target of speech segmentation is still relatively low, which has become an urgent problem to be solved

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice segmentation model training method and device and electronic equipment
  • Voice segmentation model training method and device and electronic equipment
  • Voice segmentation model training method and device and electronic equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0084] Embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although certain embodiments of the present disclosure are shown in the drawings, it should be understood that the disclosure may be embodied in various forms and should not be construed as limited to the embodiments set forth herein; A more thorough and complete understanding of the present disclosure. It should be understood that the drawings and embodiments of the present disclosure are for exemplary purposes only, and are not intended to limit the protection scope of the present disclosure.

[0085] It should be understood that the various steps described in the method implementations of the present disclosure may be executed in different orders, and / or executed in parallel. Additionally, method embodiments may include additional steps and / or omit performing illustrated steps. The scope of the present disclosure is not limited in this respect. ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the disclosure discloses a voice segmentation model training method and device, electronic equipment and a computer readable storage medium. The training method of the voice segmentation model comprises the following steps: acquiring a voice feature map of a sample voice file; acquiring annotation information of a target voice in the voice feature map; initializing model parameters of a voice segmentation model; inputting the voice feature map into the voice segmentation model to obtain prediction information of a target voice output by the voice segmentation model; calculating an error between the prediction information and the annotation information according to a target function; updating parameters of the voice segmentation model according to the error; and inputtingthe voice feature map into the voice segmentation model after parameter updating to iterate the parameter updating process until the error is less than a first threshold. According to the method, thespeech segmentation model is trained through the speech feature image, and a technical problem of inaccurate speech segmentation caused by complex speech signals in the prior art is solved.

Description

technical field [0001] The present disclosure relates to the field of speech segmentation, and in particular to a training method, device, electronic equipment and computer-readable storage medium of a speech segmentation model. Background technique [0002] As a means of human-computer interaction, speech recognition technology is of great significance in liberating human hands. With the emergence of various smart speakers, voice interaction has become the new value of the Internet portal. More and more smart devices have joined the trend of voice recognition and become a bridge for communication between people and devices. Voice segmentation technology is a branch of speech recognition technology, which is used to divide a piece of voice into different categories according to time periods, such as segmenting the voices of non-simultaneous speakers in a piece of voice, voice endpoint detection and wake-up word alignment etc., all belong to the category of speech segmentati...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/06G10L15/04
CPCG10L15/063G10L15/04Y02T10/40
Inventor 王超陈孝良冯大航
Owner SOUNDAI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products