Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voice data processing method and device, equipment and storage medium

A technology of voice data and processing method, applied in the field of data processing, can solve the problems of complex expansion process and high expansion cost

Pending Publication Date: 2021-09-03
北京巅峰科技有限公司
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The purpose of the embodiments of this specification is to provide a voice data processing method, device, equipment and storage medium to solve the problems of complicated expansion process and high expansion cost when expanding the data volume of the training samples of the voice recognition model

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice data processing method and device, equipment and storage medium
  • Voice data processing method and device, equipment and storage medium
  • Voice data processing method and device, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] In order to make those skilled in the art better understand the technical solutions in one or more embodiments of this specification, the following will describe the technical solutions in one or more embodiments of this specification with reference to the accompanying drawings in one or more embodiments of this specification. For a clear and complete description, it is obvious that the described embodiments are only one or more part of the embodiments of this specification, but not all of the embodiments. Based on one or more embodiments in this specification, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of this document.

[0029] It should be noted that, if there is no conflict, one or more embodiments in this specification and the features of the embodiments may be combined with each other. One or more embodiments of the present specification will be described in detail below with re...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

One or more embodiments of the invention provide a voice data processing method and device, equipment and a storage medium. The method comprises the following steps: acquiring to-be-processed voice data; randomly selecting a target voice processing operation from preset voice data processing operations, wherein each preset voice data processing operation comprises time domain masking, frequency domain masking, pitch conversion, volume conversion and audio noise addition; acquiring a value range corresponding to a voice processing parameter of the target voice processing operation, and randomly selecting a parameter value of the voice processing parameter of the target voice processing operation in the value range; and processing the to-be-processed voice data by using the target voice processing operation based on the parameter value. Through the embodiment of the invention, the problems of a complex expansion process and high expansion cost when the data volume of a training sample of a speech recognition model is expanded at present can be solved.

Description

technical field [0001] This document relates to the technical field of data processing, in particular to a voice data processing method, apparatus, device and storage medium. Background technique [0002] Speech recognition technology is an important research direction in the field of artificial intelligence. Speech recognition technology mainly converts speech into text through various speech recognition models such as ASR (Automatic Speech Recognition) models. No matter which speech recognition model is aimed at, the data volume of training samples is always the basis of model training. The larger the data volume of the training samples, the more accurate the speech recognition model obtained by training. In the prior art, the data volume of the training samples is expanded through a model. For example, a neural network model is trained, the speech data to be processed is processed through the neural network model, and the processed speech data is used as a training sam...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/02G10L15/06G10L15/26
CPCG10L15/02G10L15/06G10L15/26
Inventor 王亚东
Owner 北京巅峰科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products