Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voice extraction method for target speaker, device and equipment and medium

A speech extraction and speaker technology, applied in speech analysis, speech recognition, instruments, etc., can solve problems such as high cost and one-sidedness of business quality evaluation

Active Publication Date: 2021-04-27
PING AN BANK CO LTD
View PDF6 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The main purpose of this application is to provide a voice extraction method, device, equipment and medium for the target speaker, aiming to solve the problem that the service industry in the prior art conducts business quality assessment through manual sampling and unannounced visits, resulting in high costs and one-sided technical issues in the obtained business quality assessment

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice extraction method for target speaker, device and equipment and medium
  • Voice extraction method for target speaker, device and equipment and medium
  • Voice extraction method for target speaker, device and equipment and medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0065] In order to make the purpose, technical solution and advantages of the present application clearer, the present application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present application, and are not intended to limit the present application.

[0066] In order to solve the technical problems of high cost and one-sidedness of the service quality assessment obtained by the service industry in the prior art through manual sampling and unannounced visits, this application proposes a speech extraction method for the target speaker The method is applied in the technical field of artificial intelligence. The speech extraction method for the target speaker is by segmenting the speech of the target speaker in different directions, and then inputting the single speaker speech extraction model obtained based on TasNet netwo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the technical field of artificial intelligence, and discloses a voice extraction method for a target speaker, a device and equipment, and a medium, and the method comprises the steps: determining a plurality of first to-be-extracted voice data segments through employing a preset segmentation method according to first to-be-processed voice data in a first direction; performing segmentation extraction on second to-be-processed voice data in a second direction according to the plurality of first to-be-extracted voice data segments to obtain a plurality of second to-be-extracted voice data segments; performing data extraction on the plurality of first to-be-extracted voice data segments and the plurality of second to-be-extracted voice data segments at the same time to obtain a plurality of to-be-extracted voice data segment pairs; and inputting each to-be-extracted voice data segment pair into the single speaker voice extraction model for voice extraction to obtain a plurality of target speaker voice data segments, and splicing the target speaker voice data segments according to a time sequence to obtain target voice data of the target speaker. Therefore, the cost of service quality evaluation is reduced, and the comprehensiveness of service quality evaluation is improved.

Description

technical field [0001] The present application relates to the technical field of artificial intelligence, and in particular to a speech extraction method, device, equipment and medium for a target speaker. Background technique [0002] At present, the professional quality of service personnel is uneven, and there are problems of non-standard service speech skills and unfriendly attitudes. In order to improve the service quality of service personnel, manual sampling and unannounced visits are used to conduct business quality assessment, which consumes a lot of manpower and financial resources, resulting in high costs; Quality assessment is one-sided. Contents of the invention [0003] The main purpose of this application is to provide a voice extraction method, device, equipment and medium for the target speaker, aiming to solve the problem that the service industry in the prior art conducts business quality assessment through manual sampling and unannounced spot checks, r...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/02G10L15/04G10L15/06G10L15/20G10L15/22
CPCG10L15/04G10L15/02G10L15/063G10L15/20G10L15/22
Inventor 张舒婷赖众程杨念慈何利斌李会璟王小红刘彦国
Owner PING AN BANK CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products