Voice sensitive information detecting and filtering method based on unspecified people

A sensitive information, non-person-specific technology, applied in the field of multimedia content security, can solve problems such as missed detection rate, high false detection rate, large limitations, and difficulty, to ensure accuracy, ensure real-time performance, and improve accuracy rate Effect

Inactive Publication Date: 2015-10-28
HEFEI UNIV OF TECH
View PDF5 Cites 34 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although this method uses automatic computer processing to avoid waste of human resources, the speech recognition process is slow, and it is difficult to apply to real-time interactive voice programs such as TV and radio, and voice chat rooms that require high real-time performance.
[0004] To sum up, in the existing technology, non-specific person-oriented speech sensitive information detection and filtering methods have great limitations, high missed detection rate and false detection rate, and it is difficult to meet real-time requirements

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice sensitive information detecting and filtering method based on unspecified people
  • Voice sensitive information detecting and filtering method based on unspecified people
  • Voice sensitive information detecting and filtering method based on unspecified people

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] The invention constructs a sensitive word feature template database, and realizes detection and filtering of sensitive words in real-time voice or voice files based on the sensitive word feature template database.

[0030] see figure 1 , is a schematic flowchart of a method for detecting and filtering sensitive words in the present invention. The method constructs a sensitive word feature template database through a feature template training module; then realizes detection and filtering of sensitive words in real-time voice and voice files through a detection and filtering module.

[0031] Sensitive words in the present invention may include uncivilized words such as swear words, secret-related words related to national security, and the like.

[0032] figure 1 The process includes the following steps:

[0033] Step 101, receiving voice input of sensitive words, and performing endpoint detection on them. According to the statistical characteristics of speech, speech...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a voice sensitive information detecting and filtering method based on unspecified people. The method is capable of detecting and filtering real-time voices and voice files. The method includes the steps: carrying out end point detection for an original voice through an improved double threshold end point detection algorithm, extracting the Mel frequency inverted spectral coefficient characteristic of the voice, training an appropriate voice characteristic template through a self-learning time warping algorithm, and storing the template in a database; and then performing end point detection for the original voice through the improved double threshold end point detection algorithm, extracting the MFCC characteristic, comparing the extracted voice characteristic with the template in the sensitive word characteristic template database by combining rough matching with fine matching, detecting sensitive words input to the voice, and filtering the detected sensitive words.

Description

technical field [0001] The invention relates to multimedia content security technology, in particular to a non-specific person-oriented voice sensitive information detection and filtering method. Background technique [0002] With the development of telecommunications network technology, voice applications such as telephone voice and network audio are becoming more and more mature. However, voice communication containing sensitive information and illegal information is not conducive to social harmony and stability and the long-term stability of the country. How to detect sensitive information from massive speech data has become an urgent problem to be solved. [0003] The traditional method is to use artificial listening, which is only suitable for processing a small amount of speech, but it is inefficient and often consumes huge manpower and material resources when performing manual detection of massive speech information, but it is difficult to achieve satisfactory detect...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/18G10L15/06G10L25/54
Inventor 苏兆品张国富岳峰齐美彬蒋建国胡东辉
Owner HEFEI UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products