Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method for searching multimedia resource based on audio content retrieval

A multimedia resource, audio technology, applied in speech analysis, speech recognition, special data processing applications, etc., can solve the limitations of robustness and practicability, keyword recognizers can not achieve ideal results, can not be well satisfied Voice retrieval application requirements and other issues, to achieve the effect of accurate matching, comprehensive and reliable indexing

Inactive Publication Date: 2008-10-08
无锡微著网络有限公司
View PDF0 Cites 67 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] Keyword recognition technology based on Hidden Markov Model is a very important aspect of speech retrieval. It occupies an important position in the specific content retrieval of speech. Due to the limitations of robustness and practicability of speech recognition technology, Using continuous speech recognition to build a large vocabulary, the recognizer of arbitrary keywords cannot achieve ideal results, and cannot well meet the application requirements of speech retrieval.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for searching multimedia resource based on audio content retrieval
  • Method for searching multimedia resource based on audio content retrieval
  • Method for searching multimedia resource based on audio content retrieval

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] The multimedia resource retrieval method based on audio content retrieval comprises the following steps:

[0023] 1) The pre-processing server converts video and audio into standard speech to be recognized; as figure 1 As shown, the video material 1-1 and the voice material 1-2 are input to the preprocessing server S1, and the standard corpus 1-3 to be recognized is obtained through preprocessing.

[0024] 2) The speech recognition server trains the training corpus into an acoustic model, and matches the speech to be recognized with the acoustic model to obtain a semantic text index; figure 1 As shown, the training corpus 1-4 is input to the speech recognition server S2, the acoustic model is obtained through training, and stored in S2, the corpus 1-3 to be recognized and the acoustic model are input to the speech recognition server S2 together, and the corpus 1-3 to be recognized is obtained by matching. Semantic text indexing information in 3 1-5.

[0025] 3) The in...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a multimedia resource retrieval method based on audio content retrieval, comprising the following steps: 1) a pretreatment server converts video and audio into standard voice to be recognized; 2) a voice recognition server trains training corpus into an acoustical model, and matches the voice to be recognized with the acoustical model to obtain semantic text indexes; (3) an index server stores and recognizes keyword indexes, and matches the indexes with a retrieval condition to get a retrieval result. By using keyword retrieval technology in audio, the internal semantic information of audio / video resource can be obtained, the text type semantic information is indexed, therefore, the method provides more comprehensive and reliable audio / video resource information indexes, such that a retrieval system can get matched multimedia resource more accurately, and locate the precise position of retrieval keywords in audio / video.

Description

technical field [0001] The invention relates to a multimedia resource retrieval method based on audio content retrieval, in particular to a method for retrieving resources in the form of video and audio, finding the resource containing the retrieved information and giving the location of the retrieved information in the resource. Background technique [0002] In today's digital and network era, multimedia data has become the main part of the data transmitted on the Internet information highway. Multimedia content such as audio, images and video currently accounts for 15% of the Internet, and this number is still growing rapidly. The large-capacity and high-speed storage system provides a basic guarantee for the mass storage of audio and video, and the use of audio and video in various industries is becoming more and more extensive. How to obtain useful information from massive audio and video information, that is, the management and retrieval of audio and video information ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G10L15/08G10L15/14G10L15/02G10L15/06
Inventor 叶睿智
Owner 无锡微著网络有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products