Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

System and method for audio media recognition

A technology for automatic recognition of media content, applied in speech analysis, instruments, etc., can solve problems such as inefficient execution, achieve accuracy and scalability, reduce processor cost, and increase processing speed

Active Publication Date: 2013-03-06
ADELPHOI
View PDF3 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Such standard algorithms do not perform very efficiently when the space being searched has a large number of dimensions

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • System and method for audio media recognition
  • System and method for audio media recognition
  • System and method for audio media recognition

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] An example embodiment of the present invention provides an audio recognition system that processes incoming audio streams ("programs") and searches an internal database of music and sound effects ("tracks") to identify those tracks used within the program. One example of an output of an example embodiment may be in the form of a cue sheet listing a selection of audio tracks used and where they appear in the program.

[0022] An example embodiment may work with a database of, for example, ten million seconds of music. Yet other embodiments can be extended to work with much larger databases, such as databases of billions of seconds of music, and are able to identify clips of duration, such as three seconds or less, such as one second, and can Audio from a typical music station is run in real time at about ten times the rate on a conventional server computer.

[0023] The following are definitions of some of the terms used in this article:

[0024] An "audio track" is an...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Automatic recognition of sample media content is provided, A spectrogram is generated for successive time slices of audio signal. One or more sample hash vectors are generated for a time slice by calculating ratios of magnitudes of respective frequency bins from a column for the time slice. In a primary evaluation stage an exact match of bits of the sample hash vector is performed to entries in a look-up table to identify a group of one or more reference hash vectors. In a secondary evaluation stage a degree of similarity between the sample hash vector and each of the group of reference hash vectors is performed to identify any reference hash vectors that are candidates for matching the sample media content, each reference hash vector representing a time slice of reference media content.

Description

technical field [0001] The present invention relates to audio recognition systems and methods for automatic recognition of audio media content. Background technique [0002] Various audio recognition systems and methods are known for processing incoming audio streams ("programs") and searching internal databases of music and sound effects ("tracks") to identify those tracks used within the program. [0003] In the real world, music is often the only layer in a program's audio layers. One of the challenges for audio recognition is to recognize the identity of the music even in the context of other audio layers such as sound effects, voice-overs, ambience, etc. that are present at the same time. Other distortions include equalization (adjusting the relative amounts of tremble and bass in a track) and changing tempo and / or pitch. [0004] Some audio recognition techniques are based on directly performing a neighbor search on the computed hash values ​​using standard algorithm...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L25/00
CPCG10L25/51G10L25/18G10L25/00
Inventor 亚历山大·保罗·塞尔比马克·圣·约翰·欧文
Owner ADELPHOI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products