System and method for audio media recognition

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology for automatic recognition of media content, applied in speech analysis, instruments, etc., can solve problems such as inefficient execution, achieve accuracy and scalability, reduce processor cost, and increase processing speed

Active Publication Date: 2013-03-06

ADELPHOI

View PDF3 Cites 12 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Such standard algorithms do not perform very efficiently when the space being searched has a large number of dimensions

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0021] An example embodiment of the present invention provides an audio recognition system that processes incoming audio streams ("programs") and searches an internal database of music and sound effects ("tracks") to identify those tracks used within the program. One example of an output of an example embodiment may be in the form of a cue sheet listing a selection of audio tracks used and where they appear in the program.

[0022] An example embodiment may work with a database of, for example, ten million seconds of music. Yet other embodiments can be extended to work with much larger databases, such as databases of billions of seconds of music, and are able to identify clips of duration, such as three seconds or less, such as one second, and can Audio from a typical music station is run in real time at about ten times the rate on a conventional server computer.

[0023] The following are definitions of some of the terms used in this article:

[0024] An "audio track" is an...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

Automatic recognition of sample media content is provided, A spectrogram is generated for successive time slices of audio signal. One or more sample hash vectors are generated for a time slice by calculating ratios of magnitudes of respective frequency bins from a column for the time slice. In a primary evaluation stage an exact match of bits of the sample hash vector is performed to entries in a look-up table to identify a group of one or more reference hash vectors. In a secondary evaluation stage a degree of similarity between the sample hash vector and each of the group of reference hash vectors is performed to identify any reference hash vectors that are candidates for matching the sample media content, each reference hash vector representing a time slice of reference media content.

Description

technical field [0001] The present invention relates to audio recognition systems and methods for automatic recognition of audio media content. Background technique [0002] Various audio recognition systems and methods are known for processing incoming audio streams ("programs") and searching internal databases of music and sound effects ("tracks") to identify those tracks used within the program. [0003] In the real world, music is often the only layer in a program's audio layers. One of the challenges for audio recognition is to recognize the identity of the music even in the context of other audio layers such as sound effects, voice-overs, ambience, etc. that are present at the same time. Other distortions include equalization (adjusting the relative amounts of tremble and bass in a track) and changing tempo and / or pitch. [0004] Some audio recognition techniques are based on directly performing a neighbor search on the computed hash values using standard algorithm...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L25/00

CPCG10L25/51G10L25/18G10L25/00

Inventor 亚历山大·保罗·塞尔比马克·圣·约翰·欧文

Owner ADELPHOI

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

System and method for audio media recognition

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology