Human voice melody extraction method and system based on numbered musical notation recognition and fundamental frequency extraction

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
An extraction method and a technique of notation, applied in the field of vocal melody extraction, can solve the problems of inability to obtain lyrics and pitch, difficult to obtain lyric information, and inability to extract melody, etc.

Active Publication Date: 2020-06-23

成都潜在人工智能科技有限公司

View PDF10 Cites 6 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] The goal of this method is to extract the main melody track from multiple audio tracks, but it cannot extract the melody from the main melody track. At the same time, it is difficult for this method to obtain lyric information containing sub-track information.

Unable to get matching libretto and pitch

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0041] The present invention will be further described below in conjunction with accompanying drawing:

[0042] Such as figure 1 As shown, a human voice melody extraction method based on numbered musical notation recognition and fundamental frequency extraction includes the following steps:

[0043] S1: Data preprocessing, binarize the numbered notation file corresponding to the song to be processed, process the original audio file of the song into down-sampled mono audio, and separate the human voice from the down-sampled mono audio Waveform; specifically includes:

[0044] S101: Decode the original audio file of the song into wave format, and normalize it to -1~1;

[0045] S102: averaging the audio in wave format to obtain mono audio;

[0046] S103: down-sampling the monophonic audio to between 8000 and 44100;

[0047] S104: Binarize the numbered musical notation file corresponding to the song;

[0048] S105: separate the vocal waveform from the down-sampled monophonic ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a human voice melody extraction method and system based on numbered musical notation recognition and fundamental frequency extraction, and the system applies the method, and the method comprises the steps: carrying out the binarization of a numbered musical notation file corresponding to a to-be-processed song, processing an original audio file of the song into downsampledsingle-track audio, and separating a human voice waveform from the single-track audio; identifying notes and lyric pairs in the numbered musical notation to obtain a list of lyrics and notes; retrieving a list of lyrics and notes according to the libretto file to obtain a matching result sequence of libretto and notes; selecting a note, calculating the fundamental frequency of the note according to the separated human voice waveform, calculating the frequency of each note according to the calculated fundamental frequency and the relative relation of the notes, and converting the frequency of each note into midi pitch; and translating the matching result sequence of the row lyrics and the notes to obtain a matching result sequence of the row lyrics and the notes of which the pitches are matched with the midi pitches of the notes. The human voice melody with the pitch matched with the melody can be extracted.

Description

technical field [0001] The invention belongs to the technical field of audio processing, and in particular relates to a human voice melody extraction method and system based on numbered spectrum recognition and fundamental frequency extraction. Background technique [0002] With the development of computer technology, the main way of dissemination of music has changed from the original carrier based on tapes and CDs to the network download and click based on digital music. In order to adapt to the change of the way of transmission, music identification and retrieval technology is also applied more and more widely. In music information retrieval, the main theme of music is mainly used, and the main theme of music can be used for music analysis, music retrieval, music identification, similar music recommendation, etc. [0003] The invention patent with application number 201810537265.3 discloses a method, device, terminal and storage medium for extracting the main melody trac...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L25/48G10L25/51

CPCG10L25/48G10L25/51G10H2210/056G10H2210/061Y02D30/70

Inventor 尹学渊刘鑫忠江天宇

Owner 成都潜在人工智能科技有限公司

Who we serve

R&D Engineer
R&D Manager
IP Professional

Why Patsnap Eureka

Industry Leading Data Capabilities
Powerful AI technology
Patent DNA Extraction

Social media

Patsnap Eureka Blog

Learn More

PatSnap group products

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Human voice melody extraction method and system based on numbered musical notation recognition and fundamental frequency extraction

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology