Audio and text synchronization method, computing device and storage medium

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A text and audio technology, applied in the field of data processing, can solve problems such as poor user experience, lack of synchronous sentences, difficulty in synchronizing audio and book text, etc., to improve user experience, facilitate speech recognition, and improve reading effects.

Active Publication Date: 2021-08-24

ZHANGYUE TECH CO LTD

View PDF9 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, for audio recorded by a real person for a book, it is difficult to synchronize the audio with the text of the book, resulting in the inability to display the read sentences synchronously during audio playback. When the user cannot hear or understand the content read in the audio , due to the lack of display of synchronous statements, the content of the book cannot be well understood, resulting in poor user experience

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0023] figure 1 A schematic flowchart showing a method for synchronizing audio and text according to Embodiment 1 of the present invention, as shown in figure 1 As shown, the method includes the following steps:

[0024] In step S101, the audio to be matched and the first text are obtained, and the first text is segmented to obtain a first sentence set.

[0025] Among them, the original book text of each book and the audio recorded by a real person for each book are stored in the book platform. In this embodiment, the original book text of the book is called the first text. The resulting book text is called the second text. In step S101, the audio and the first text corresponding to the same book that need to be synchronized are obtained from the book platform as the audio and the first text to be matched, and then the first text is segmented to obtain multiple sentences The first statement set of .

[0026] Step S102, segment the audio to obtain a set of audio clips, perf...

Embodiment 2

[0034] Figure 2a A schematic flowchart showing a method for synchronizing audio and text according to Embodiment 2 of the present invention, as shown in Figure 2a As shown, the method includes the following steps:

[0035] In step S201, the audio to be matched and the first text are obtained, and the first text is segmented to obtain a first sentence set.

[0036]The audio and the first text corresponding to the same book that need to be synchronously processed are obtained from the book platform, and then the first text can be segmented according to the specified punctuation marks to obtain the first statement set. Wherein, the specified punctuation mark may be a punctuation mark used to indicate the end of a sentence, such as a period, a question mark, an exclamation mark, and the like. Specifically, the symbol position of the specified punctuation mark contained in the first text can be identified, and the first segmentation point is determined according to the symbol p...

Embodiment 3

[0072] Embodiment 3 of the present invention provides a non-volatile computer storage medium, the storage medium stores at least one executable instruction, and the executable instruction can execute the method for synchronizing audio and text in any of the above method embodiments.

[0073] Specifically, the executable instruction can be used to make the processor perform the following operations: obtain the audio to be matched and the first text, segment the first text to obtain the first statement set; segment the audio to obtain a collection of audio fragments, Perform speech recognition on each audio segment in the segment set to obtain each segment sentence, combine each segment sentence to obtain the second text, and obtain the character sequence corresponding to the second text; sequentially extract the first sentence from the first sentence set, and obtain the first sentence For the corresponding first character sequence, extract the second character sequence from the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a method for synchronizing audio and text, a computing device and a storage medium, wherein the method includes: acquiring the audio to be matched and a first text, and segmenting the first text to obtain a first statement set; Perform segmentation to obtain a collection of audio fragments, perform speech recognition on each audio fragment in the audio fragment collection to obtain each fragment sentence, combine each fragment sentence to obtain the second text, and obtain the character sequence corresponding to the second text; sequentially from the first sentence extracting the first sentence from the collection, obtaining the first character sequence corresponding to the first sentence, extracting the second character sequence from the character sequence corresponding to the second text according to the preset window, and matching the first character sequence with the second character sequence, A third character sequence matching the first character sequence is determined, and a synchronization relationship between the audio segment corresponding to the third character sequence and the first sentence is established. This solution realizes the precise determination of the synchronous relationship between audio clips and sentences.

Description

technical field [0001] The invention relates to the technical field of data processing, in particular to a method for synchronizing audio and text, a computing device and a storage medium. Background technique [0002] With the continuous development of e-book technology, users can not only read book content with their eyes, but also obtain book content by playing audiobooks. Among them, the way of obtaining book content by playing audiobooks can also be called the way of listening to books, and this way of listening to books brings a new reading experience to users. However, for audio recorded by a real person for a book, it is difficult to synchronize the audio with the text of the book, resulting in the inability to display the read sentences synchronously during audio playback. When the user cannot hear or understand the content read in the audio , due to the lack of display of synchronous statements, the content of the book cannot be well understood, resulting in poor ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G10L15/26G06F40/289G11B20/10

CPCG10L15/26G11B2020/10953G06F40/289

Inventor 陈梦瑶唐旺

Owner ZHANGYUE TECH CO LTD

Who we serve

R&D Engineer
R&D Manager
IP Professional

Why Patsnap Eureka

Industry Leading Data Capabilities
Powerful AI technology
Patent DNA Extraction

Social media

Patsnap Eureka Blog

Learn More

PatSnap group products

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Audio and text synchronization method, computing device and storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology