Audio and text synchronization method, computing device and storage medium

A text and audio technology, applied in the field of data processing, can solve problems such as poor user experience, lack of synchronous sentences, difficulty in synchronizing audio and book text, etc., to improve user experience, facilitate speech recognition, and improve reading effects.

Active Publication Date: 2021-08-24
ZHANGYUE TECH CO LTD
View PDF9 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, for audio recorded by a real person for a book, it is difficult to synchronize the audio with the text of the book, resulting in the inability to display the read sentences synchronously during audio playback. When the user cannot hear or understand the content read in the audio , due to the lack of display of synchronous statements, the content of the book cannot be well understood, resulting in poor user experience

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Audio and text synchronization method, computing device and storage medium
  • Audio and text synchronization method, computing device and storage medium
  • Audio and text synchronization method, computing device and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0023] figure 1 A schematic flowchart showing a method for synchronizing audio and text according to Embodiment 1 of the present invention, as shown in figure 1 As shown, the method includes the following steps:

[0024] In step S101, the audio to be matched and the first text are obtained, and the first text is segmented to obtain a first sentence set.

[0025] Among them, the original book text of each book and the audio recorded by a real person for each book are stored in the book platform. In this embodiment, the original book text of the book is called the first text. The resulting book text is called the second text. In step S101, the audio and the first text corresponding to the same book that need to be synchronized are obtained from the book platform as the audio and the first text to be matched, and then the first text is segmented to obtain multiple sentences The first statement set of .

[0026] Step S102, segment the audio to obtain a set of audio clips, perf...

Embodiment 2

[0034] Figure 2a A schematic flowchart showing a method for synchronizing audio and text according to Embodiment 2 of the present invention, as shown in Figure 2a As shown, the method includes the following steps:

[0035] In step S201, the audio to be matched and the first text are obtained, and the first text is segmented to obtain a first sentence set.

[0036]The audio and the first text corresponding to the same book that need to be synchronously processed are obtained from the book platform, and then the first text can be segmented according to the specified punctuation marks to obtain the first statement set. Wherein, the specified punctuation mark may be a punctuation mark used to indicate the end of a sentence, such as a period, a question mark, an exclamation mark, and the like. Specifically, the symbol position of the specified punctuation mark contained in the first text can be identified, and the first segmentation point is determined according to the symbol p...

Embodiment 3

[0072] Embodiment 3 of the present invention provides a non-volatile computer storage medium, the storage medium stores at least one executable instruction, and the executable instruction can execute the method for synchronizing audio and text in any of the above method embodiments.

[0073] Specifically, the executable instruction can be used to make the processor perform the following operations: obtain the audio to be matched and the first text, segment the first text to obtain the first statement set; segment the audio to obtain a collection of audio fragments, Perform speech recognition on each audio segment in the segment set to obtain each segment sentence, combine each segment sentence to obtain the second text, and obtain the character sequence corresponding to the second text; sequentially extract the first sentence from the first sentence set, and obtain the first sentence For the corresponding first character sequence, extract the second character sequence from the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for synchronizing audio and text, a computing device and a storage medium, wherein the method includes: acquiring the audio to be matched and a first text, and segmenting the first text to obtain a first statement set; Perform segmentation to obtain a collection of audio fragments, perform speech recognition on each audio fragment in the audio fragment collection to obtain each fragment sentence, combine each fragment sentence to obtain the second text, and obtain the character sequence corresponding to the second text; sequentially from the first sentence extracting the first sentence from the collection, obtaining the first character sequence corresponding to the first sentence, extracting the second character sequence from the character sequence corresponding to the second text according to the preset window, and matching the first character sequence with the second character sequence, A third character sequence matching the first character sequence is determined, and a synchronization relationship between the audio segment corresponding to the third character sequence and the first sentence is established. This solution realizes the precise determination of the synchronous relationship between audio clips and sentences.

Description

technical field [0001] The invention relates to the technical field of data processing, in particular to a method for synchronizing audio and text, a computing device and a storage medium. Background technique [0002] With the continuous development of e-book technology, users can not only read book content with their eyes, but also obtain book content by playing audiobooks. Among them, the way of obtaining book content by playing audiobooks can also be called the way of listening to books, and this way of listening to books brings a new reading experience to users. However, for audio recorded by a real person for a book, it is difficult to synchronize the audio with the text of the book, resulting in the inability to display the read sentences synchronously during audio playback. When the user cannot hear or understand the content read in the audio , due to the lack of display of synchronous statements, the content of the book cannot be well understood, resulting in poor ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/26G06F40/289G11B20/10
CPCG10L15/26G11B2020/10953G06F40/289
Inventor 陈梦瑶唐旺
Owner ZHANGYUE TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products