Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Time sequence mapping method for text to audio realized by computer

A computer and audio technology, applied in the field of audio analysis, can solve the problems of low word-level granularity of animation, large-capacity file occupancy, individual inability of spoken words, etc., and achieve the effect of small data transmission bandwidth and easy modification.

Inactive Publication Date: 2010-06-09
埃里克・路易斯・汉森
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

When playing a recorded speech while simultaneously presenting a written transcript of that speech (or presenting a text while simultaneously playing a transcribed version of it), the listening reader (or the audience who is reading) encounters several problems: The first Yes, how do you grasp where the words in the text have progressed in relation to what is being told? Previous techniques deal with this problem in two ways, and in the following we analyze their shortcomings
The second problem is that in speech-plus-text representations, the individual written words that make up the text can be made machine-searchable, annotatable, and interactive, while the individual spoken words of the audio part cannot.
Previous technologies, while aware of the correspondence between text and audio, have failed to make audio containing speech machine-searchable, annotatable, and interactive
A third issue is that interactive transmission of audio components requires the development of a streaming protocol
[0020] 1. The creation of this kind of image takes a long time and requires relevant personnel to have high skills
[0021] 2. Even if only text is displayed and audio is played, the creation of this kind of video will form a large-capacity data file
These large files consume correspondingly large amounts of bandwidth and data storage space, and thus impose many limitations on devices that can download speech-plus-text representations to programmable or dedicated digital computing devices
[0022] 3. Animation is fixed
[0023] 4. Usually animations are below word-level granularity
[0025] 6. Interaction with audio is limited to controlling the player
[0026] 7. Audio is not machine searchable or annotatable
[0029] 10. Cannot interact with the text itself

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Time sequence mapping method for text to audio realized by computer
  • Time sequence mapping method for text to audio realized by computer
  • Time sequence mapping method for text to audio realized by computer

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0058] The present invention can be embodied in various forms. Therefore, the details disclosed herein are not so much limitations as examples to teach one skilled in the art to use the invention on any suitable system or structure or in any way.

[0059] figure 1 A digital computing device 100 of the present invention is shown. The composition of digital computing device 100 is as follows: 1. input processor, 2. general purpose processor, 3. memory, 4. non-volatile digital storage, 5. audio processor, 6. video processor, 7. network adapter, The above components are all connected together through the bus structure 8 . Digital computing device 100 may be incorporated into a standard personal computer, cell phone, smart phone, palmtop computer, notebook computer, personal digital assistant, or the like, equipped with appropriate input, video display, and audio hardware. It can also be implemented with dedicated hardware and software. They can be integrated into consumer appl...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention introduces a device, method and computer reading media for mapping establishing text to audio time sequence. The present invention also introduces a device, method and computer reading media for playing audio text animation. A mapping agent (10) takes the text (12) and the corresponding audio record (11) as input, and makes begin and end time assign to the text unit (15). Theplayer (50) takes text (15), audio (17) and mapping (16) as input, and makes text animation and displays text (15) in-phase along with playing the audio (17). The present invention is used to endow vitality during the process of playing audio record; to instead the traditional playback control to control audio play; to play and display the note of voice recording; to implement the characteristicof flowing audio without using basic stream protocol.

Description

technical field [0001] The invention relates to the field of audio analysis, in particular audio such as speeches containing textual descriptions. More specifically, the corresponding process for creating text-to-audio mappings. Background technique [0002] The first technological advance in language-based was the development of simple vocalizations, which at the time could only convey meaning in temporal isolation. Later, people combined these initial vocalizations in time phase and sequential order to form streams of speech. Later, people invented drawing simple symbols or images on cave walls or other suitable surfaces, but they only communicated in isolation in space. Later generations linked these symbols or images with spoken language in time. Later, people combined these independent language-related graphics in a sequential order in space to form written language or "text". Specifically, our innovative ancestors began to order the sequential space of morphologica...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G11B20/10G11B27/10G11B27/00
CPCG10L13/043G10L19/167G10L13/00
Inventor 埃里克·路易斯·汉森
Owner 埃里克・路易斯・汉森
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products