Device for transferring speech recognition to video

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A speech recognition and conversion device technology, applied in speech recognition, speech analysis, pulse modulation TV signal transmission, etc.

Inactive Publication Date: 2008-06-18

ZTE CORP

View PDF0 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0003] With the development of science and technology, users' needs for multimedia life will become more and more extensive, not only needing text and sound, but also visual sensory needs, which forces the development of new services to be particularly important. Among them, from speech recognition to time-frequency The conversion technology is a worthy research and development topic, however, there are few technical achievements related to this

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

no. 1 example

[0020] First, refer to image 3 and Figure 4 A first embodiment of the present invention is described. image 3 is a flow chart of the voice-to-video conversion method according to the first embodiment of the present invention, Figure 4 is a schematic diagram of a voice-to-video conversion method according to an embodiment of the present invention.

[0021] Such as image 3 As shown, the conversion method from speech recognition to video according to the first embodiment of the present invention includes the following steps: Step S302: the media server establishes a corresponding identification code according to the type of video resource when starting; Step S304: the media server receives the application After the server's request, set up the connection channel of the audio stream and receive the audio stream; Step S306: the voice recognition module of the media server recognizes the audio data, and outputs the recognized data to the conversion processing program; Step S...

no. 2 example

[0043] The following will refer to Figure 5 A second embodiment of the present invention is described. Figure 5 It is a block diagram of an apparatus 500 for converting speech recognition to video according to an embodiment of the present invention.

[0044] Such as Figure 5 As shown, the conversion device 500 from speech recognition to video according to the embodiment of the present invention includes: an identification code establishment module 502, which is used to establish a corresponding identification code according to the type of video resource when the media server is started; an audio stream receiving module 504, Connected to the identification code establishment module 502, used to set up the connection channel of the audio stream and receive the audio stream after the media server receives the request of the application server; the speech recognition module 506, connected to the audio stream receiving module 504, used to identify the audio stream data, and th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a transforming device from speech identification to video, which comprises: an identifying code establishing module which is used for establishing a corresponding identifying code according to the types of video resource when a media server is started; an audio stream receiving module which is connected with the identifying code establishing module and is used for establishing a connecting channel of the audio stream and receiving the audio stream after the media server receives the request of an application server; a speech identifying module which is connected with the audio stream receiving module and is used for identifying audio data and outputting the identified data to a transformation processing module; the transformation processing module which is connected with the speech identifying module and the identifying code establishing module and is used for transforming the received data of the speech identifying code and making comparison between the transformed data with an identifying code established by the identifying code establishing module, thus realizing video transformation; a video stream output module which is connected with the transformation processing module and used for outputting the transformed video stream to a terminal unit through network.

Description

technical field [0001] The present invention relates to the fields of media server applications and speech recognition, and in particular, to a device for converting speech recognition to video using a media server. Background technique [0002] The next-generation network is a service-driven network. The media server is an independent device that provides dedicated media resource functions, and is also an important device in the packet network. Its position in the system is as follows: figure 1 As shown, among them, figure 1 It is a schematic diagram of the service-driven network. Under the control of the application server, the media server provides the media resource functions required by various services on the softswitch, including: playback, recording, dual-tone multi-frequency (dual-tone multi-frequency, DTMF) number collection, fax, conference, speech synthesis (text to speech, TTS) and automatic speech recognition (automatic speech recognition, ASR) and other funct...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L21/06G10L15/00H04N7/24G10L21/10

Inventor 王东郑罡张嵩

Owner ZTE CORP

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Device for transferring speech recognition to video

What is Al technical title? Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document. A speech recognition and conversion device technology, applied in speech recognition, speech analysis, pulse modulation TV signal transmission, etc.

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

no. 1 example

no. 2 example

PUM

Abstract

Description

Claims

Application Information

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A speech recognition and conversion device technology, applied in speech recognition, speech analysis, pulse modulation TV signal transmission, etc.

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology