Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Device for transferring speech recognition to video

A speech recognition and conversion device technology, applied in speech recognition, speech analysis, pulse modulation TV signal transmission, etc.

Inactive Publication Date: 2008-06-18
ZTE CORP
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] With the development of science and technology, users' needs for multimedia life will become more and more extensive, not only needing text and sound, but also visual sensory needs, which forces the development of new services to be particularly important. Among them, from speech recognition to time-frequency The conversion technology is a worthy research and development topic, however, there are few technical achievements related to this

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Device for transferring speech recognition to video
  • Device for transferring speech recognition to video
  • Device for transferring speech recognition to video

Examples

Experimental program
Comparison scheme
Effect test

no. 1 example

[0020] First, refer to image 3 and Figure 4 A first embodiment of the present invention is described. image 3 is a flow chart of the voice-to-video conversion method according to the first embodiment of the present invention, Figure 4 is a schematic diagram of a voice-to-video conversion method according to an embodiment of the present invention.

[0021] Such as image 3 As shown, the conversion method from speech recognition to video according to the first embodiment of the present invention includes the following steps: Step S302: the media server establishes a corresponding identification code according to the type of video resource when starting; Step S304: the media server receives the application After the server's request, set up the connection channel of the audio stream and receive the audio stream; Step S306: the voice recognition module of the media server recognizes the audio data, and outputs the recognized data to the conversion processing program; Step S...

no. 2 example

[0043] The following will refer to Figure 5 A second embodiment of the present invention is described. Figure 5 It is a block diagram of an apparatus 500 for converting speech recognition to video according to an embodiment of the present invention.

[0044] Such as Figure 5 As shown, the conversion device 500 from speech recognition to video according to the embodiment of the present invention includes: an identification code establishment module 502, which is used to establish a corresponding identification code according to the type of video resource when the media server is started; an audio stream receiving module 504, Connected to the identification code establishment module 502, used to set up the connection channel of the audio stream and receive the audio stream after the media server receives the request of the application server; the speech recognition module 506, connected to the audio stream receiving module 504, used to identify the audio stream data, and th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a transforming device from speech identification to video, which comprises: an identifying code establishing module which is used for establishing a corresponding identifying code according to the types of video resource when a media server is started; an audio stream receiving module which is connected with the identifying code establishing module and is used for establishing a connecting channel of the audio stream and receiving the audio stream after the media server receives the request of an application server; a speech identifying module which is connected with the audio stream receiving module and is used for identifying audio data and outputting the identified data to a transformation processing module; the transformation processing module which is connected with the speech identifying module and the identifying code establishing module and is used for transforming the received data of the speech identifying code and making comparison between the transformed data with an identifying code established by the identifying code establishing module, thus realizing video transformation; a video stream output module which is connected with the transformation processing module and used for outputting the transformed video stream to a terminal unit through network.

Description

technical field [0001] The present invention relates to the fields of media server applications and speech recognition, and in particular, to a device for converting speech recognition to video using a media server. Background technique [0002] The next-generation network is a service-driven network. The media server is an independent device that provides dedicated media resource functions, and is also an important device in the packet network. Its position in the system is as follows: figure 1 As shown, among them, figure 1 It is a schematic diagram of the service-driven network. Under the control of the application server, the media server provides the media resource functions required by various services on the softswitch, including: playback, recording, dual-tone multi-frequency (dual-tone multi-frequency, DTMF) number collection, fax, conference, speech synthesis (text to speech, TTS) and automatic speech recognition (automatic speech recognition, ASR) and other funct...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L21/06G10L15/00H04N7/24G10L21/10
Inventor 王东郑罡张嵩
Owner ZTE CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products