Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech summarization program

A voice and program instruction technology, applied in voice analysis, voice recognition, instruments, etc., can solve problems such as unrecognizable accents, obstacles, and language barriers in video conferences

Active Publication Date: 2017-11-28
IBM CORP
View PDF10 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, like any conversation, video conferencing can be hampered by language barriers, unintelligible accents, rapid speaking, or the occasional occasion when a participant in a multi-person meeting arrives late and misses what was previously discussed

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech summarization program
  • Speech summarization program
  • Speech summarization program

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0007] Embodiments of the present invention will now be described in detail with reference to the accompanying drawings.

[0008] figure 1 A speech summarization system 100 according to an embodiment of the invention is shown, in this example embodiment, the speech summarization system 100 includes a computing device 110, a video camera 114, a microphone 112, a computing device 120, a video camera 124, a microphone 122 and a network 108 .

[0009] Network 108 may be the Internet, which represents a worldwide collection of networks and gateways that support communication between devices connected to the Internet. Network 108 may include, for example, wired, wireless or fiber optic connections. In other embodiments, network 108 may be implemented as an intranet, local area network, or wide area network. In general, network 108 may be any combination of connections and protocols that support communication between computing device 110 and computing device 120 .

[0010] Microp...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Embodiments of the present invention disclose a method, system, and computer program product for speech summarization. A computer receives audio and video components from a video conference. The computer determines which participant is speaking based on comparing images of the participants with template images of speaking and non-speaking faces. The computer determines the voiceprint of the speaking participant by applying a Hidden Markov Model to a brief recording of the voice waveform of the participant and associates the determined voiceprint with the face of the speaking participant. The computer recognizes and transcribes the content of statements made by the speaker, determines the key points, and displays them over the face of the participant in the video conference.

Description

technical field [0001] The present invention relates generally to speech analysis, and more particularly to determining key points made by a speaker during a video conference. Background technique [0002] Videoconferencing is often used for business or personal purposes as an efficient and convenient method of communication that avoids the need to physically travel to one location for a face-to-face conversation. Video conferencing is becoming increasingly popular because a single video conference can simultaneously connect hundreds of people from anywhere on the planet to a real-time, face-to-face conversation. However, like any conversation, video conferencing can be hampered by language barriers, unintelligible accents, rapid speaking, or the occasional occasion when a participant in a multi-person meeting arrives late and misses what was previously discussed. Contents of the invention [0003] Embodiments of the invention disclose methods, systems and computer progra...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): H04L12/18H04N7/15
CPCG10L25/57G10L17/00H04L12/1831G10L25/87H04N7/147H04L12/1827G10L21/10H04N7/15G10L15/26G10L17/02
Inventor 陈叶青聂文娟吴婷杨昭
Owner IBM CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products