Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

40 results about "Speech to speech translation" patented technology

Speech-enabled language translation system and method enabling interactive user supervision of translation and speech recognition accuracy

A system and method for a highly interactive style of speech-to-speech translation is provided. The interactive procedures enable a user to recognize, and if necessary correct, errors in both speech recognition and translation, thus providing robust translation output than would otherwise be possible. The interactive techniques for monitoring and correcting word ambiguity errors during automatic translation, search, or other natural language processing tasks depend upon the correlation of Meaning Cues and their alignment with, or mapping into, the word senses of third party lexical resources, such as those of a machine translation or search lexicon. This correlation and mapping can be carried out through the creation and use of a database of Meaning Cues, i.e., SELECT. Embodiments described above permit the intelligent building and application of this database, which can be viewed as an interlingua, or language-neutral set of meaning symbols, applicable for many purposes. Innovative techniques for interactive correction of server-based speech recognition are also described.
Owner:ZAMA INNOVATIONS LLC

Enhanced speech-to-speech translation system and methods

A speech translation system and methods for cross-lingual communication that enable users to improve and modify content and usage of the system and easily abort or reset translation. The system includes a speech recognition module configured for accepting an utterance, a machine translation module, an interface configured to communicate the utterance and proposed translation, a correction module and an abort action unit that removes any hypotheses or partial hypotheses and terminates translation. The system also includes modules for storing favorites, changing language mode, automatically identifying language, providing language drills, viewing third party information relevant to conversation, among other things.
Owner:META PLATFORMS INC

System and methods for maintaining speech-to-speech translation in the field

A method and apparatus are provided for updating the vocabulary of a speech translation system for translating a first language into a second language including written and spoken words. The method includes adding a new word in the first language to a first recognition lexicon of the first language and associating a description with the new word, wherein the description contains pronunciation and word class information. The new word and description are then updated in a first machine translation module associated with the first language. The first machine translation module contains a first tagging module, a first translation model and a first language module, and is configured to translate the new word to a corresponding translated word in the second language. Optionally, the invention may be used for bidirectional or multi-directional translation
Owner:META PLATFORMS INC

System for automated translation of speech

The present invention allows subscribers to an online information service to participate in real-time conferencing or chat sessions in which a message originating from a subscriber in accordance with a first language is translated to one or more languages before it is broadcast to the other conference areas. Messages in a first language are translated automatically to one or more other languages through language translation capabilities resident at online information service host computers. Access software that subscribers use for participating in conference is integrated with speech recognition and speech generation software such that a subscriber may speak the message he or she would like to share with other participants and may hear the messages from the other participants in the conference. Speech-to-speech translation may be accomplished as a message spoken into a computer microphone in accordance with a first language may be recited by a remote computer in accordance with a second language.
Owner:META PLATFORMS INC

Mobile wireless display providing speech to speech translation and avatar simulating human attributes

An apparatus includes a monocular display with a wireless communications interface, user input device, transmitter, and controller, and may provide a video link to and control and management of a host device and other devices, such as a cell phone, computer, laptop, or media player. The apparatus may receive speech and digitize it. The apparatus may compare the digitized speech in a first language to a table of digitized speech in a second language to provide translation or, alternatively, may compare the digitized speech to a table of control commands. The control commands allow user interaction with the apparatus or other remote devices in a visual and audio manner. The control signals control a “recognized persona” or avatar stored in a memory to provide simulated human attributes to the apparatus, network or third party communication device. The avatar may be changed or upgraded according to user choice.
Owner:KOPIN CORPORATION

Speech-to-speech translation system with user-modifiable paraphrasing grammars

The present invention discloses a speech-to-speech translation device which allows one or more users to input a spoken utterance in one language, translates the utterance into one or more second languages, and outputs the translation in speech form. Additionally, the device allows for translation both directions, recognizing inputs in the one or more second languages and translating them back into the first language. The device recognizes and translates utterances in a limited domain as in a phrase book translation system, so the translation accuracy is essentially 100%. By limiting the domain the system increases the accuracy of the speech recognition component and thus the accuracy of the overall system. However unlike other phrase book systems, the device also allows wide variations and paraphrasing in the input, so that the user is much more likely to find the desired phrase from the stored list of phrases. The device paraphrases the input to a basic canonical form and performs the translation on that canonical form, ignoring the non-essential variations in the surface form of the input. The device can provide visual and / or auditory feedback to confirm the recognized input and makes the system usable for non-bilingual users with absolute confidence.
Owner:EHSANI FARZAD +2

System for automated translation of speech

The present invention allows subscribers to an online information service to participate in real-time conferencing or chat sessions in which a message originating from a subscriber in accordance with a first language is translated to one or more languages before it is broadcast to the other conference areas. Messages in a first language are translated automatically to one or more other languages through languages translation capabilities resident at online information service host computers. Access software that subscribers use for participating in conference is integrated with speech recognition and speech generation software such that a subscriber may speak the message he or she would like to share with other participants and may hear the messages from the other participants in the conference. Speech-to-speech translation may be accomplished as a message spoken into a computer microphone in accordance with a first language may be recited by a remote computer in accordance with a second language.
Owner:META PLATFORMS INC

Method and apparatus for cross-lingual communication

A system and method for a highly interactive style of speech-to-speech translation is provided. The interactive procedures enable a user to recognize, and if necessary correct, errors in both speech recognition and translation, thus providing robust translation output than would otherwise be possible. The interactive techniques for monitoring and correcting word ambiguity errors during automatic translation, search, or other natural language processing tasks depend upon the correlation of Meaning Cues and their alignment with, or mapping into, the word senses of third party lexical resources, such as those of a machine translation or search lexicon. This correlation and mapping can be carried out through the creation and use of a database of Meaning Cues, i.e., SELECT. Embodiments described above permit the intelligent building and application of this database, which can be viewed as an interlingua, or language-neutral set of meaning symbols, applicable for many purposes. Innovative techniques for interactive correction of server-based speech recognition are also described.
Owner:ZAMA INNOVATIONS LLC

Disfluency detection for a speech-to-speech translation system using phrase-level machine translation with weighted finite state transducers

A computer-implemented method for creating a disfluency translation lattice includes providing a plurality of weighted finite state transducers including a translation model, a language model, and a phrase segmentation model as input, performing a cascaded composition of the weighted finite state transducers to create a disfluency translation lattice, and storing the disfluency translation lattice to a computer-readable media.
Owner:IBM CORP

System and methods for maintaining speech-to-speech translation in the field

A method and apparatus are provided for updating the vocabulary of a speech translation system for translating a first language into a second language including written and spoken words. The method includes adding a new word in the first language to a first recognition lexicon of the first language and associating a description with the new word, wherein the description contains pronunciation and word class information. The new word and description are then updated in a first machine translation module associated with the first language. The first machine translation module contains a first tagging module, a first translation model and a first language module, and is configured to translate the new word to a corresponding translated word in the second language. Optionally, the invention may be used for bidirectional or multi-directional translation.
Owner:META PLATFORMS INC

Methods and apparatus for context adaptation of speech-to-speech translation systems

A technique for context adaptation of a speech-to-speech translation system is provided. A plurality of sets of paralinguistic attribute values is obtained from a plurality of input signals. Each set of the plurality of sets of paralinguistic attribute values is extracted from a corresponding input signal of the plurality of input signals via a corresponding classifier of a plurality of classifiers. A final set of paralinguistic attribute values is generated for the plurality of input signals from the plurality of sets of paralinguistic attribute values. Performance of at least one of a speech recognition module, a translation module and a text-to-speech module of the speech-to-speech translation system is modified in accordance with the final set of paralinguistic attribute values for the plurality of input signals.
Owner:IBM CORP

Voice prompts for use in speech-to-speech translation system

Techniques for employing improved prompts in a speech-to-speech translation system are disclosed. By way of example, a technique for use in indicating a dialogue turn in an automated speech-to-speech translation system comprises the following steps / operations. One or more text-based scripts are obtained. The one or more text-based scripts are synthesizable into one or more voice prompts. At least one of the one or more voice prompts is synthesized for playback from at least one of the one or more text-based scripts, the at least one synthesized voice prompt comprising an audible message in a language understandable to a speaker interacting with the speech-to-speech translation system, the audible message indicating a dialogue turn in the automated speech-to-speech translation system.
Owner:IBM CORP

Using separate recording channels for speech-to-speech translation systems

InactiveUS20090306957A1Highly accurate language detectionHighly accurate language detection and speech recognitionNatural language translationSpeech recognitionSpeech to speech translationLoudspeaker
A system and method for speech-to-speech translation using a translation system includes designating separate input channels for each of a plurality of speakers. In response to speech from a first channel in a first language, the speech from the first channel is translated to a second language, and in response to speech from the second channel in a second language, the speech from the second channel is translated to the first language. Translating the speech from the first channel and translating the speech from the second channel are performed concurrently.
Owner:IBM CORP

Method and apparatus for cross-lingual communication

A system and method for a highly interactive style of speech-to-speech translation is provided. The interactive procedures enable a user to recognize, and if necessary correct, errors in both speech recognition and translation, thus providing robust translation output than would otherwise be possible. The interactive techniques for monitoring and correcting word ambiguity errors during automatic translation, search, or other natural language processing tasks depend upon the correlation of Meaning Cues and their alignment with, or mapping into, the word senses of third party lexical resources, such as those of a machine translation or search lexicon. This correlation and mapping can be carried out through the creation and use of a database of Meaning Cues, i.e., SELECT. Embodiments described above permit the intelligent building and application of this database, which can be viewed as an interlingua, or language-neutral set of meaning symbols, applicable for many purposes. Innovative techniques for interactive correction of server-based speech recognition are also described.
Owner:ZAMA INNOVATIONS LLC

Mobile wireless display providing speech to speech translation and avatar simulating human attributes

An apparatus includes a monocular display with a wireless communications interface, user input device, transmitter, and controller, and may provide a video link to and control and management of a host device and other devices, such as a cell phone, computer, laptop, or media player. The apparatus may receive speech and digitize it. The apparatus may compare the digitized speech in a first language to a table of digitized speech in a second language to provide translation or, alternatively, may compare the digitized speech to a table of control commands. The control commands allow user interaction with the apparatus or other remote devices in a visual and audio manner. The control signals control a recognized persona or avatar stored in a memory to provide simulated human attributes to the apparatus, network or third party communication device. The avatar may be changed or upgraded according to user choice.
Owner:KOPIN CORPORATION

Disfluency detection for a speech-to-speech translation system using phrase-level machine translation with weighted finite state transducers

A computer-implemented method for creating a disfluency translation lattice includes providing a plurality of weighted finite state transducers including a translation model, a language model, and a phrase segmentation model as input, performing a cascaded composition of the weighted finite state transducers to create a disfluency translation lattice, and storing the disfluency translation lattice to a computer-readable media.
Owner:IBM CORP

Apparatus and method for integrated phrase-based and free-form speech-to-speech translation

An apparatus and method that integrates both phrase-based and free-form speech-to-speech translation approaches using probability models. The starting step of the method is to receive vocal communication in a source language. Then store the received vocal communication. Then decipher the content of the vocal communication. Then locate in a multilingual dictionary module the corresponding translation of the deciphered vocal communication provided a preset sentence exists in a speech recognition module for the vocal communication. Then translate the vocal communication into the target language provided there is no corresponding translation located in the multilingual dictionary module. Then synthesize the translated target language when there is no corresponding translation for the vocal communication in the multilingual dictionary module. Then store the sound of the translated target language. Then play the sound of the translated target language.
Owner:IBM CORP

System and methods for maintaining speech-to-speech translation in the field

A field maintainable class-based translation system and apparatus with components that ease use by linguistically untrained users is disclosed. The apparatus includes modules for recovering errors, extending and customizing language coverage and increasing the speed of effective communication.
Owner:META PLATFORMS INC

System and Methods for Maintaining Speech-To-Speech Translation in the Field

Extensions to an apparatus for speech translation improve the effectiveness of communication. An apparatus for speech translation capable of repairing errors and expanding its vocabulary. System and methods improve the ease by which common, linguistically untrained users can better recover from errors, extend and customize the language coverage of their speech translation device and increase the speed of effective communication, including stopping translation by shaking, correction of pronunciations using pseudo phonetics, translation favorites, translation modes, small dictionary based on user repair and boosting, single action speech translators, speech translation dialog language learning, translation from telephone conversations, targeted ads based on translated speech, backchanneling based on speech, and web presentation based on speech.
Owner:META PLATFORMS INC

Apparatus And Method For Integrated Phrase-Based And Free-Form Speech-To-Speech Translation

An apparatus and method that integrates both phrase-based and free-form speech-to-speech translation approaches using probability models. The starting step of the method is to receive vocal communication in a source language. Then store the received vocal communication. Then decipher the content of the vocal communication. Then locate in a multilingual dictionary module the corresponding translation of the deciphered vocal communication provided a preset sentence exists in a speech recognition module for the vocal communication. Then translate the vocal communication into the target language provided there is no corresponding translation located in the multilingual dictionary module. Then synthesize the translated target language when there is no corresponding translation for the vocal communication in the multilingual dictionary module. Then store the sound of the translated target language. Then play the sound of the translated target language.
Owner:IBM CORP

Direct speech-to-speech translation via machine learning

The present disclosure provides systems and methods that train and use machine-learned models such as, for example, sequence-to-sequence models, to perform direct and text-free speech-to-speech translation. In particular, aspects of the present disclosure provide an attention-based sequence-to-sequence neural network which can directly translate speech from one language into speech in another language, without relying on an intermediate text representation.
Owner:GOOGLE LLC

Speech-to-speech translation method and device

The invention discloses a speech-to-speech translation method. The speech-to-speech translation method comprises the steps that the corresponding relation of the content between different languages is stored in a translation reference database, and a speech-to-speech translation device obtains a source speech of a first user; the speech-to-speech translation device carries out matching on the translation reference database according to the source speech, and obtains an objective speech according to the matching result; the speech-to-speech translation device provides the objective speech for a second user. The invention further and correspondingly discloses the speech-to-speech translation device. According to the speech-to-speech translation method and device, when speech translation or text translation is carried out, matching is carried out on the translation reference database firstly, and when the objective speech corresponding to the source speech, or an objective text corresponding to a source text is stored in the translation reference database, the speech translation time can be shortened, the frequency of conversation delay can be reduced, and user experience can be improved.
Owner:CHINA MOBILE COMM GRP CO LTD

Rapid automatic user training with simulated bilingual user actions and responses in speech-to-speech translation

A system and method for automatic user training in speech-to-speech translation includes integrating an automatic user response system configured to be responsive to a plurality of training items and selecting a training item from the plurality of training items. For the selected training item, in response to an utterance in a first language, the utterance is translated into a second language, and a response to the utterance in the second language is generated. A simulated action corresponding with the response in accordance with a user speaking the second language is also generated. The response and simulated action are output as a learning exercise for learning operations of the automatic user response system.
Owner:IBM CORP

Method, device and voice translation equipment for achieving speech-to-speech translation

The invention discloses a method, device and speech translation equipment for achieving speech-to-speech translation. The method comprises the following steps that a voice recognition engine set, a text translation engine set and a voice synthesis engine set are constructed; for each translation service, a voice recognition engine, a text translation engine and a voice synthesis engine which support the translation service are respectively selected from the voice recognition engine set, the text translation engine set and the voice synthesis engine set, and at least two sets of engine combinations are formed; characteristic information of each set of engine combination is obtained; according to the characteristic information, priority ranking is performed on the engine combinations of eachtranslation service to generate an engine combination selection set of the translation service for being selectively used in subsequent translation. The method, device and speech translation equipment for achieving the speech-to-speech translation have the advantages that the free combination of multiple engines is achieved, the advantages of different engines are fully used, thereby greatly improving the translation performance, providing better translation services for users, and greatly improving the user experience.
Owner:SHENZHEN WATER WORLD CO LTD

Speech-to-speech translation method, device and equipment and control equipment

The invention provides a speech-to-speech translation method, device and equipment and control equipment. The method comprises the steps that audio data and a language text obtained by speech processing equipment based on the conversion of the audio data are acquired; the audio data and the language text are sent to a control terminal, so that the control terminal verifies the language text according to the audio data, and when it is determined that the language text has errors through the verification, a correction rule is generated; the correction rule sent by the control terminal is received, and according to the correction rule, the language text is calibrated. The speech-to-speech translation method, device and equipment and the control equipment have the advantages that the accuracyof a translation result of the language text is improved, the language text can be calibrated in real time according to the correction rule, and the normal operation of a meeting is not affected.
Owner:COLORFULCLOUDS PACIFIC TECH CO LTD +1
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products