Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

297 results about "Speech translation" patented technology

Speech translation is the process by which conversational spoken phrases are instantly translated and spoken aloud in a second language. This differs from phrase translation, which is where the system only translates a fixed and finite set of phrases that have been manually entered into the system. Speech translation technology enables speakers of different languages to communicate. It thus is of tremendous value for humankind in terms of science, cross-cultural exchange and global business. so when you type something in you can press a button and it will read it out and pronounce it for you so you know how to say it in a different language.

Enhanced speech-to-speech translation system and methods

A speech translation system and methods for cross-lingual communication that enable users to improve and modify content and usage of the system and easily abort or reset translation. The system includes a speech recognition module configured for accepting an utterance, a machine translation module, an interface configured to communicate the utterance and proposed translation, a correction module and an abort action unit that removes any hypotheses or partial hypotheses and terminates translation. The system also includes modules for storing favorites, changing language mode, automatically identifying language, providing language drills, viewing third party information relevant to conversation, among other things.
Owner:META PLATFORMS INC

System and methods for maintaining speech-to-speech translation in the field

A method and apparatus are provided for updating the vocabulary of a speech translation system for translating a first language into a second language including written and spoken words. The method includes adding a new word in the first language to a first recognition lexicon of the first language and associating a description with the new word, wherein the description contains pronunciation and word class information. The new word and description are then updated in a first machine translation module associated with the first language. The first machine translation module contains a first tagging module, a first translation model and a first language module, and is configured to translate the new word to a corresponding translated word in the second language. Optionally, the invention may be used for bidirectional or multi-directional translation
Owner:META PLATFORMS INC

System for automated translation of speech

The present invention allows subscribers to an online information service to participate in real-time conferencing or chat sessions in which a message originating from a subscriber in accordance with a first language is translated to one or more languages before it is broadcast to the other conference areas. Messages in a first language are translated automatically to one or more other languages through language translation capabilities resident at online information service host computers. Access software that subscribers use for participating in conference is integrated with speech recognition and speech generation software such that a subscriber may speak the message he or she would like to share with other participants and may hear the messages from the other participants in the conference. Speech-to-speech translation may be accomplished as a message spoken into a computer microphone in accordance with a first language may be recited by a remote computer in accordance with a second language.
Owner:META PLATFORMS INC

Mobile wireless display providing speech to speech translation and avatar simulating human attributes

An apparatus includes a monocular display with a wireless communications interface, user input device, transmitter, and controller, and may provide a video link to and control and management of a host device and other devices, such as a cell phone, computer, laptop, or media player. The apparatus may receive speech and digitize it. The apparatus may compare the digitized speech in a first language to a table of digitized speech in a second language to provide translation or, alternatively, may compare the digitized speech to a table of control commands. The control commands allow user interaction with the apparatus or other remote devices in a visual and audio manner. The control signals control a “recognized persona” or avatar stored in a memory to provide simulated human attributes to the apparatus, network or third party communication device. The avatar may be changed or upgraded according to user choice.
Owner:KOPIN CORPORATION

Speech translation apparatus and computer program product

A translation direction specifying unit specifies a first language and a second language. A speech recognizing unit recognizes a speech signal of the first language and outputs a first language character string. A first translating unit translates the first language character string into a second language character string that will be displayed on a display device. A keyword extracting unit extracts a keyword for a document retrieval from the first language character string or the second language character string, with which a document retrieving unit performs a document retrieval. A second translating unit translates a retrieved document into its opponent language, which will be displayed on the display device.
Owner:KK TOSHIBA

Speech-to-speech translation system with user-modifiable paraphrasing grammars

The present invention discloses a speech-to-speech translation device which allows one or more users to input a spoken utterance in one language, translates the utterance into one or more second languages, and outputs the translation in speech form. Additionally, the device allows for translation both directions, recognizing inputs in the one or more second languages and translating them back into the first language. The device recognizes and translates utterances in a limited domain as in a phrase book translation system, so the translation accuracy is essentially 100%. By limiting the domain the system increases the accuracy of the speech recognition component and thus the accuracy of the overall system. However unlike other phrase book systems, the device also allows wide variations and paraphrasing in the input, so that the user is much more likely to find the desired phrase from the stored list of phrases. The device paraphrases the input to a basic canonical form and performs the translation on that canonical form, ignoring the non-essential variations in the surface form of the input. The device can provide visual and / or auditory feedback to confirm the recognized input and makes the system usable for non-bilingual users with absolute confidence.
Owner:EHSANI FARZAD +2

Audio renderings for expressing non-audio nuances

Methods, systems, computer program products, and methods of doing business by adapting audio renderings of non-audio messages (for example, e-mail messages that are processed by a text-to-speech translator) to reflect various nuances of the non-audio information. Audio cues are provided for this purpose, which are sounds that are “mixed” in with the audio rendering as a separate (background) audio stream. Audio cues may reflect information such as the topical structure of a text file, or changes in paragraphs. Or, audio cues may be used to signal nuances such as changes in the color or font of the source text. Audio cues may also be advantageously used to reflect information about the translation process with which the audio rendering of a text file was created, such as using varying background tones to convey the degree of certainty in the accuracy of translating text to audio using a text-to-speech translation system, or of translating audio to text using a voice recognition system, or of translating between languages, and so forth. Stylesheets, such as those encoded in the Extensible Stylesheet Language (“XSL”), may optionally be used to customize the audio cues. For example, a user-specific stylesheet customization may be performed to override system-wide default audio cues for a particular user, enabling her to hear a different background sound for messages on a particular topic than other users will hear.
Owner:CERENCE OPERATING CO

Speech translation device and computer readable medium

A translation device which has both advantages of a table look-up translation device and advantages of a machine translation device by leading the user's utterance through a sentence template suitable for the user's intent of speech is realized. Since the translation device searches for sentence templates suitable for the user's intent of speech with an orally inputted keyword and displays retrieved sentences, the user's utterance can be lead. In addition, the user is free from a troublesome manipulation for replacing a word since an expression uttered by the user is inserted into a replaceable portion (slot) within the sentence template, and the translation device translates a resulting sentence with the replaced expression embedded in the slot.
Owner:RAKUTEN GRP INC

Apparatus, method and computer program product for translating speech, and terminal that outputs translated speech

In a speech translation apparatus, a correspondence storage unit stores therein identifiers of terminals and usage languages used in the terminals associated with each other. A receiving unit receives a source speech from one of the terminals. A translating unit acquires usage languages from the correspondence storage unit, and generates a translated speech by using each of the acquired usage languages as a target language. When the translated speech is generated in any one of the target languages, a determining unit determines whether it has been generated in all the target languages. If the translated speech has been generated in all the target languages, an output unit outputs the translated speech. A sending unit sends the translated speech to each of the terminals.
Owner:TOSHIBA DIGITAL SOLUTIONS CORP

Instant Translation System

A method of, and corresponding headset computer for, performing instant speech translation including, establishing a local network including a link between a first and a second headset computer in which preferred language settings of each headset computer are exchanged, transmitting captured speech in a first language from a first headset computer to a network-based speech recognition service to recognize and transcribe the captured speech as text, receiving the text at the first headset computer, broadcasting the text over the local network to at least the second headset computer, receiving the text at the second headset computer, transmitting the received text from the second headset computer to a network-based text translation service to translate the text to a text in a second language, receiving the text in the second language at the second headset computer from the network-based text translation service, and displaying the translated text at the second headset computer.
Owner:KOPIN CORPORATION

System and methods for maintaining speech-to-speech translation in the field

A method and apparatus are provided for updating the vocabulary of a speech translation system for translating a first language into a second language including written and spoken words. The method includes adding a new word in the first language to a first recognition lexicon of the first language and associating a description with the new word, wherein the description contains pronunciation and word class information. The new word and description are then updated in a first machine translation module associated with the first language. The first machine translation module contains a first tagging module, a first translation model and a first language module, and is configured to translate the new word to a corresponding translated word in the second language. Optionally, the invention may be used for bidirectional or multi-directional translation.
Owner:META PLATFORMS INC

Speech translation system between Mandarin and various dialects and method thereof

The invention relates to a speech translation system between the Mandarin and various dialects and a method thereof for solving the problems of poor correctness, tedium and stiffness of translated dialects. By selecting correct dialect speeches and carrying out tone regulation through a selection rule, the invention achieves the technical effect of improving the accuracy and the simulation degreeof the dialect speeches.
Owner:INVENTEC CORP

Speech translation device and computer readable medium

A translation device which has both advantages of a table look-up translation device and advantages of a machine translation device by leading the user's utterance through a sentence template suitable for the user's intent of speech is realized. Since the translation device searches for sentence templates suitable for the user's intent of speech with an orally inputted keyword and displays retrieved sentences, the user's utterance can be lead. In addition, the user is free from a troublesome manipulation for replacing a word since an expression uttered by the user is inserted into a replaceable portion (slot) within the sentence template, and the translation device translates a resulting sentence with the replaced expression embedded in the slot.
Owner:RAKUTEN INC

Multi-Lingual Conference Call

The present invention relates to apparatus arranged to facilitate translation of speech within a conference call. The apparatus receives speech data in a first language, converts the speech data in the first language into meta-data; and the meta-data is then translated into speech data in a second language. The apparatus may be distributed across a network or situated on a single node in the network.
Owner:RPX CLEARINGHOUSE

Mobile phone speech instant translation system and method

InactiveCN101867632ASupport communicationImplement translation conversionSpeech recognitionSpecial data processing applicationsSpeech translationSpeech identification
The invention relates to mobile phone speech instant translation technology, which belongs to a speech identification system and an instant translation method. The key points of the system and the method are that: software with a speech translation function is arranged on a preset mobile phone; a translation service unit translates the information acquired by the speech identification system and sends a translation result to an opposite terminal; and meanwhile, the speech information of the opposite terminal is processed in the same way. Thus, users who speak different languages can communicate with each other on the mobile phone by using the speech translation technology.
Owner:刘越 +1

System and methods for maintaining speech-to-speech translation in the field

A method and apparatus are provided for updating the vocabulary of a speech translation system for translating a first language into a second language including written and spoken words. The method includes adding a new word in the first language to a first recognition lexicon of the first language and associating a description with the new word, wherein the description contains pronunciation and word class information. The new word and description are then updated in a first machine translation module associated with the first language. The first machine translation module contains a first tagging module, a first translation model and a first language module, and is configured to translate the new word to a corresponding translated word in the second language. Optionally, the invention may be used for bidirectional or multi-directional translation.
Owner:META PLATFORMS INC

Multi-to-multi speech conversion method based on text encoder under non-parallel text conditions

The invention discloses a multi-to-multi speech conversion method based on a text encoder under non-parallel text conditions. The method includes a training part and a conversion part, a conditional variation self-encoder, the generative adversarial network (VAWGAN) and a text encoder (Text-Encoder) are used to implement a speech conversion system, sentence embedding representing semantics is added to the VAWGAN, the speech quality and personality similarity of the converted speech can be better improved, and high-quality speech conversion is realized. The method is advantaged in that dependence on parallel texts is relieved, speech conversion under the non-parallel text conditions is realized, not any alignment process is required in the training process, versatility and practicability ofthe speech conversion system are improved, the conversion system of multiple source / target speaker pairs can be integrated into one transformation model, that is, multi-talker-to-multi-talker conversion is realized, and the method has good application prospects in the fields of film dubbing, speech translation, speech synthesis and the like.
Owner:NANJING UNIV OF POSTS & TELECOMM

Network provided information using text-to-speech and speech recognition and text or speech activated network control sequences for complimentary feature access

A real-time networked telephony or computer system has a feature complex and / or applications that offer a class of features to a subscriber, including call information, and permits the subscriber to manage incoming and existing calls through available features accessed using spoken utterances. A speech processing unit coupled to the system interprets a subscriber's spoken utterances without requiring the subscriber to train the system to recognize his or her voice. The interpretation of spoken utterances is enabled by a system state database that is maintained at the speech processing unit and comprises a database of the possible system slates, including possible call flows for a call, and a database associated with the system state database comprising context-specific grammar that a subscriber may recite at respective points in the call flow. The speech processing unit may also convert message signals from the network to speech which is read to the subscriber using a text to speech translator. The network can identify the voice or subscriber voice, or language used and will thereafter recognize all further commands using specific grammar for that language as well as perform text-to-speech conversion using the identified language. Use of the features can be applied to update of grammars, profiles and templates, etc. by transmitting results of transactions.
Owner:SOUND VIEW INNOVATIONS

Device and method for prosody generation at visual synthesis

A device for prosody generation at visual synthesis. A number of half-syllables are stored together with registered movement patterns in a face. When synthesizing speech, a number of half-syllables are put together into words and sentences. The words and sentences are given a stress and pattern of intonation corresponding to the intended language. In the face, a number of points and their movement patterns are further registered. In connection with the generation of words and sentences, the movement patterns of the different points are amplified depending on a given stress and sentence intonation. The given movement patterns are after that applied to a model, which is applied to a real face at which a life-like animation is obtained, at for instance a translation of a person's speech in a first language to a second language.
Owner:INTELLECTUAL VENTURES I LLC +1

Apparatus for bi-directional sign language/speech translation in real time and method

Provided is an apparatus for bi-directional sign language / speech translation in real time and method that may automatically translate a sign into a speech or a speech into a sign in real time by separately performing an operation of recognizing a speech externally made through a microphone and outputting a sign corresponding to the speech, and an operation of recognizing a sign sensed through a camera and outputting a speech corresponding to the sign.
Owner:HYUNDAI MOTOR CO LTD +1

Voice prompts for use in speech-to-speech translation system

Techniques for employing improved prompts in a speech-to-speech translation system are disclosed. By way of example, a technique for use in indicating a dialogue turn in an automated speech-to-speech translation system comprises the following steps / operations. One or more text-based scripts are obtained. The one or more text-based scripts are synthesizable into one or more voice prompts. At least one of the one or more voice prompts is synthesized for playback from at least one of the one or more text-based scripts, the at least one synthesized voice prompt comprising an audible message in a language understandable to a speaker interacting with the speech-to-speech translation system, the audible message indicating a dialogue turn in the automated speech-to-speech translation system.
Owner:IBM CORP

Many-to-many speaker conversion method based on i vector in non-parallel text condition

The invention discloses a many-to-many speaker conversion method based on an i vector in a non-parallel text condition. The method comprises a training stage and a conversion stage. A speech conversion system is realized by combining VAWGAN with an i vector, so that the personality similarity and speech quality of the speech after conversion are improved well and a high-quality speech conversion method is realized. In addition, the dependence on the parallel text is eliminated and the speech conversion under the non-parallel text condition is realized. Moreover, no alignment process is neededby the training process. A plurality of source-target speaker conversion systems are integrated into one conversion model, so that multi-speaker-to-multi-speaker conversion is realized. The many-to-many speaker conversion method has the broad application prospects in fields of cross-linguage speech conversion, film dubbing, and speech translation and the like.
Owner:NANJING UNIV OF POSTS & TELECOMM

System, apparatus and method for providing global communications

A system, apparatus and method for transmitting and receiving messages over a wireless communications network, the communicating device including a computer platform having storage for one or more programs, a user interface that includes a visual display for displaying at least alphanumeric characters and a microphone for inputting speech of a user of the computerized communicating device a trackball module, the trackball module for inputting at least alphanumeric characters, a sensor for obtaining biodata from a user of the computerized paging device, and a speech translation program resident and selectively executable on the computer platform, whereupon initiating a message for transmission, the speech translation software interpreting the words of the user and translating them into a digital text format, the speech translation program may include an electronic dictionary, the electronic dictionary identifies a word by comparing an electronic signature of the word to a plurality of electronic signatures stored in the electronic dictionary.
Owner:TAORMINA ROBERT

Speech translation method and device, and device for speech translation

Embodiments of the invention provide a speech translation method and device, and a device for speech translation. The method specifically includes the steps of obtaining a text that corresponds to a speech recognition result that has undergone punctuation adding processing; obtaining a target clause from the text; translating the target clause and outputting the obtained first translation result;and when a current pause corresponding to the speech recognition result is detected, performing a second translation on the text corresponding to the speech recognition result that has undergone punctuation adding processing and between a previous pause and the current pause, and outputting the obtained second translation result so as to replace the first translation result with the second translation result. In the embodiments of the invention, the lag of the translation result with respect to a speech signal can be effectively reduced by the first translation result, and the quality of the translation result finally provided to a user can be enhanced by the second translation result.
Owner:BEIJING SOGOU TECHNOLOGY DEVELOPMENT CO LTD

Voice translation method and device

The invention discloses a voice translation method. The method includes the steps of extracting vocal print characteristics of first voice data when receiving the first voice data, determining a language category corresponding to the extracted vocal print characteristics, obtaining a pre-stored second language when the language category corresponding to the extracted vocal print characteristics is a first language, and converting the first voice data of the first language into second voice data corresponding to the second language. The invention further discloses a voice translation device. By extracting vocal print characteristics, accurately distinguishing different languages and automatically converting voice of one language into voice of another language, the effectiveness of communication is improved.
Owner:ZTE CORP

Speech translation device and method

A speech translation device includes a speech input unit, a speech recognition unit, a machine translation unit, a parameter setting unit, a speech synthesis unit, and a speech output unit, and a speech volume value of speech data to be outputted is determined from plural likelihoods obtained by the speech recognition / machine translation. With respect to a word with a low likelihood, the speech volume value is made small and is made hard to transmit to the user, and on the other hand, with respect to a word with a high likelihood, the speech volume value is made large and is especially emphasized and is transmitted to the user.
Owner:KK TOSHIBA

Speech translation apparatus, method and program

Speech translation apparatus includes first generation unit generating first text representing speech recognition result, and first prosody information, second generation unit generating first para-language information, first association unit associating each first portion of first text with corresponding first portion of first para-language information, translation unit translating first text into second texts, second association unit associating each second portion of first para-language information with corresponding second portion of each second text, third generation unit generating second prosody-information items, fourth generation unit generating second para-language-information items, computation unit computing degree-of-similarity between each first para-language information and corresponding one of second para-language-information items to obtain degrees of similarity, selection unit selecting, from second prosody-information items, maximum-degree-of-similarity prosody information corresponding to maximum degree, fifth generation unit generating prosody pattern of one of second texts which corresponds to maximum-degree-of-similarity prosody information, and output unit outputting one of second texts which corresponds to maximum-degree-of-similarity prosody information.
Owner:KK TOSHIBA

Artificial intelligence-based speech translation method, device, terminal device and cloud end server

InactiveCN106935240AEnable real-time translationMeet translation needsNatural language translationSpeech recognitionSpeech translationTerminal equipment
The present invention provides an artificial intelligence-based speech translation method, an artificial intelligence-based speech translation device, a terminal device and a cloud end server. The artificial intelligence-based speech translation method includes the following steps that: speech of a source language inputted by a user is received through a terminal device; the speech of the source language is sent to the cloud end server; an audio file of a target language which is sent by the cloud end server is received; and the audio file of the target language is played. With the artificial intelligence-based speech translation method, device, terminal device and cloud end server of the invention adopted, the real-time translation of speech can be realized, the translation requirements of outbound travel scenes can be satisfied, and translation accuracy is higher.
Owner:BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD

Voice translation method and voice translation apparatus

The present invention provides a speech sound translating method and apparatus, according to one aspect of the invention, provided is a speech sound translating method, comprising: recognizing the speech sound as a text which has at least one long sentence comprising a plurality of single sentences; cutting at least one long sentence into a plurality of single sentences; translating each of a plurality of single sentences into a sentence of target language kind. The invention can insert a long sentence cutting module between a speech sound recognition module and a machine translation module, can recognize the long sentence in the text and can automatically cut the long sentences into a plurality of simple and complete single sentences, thereby reducing the difficulty of the machine translation and improving the quality of the machine translation. Moreover, the invention also provides an interface interactive with the user, enables the user to conveniently modify the automatic cutting result, meanwhile, the modifying operation of the user is recorded for updating the cutting model in real time, and the automatic cutting effect is improved continuously.
Owner:KK TOSHIBA

Speech translation method and apparatus

A method and apparatus for speech translation. The method includes: receiving a source speech; extracting non-text information in the source speech; translating the source speech into a target speech; and adjusting the translated target speech according to the extracted non-text information so that the target speech preserves the non-text information in the source speech. The apparatus includes: a receiving module for receiving source speech; an extracting module for extracting non-text information in the source speech; a translation module for translating the source speech into a target speech; and an adjusting module for adjusting the translated target speech according to the extracted non-text information so that the target speech preserves the non-text information in the source speech.
Owner:NUANCE COMM INC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products