Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

1179 results about "Recognition speech" patented technology

Speech recognition. Speech recognition is the inter-disciplinary sub-field of computational linguistics that develops methodologies and technologies that enables the recognition and translation of spoken language into text by computers. It is also known as automatic speech recognition (ASR), computer speech recognition or speech to text (STT).

Systems and methods for non-traditional payment using biometric data

Facilitating transactions using non-traditional devices and biometric data to activate a transaction device is disclosed. A transaction request is formed at a non-traditional device, and communicated to a reader, wherein the non-traditional device may be configured with an RFID device. The RFID device is not operable until a biometric voice analysis has been executed to verify that the carrier of the RFID equipped non-traditional device is the true owner of account information stored thereon. The non-traditional device provides a conduit between a user and a verification system to perform biometric voice analysis of the user. When the verification system has determined that the user is the true owner of one or more accounts stored at the verification system, a purchase transaction is facilitated between the verification system. Transactions may further be carried out through a non-RF device such as a cellular telephone in direct communication with an acquirer / issuer or payment processor
Owner:LIBERTY PEAK VENTURES LLC

Retraining and updating speech models for speech recognition

ActiveUS20030036903A1Improve speech recognitionEfficiently and inexpensively adaptingSpeech recognitionVoice dataSpeech identification
A technique is provided for updating speech models for speech recognition by identifying, from a class of users, speech data for a predetermined set of utterances that differ from a set of stored speech models by at least a predetermined amount. The identified speech data for similar utterances from the class of users is collected and used to correct the set of stored speech models. As a result, the corrected speech models are a closer match to the utterances than were the set of stored speech models. The set of speech models are subsequently updated with the corrected speech models to provide improved speech recognition of utterances from the class of users. For example, the corrected speech models may be processed and stored at a central database and returned, via a suitable communications channel (e.g. the Internet) to individual user sites to update the speech recognition apparatus at those sites.
Owner:SONY CORP +1

Speech recognition apparatus, navigation apparatus including a speech recognition apparatus, and speech recognition method

A speech recognition apparatus includes a speech recognition dictionary and a speech recognition unit. The speech recognition dictionary includes comparison data used to recognize a voice input. The speech recognition unit is adapted to calculate the score for each comparison data by comparing voice input data generated based on the voice input with each comparison data, recognize the voice input based on the score, and produce the recognition result of the voice input. The speech recognition apparatus further includes data indicating score weights associated with particular comparison data, used to weight the scores calculated for the particular comparison data. After the score is calculated for each comparison data, the score weights are added to the scores of the particular comparison data, and the voice input is recognized based on total scores including the added score weights.
Owner:ALPINE ELECTRONICS INC

System and method for accessing data via Internet, wireless PDA, smartphone, text to voice and voice to text

An intermediary system and method are disclosed for providing users access to enterprise data via the Internet, Wireless PDA, VoIP Phone, Wireless Phone, and GSM / EDGE SmartPhone, and other communication devices. The intermediary system allows users to access enterprise data based on the user's role in the enterprise, the user's assigned privileges, or the user's object permissions. The intermediary system tailors the enterprise data for the user based on the type of communication device of the user, the point in time the user communicates with the system, or the location in a network where the user is communicating with the system. Depending on the above criteria, the user is given a “view” of the enterprise data that relates more directly to the user's current needs, duties, and tasks.
Owner:IKNOWWRE

Method of providing voicemails to a wireless information device

Voicemail is received at a voicemail server and converted to an audio file format; it is then sent or streamed over a wide area network to a voice to text transcription system comprising a network of computers. One of the networked computers plays back the voice message to an operator and the operator intelligently transcribes the actual message from the original voice message by entering the corresponding text message (actually a succinct version of the original voice message, not a verbose word-for-word conversion) into the computer to generate a transcribed text message. The transcribed text message is then sent to the wireless information device from the computer. Because human operators are used instead of machine transcription, voicemails are converted accurately, intelligently, appropriately and succinctly into text messages (SMS / MMS).
Owner:SPINVOX LTD

Techniques for combining voice with wireless text short message services

Combining speech with existing text wireless Short Message Service (SMS), to create a multimodal SMS service. A user of standard SMS text service is provided with a link within each message that allows adding or retrieval of a voice message associated with an SMS text message. A speech server in the wireless network may be used for recognition of voice commands, rendering of text into speech, and recording or playing back voice recordings. The messages may be retrieved or sent using any wireless handset. Additional services that may be used include message lists, prerecorded messages, integration with voicemail systems, image and video messages, cross-functional applications with Multimedia Messaging Services (MMS) and Enhanced Message Services (EMS), and extensions of the combination of voice with text SMS to other computing devices such as PDAs and PCs.
Owner:AAWAAZ INC

Speech recognition system to selectively utilize different speech recognition techniques over multiple speech recognition passes

Method and apparatus for multi-pass speech recognition. An input device receives spoken input. A processor performs a first pass speech recognition technique on the spoken input and forms first pass results. The first pass results include a number of alternative speech expressions, each having an assigned score related to the certainty that the corresponding expression correctly matches the spoken input. The processor selectively performs a second pass speech recognition technique on the spoken input according to the first pass results. Preferably, the second pass attempts to correctly match the spoken input to only those expressions which were identified during the first pass. Otherwise, if one of the expressions identified by the first pass is assigned a score higher than a predetermined threshold (e.g., 95%), the second pass is not performed. Because the second pass is performed only when necessary, the invention recognizes speech with a faster average speed for a given accuracy in comparison to prior systems. Alternately, the first pass results identify a characteristic of the spoken input. The characteristic can be the gender of the speaker or a type of telephone the speaker is calling from. In which case, the second pass speech recognition technique is selected from a plurality of speech recognition techniques according to the characteristic identified by the first pass. Because the selected second pass technique is specific to the characteristic of the spoken input, the second pass technique can perform speech recognition faster for a given accuracy than a technique which is not specific.
Owner:NUANCE COMM INC

Method of generating a sms or mms text message for receipt by a wireless information device

A spoken message that a user wishes to have converted to a SMS or MMS message is received at a voicemail server and converted to an audio file format; it is then sent or streamed over a wide area network to a voice to text transcription system comprising a network of computers. One of the networked computers plays back the voice message to an operator and the operator intelligently transcribes the actual message from the original voice message by entering the corresponding text message (actually a succinct version of the original voice message, not a verbose word-for-word conversion) into the computer to generate a transcribed text message. The transcribed text message is then sent to the wireless information device from the computer as a SMS or MMS text message. Because human operators are used instead of machine transcription, voicemails are converted accurately, intelligently, appropriately and succinctly into text messages (SMS / MMS).
Owner:SPINVOX LTD

Onboard information device

Onboard information device mounted on vehicle to provide information for supporting passenger includes voice acquisition unit that continually detects and acquires voice the passenger utters while the onboard information device is operating; voice recognition unit that recognizes speech contents of the voice the voice acquisition unit acquires; vehicle state detector that detects vehicle state including environmental state in the vehicle, surrounding state of the vehicle or operating state of the vehicle; output controller that creates display data or voice data from the speech contents recognized by the voice recognition unit according to the vehicle state the vehicle state detector detects, and that controls output of the display data or voice data; and output unit that outputs the display data or voice data the output controller creates. It can perform effective conversation support in real time according to recognition result obtained by continually recognizing speech contents of the passenger.
Owner:HYUNDAI MOTOR CO LTD +1

Methods for Identifying Messages and Communicating with Users of a Multimodal Message Service

Methods for combining speech with existing text wireless Short Message Service (SMS) to create a multimodal SMS service are provided. A user of standard SMS text service is provided with a link within each message that allows adding or retrieval of a voice message associated with an SMS text message. A speech server in the wireless network may be used for recognition of voice commands, rendering of test into speech, and recording or playing back voice recordings. An automatic and unique identification of each message is created to allow retrieval of the message without manually selecting the message. Additional services include message lists, integration with voicemail systems, image and video messages, cross-functional applications with Multimedia Messaging Services (MMS) and Enhanced Message Services (EMS), and extensions of the combination of voice with text SMS to other computing devices such as PDAs and PCs.
Owner:MULTIMODAL MEDIA LLC

Voice-to-text chat conversion for remote video game play

A multi-player networked video game playing system including for example video game consoles analyzes speech to vary the font size and / or color of associated text displayed to other users. If the amplitude of the voice is high, the text displayed to other users is displayed in a larger than normal font. If the voice sounds stressed or is aggressive words are used, the text displayed to other users is displayed using a special font such as red color. Other analysis may be performed on the speech in context to vary the font size, color, font type and / or other display attributes.
Owner:NINTENDO CO LTD

Method and apparatus for transmitting data packets over voice channel

A communication system includes a first station in voice communication with a base station and a second station in data communication with the base station. The second station monitors the voice communication and identifies predetermined periods (e.g., preferably silent periods) of the voice communication. The second station transmits data packets during the predetermined periods.
Owner:NEC CORP

Configuring A Speech Engine For A Multimodal Application Based On Location

Methods, apparatus, and products are disclosed for configuring a speech engine for a multimodal application based on location. The multimodal application operates on a multimodal device supporting multiple modes of user interaction with the multimodal application. The multimodal application is operatively coupled to a speech engine. Configuring a speech engine for a multimodal application based on location includes: receiving a location change notification in a location change monitor from a device location manager, the location change notification specifying a current location of the multimodal device; identifying, by the location change monitor, location-based configuration parameters for the speech engine in dependence upon the current location of the multimodal device, the location-based configuration parameters specifying a configuration for the speech engine at the current location; and updating, by the location change monitor, a current configuration for the speech engine according to the identified location-based configuration parameters.
Owner:NUANCE COMM INC

Automatic transmission of voice-to-text converted voice message

A voice messaging system includes an input device to accept a destination electronic messaging address, a voice-to-text converter to convert or transcribe a received voice message into a converted text message, and a processor to operate an electronic messaging program and to prepare the text message for automatic electronic transmission to the destination electronic messaging address. A method is also provided to convert a voice message into a text message and to transmit the same to a destination electronic messaging address. The method includes inputting a destination electronic messaging address. A received voice message is converted or transcribed into a text message and prepared as a text file. The prepared text file is automatically transmitted to the destination electronic messaging address. A log file including the text of the voice messages may be maintained with updates at a predetermined time interval. A database of the converted text messages may be generated and maintained to provide a searchable mechanism of archived messages.
Owner:GOOGLE LLC

Systems And Methods For Haptic Augmentation Of Voice-To-Text Conversion

Systems and methods for providing voice-to-text haptic augmentation in a user interface are disclosed. For example, one disclosed system for converting audible speech information to a haptic effect includes a microphone and a processor in communication with the microphone, the processor configured to receive an audio signal associated with a voice from the microphone, determine a characteristic of the audio signal, and generate an actuator signal based at least in part on the characteristic, the actuator signal configured to cause an actuator to output a haptic effect.
Owner:IMMERSION CORPORATION

Semantic language modeling and confidence measurement

A system and method for speech recognition includes generating a set of likely hypotheses in recognizing speech, rescoring the likely hypotheses by using semantic content by employing semantic structured language models, and scoring parse trees to identify a best sentence according to the sentence's parse tree by employing the semantic structured language models to clarify the recognized speech.
Owner:NUANCE COMM INC

Semiautomated Relay Method and Apparatus

A system for providing voice-to-text captioning service comprising a relay processor that receives voice messages generated by a hearing user during a call, the processor programmed to present the voice messages to a first call assistant, a call assistant device used by the first call assistant to generate call assistant generated text corresponding to the hearing user's voice messages, the processor further programmed to run automated voice-to-text transcription software to generate automated text corresponding to the hearing user's voice messages, use the call assistant generated text, the automated text and the hearing user's voice messages to train the voice-to-text transcription software to more accurately transcribe the hearing user's voice messages to text, determine when the accuracy exceeds an accuracy requirement threshold, during the call and prior to the automated text exceeding the accuracy requirement threshold, transmitting the call assistant generated text to the assisted user's device for display to the assisted user and subsequent to the automated text exceeding the accuracy requirement threshold, transmitting the automated text to the assisted user's device for display to the assisted user.
Owner:ULTRATEC INC

Recognizing speech, and processing data

An example embodiment of the invention includes a speech recognition processing unit for specifying speech segments for speech data, recognizing a speech in each of the speech segments, and associating a character string of obtained recognition data with the speech data for each speech segment, based on information on a time of the speech, and an output control unit for displaying / outputting the text prepared by sorting the recognition data in each speech segment. Sometimes, the system further includes a text editing unit for editing the prepared text, and a speech correspondence estimation unit for associating a character string in the edited text with the speech data by using a technique of dynamic programming.
Owner:NUANCE COMM INC

Systems and methods for dynamic re-configurable speech recognition

Speech recognition models are dynamically re-configurable based on user information, application information, background information such as background noise and transducer information such as transducer response characteristics to provide users with alternate input modes to keyboard text entry. Word recognition lattices are generated for each data field of an application and dynamically concatenated into a single word recognition lattice. A language model is applied to the concatenated word recognition lattice to determine the relationships between the word recognition lattices and repeated until the generated word recognition lattices are acceptable or differ from a predetermined value only by a threshold amount. These techniques of dynamic re-configurable speech recognition provide for deployment of speech recognition on small devices such as mobile phones and personal digital assistants as well environments such as office, home or vehicle while maintaining the accuracy of the speech recognition.
Owner:NUANCE COMM INC

Utilizing user transmitted text to improve language model in mobile dictation application

In embodiments of the present invention improved capabilities are described for utilizing user transmitted text to improve language modeling in converting voice to text on a mobile communication facility comprising capturing speech presented by a user using a resident capture facility on the mobile communication facility; transmitting at least a portion of the captured speech as data through a wireless communication facility to a speech recognition facility; generating speech-to-text results for the captured speech utilizing the speech recognition facility; transmitting the text results from the speech recognition facility to the mobile communications facility; entering the text results into a text field on the mobile communications facility; monitoring for a user selected transmission of the entered text results through a communications application on the mobile communications facility; and receiving the user selected transmitted text at the speech recognition facility and using it to improve the performance of the speech recognition facility.
Owner:VLINGO CORP

Computerized device with voice command input capability

A computerized device with voice command capability processed remotely includes a low power processor, executing a loose algorithmic model to recognize a wake word prefix in a voice command, the loose model having a low false rejection rate but suffering a high false acceptance rate, and a second processor which can operate in at least a low power / low clock rate mode and a high power / high clock rate mode. When the first processor determines the presence of the wake word, it causes the second processor to switch to the high power / high clock rate mode and to execute a tight algorithmic model to verify the presence of the wake word. By using the two processors in this manner, the average overall power required by the computerized device is reduced, as is the amount of waste heat generated by the system.
Owner:GENERAC POWER SYSTEMS

Sending a communications header with voice recording to send metadata for use in speech recognition and formatting in mobile dictation application

In embodiments of the present invention improved capabilities are described for sending a communications header with voice recording to send metadata for use in speech recognition and formatting when converting voice to text on a mobile communication facility comprising capturing speech presented by a user using a resident capture facility on the mobile communication facility; transmitting a communications header to a speech recognition facility from the mobile communication facility via a wireless communications facility, wherein the communications header includes at least one of device name, network provider, network type, audio source, a display parameter for the wireless communications facility, geographic location, and phone number information; transmitting at least a portion of the captured speech as data through a wireless communication facility to a speech recognition facility; generating speech-to-text results for the captured speech utilizing the speech recognition facility based at least in part on the communications header; transmitting the text results from the speech recognition facility to the mobile communications facility; and entering the text results into a text field on the mobile communication facility.
Owner:VLINGO CORP

Transaction support apparatus

PCT No. PCT / GB96 / 02176 Sec. 371 Date Apr. 3, 1998 Sec. 102(e) Date Apr. 3, 1998 PCT Filed Sep. 4, 1996 PCT Pub. No. WO97 / 11435 PCT Pub. Date Mar. 27, 1997A transaction support apparatus is used by one or more human transaction operators, each provided with a telephone. It includes a speech recognition device having: a speech input coupled to receive a speech signal input to a telephone by a transactiob operator, a speech recognition processor arranged to recognize predetermined transaction parameters within the speech signal; and a parameter output at which the speech recognition device is arranged to make values of the parameters thus recognized available.
Owner:BRITISH TELECOMM PLC

Method and device for recognizing personalized speeches

The invention provides a method and a device for recognizing personalized speeches. The method includes: A, determining whether a speech to be recognized belongs to an authorized user or not; if yes, using a speech recognition module corresponding to the authorized user to recognize the speech to be recognized; and if not, executing a step B; B, determining what dialect type the speech to be recognized belongs to, and using a speech recognition module, corresponding to the dialect type which the speech to be recognized belongs to, to recognize the speech to be recognized. Precision in recognizing speeches of various users can be improved by the use of the method and device.
Owner:BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD

Vowel recognition system and method in speech to text applictions

InactiveUS20100217591A1Accurate speech-to-text conversionAccurate conversionSpeech recognitionVowel recognitionRecognition algorithm
The present invention provides systems, software and methods method for accurate vowel detection in speech to text conversion, the method including the steps of applying a voice recognition algorithm to a first user speech input so as to detect known words and residual undetected words; and detecting at least one undetected vowel from the residual undetected words by applying a user-fitted vowel recognition algorithm to vowels from the known words so as to accurately detect the vowels in the undetected words in the speech input, to enhance conversion of voice to text.
Owner:SHPIGEL AVRAHAM

Speech to text system using controlled vocabulary indices

A synthesis of automated speech recognition (voice to text) technology and a knowledge-based analysis of the concepts and contexts of the free text therefrom enable a directed-vocabulary look up index to be used in conjunction with the speech recognition technology thus enabling medical dictation to be transcribed in real time without elaborate training of the dictator or the speech recognition technology. Thus, caregivers can create and review Computer-Based Patient Records in the necessary timeframe consistent with good patient care. The Computer-Based Patient Records can be linked to other applications such as prescription cross checking, lab test results, payer regulations, etc.
Owner:MEDCOM INFORMATION SYST
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products