Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

1174 results about "Recognition speech" patented technology

Speech recognition. Speech recognition is the inter-disciplinary sub-field of computational linguistics that develops methodologies and technologies that enables the recognition and translation of spoken language into text by computers. It is also known as automatic speech recognition (ASR), computer speech recognition or speech to text (STT).

Retraining and updating speech models for speech recognition

ActiveUS20030036903A1Improve speech recognitionEfficiently and inexpensively adaptingSpeech recognitionVoice dataSpeech identification
A technique is provided for updating speech models for speech recognition by identifying, from a class of users, speech data for a predetermined set of utterances that differ from a set of stored speech models by at least a predetermined amount. The identified speech data for similar utterances from the class of users is collected and used to correct the set of stored speech models. As a result, the corrected speech models are a closer match to the utterances than were the set of stored speech models. The set of speech models are subsequently updated with the corrected speech models to provide improved speech recognition of utterances from the class of users. For example, the corrected speech models may be processed and stored at a central database and returned, via a suitable communications channel (e.g. the Internet) to individual user sites to update the speech recognition apparatus at those sites.
Owner:SONY CORP +1

Speech recognition system to selectively utilize different speech recognition techniques over multiple speech recognition passes

Method and apparatus for multi-pass speech recognition. An input device receives spoken input. A processor performs a first pass speech recognition technique on the spoken input and forms first pass results. The first pass results include a number of alternative speech expressions, each having an assigned score related to the certainty that the corresponding expression correctly matches the spoken input. The processor selectively performs a second pass speech recognition technique on the spoken input according to the first pass results. Preferably, the second pass attempts to correctly match the spoken input to only those expressions which were identified during the first pass. Otherwise, if one of the expressions identified by the first pass is assigned a score higher than a predetermined threshold (e.g., 95%), the second pass is not performed. Because the second pass is performed only when necessary, the invention recognizes speech with a faster average speed for a given accuracy in comparison to prior systems. Alternately, the first pass results identify a characteristic of the spoken input. The characteristic can be the gender of the speaker or a type of telephone the speaker is calling from. In which case, the second pass speech recognition technique is selected from a plurality of speech recognition techniques according to the characteristic identified by the first pass. Because the selected second pass technique is specific to the characteristic of the spoken input, the second pass technique can perform speech recognition faster for a given accuracy than a technique which is not specific.
Owner:NUANCE COMM INC

Method of generating a sms or mms text message for receipt by a wireless information device

A spoken message that a user wishes to have converted to a SMS or MMS message is received at a voicemail server and converted to an audio file format; it is then sent or streamed over a wide area network to a voice to text transcription system comprising a network of computers. One of the networked computers plays back the voice message to an operator and the operator intelligently transcribes the actual message from the original voice message by entering the corresponding text message (actually a succinct version of the original voice message, not a verbose word-for-word conversion) into the computer to generate a transcribed text message. The transcribed text message is then sent to the wireless information device from the computer as a SMS or MMS text message. Because human operators are used instead of machine transcription, voicemails are converted accurately, intelligently, appropriately and succinctly into text messages (SMS / MMS).
Owner:SPINVOX LTD

Computerized device with voice command input capability

A computerized device with voice command capability processed remotely includes a low power processor, executing a loose algorithmic model to recognize a wake word prefix in a voice command, the loose model having a low false rejection rate but suffering a high false acceptance rate, and a second processor which can operate in at least a low power / low clock rate mode and a high power / high clock rate mode. When the first processor determines the presence of the wake word, it causes the second processor to switch to the high power / high clock rate mode and to execute a tight algorithmic model to verify the presence of the wake word. By using the two processors in this manner, the average overall power required by the computerized device is reduced, as is the amount of waste heat generated by the system.
Owner:GENERAC POWER SYSTEMS

Sending a communications header with voice recording to send metadata for use in speech recognition and formatting in mobile dictation application

In embodiments of the present invention improved capabilities are described for sending a communications header with voice recording to send metadata for use in speech recognition and formatting when converting voice to text on a mobile communication facility comprising capturing speech presented by a user using a resident capture facility on the mobile communication facility; transmitting a communications header to a speech recognition facility from the mobile communication facility via a wireless communications facility, wherein the communications header includes at least one of device name, network provider, network type, audio source, a display parameter for the wireless communications facility, geographic location, and phone number information; transmitting at least a portion of the captured speech as data through a wireless communication facility to a speech recognition facility; generating speech-to-text results for the captured speech utilizing the speech recognition facility based at least in part on the communications header; transmitting the text results from the speech recognition facility to the mobile communications facility; and entering the text results into a text field on the mobile communication facility.
Owner:VLINGO CORP

Method and device for recognizing personalized speeches

The invention provides a method and a device for recognizing personalized speeches. The method includes: A, determining whether a speech to be recognized belongs to an authorized user or not; if yes, using a speech recognition module corresponding to the authorized user to recognize the speech to be recognized; and if not, executing a step B; B, determining what dialect type the speech to be recognized belongs to, and using a speech recognition module, corresponding to the dialect type which the speech to be recognized belongs to, to recognize the speech to be recognized. Precision in recognizing speeches of various users can be improved by the use of the method and device.
Owner:BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products