Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

141 results about "Voice Tag" patented technology

Voice tags are used in automated speech recognition in a voice command device, allowing the user to "speak" commands. For example, using voice commands with an automated device, such as an IVR telephone prompt or to dial a contact on a mobile phone.

Application of Voice Tags in a Social Media Context

According to a present invention embodiment, a system utilizes a voice tag to automatically tag one or more entities within a social media environment, and comprises a computer system including at least one processor. The system analyzes the voice tag to identify one or more entities, where the voice tag includes voice signals providing information pertaining to one or more entities. One or more characteristics of each identified entity are determined based on the information within the voice tag. One or more entities appropriate for tagging within the social media environment are determined based on the characteristics and user settings within the social media environment of the identified entities, and automatically tagged. Embodiments of the present invention further include a method and computer program product for utilizing a voice tag to automatically tag one or more entities within a social media environment in substantially the same manner described above.
Owner:IBM CORP

Search capabilities for voicemail messages

Methods, systems, and products for voicemail searching that include storing, in association with voicemail messages, voiceprints of callers who leave voicemail messages for voicemail users in a voicemail system; storing caller speech tags in association with the voiceprints; identifying, in dependence upon caller voiceprints, callers who leave new voicemail messages; receiving, from a particular voicemail user, search keywords entered as speech and converted to text through automated speech recognition; and selecting, in dependence upon the search keywords and the caller speech tags, one or more selected voicemail messages from a multiplicity of voicemail messages for the particular voicemail user.
Owner:IBM CORP

Smart voice interaction method and system

The invention discloses a smart voice interaction method and system. The method includes receiving voice data; performing voice recognition on the voice data and acquiring a voice recognition result;performing recognition rejection judgment on the voice recognition result according to a pre-constructed recognition rejection judgment model based on a semantic layer and acquiring a model output result; determining whether the voice data is man-machine interaction voice data or not according to the model output result; if the voice data is man-machine interaction voice data, performing semanticunderstanding on the voice recognition result and generating an interaction result according to a semantic understanding result, wherein the interaction result includes a response text. By utilizing the method and system provided by the invention, influence on man-machine interaction by noise-containing voice data can be reduced and error response of the man-machine interaction system is reduced.
Owner:IFLYTEK CO LTD

Generating and relating text to audio segments

A method, apparatus and system for generating speech minutes. The method comprises the steps of displaying status indicators of respective audio (speech) stream chunks received and text information thereof on a GUI display and establishing the tagging between each audio stream chunk and the corresponding text information by dragging and dropping the status signs of the respective speech stream chunks onto the corresponding text information on the GUI, such that the speech stream, the text information and the corresponding tagging relation form voice tagged meeting minutes.
Owner:NUANCE COMM INC

Audio caller ID for mobile telephone headsets

A caller ID feature on the mobile telephone identifies the telephone number of an incoming call and correlates the number with any corresponding voice tag stored in the personal directory. The voice tag associated with the incoming caller is delivered from the telephone to the headset where it is played to provide the user with an audio caller identification. In the absence of a voice tag, voice-synthesized numerals corresponding to the telephone number of the incoming call can be provided to the headset as an audio caller ID.
Owner:LOGITECH EURO SA

Apparatus and method for voice-tagging lexicon

A voice-tag editor develops voice-tag “sounds like” pairs for a voice-tagging lexicon. The voice-tag editor is receptive of alphanumeric characters input by a user. The alphanumeric characters are indicative of a voice tag and / or “sounds like” text. The voice-tag editor is configured to allow the user to view and edit the alphanumeric characters. A text parser connected to the voice-tag editor generates normalized text corresponding to the “sounds like” text. The normalized text serves as recognition text for the voice tag and is displayed by the voice-tag editor. A storage mechanism is connected to the editor. The storage mechanism updates the lexicon with the alphanumeric characters which represent voice-tag “sounds like” pairs.
Owner:PANASONIC CORP

Program for voice talking, voice talking method, and voice talking apparatus

A computer program, which is used to voice talking to cause an information terminal to execute voice talking, managing an ID of the information terminal and a first address over a first network, stores instructions for execution on a computer system enabling the computer system to perform determining that the information terminal moves over a second network which is different from the first network, acquiring a second address over the second network, and transmitting a request for re-registering a combination of the second address and the ID instead of the first address into another device having a combination of the first address and the ID registered therein.
Owner:KK TOSHIBA

Facilitation of speech recognition in user interface

Items are represented to a user through a user interface with each item having a respective perceivable range value and associated label by which the item can be addressed. To address a particular item, the user speaks its label at a loudness indicative of its perceived range. A loudness-to-range function of the interface determines on the basis of the loudness of the user input, a range gate expected to encompass the range value of the addressed item. A speech recogniser is used to recognise the spoken label and thus the addressed item, the label search space of the recogniser being restricted to exclude the labels of items having a range value outside of the determined range gate. In one embodiment, the user interface is an audio interface in which the items are represented in an audio field through corresponding synthesized sound sources, the depth at which each sound source is rendered in the audio field being the range value associated with the corresponding item.
Owner:SAMSUNG ELECTRONICS CO LTD

System and method for a remotely accessible web-based personal address book

A computer implemented method for providing a remotely accessible web-based address book includes the following steps. First, a user registers with a web-server and sets up an account. The web-server is configured to generate, store and provide access services to web-based address books. Next, the user uploads personal address book information and contacts in the account. Next, the web-server generates a personal web-based address book for the user based on the address book information and contacts and then adds voice tags and text tags to each entry in the user's personal web-based address book. Next, the web-server cross-correlates and matches the uploaded names and contact information of the user's personal contacts with information in other users' profiles stored in a central directory database. If a match exists between one of the uploaded user's personal contacts and a pre-existing user's profile in the central directory database, the web-server updates the pre-existing user's profile in the central directory database. If a match does not exist, the web-server generates a new user's profile in the central directory database. Next, the user accesses the personal web-based address book by placing a phone-call via a voice transmitting connection. Next, the web-server verifies the user's identity. Next, the user selects a personal contact in the user's personal web-based address book and the web-server places a phone-call to the selected personal contact.
Owner:HUMANBOOK

User interface identification and service tags for a document processing system

A tag-based user interface scheme for digitizing and processing hardcopy documents utilizes a sticker that includes a printed data code representative of a user identity code and a service code. When the sticker is applied to a hardcopy document and scanned, the sticker is located, the data code is parsed, and a desired service is performed based upon the information stored in the data code.
Owner:XEROX CORP

Voice recognition apparatus and voice recognition method

InactiveUS20180308483A1Voice recognition is able to be performed efficientlyEfficient executionSound input/outputSpeech recognitionSpeech soundVoice data
Disclosed is a voice recognition apparatus including: an audio input unit configured to receiving a voice; a communication module configured to transmit voice data received from the audio input unit to a server system which performs voice recognition processing, and receive recognition result data on the voice data from the server system; and a controller configured to control the audio input unit and the communication module, wherein, when first voice data is received, the controller perform control to transmit the first voice data to the server system, and wherein, when the first voice data is a conversation command, the controller performs control to receive a first audible answer message including a first question from the server system and output the first audible answer message. In this manner, voice recognition may be performed efficiently.
Owner:LG ELECTRONICS INC

Method for Parsing Natural Language Text

A parser for natural language text is provided. The parser is trained by accessing a corpus of labeled utterances. The parser extracts details of the syntactic tree structures and part of speech tags from the labeled utterances. The details extracted from the tree structures include Simple Links which are the key to the improved efficiency of this new approach. The parser creates a language model using the details that were extracted from the corpus. The parser then uses the language model to parse utterances.
Owner:NEW ROBERT D

System and method of using POS tagging for symbol assignment

Systems and methods for automatically discovering and assigning symbols for identified text in a software application include identifying text for which symbol assignment is desired. The words within the identified text and selected surrounding words defining an observation sequence are subjected to a part of speech tagging algorithm to electronically determine one or more most likely part of speech tags for the identified text. Context relations between the identified text and selected surrounding keywords may also be identified. The identified text, part of speech tag(s) and / or determined relations are then analyzed to map the identified text to one or more identified word senses. Related word senses may also be analyzed to determine if any related word senses have symbols. One of the determined symbols may then be associated with the identified text such that the symbol is thereafter displayed in conjunction with or instead of the text in the application.
Owner:DYNAVOX SYST

Automated voice and speech labeling

A system and method for voice and speech analysis which correlates a speaker signal source and a normalized signal comprising measurements of input acoustic data to a database of language, dialect, accent, and / or speaker attributes in order to create a transcription of the input acoustic data.
Owner:SRC INC

Locating digital images in a portable electronic device

The present invention provides systems and methods for the creation and use of voice tags in an electronic device. When tags are created an image handling unit receives a user selection of a voice tag that may be provided for locating at least one digital image, a sound recording unit records sound emitted by said user, which sound is stored as a sound file to be used as a tag for locating images. When image files are to be located the image handling unit receives the selection of searching for digital images using name tags from the user, the sound recording unit records sound emitted by said user, a voice recognition unit compares the sound with stored sound files and indicates a sound file corresponding to the received sound. The image file associated with the indicated sound file is then located.
Owner:SONY ERICSSON MOBILE COMM AB

Voice recognition method and system based on deep learning

InactiveCN109147768AReduce waiting time for repliesReduce workloadSpeech recognitionData setAcoustic model
The application discloses a voice recognition method and system based on deep learning. The method includes the following steps: acquiring a training data set, wherein the training data set includes atraining voice data set, a voice label and dialogue text information; training the training data set through a training process, and establishing an acoustic model and a language model; acquiring voice query request data; carrying out voice recognition on the voice query request data according to the acoustic model, the language model and a preset dictionary; and finally, outputting a voice recognition text result of the voice query request data. Through the voice recognition method based on deep learning provided by the application, voice consulting content input by customers can be accurately identified, the workload of the manual customer service staff needing to listen to all the consulting requests is reduced, and the time for customers to wait for response is reduced.
Owner:YUNNAN POWER GRID +1

Voice state data generating device, voice state visualizing device, voice state data editing device, voice data reproducing device, and voice communication system

A speech situation data creating device for providing the user with data with a good convenience for the user when the user uses speech data collected from sound sources and recorded with time.A direction / speaker identifying section (3) of a control unit (1) observes a variation of direction data acquired from speech communication data and sets single-direction data and combination direction data on a combination of directions in speaker identification data if no variation of the direction data indicating a single direction or direction data indicating directions over a predetermined time occurs. If any variation of the direction data occurs within a predetermined time, the direction / speaker identifying section (3) reads speech feature value data Sc from a speaker speech DB (53), identifies the speaker by comparing the speech feature value data Sc with the speech feature value analyzed by a speech data analyzing section (2), sets speaker name data in the speaker identification data if the speaker is identified, and sets direction undetection data in the speaker identification data if the speaker is not identified. A speech situation data creating section (4) creates speech situation data according to the variation with time of the speaker identification data.
Owner:YAMAHA CORP

Chinese speech recognition system and method

A Chinese speech recognition system and method is disclosed. Firstly, a speech signal is received and recognized to output a word lattice. Next, the word lattice is received, and word arcs of the word lattice are rescored and reranked with a prosodic break model, a prosodic state model, a syllable prosodic-acoustic model, a syllable-juncture prosodic-acoustic model and a factored language model, so as to output a language tag, a prosodic tag and a phonetic segmentation tag, which correspond to the speech signal. The present invention performs rescoring in a two-stage way to promote the recognition rate of basic speech information and labels the language tag, prosodic tag and phonetic segmentation tag to provide the prosodic structure and language information for the rear-stage voice conversion and voice synthesis.
Owner:NAT CHIAO TUNG UNIV

Voice taxi calling method, voice taxi calling device and voice taxi calling system

The invention belongs to the technical field of mobile terminals, and discloses a voice taxi calling method comprising the steps that voice information of a user is detected in real time; when the mobile terminal responds to preset awakening information included in the voice information of the user under the standby state, a taxi calling software client side is awakened; and when the taxi calling software client side responds to destination information included in the voice information of the user, current position information of the mobile terminal is acquired, and the current position information and the destination information are transmitted to a taxi calling software server so that the taxi calling software server is enabled to start the taxi calling flow. The voice information of the user is identified, the awakening information and the destination information are acquired from the voice information of the user, the taxi calling software client side is awakened according to the awakening information and the current position information of the mobile terminal is acquired, and the destination information and the current position information are transmitted to the taxi calling software server so as to start the taxi calling flow. The taxi calling service can be realized by inputting the destination information for one time through the voice information.
Owner:LETV HLDG BEIJING CO LTD +1
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products