Patents

Literature

Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.

195 results about "Word model" patented technology

Filter

Efficacy Topic

Property

Owner

Technical Advancement

Application Domain

Technology Topic

Technology Field Word

Patent Country/Region

Patent Type

Patent Status

Application Year

Inventor

To be a model is to be so gorgeous that you’re photographed for a living. The word model, which can be a noun, verb, or adjective, comes from the Latin word modulus, meaning “measure,” or “standard.” If you are a model student, you do everything as the school and teachers wish: you are the standard.

Training and using pronunciation guessers in speech recognition

InactiveUS7467087B1Reduce frequencyReliable phonetic spellingSpeech recognitionSpeech synthesisWord modelSpeech identification

The error rate of a pronunciation guesser that guesses the phonetic spelling of words used in speech recognition is improved by causing its training to weigh letter-to-phoneme mappings used as data in such training as a function of the frequency of the words in which such mappings occur. Preferably the ratio of the weight to word frequency increases as word frequencies decreases. Acoustic phoneme models for use in speech recognition with phonetic spellings generated by a pronunciation guesser that makes errors are trained against word models whose phonetic spellings have been generated by a pronunciation guesser that makes similar errors. As a result, the acoustic models represent blends of phoneme sounds that reflect the spelling errors made by the pronunciation guessers. Speech recognition enabled systems are made by storing in them both a pronunciation guesser and a corresponding set of such blended acoustic models.

Training and using pronunciation guessers in speech recognition

Training and using pronunciation guessers in speech recognition

Training and using pronunciation guessers in speech recognition

Owner:CERENCE OPERATING CO

Word boundary probability estimating, probabilistic language model building, kana-kanji converting, and unknown word model building

InactiveUS20080228463A1Accuracy of recognitionImprove abilitiesNatural language translationSpeech recognitionCorpus restiformeWord model

Calculates a word n-gram probability with high accuracy in a situation where a first corpus), which is a relatively small corpus containing manually segmented word information, and a second corpus, which is a relatively large corpus, are given as a training corpus that is storage containing vast quantities of sample sentences. Vocabulary including contextual information is expanded from words occurring in first corpus of relatively small size to words occurring in second corpus of relatively large size by using a word n-gram probability estimated from an unknown word model and the raw corpus. The first corpus (word-segmented) is used for calculating n-grams and the probability that the word boundary between two adjacent characters will be the boundary of two words (segmentation probability). The second corpus (word-unsegmented), in which probabilistic word boundaries are assigned based on information in the first corpus (word-segmented), is used for calculating a word n-grams.

Word boundary probability estimating, probabilistic language model building, kana-kanji converting, and unknown word model building

Word boundary probability estimating, probabilistic language model building, kana-kanji converting, and unknown word model building

Word boundary probability estimating, probabilistic language model building, kana-kanji converting, and unknown word model building

Owner:INT BUSINESS MASCH CORP

System and method enabling acoustic barge-in

ActiveUS7392188B2Suppressing speech recognition errorReduce the possibilitySpeech recognitionCommunications systemWord model

A system and method enabling acoustic barge-in during a voice prompt in a communication system. An acoustic prompt model is trained to represent the system prompt using the specific speech signal of the prompt. The acoustic prompt model is utilized in a speech recognizer in parallel with the recognizer's active vocabulary words to suppress the echo of the prompt within the recognizer. The speech recognizer may also use a silence model and traditional garbage models such as noise models and out-of-vocabulary word models to reduce the likelihood that noises and out-of-vocabulary words in the user utterance will be mapped erroneously onto active vocabulary words.

System and method enabling acoustic barge-in

System and method enabling acoustic barge-in

System and method enabling acoustic barge-in

Owner:TELEFON AB LM ERICSSON (PUBL)

System and method for generating a phonetic baseform for a word and using the generated baseform for speech recognition

InactiveUS6389395B1Machines/enginesSpeech recognitionSample sequenceRecognition algorithm

Out-of-vocabulary word models for a speech recognizer vocabulary are generated by forming phonemic transcriptions (phonetic baseforms) of user's utterances in terms of existing reference phonemes by using a speech recognition algorithm to match input sub-word feature sample sequences to suitably-constrained allowable sequences of existing reference phoneme features. The resultant new-vocabulary-word phonetic baseform models are stored for subsequent speech recognition using the same recognition algorithm.

System and method for generating a phonetic baseform for a word and using the generated baseform for speech recognition

System and method for generating a phonetic baseform for a word and using the generated baseform for speech recognition

System and method for generating a phonetic baseform for a word and using the generated baseform for speech recognition

Owner:CISCO TECH INC

System and method enabling acoustic barge-in

ActiveUS20050027527A1Suppressing speech recognition errorReduce the possibilitySpeech recognitionCommunications systemWord model

A system and method enabling acoustic barge-in during a voice prompt in a communication system. An acoustic prompt model is trained to represent the system prompt using the specific speech signal of the prompt. The acoustic prompt model is utilized in a speech recognizer in parallel with the recognizer's active vocabulary words to suppress the echo of the prompt within the recognizer. The speech recognizer may also use a silence model and traditional garbage models such as noise models and out-of-vocabulary word models to reduce the likelihood that noises and out-of-vocabulary words in the user utterance will be mapped erroneously onto active vocabulary words.

System and method enabling acoustic barge-in

System and method enabling acoustic barge-in

System and method enabling acoustic barge-in

Owner:TELEFON AB LM ERICSSON (PUBL)

Isolated word training and detection

ActiveUS20160189706A1Volume/mass flow measurementPower supply for data processingWord modelData library

Methods, systems, and apparatuses are described for isolated word training and detection. Isolated word training devices and systems are provided in which a user may provide a wake-up phrase from 1 to 3 times to train the device or system. A concatenated phoneme model of the user-provided wake-up phrase may be generated based on the provided wake-up phrase and a pre-trained phoneme model database. A word model of the wake-up phrase may be subsequently generated from the concatenated phoneme model and the provided wake-up phrase. Once trained, the user-provided wake-up phrase may be used to unlock the device or system and / or to wake up the device or system from a standby mode of operation. The word model of the user-provided wake-up phrase may be further adapted based on additional provisioning of the wake-up phrase.

Isolated word training and detection

Isolated word training and detection

Isolated word training and detection

Owner:AVAGO TECH INT SALES PTE LTD

Intelligent device awakening method and device, server and intelligent equipment

ActiveCN107704275AReduce consumptionMeet individual needsInput/output for user-computer interactionBootstrappingPersonalizationWord model

The invention discloses an intelligent equipment awakening method and device, a server and intelligent equipment. The method comprises the following steps of: obtaining a voice signal sent by the intelligent equipment; recognizing the voice signal so as to determine a recognition result corresponding to the voice signal; and distributing an awakening instruction to the intelligent equipment when determining that a matching degree between the recognition result and any awakening word model corresponding to the intelligent equipment is greater than a threshold value. According to the method, voice signals are recognized by utilizing the server and awakening control is carried out on the intelligent equipment, so that the consumption of hardware performance in the awakening process is reduced, the users can awaken the intelligent equipment by utilizing self-defined awakening words, the awakening words are enriched, the awakening flexibility is improved, individualized demands of the userscan be better satisfied and the user experience is improved.

Intelligent device awakening method and device, server and intelligent equipment

Intelligent device awakening method and device, server and intelligent equipment

Intelligent device awakening method and device, server and intelligent equipment

Owner:BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD

Automatic generation of statistical language models for interactive voice response applications

InactiveUS20080071533A1Speech recognitionApplication softwareContent word

A Statistical Language Model (SLM) that can be used in an ASR for Interactive Voice Response (IVR) systems in general and Natural Language Speech Applications (NLSAs) in particular can be created by first manually producing a brief description in text for each task that can be performed in an NLSA. These brief descriptions are then analyzed, in one embodiment, to generate spontaneous speech utterances based pre-filler patterns and a skeletal set of content words. The pre-filler patterns are in turn used with Part-of-Speech (POS) tagged conversations from a spontaneous speech corpus to generate a set of pre-filler phrases. The skeletal set of content words is used with an electronic lexico-semantic database and with a thesaurus-based content word extraction process to generate a more extensive list of content words. The pre-filler phrases and content words set, thus generated, are combined into utterances using a lexico-semantic resource based process. In one embodiment, a lexico-semantic statistical validation process is used to correct and / or add the automatically generated utterances to the database of expected utterances. The system requires a minimum amount of human intervention and no prior knowledge regarding the expected user utterances, and the WWW is used to validate the word models. The system requires a minimum amount of human intervention and no prior knowledge regarding the expected user utterances in response to a particular prompt.

Automatic generation of statistical language models for interactive voice response applications

Automatic generation of statistical language models for interactive voice response applications

Automatic generation of statistical language models for interactive voice response applications

Owner:LYMBA CORP

Commodity target word oriented emotional tendency analysis method

InactiveCN107544957ASemantic description is accurateAccurate predictionSpecial data processing applicationsData setBag-of-words model

The invention discloses a commodity target word oriented emotional tendency analysis method, which belongs to the field of the analysis processing of online shopping commodity reviews. The method comprises the following four steps that: 1: corpus preprocessing: carrying out word segmentation on a dataset, and converting a category label into a vector form according to a category number; 2: word vector training: training review data subjected to the word segmentation through a CBOW (Continuous Bag-of-Words Model) to obtain a word vector; 3: adopting a neural network structure, and using an LSTM(Long Short Term Memory) network model structure to enable the network to pay attention to whole-sentence contents; and 4: review sentence emotion classification: taking the output of the neural network as the input of a Softmax function to obtain a final result. By use of the method, semantic description in a semantic space is more accurate, the data is trained through the neural network so as to optimize the weight and the offset parameter in the neural network, parameters trained after continuous iteration make a loss value minimum, at the time, the trained parameters are used for traininga test set, and therefore, higher accuracy can be obtained.

Commodity target word oriented emotional tendency analysis method

Commodity target word oriented emotional tendency analysis method

Commodity target word oriented emotional tendency analysis method

Owner:NORTH CHINA ELECTRIC POWER UNIV (BAODING)

Knowledge graph completion method based on entity description and relationship path

PendingCN111026875AImprove computing efficiencyImprove accuracySpecial data processing applicationsSemantic tool creationBag-of-words modelWord model

The invention relates to a knowledge graph completion method based on entity description and relationship path, which comprises the steps of S1, establishing a continuous bag-of-words model based on entity text description, and performing vector representation on entity description of entities in a knowledge graph by utilizing the continuous bag-of-words model to obtain a description-based vector;S2, according to a conversion-based model between the entity vector and the relation vector and between the entity vector and the relation path, obtaining a conversion-based model, establishing a score function of the relation triad (h, r, t) and the path triad (h, p, t) and a loss function of the score function, learning vector representation of an entity, a relation and a path by minimizing theloss function, and learning vector representation based on a structure; and S3, adopting the learned entity vector representation, obtaining representation results in the vector space in different tasks, and complementing the knowledge graph or mining the potential relationship.

Knowledge graph completion method based on entity description and relationship path

Knowledge graph completion method based on entity description and relationship path

Knowledge graph completion method based on entity description and relationship path

Owner:RENMIN UNIVERSITY OF CHINA

Service intelligent navigation method and system

ActiveCN101499277AAchieve understandingRealize intelligent navigationSpeech recognitionSpecial data processing applicationsWord modelData mining

The invention provides a service intelligence navigation method, including the steps of: carrying out grammar matching based on metadata for word segmentation result of query proposed by a user; finding out a matched word model mode according to the word segmentation result and acquiring the service classification to which the matched word model mode belongs; carrying out matching search to the word segmentation result and descriptive information or business information and obtaining a first candidate service classification list; acquiring service from the only candidate service classification of the first candidate service classification list and returning the service to the user; if the candidate service classifications are two or more than two, carrying out fuzzy reading of non-login string in the query and obtaining a second candidate service classification list; carrying out fusion operation according to the first candidate service classification list and the second candidate service classification list and obtaining common candidate service classification; if the common candidate service classification is only one, returning the service to the user; if the common candidate service classification does not exist or is more than one, then showing the navigation is a failure.

Service intelligent navigation method and system

Service intelligent navigation method and system

Service intelligent navigation method and system

Owner:INST OF COMPUTING TECH CHINESE ACAD OF SCI

Voice wake-up method and system thereof

ActiveCN108198548ASpeech recognitionStreaming dataData stream

The invention discloses a voice wake-up method and system. The method comprises the steps of step 1) acquiring input real-time audio stream data, detecting the input real-time audio stream data by using a voice detection algorithm, if voice data is found, starting a wake-up word detection algorithm, otherwise, stopping the wake-up word detection algorithm, and continuing to detect; step 2) using the wake-up word detection algorithm to analyze and detect an input voice data stream, and calculating the similarity between the voice data and a given wake-up word model; if the similarity is greaterthan a given threshold T1, considering that wake-up words are detected, starting a wake-up word confirmation algorithm, introducing the voice data of assumed wake-up words, and otherwise, continuously detecting the wake-up words until the voice data stream ends; step 3) using the wake-up word confirmation algorithm to analyze and calculate the obtained voice data stream which is judged as the wake-up words by the wake-up word detection algorithm, and calculating the confidence coefficient of the wake-up words on a voice segment.

Voice wake-up method and system thereof

Voice wake-up method and system thereof

Owner:苏州奇梦者科技有限公司

Emotion analysis method for Chinese texts based on computer information processing technology

InactiveCN104965822AHigh precisionSpecial data processing applicationsInformation processingRestricted Boltzmann machine

The invention discloses an emotion analysis method for Chinese texts based on computer information processing technology. Comments on Chinese products are subjected to word segmentation. By utilizing a bag-of-words model, vector representations of product comments are generated. The vector of every comment is inputted to a visible unit of a limited Boltzmann machine (RBM) in deep learning. Sentimental characteristics of Chinese texts are extracted by the RBM and the extracted emotional characteristics are inputted to a SVM for text emotion classification. The emotion analysis method for the Chinese texts based on computer information processing technology is capable of improving relevance of emotional semantics of characteristics while the SVM is capable of improving accuracy of emotion classification of comments on Chinese products.

Emotion analysis method for Chinese texts based on computer information processing technology

Emotion analysis method for Chinese texts based on computer information processing technology

Emotion analysis method for Chinese texts based on computer information processing technology

Owner:CENT SOUTH UNIV

Prosodic structure forming method based on prosodic phrase

InactiveCN101650942AImprove recognition accuracyImprove accuracySpecial data processing applicationsSpeech synthesisWord modelStudy methods

The invention provides a novel prosodic structure boundary division forming method based on prosodic phrases. The method combines machine learning with rules to greatly improve the accuracy of the prediction of Chinese text prosodic structure boundary. Prosodic phrase boundaries are firstly identified on the premise that input files goes through word segmentation and part of speech tagging, then prosodic word boundaries are formed by combining prosodic phrase boundary information, and finally a plurality rules are artificially added to carry out integral modification. In prosodic phrase and prosodic word boundary identification, characteristics are respectively designed and selected for establishing a characteristic template, and a prosodic phrase model and a prosodic word model are established by utilizing the maximum entropy algorithm for respectively identifying prosodic boundaries of two stages. In addition, aiming at the errors in identification of a maximum entropy model, an optimal rule is selected by utilizing an error-driven rule learning method to further improve the accuracy. Based on the method, the prosodic structure boundary division forming method based on prosodic phrases is provided, and the method can effectively improve the accuracy of prosodic structure prediction and the naturalness of speed synthesis.

Prosodic structure forming method based on prosodic phrase

Prosodic structure forming method based on prosodic phrase

Prosodic structure forming method based on prosodic phrase

Owner:BEIJING UNIV OF POSTS & TELECOMM

Generation and deletion of pronunciation variations in order to reduce the word error rate in speech recognition

InactiveUS20060143008A1Save memory spaceReduce in quantitySpeech recognitionWord modelSpeech identification

Disclosed is a speech recognition method which is based on a dynamic extension of the word models in combination with an evaluation of the pronunciation variations.

Generation and deletion of pronunciation variations in order to reduce the word error rate in speech recognition

Generation and deletion of pronunciation variations in order to reduce the word error rate in speech recognition

Owner:SCHNEIDER TOBIAS +5

System and Method for Correcting Speech

InactiveUS20090220926A1Speech recognitionElectrical appliancesSpoken languageWord model

A method and device for correcting mispronunciations of a user, the method comprising the following steps: providing a database comprising a plurality of records each of which comprising at least a textual and a vocal representation of a specific word; training a speech recognition module to recognize spoken utterances of said user comprising the words represented by said records; generating word models for each recognized spoken word; associating each word model with a respective database record; after training said speech recognition module with sufficient words receiving spoken utterance from said user; extracting a sequence of words from said spoken utterance and generating a word model for each extracted word; comparing said word models to the word models associated with said database records; constructing an audible output comprising vocal representations obtained from records which their word models matched word models generated for said extracted word, wherein said word models comprises features extracted from data of the words spoken by said user.

System and Method for Correcting Speech

System and Method for Correcting Speech

System and Method for Correcting Speech

Owner:RECHLIS GADI

An intelligent operation and maintenance statement similarity matching method based on natural language processing

InactiveCN109902159AAvoid the phenomenon of "curse of dimensionality"Improve accuracySpecial data processing applicationsSemantic tool creationCurse of dimensionalitySentence similarity

The invention discloses an intelligent operation and maintenance statement similarity matching method based on a natural language processing technology. The method mainly comprises two parts of data processing in knowledge base construction and sentence similarity matching based on deep learning. Compared with the prior art, the method has the advantages that (1) the operation and maintenance management knowledge is subjected to word segmentation by utilizing the specific word library and the HMM to find the new word model, so that the text word segmentation accuracy is improved, and the moreperfect text word library is established; (2) word vectors are trained through a deep learning method, so that the phenomenon of'dimensionality disaster 'represented by the word vectors can be avoided, information of vocabulary contexts can be fully mined, and relations between words can be obtained; And (3) on the basis of the sentence vectors configured with the weights, not only can the importance measure of each word be obtained, but also the information of the sentence vectors can be richer through the combination of the word vectors, and the accuracy of matching on the basis of forming the sentence vectors can be guaranteed through a cosine similarity matching algorithm.

An intelligent operation and maintenance statement similarity matching method based on natural language processing

An intelligent operation and maintenance statement similarity matching method based on natural language processing

An intelligent operation and maintenance statement similarity matching method based on natural language processing

Owner:华融融通(北京)科技有限公司

Training speech recognition word models from word samples synthesized by Monte Carlo techniques

ActiveUS7133827B1Rapid channel normalizationSlow normalizationSpeech recognitionAlgorithmWord model

A new word model is trained from synthetic word samples derived by Monte Carlo techniques from one or more prior word models. The prior word model can be a phonetic word model and the new word model can be a non-phonetic, whole-word, word model. The prior word model can be trained from data that has undergone a first channel normalization and the synthesized word samples from which the new word model is trained can undergo a different channel normalization similar to that to be used in a given speech recognition context. The prior word model can have a first model structure and the new word model can have a second, different, model structure. These differences in model structure can include, for example, differences of model topology; differences of model complexity; and differences in the type of basis function used in a description of such probability distributions.

Training speech recognition word models from word samples synthesized by Monte Carlo techniques

Training speech recognition word models from word samples synthesized by Monte Carlo techniques

Training speech recognition word models from word samples synthesized by Monte Carlo techniques

Owner:CERENCE OPERATING CO

Remote sensing image land utilization scene classification method based on two-dimension wavelet decomposition and visual sense bag-of-word model

ActiveCN103413142ASolve the lack of considerationImprove utilizationCharacter and pattern recognitionDecompositionClassification methods

The invention relates to a remote sensing image land utilization scene classification method based on two-dimension wavelet decomposition and a visual sense bag-of-word model. The method comprises the steps that a remote sensing image land utilization scene classification training set is built; scene images in the training set are converted to grayscale images, and two-dimension decomposition is conducted on the grayscale images; regular-grid sampling and SIFT extracting are conducted on the converted grayscale images and sub-images formed after two-dimension decomposition, and universal visual word lists of the converted grayscale images and the sub-images are independently generated through clustering; visual word mapping is conducted on each image in the training set to obtain bag-of-word characteristics; the bag-of-word characteristics of each image in the training set and corresponding scene category serial numbers serve as training data for generating a classification model through an SVM algorithm; images of each scene are classified according to the classification model. The remote sensing image land utilization scene classification method well solves the problems that remote sensing image texture information is not sufficiently considered through an existing scene classification method based on a visual sense bag-of-word model, and can effectively improve scene classification precision.

Remote sensing image land utilization scene classification method based on two-dimension wavelet decomposition and visual sense bag-of-word model

Remote sensing image land utilization scene classification method based on two-dimension wavelet decomposition and visual sense bag-of-word model

Remote sensing image land utilization scene classification method based on two-dimension wavelet decomposition and visual sense bag-of-word model

Owner:INST OF REMOTE SENSING & DIGITAL EARTH CHINESE ACADEMY OF SCI

Model training method for unspecified person alone word, recognition system and recognition method

InactiveCN101114449AImprove the ability to rejectReduce the number of matching operationsSpeech recognitionWord modelArea coverage

The invention discloses a speaker independent isolated words model training method, an identifying system and an identifying method, wherein, the speaker independent isolated words model training method comprises getting a plurality of training modes procedures of the speaker independent isolated words using the dynamic time wrapping algorithm, and a centre training model getting procedure, combining a plurality of training models of the speaker independent isolated words into a centre training model of the speaker independent isolated words. The invention decreases the calculated amount when matching the operation by combining a plurality of training models of the speaker independent isolated words into a centre training mode. At the same time, the invention directly identifies the identifying result smaller than the rejection threshold as out-of-vocabulary words when in identification and process, and effectively improves the rejecting identification capability of out-of-vocabulary words. In addition, by limiting the searching area coverage and loosing the matching jumping-off, the invention increases the recognition rate and decreases the calculated amount.

Model training method for unspecified person alone word, recognition system and recognition method

Model training method for unspecified person alone word, recognition system and recognition method

Model training method for unspecified person alone word, recognition system and recognition method

Owner:大连三曦智能科技有限公司

A multi-example keyword detection method based on a multi-task neural network

ActiveCN108538285AEasy to detectImprove low resource conditionsSpeech recognitionData setMulti language

The invention provides a multi-example keyword detection method based on a multi-task neural network, which belongs to the technical field of voice signal processing. The method comprises the steps of: training a bottleneck deep neural network on a multi-language data set; extracting fbank features from target data set audio frame by frame and extracting bottleneck features of the target data set;based on a training set, training an HMM model for each keyword by using the bottleneck features of the keywords, acquiring a frame-level station tag thereof and training a filler word model by usingthe bottleneck features of all non-keywords; performing multi-task DNN acoustic model training by using the bottleneck features; acquiring the acoustic score of test set audio and obtaining keyword detection results through Viterbi decoding. The multi-task technology can effectively improve low resource conditions and remarkably improve multi-example keyword detection performance.

A multi-example keyword detection method based on a multi-task neural network

Owner:TSINGHUA UNIV

Speech processing apparatus and method using confidence scores

InactiveUS7165031B2Speech recognitionWord modelSpeech identification

A speech recognition method and apparatus is disclosed in which outputs a confidence score indicative of the posterior probability of an utterance being correctly matched to a word model. The confidence score for the matching of an utterance to a word model is determined directly from the generated values indicative of the goodness of match between the utterance and stored word models utilizing the following equation:confidence=exp⁡(-2σ⁢S⁡(x|w))∑words⁢exp⁡(2σ⁢S⁡(x|w))where S(x|w) is the match score for the correlation between a signal x and word w and σ is an experimentally determined constant.

Speech processing apparatus and method using confidence scores

Speech processing apparatus and method using confidence scores

Speech processing apparatus and method using confidence scores

Owner:CANON KK

Voice awakening method and device based on artificial intelligence

ActiveCN106653022ASolve the problem of excessive data usageImprove wakeup precisionSpeech recognitionWord modelArtificial Intelligence System

The invention provides a voice awakening method and a voice awakening device based on artificial intelligence, wherein the method comprises the following steps: clustering various phonemes so as to select out garbage phonemes for expressing all phonemes; in accordance with an awakening word which is set in advance, constructing a counter-awakening word similar to the awakening word; and constructing a decoding network in accordance with the garbage phonemes, the counter-awakening word and the awakening word, so that voice awakening can be implemented by virtue of the decoding network. Since the garbage phoneme is obviously less than a garbage word in data volume, the problem of the prior art that a garbage word model occupies an excessively high data volume is solved; meanwhile, since each word is constituted by a finite number of phonemes, the garbage phoneme, in comparison with the garbage word, can cover all words more comprehensively, so that awakening precision is enhanced and a probability of incorrect awakening is reduced.

Voice awakening method and device based on artificial intelligence

Voice awakening method and device based on artificial intelligence

Voice awakening method and device based on artificial intelligence

Owner:BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD

Speech Recognition Apparatus And Speech Recognition Method

InactiveUS20070203700A1Reduces EventsImprove recognition efficiencySpeech recognitionSpoken languageWord model

A speech recognition apparatus and speech recognition method are provided for reducing such events as erroneous recognition and disabled recognition and improving a recognition efficiency. The speech recognition apparatus generates a word model based on a dictionary memory and a sub-word sound model, and matches the word model with a speech input signal in accordance with a predetermined algorithm to perform a speech recognition for the speech input signal, wherein the apparatus comprises main matching means, operative when matching the word model with the speech input signal along a processing path indicated by the algorithm, for limiting the processing path based on a course command to select the word model most approximate to the speech input signal, local template storing means for previously typifying local sound features of spoken speeches for storage as local templates; and local matching means for matching each of component sections of the speech input signal with the local templates stored in the local template storing means to definitely determine a sound feature for each of the component sections, and generating the course command in accordance with the result of the definite determination.

Speech Recognition Apparatus And Speech Recognition Method

Speech Recognition Apparatus And Speech Recognition Method

Speech Recognition Apparatus And Speech Recognition Method

Owner:PIONEER CORP

Communication Device Having Speaker Independent Speech Recognition

InactiveUS20070203701A1Substation equipmentSpeech recognitionWord modelSpeech input

Techniques for performing speech recognition in a communication device with a voice dialing function is provided. Upon receipt of a voice input in a speech recognition mode, input feature vectors are generated from the voice input. Also, a likelihood vector sequence is calculated from the input feature vectors indicating the likelihood in time of an utterance of phonetic units. In a warping operation, the likelihood vector sequence is compared to phonetic word models and word model match likelihoods are calculated for that word models. After determination of a best-matching word model, the corresponding number to the name synthesized from the best-matching word model is dialed in a dialing operation.

Communication Device Having Speaker Independent Speech Recognition

Communication Device Having Speaker Independent Speech Recognition

Communication Device Having Speaker Independent Speech Recognition

Owner:INTELLECTUAL VENTURES FUND 21

Multi-channel topic model-based editable garment image search method

InactiveCN107870992AHigh precisionOvercome the "Semantic Gap" problemCharacter and pattern recognitionSpecial data processing applicationsBag-of-words modelSample image

The invention discloses a multi-channel topic model-based editable garment image search method. The method comprises the steps of firstly, finding a main body region of a garment commodity in a picture by using an object detection method, extracting multiple descriptors from the main body region and quantizing the descriptors into vectors in bag-of-word forms by using a bag-of-word model; secondly, according to search conditions, editing and modifying weights of visual words, fusing the vectors in the bag-of-word forms into retrieval features capable of describing high-level semanteme of the garment commodity by using a pre-trained multi-channel topic model, and establishing indexes; and during online detection, calculating vector similarity of a to-be-queried commodity sample image and animage of a database, and taking the commodity with the highest similarity as a search result. The information of vision, commodity text attributes and the like of the to-be-queried commodity can be re-edited; the demand of a user on the commodity is described more accurately; and the user-expected commodity is searched for through the multi-channel topic model.

Multi-channel topic model-based editable garment image search method

Multi-channel topic model-based editable garment image search method

Multi-channel topic model-based editable garment image search method

Owner:SHANGHAI JIAO TONG UNIV

Literature retrieval method based on semantic small-word model

InactiveCN101017504AImprove index recallReduce deliverySpecial data processing applicationsSmall worldsSocial network

This invention discloses one file index method based on language meanings small world, which comprises the following steps: firstly using latent meanings index to extract file property vector to maintain file property to lower its dimensions and to reduce the information memory volume; then using supportive vector machine to sort all common files to form sort information to mark the sort interest proportion; finally using social network small world with small linkage point with high proportion interest of certain file sort to form network topological structure with small property.

Literature retrieval method based on semantic small-word model

Literature retrieval method based on semantic small-word model

Owner:HUAZHONG UNIV OF SCI & TECH

Unsupervised writer style adaptation for handwritten word spotting

ActiveUS20090252417A1Character and pattern recognitionHide markov modelWord model

A word spotting system includes a semi-continuous hidden Markov model configured to model a handwritten word of interest. A writing segments extractor is configured to extract writing segments generally comprising images of handwritten character strings from a received document image. A word model adaptation processor is configured to adjust a shared pool of Gaussians of the semi-continuous hidden Markov model respective to the extracted writing segments. A modeler is configured to model extracted writing segments using the semi-continuous hidden Markov model with the adjusted shared pool of Gaussians to identify whether each modeled writing segment matches the handwritten word of interest.

Unsupervised writer style adaptation for handwritten word spotting

Unsupervised writer style adaptation for handwritten word spotting

Unsupervised writer style adaptation for handwritten word spotting

Owner:XEROX CORP

System and method for extracting entities of interest from text using n-gram models

ActiveUS20080040298A1Quick analysisDigital computer detailsNatural language data processingWord modelDocumentation

A document (or multiple documents) is analyzed to identify entities of interest within that document. This is accomplished by constructing n-gram or bi-gram models that correspond to different kinds of text entities, such as chemistry-related words and generic English words. The models can be constructed from training text selected to reflect a particular kind of text entity. The document is tokenized, and the tokens are run against the models to determine, for each token, which kind of text entity is most likely to be associated with that token. The entities of interest in the document can then be annotated accordingly.

System and method for extracting entities of interest from text using n-gram models

System and method for extracting entities of interest from text using n-gram models

System and method for extracting entities of interest from text using n-gram models

Owner:IBM CORP

Phonetic segmentation-based isolate word recognition method

InactiveCN101819772AEasy to identifyImprove recognition response timeSpeech recognitionSyllableSmall probability

The invention discloses a phonetic segmentation-based isolate word recognition method. In the method, a continuous HMM model obtained by voice training takes syllable or semi-syllable as a unit set, trained unit set models are spliced into whole word models according to the syllables or semi-syllables of isolate words in a word list, and a Viterbi algorithm is adopted in recognition. The method has the advantages of improving recognition performance because each segment of HMM model in recognition results (accurately recognized isolate words) can be better matched with each segment of characteristic parameters of voices to be recognized, and shortening recognition response time because the recognition of the isolate words with relatively smaller probability values can be directly refused every time recognition operation is performed on the tail state of one syllable or semi-syllable.

Phonetic segmentation-based isolate word recognition method

Phonetic segmentation-based isolate word recognition method

Phonetic segmentation-based isolate word recognition method

Owner:NO 709 RES INST OF CHINA SHIPBUILDING IND CORP

Popular searches

Acoustic model Speech sound Lettering Acoustics Subvocal recognition Lexical frequency Large size Human language Contextual information Language model

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com