Patents

Literature

Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.

711 results about "Recurrent neural nets" patented technology

Filter

Efficacy Topic

Property

Owner

Technical Advancement

Application Domain

Technology Topic

Technology Field Word

Patent Country/Region

Patent Type

Patent Status

Application Year

Inventor

Recurrent neural networks (RNNs) are a kind of neural net often used to model sequence data. They maintain a hidden state which can "remember" certain aspects of the sequence it has seen. RNNs can be trained using backpropagation through time, although efficient training remains an open problem.

Dialog device with dialog support generated using a mixture of language models combined using a recurrent neural network

ActiveUS20170316775A1Digital data information retrievalNatural language data processingAlgorithmDisplay device

A dialog device comprises a natural language interfacing device (chat interface or a telephonic device), and a natural language output device (the chat interface, a display device, or a speech synthesizer outputting to the telephonic device). A computer stores natural language dialog conducted via the interfacing device and constructs a current utterance word-by-word. Each word is chosen by applying a plurality of language models to a context comprising concatenation of the stored dialog and the current utterance thus far. Each language model outputs a distribution over the words of a vocabulary. A recurrent neural network (RNN) is applied to the distributions to generate a mixture distribution. The next word is chosen using the mixture distribution. The output device outputs the current natural language utterance after it has been constructed by the computer.

Dialog device with dialog support generated using a mixture of language models combined using a recurrent neural network

Dialog device with dialog support generated using a mixture of language models combined using a recurrent neural network

Dialog device with dialog support generated using a mixture of language models combined using a recurrent neural network

Owner:CONDUENT BUSINESS SERVICES LLC

Method and System for Detecting Actions in Videos

ActiveUS20170255832A1Longer temporal durationImprove performanceImage enhancementImage analysisComputer visionRecurrent neural nets

A method and system detects actions of an object in a scene by first acquiring a video of the scene as a sequence of images, wherein each image includes pixels, wherein the video is partitioned into chunks. The object in the video is tracked. For each object and each chunk of the video, trajectories of the pixels within a bounding box located over the object are tracked, and cropped trajectories and cropped images for one or more images in the chunk are produced using the bounding box. Then, the cropped trajectories and cropped images are passed to a recurrent neural network (RNN) that outputs a relative score for each action of interest.

Method and System for Detecting Actions in Videos

Method and System for Detecting Actions in Videos

Method and System for Detecting Actions in Videos

Owner:MITSUBISHI ELECTRIC RES LAB INC

Image captioning utilizing semantic text modeling and adversarial learning

ActiveUS20180373979A1Accurately reflectCharacter and pattern recognitionMachine learningText modelingDigital image

The present disclosure includes methods and systems for generating captions for digital images. In particular, the disclosed systems and methods can train an image encoder neural network and a sentence decoder neural network to generate a caption from an input digital image. For instance, in one or more embodiments, the disclosed systems and methods train an image encoder neural network (e.g., a character-level convolutional neural network) utilizing a semantic similarity constraint, training images, and training captions. Moreover, the disclosed systems and methods can train a sentence decoder neural network (e.g., a character-level recurrent neural network) utilizing training sentences and an adversarial classifier.

Image captioning utilizing semantic text modeling and adversarial learning

Image captioning utilizing semantic text modeling and adversarial learning

Image captioning utilizing semantic text modeling and adversarial learning

Owner:ADOBE SYST INC

Method and system for deep nerve translation based on character encoding

ActiveCN106126507AReduce feature dimensionExcellent translation resultNatural language translationSpecial data processing applicationsNerve networkEncoder decoder

The invention provides a method and a system for deep nerve translation based on character encoding. A combined nerve network model is established by using an RNN to cover the whole translation process, and translation tasks are directly completed from the perspective of an encoder-decoder framework. The method comprises the following steps: A, word vector generation: performing word segmentation on character-level input data through neural network modeling and generating a word vector; B, language model generation: establishing grammar rules by utilizing the characteristic of memory of the recurrent neural network in time; C, word alignment model generation: obtaining the probability of translating multiple words in a source language statement into target language words; D, output: translating an inputted source language into a target language; E, translation model combination: establishing a deep nerve translation model (RNN-embed) based on character encoding in combination with neural network models in the four steps and accelerating model training by using CPU parallel computation.

Method and system for deep nerve translation based on character encoding

Method and system for deep nerve translation based on character encoding

Method and system for deep nerve translation based on character encoding

Owner:HARBIN INST OF TECH SHENZHEN GRADUATE SCHOOL

Voice identification method using long-short term memory model recurrent neural network

InactiveCN106328122ASpeech recognitionNerve networkFrequency spectrum

The invention discloses a voice identification method using a long-short term memory model recurrent neural network. The voice identification method comprises training and identification. The training process comprises steps of introducing voice data and text data to generate a commonly-trained acoustic and language mode, and using an RNN sensor to perform decoding to form a model parameter. The identification process comprises steps of converting voice input to a frequency spectrum graph through Fourier conversion, using the recursion neural network of the long-short term memory model to perform orientational searching decoding and finally generating an identification result. The voice identification method adopts the recursion neural network (RNNs) and adopts connection time classification (CTC) to train RNNs through an end-to-end training method. These LSTM units combining with the long-short term memory have good effects and combines with multi-level expression to prove effective in a deep network; only one neural network model (end-to-end model) exits from a voice characteristic (an input end) to a character string (an output end) and the neural network can be directly trained by a target function which is a some kind of a proxy of WER, which avoids to cost useless work to optimize an individual target function.

Voice identification method using long-short term memory model recurrent neural network

Voice identification method using long-short term memory model recurrent neural network

Voice identification method using long-short term memory model recurrent neural network

Owner:SHENZHEN WEITESHI TECH

Apparatus for calculating a posterior probability of phoneme symbol, and speech recognition apparatus

InactiveUS6041299ADigital computer detailsSpeech recognitionFeature extractionSpeech identification

There are disclosed an apparatus for calculating a posteriori probabilities of phoneme symbols and a speech recognition apparatus using the apparatus for calculating a posteriori probabilities of phoneme symbols. A feature extracting section extracts speech feature parameters from a speech signal of an uttered speech sentence composed of an inputted character series, and a calculating section calculates a a posteriori probability of a phoneme symbol of the speech signal, by using a bidirectional recurrent neural network. The bidirectional recurrent neural network includes (a) an input layer for receiving the speech feature parameters extracted by the feature extracting means and a plurality of hypothetical phoneme symbol series signals, (b) an intermediate layer of at least one layer having a plurality of units, and (c) an output layer for outputting a a posteriori probability of each phoneme symbol. The input layer includes (a) a first input neuron group having a plurality of units, for receiving a plurality of speech feature parameters and a plurality of phoneme symbol series signals, (b) a forward module, and (c) a backward module.

Apparatus for calculating a posterior probability of phoneme symbol, and speech recognition apparatus

Apparatus for calculating a posterior probability of phoneme symbol, and speech recognition apparatus

Apparatus for calculating a posterior probability of phoneme symbol, and speech recognition apparatus

Owner:A T R ONSEI HONYAKU TSUSHIN KENKYUSHO KK

Video description method based on deep learning and text summarization

ActiveCN105279495AImprove accuracyGood varietyCharacter and pattern recognitionNeural learning methodsFrame sequenceData set

The invention discloses a video description method based on deep learning and text summarization. The video description method comprises the following steps: through a traditional image data set, training a convolutional neural network model according to an image classification task; extracting a video frame sequence of a video, utilizing the convolutional neural network model to extract convolutional neural network characteristics to form a <video frame sequence, text description sequence> pair which is used as the input of a recurrent neural network model, and training to obtain the recurrent neural network model; describing the video frame sequence of the video to be described through the recurrent neural network model obtained by training to obtain description sequences; and through a method that graph-based vocabulary centrality is used as the significance of the text summarization, sorting the description sequences, and outputting a final description result of the video. An event which happens in one video and object attributes associated with the event are described through a natural language so as to achieve a purpose that video contents are described and summarized.

Video description method based on deep learning and text summarization

Video description method based on deep learning and text summarization

Video description method based on deep learning and text summarization

Owner:广州葳尔思克自动化科技有限公司

Bidirectional long short-term memory unit-based behavior identification method for video

InactiveCN106845351AImprove accuracyGuaranteed accuracyCharacter and pattern recognitionNeural architecturesTime domainTemporal information

The invention discloses a bidirectional long short-term memory unit-based behavior identification method for a video. The method comprises the steps of (1) inputting a video sequence and extracting an RGB (Red, Green and Blue) frame sequence and an optical flow image from the video sequence; (2) respectively training a deep convolutional network of an RGB image and a deep convolutional network of the optical flow image; (3) extracting multilayer characteristics of the network, wherein characteristics of a third convolutional layer, a fifth convolutional layer and a seventh fully connected layer are at least extracted, and the characteristics of the convolutional layers are pooled; (4) training a recurrent neural network constructed by use of a bidirectional long short-term memory unit to obtain a probability matrix of each frame of the video; and (5) averaging the probability matrixes, finally fusing the probability matrixes of an optical flow frame and an RGB frame, taking a category with a maximum probability as a final classification result, and thus realizing behavior identification. According to the method, the conventional artificial characteristics are replaced with multi-layer depth learning characteristics, the depth characteristics of different layers represent different pieces of information, and the combination of multi-layer characteristics can improve the accuracy rate of classification; and the time information is captured by use of the bidirectional long short-term memory, many pieces of time domain structural information are obtained and a behavior identification effect is improved.

Bidirectional long short-term memory unit-based behavior identification method for video

Bidirectional long short-term memory unit-based behavior identification method for video

Bidirectional long short-term memory unit-based behavior identification method for video

Owner:SUZHOU UNIV

Convolutional recurrent neural networks for small-footprint keyword spotting

ActiveUS20180261213A1Mathematical modelsSound input/outputAlgorithmSmall footprint

Described herein are systems and methods for creating and using Convolutional Recurrent Neural Networks (CRNNs) for small-footprint keyword spotting (KWS) systems. Inspired by the large-scale state-of-the-art speech recognition systems, in embodiments, the strengths of convolutional layers to utilize the structure in the data in time and frequency domains are combined with recurrent layers to utilize context for the entire processed frame. The effect of architecture parameters were examined to determine preferred model embodiments given the performance versus model size tradeoff. Various training strategies are provided to improve performance. In embodiments, using only ˜230 k parameters and yielding acceptably low latency, a CRNN model embodiment demonstrated high accuracy and robust performance in a wide range of environments.

Convolutional recurrent neural networks for small-footprint keyword spotting

Convolutional recurrent neural networks for small-footprint keyword spotting

Convolutional recurrent neural networks for small-footprint keyword spotting

Owner:BAIDU USA LLC

Deep learning-based short-term traffic flow prediction method

ActiveCN107230351AImprove accuracyOvercoming featureDetection of traffic movementForecastingShort-term memoryStudy methods

The present invention discloses a deep learning method-based short-term traffic flow prediction method. The influence of the traffic flow rate change of the neighbor points of a prediction point, the time characteristic of the prediction point and the influence of the periodic characteristic of the prediction point on the traffic flow rate of the prediction point are considered simultaneously. According to the deep learning method-based short-term traffic flow prediction method of the invention, a convolutional neural network and a long and short-term memory (LSTM) recurrent neural network are combined to construct a Conv-LSTM deep neural network model; a two-way LSTM model is used to analyze the traffic flow historical data of the point and extract the periodic characteristic of the point; and a traffic flow trend and a periodic characteristic which are obtained through analysis are fused, so that the prediction of traffic flow can be realized. With the method of the invention adopted, the defect of the incapability of an existing method to make full use of time and space characteristics can be eliminated, the time and space characteristics of the traffic flow are fully extracted, and the periodic characteristic of the data of the traffic flow is fused with the time and space characteristics, and therefore, the accuracy of short-term traffic flow prediction results can be improved.

Deep learning-based short-term traffic flow prediction method

Deep learning-based short-term traffic flow prediction method

Deep learning-based short-term traffic flow prediction method

Owner:FUZHOU UNIV

Method and system for enhancing features by aid of bidirectional long-term and short-term memory recurrent neural networks

InactiveCN104952448AEnhancement effect is goodImprove performanceSpeech recognitionShort-term memoryData mining

The invention discloses a method and a system for enhancing features by the aid of bidirectional long-term and short-term memory recurrent neural networks. The method includes steps of training inputted features with noise and corresponding features without noise by the aid of bidirectional long-term and short-term memory recurrent neural network models; enhancing the features by the aid of the bidirectional long-term and short-term memory recurrent neural network models. The method and the system have the advantages that the long-term and short-term memory recurrent neural network models can be built in two directions by the aid of the inputted features, and accordingly models of contexts of current frames can be effectively built; long-term and short-term memory cells and certain control variables are introduced into the neural networks, and accordingly the models can be built in the dependence on the long contexts; excellent enhancement effects can be realized by the models owing to long-term dependence when the features are enhanced by the aid of the models, and accordingly the performance of voice recognition systems and audio event classification systems can be improved.

Method and system for enhancing features by aid of bidirectional long-term and short-term memory recurrent neural networks

Method and system for enhancing features by aid of bidirectional long-term and short-term memory recurrent neural networks

Method and system for enhancing features by aid of bidirectional long-term and short-term memory recurrent neural networks

Owner:张爱英 +1

Design method of hardware accelerator based on LSTM recursive neural network algorithm on FPGA platform

InactiveCN108090560AImprove forecastImprove performanceNeural architecturesPhysical realisationNeural network hardwareLow resource

The invention discloses a method for accelerating an LSTM neural network algorithm on an FPGA platform. The FPGA is a field-programmable gate array platform and comprises a general processor, a field-programmable gate array body and a storage module. The method comprises the following steps that an LSTM neural network is constructed by using a Tensorflow pair, and parameters of the neural networkare trained; the parameters of the LSTM network are compressed by adopting a compression means, and the problem that storage resources of the FPGA are insufficient is solved; according to the prediction process of the compressed LSTM network, a calculation part suitable for running on the field-programmable gate array platform is determined; according to the determined calculation part, a softwareand hardware collaborative calculation mode is determined; according to the calculation logic resource and bandwidth condition of the FPGA, the number and type of IP core firmware are determined, andacceleration is carried out on the field-programmable gate array platform by utilizing a hardware operation unit. A hardware processing unit for acceleration of the LSTM neural network can be quicklydesigned according to hardware resources, and the processing unit has the advantages of being high in performance and low in power consumption compared with the general processor.

Design method of hardware accelerator based on LSTM recursive neural network algorithm on FPGA platform

Design method of hardware accelerator based on LSTM recursive neural network algorithm on FPGA platform

Design method of hardware accelerator based on LSTM recursive neural network algorithm on FPGA platform

Owner:SUZHOU INST FOR ADVANCED STUDY USTC

Method and system for detecting voice spoofing attack of speakers on basis of deep learning

ActiveCN105869630AHigh precisionReduce the differenceSpeech recognitionAlgorithmGlobal information

The invention discloses a method and a system for detecting voice spoofing attack of speakers on the basis of deep learning. The method includes constructing audio-frequency training sets, initializing deep feed-forward neural networks and deep recurrent neural networks and respectively training the deep feed-forward neural networks and the deep recurrent neural networks by the aid of multi-frame feature vectors and single-frame vector sequences of the training sets; respectively leading frame level and sequence level feature vectors of to-be-tested audio frequencies into two trained linear differential analysis models in test phases, weighting two obtained result grades to obtain scores and comparing the scores to predefined threshold values so as to discriminate voice spoofing. The method and the system have the advantages that local features can be captured, and global information can be grasped; the linear differential analysis models are used as classifiers in identification and verification phases, the voice spoofing attack can be judged by means of grade fusion, and accordingly the voice spoofing detection accuracy can be greatly improved.

Method and system for detecting voice spoofing attack of speakers on basis of deep learning

Method and system for detecting voice spoofing attack of speakers on basis of deep learning

Owner:AISPEECH CO LTD

Systems and methods for malicious code detection

ActiveUS20180285740A1Reduce data sizePlatform integrity maintainanceNeural architecturesNeural network systemData mining

There is provided a neural network system for detection of malicious code, the neural network system comprising: an input receiver configured for receiving input text from one or more code input sources; a convolutional neural network unit including one or more convolutional layers, the convolutional unit configured for receiving the input text and processing the input text through the one or more convolutional layers; a recurrent neural network unit including one or more long short term memory layers, the recurrent neural network unit configured to process the output from the convolutional neural network unit to perform pattern recognition; and a classification unit including one or more classification layers, the classification unit configured to receive output data from the recurrent neural network unit to perform a determination of whether the input text or portions of the input text are malicious code or benign code.

Systems and methods for malicious code detection

Systems and methods for malicious code detection

Systems and methods for malicious code detection

Owner:ROYAL BANK OF CANADA

Language identification using recurrent neural networks

ActiveUS20180067918A1Semantic analysisSpeech analysisAlgorithmMachine learning

Systems and processes for language identification using recurrent neural networks are provided. An example method includes, at an electronic device, receiving a first typed character of a character sequence and determining a character context of the first typed character based on the first typed character and a second typed character of the character sequence. The method further includes determining a confidence level that the character sequence is associated with a language of a plurality of languages based on the character context of the first typed character, and determining whether the confidence level exceeds a threshold, in accordance with a determination that the confidence level exceeds the threshold, providing the language as a candidate language, and in accordance with a determination that the confidence level does not exceed the threshold, forgoing providing the language as a candidate language.

Language identification using recurrent neural networks

Language identification using recurrent neural networks

Language identification using recurrent neural networks

Owner:APPLE INC

Efficient word encoding for recurrent neural network language models

InactiveUS20170091169A1Digital data information retrievalBiological neural network modelsRecurrent neural network language modelsUser input

Systems and processes for word encoding are provided. In accordance with one example, a method includes, at an electronic device with one or more processors and memory, receiving a user input, determining a first similarity between a representation of the user input and a first acoustic model of a plurality of acoustic models, and determining a second similarity between the representation of the user input and a second acoustic model of the plurality of acoustic models. The method further includes determining whether the first similarity is greater than the second similarity. In accordance with a determination that the first similarity is greater than the second similarity, the first acoustic model may be selected; and in accordance with a determination that the first similarity is not greater than the second similarity, the second acoustic model may be selected.

Efficient word encoding for recurrent neural network language models

Efficient word encoding for recurrent neural network language models

Efficient word encoding for recurrent neural network language models

Owner:APPLE INC

Adaptive acoustic channel equalizer & tuning method

InactiveUS6434084B1SurveyConstructionsCommunication unitOriginal data

A method and apparatus for data communication in an oil well environment, wherein the method comprises detecting an acoustic signal transmitted along an acoustic channel, the acoustic signal being distorted from transmission through the acoustic channel, generating a transmitted data signal in response to the acoustic signal, inputting the transmitted data signal to an adaptive equalizer and adaptively equalizing the transmitted data signal to produce an equalized data signal related to the transmitted data signal by a mathematical function. The detecting step may include positioning an acoustic receiver in a communication unit along the acoustic channel. The communication unit may be positioned downhole and the adaptive equalizer may be positioned remotely relative to the communication unit or may be placed in the communication unit. The adaptive equalizer may be a frequency domain filter, a neural net adaptive equalizer or a nonlinear recurrent neural net equalizer. The acoustic signal may comprise a plurality of discrete transmissions which may be a training sequence for training the adaptive equalizer and may comprise a first discrete transmission transmitted repeatedly.The method of data communication in an oil well environment may comprise the steps of transmitting an acoustic signal from a first location along an acoustic channel, detecting the acoustic signal at a second location along the acoustic channel, generating a transmitted data signal in response to the acoustic signal, inputting the transmitted data signal to an adaptive equalizer and adaptively equalizing the transmitted data signal to produce an equalized data signal related to the transmitted data signal by a mathematical function. The transmitting step may further comprise positioning an acoustic transmitter in a first communication unit along the acoustic channel downhole or elsewhere. The method may further comprise acquiring data, generating an original data signal in response to the acquired data and inputting the original data signal to the acoustic transmitter. The acoustic signal may comprise a series of acoustic training signals for training the adaptive equalizer. The acoustic training signals may be transmitted at a predetermined time. A stored training signal may include a series of stored training data signals corresponding to the series of acoustic training signals. At least a portion of the stored training signals may be cross-correlated to the transmitted data signal. The acoustic signal may comprise a notification signal for notifying the adaptive equalizer of a training session.

Adaptive acoustic channel equalizer & tuning method

Adaptive acoustic channel equalizer & tuning method

Adaptive acoustic channel equalizer & tuning method

Owner:WELLDYNAMICS INC

Legged mobile robot and its motion teaching method, and storage medium

InactiveUS7024276B2Programme-controlled manipulatorComputer controlNerve networkSerial pattern

Learning-type motion control is performed using a hierarchical recurrent neural network. A motion pattern provided through human teaching work is automatically time-serially segmented with the hierarchical recurrent neural network, and the motion control of a machine body is carried out with a combination of the segmented data, whereby various motion patterns can be produced. With the time-serial segmentation, local time-serial patterns and an overall pattern as a combination of the local time-serial patterns are produced. For those motion patterns, indices for static stability and dynamic stability of the machine body, e.g., ZMP stability criteria, are satisfied and hence control stability is ensured.

Legged mobile robot and its motion teaching method, and storage medium

Legged mobile robot and its motion teaching method, and storage medium

Legged mobile robot and its motion teaching method, and storage medium

Owner:SONY CORP

Behavior identification method based on recurrent neural network and human skeleton movement sequences

ActiveCN104615983AEasy accessHigh precision recognition rateCharacter and pattern recognitionNeural architecturesHuman bodyVideo monitoring

The invention discloses a behavior identification method based on a recurrent neural network and human skeleton movement sequences. The method comprises the following steps of normalizing node coordinates of extracted human skeleton posture sequences to eliminate influence of absolute space positions, where a human body is located, on an identification process; filtering the skeleton node coordinates through a simple smoothing filter to improve the signal to noise ratio; sending the smoothed data into the hierarchic bidirectional recurrent neural network for deep characteristic extraction and identification. Meanwhile, the invention provides a hierarchic unidirectional recurrent neural network model for coping with practical real-time online analysis requirements. The behavior identification method based on the recurrent neural network and the human skeleton movement sequences has the advantages of designing an end-to-end analyzing mode according to the structural characteristics and the motion relativity of human body, achieving high-precision identification and meanwhile avoiding complex computation, thereby being applicable to practical application. The behavior identification method based on the recurrent neural network and the human skeleton movement sequence is significant to the fields of intelligent video monitoring based on the depth camera technology, intelligent traffic management, smart city and the like.

Behavior identification method based on recurrent neural network and human skeleton movement sequences

Behavior identification method based on recurrent neural network and human skeleton movement sequences

Behavior identification method based on recurrent neural network and human skeleton movement sequences

Owner:INST OF AUTOMATION CHINESE ACAD OF SCI

Multi-task recurrent neural network architecture for efficient morphology handling in neural language modeling

ActiveUS20180349349A1Eliminate separationPerform word prediction more quickly, efficiently, and accuratelySemantic analysisSpeech analysisAlgorithmMorpheme

The present disclosure generally relates to systems and processes for morpheme-based word prediction. An example method includes receiving a current word; determining a context of the current word based on the current word and a context of a previous word; determining, using a morpheme-based language model, a likelihood of a prefix based on the context of the current word; determining, using the morpheme-based language model, a likelihood of a stem based on the context of the current word; determining, using the morpheme-based language model, a likelihood of a suffix based on the context of the current word; determining a next word based on the likelihood of the prefix, the likelihood of the stem, and the likelihood of the suffix; and providing an output including the next word.

Multi-task recurrent neural network architecture for efficient morphology handling in neural language modeling

Multi-task recurrent neural network architecture for efficient morphology handling in neural language modeling

Multi-task recurrent neural network architecture for efficient morphology handling in neural language modeling

Owner:APPLE INC

Dynamic Bandwidth Allocation Method of Ethernet Passive Optical Network

InactiveUS20100254707A1Fast convergenceAccurate predictionMultiplex system selection arrangementsTime-division optical multiplex systemsRecurrent neural netsPipeline scheduling

A dynamic bandwidth allocation method of an Ethernet passive optical network, comprises a predictor and a rule of QoS-promoted dynamic bandwidth allocation (PQ-DBA); the predictor predicts a client behavior and numbers of various kinds of packets by using a pipeline scheduling predictor consisted of a pipelined recurrent neural network (PRNN), and a learning rule of the extended recursive least squares (ERLS); the present invention establishes a better QoS traffic management for the OLT-allocated ONU bandwidth and client packets sent by priority.

Dynamic Bandwidth Allocation Method of Ethernet Passive Optical Network

Dynamic Bandwidth Allocation Method of Ethernet Passive Optical Network

Dynamic Bandwidth Allocation Method of Ethernet Passive Optical Network

Owner:CHUNGHWA TELECOM CO LTD

Recurrent neural network-based discrete emotion recognition method

ActiveCN105469065AModeling implementationRealize precise identificationCharacter and pattern recognitionDiscrete emotionsSupport vector machine classifier

The invention provides a recurrent neural network-based discrete emotion recognition method. The method comprises the following steps: 1, carrying out face detecting and tracking on image signals in a video, extracting key points of faces to serve as deformation features of the faces after obtaining the face regions, clipping the face regions and normalizing to a uniform size and extracting the appearance features of the faces; 2, windowing audio signals in the video, segmenting audio sequence units out and extracting audio features; 3, respectively carrying out sequential coding on the three features obtained by utilizing a recurrent neural network with long short-term memory models to obtain emotion representation vectors with fixed lengths, connecting the vectors in series and obtaining final emotion expression features; and 4, carrying out emotion category prediction by utilizing the final emotion expression features obtained in the step 3 on the basis of a support vector machine classifier. According to the method, dynamic information in the emotion expressing process can be fully utilized, so that the precise recognition of emotions of participators in the video is realized.

Recurrent neural network-based discrete emotion recognition method

Recurrent neural network-based discrete emotion recognition method

Recurrent neural network-based discrete emotion recognition method

Owner:北京中科欧科科技有限公司

Global semantic word embeddings using bi-directional recurrent neural networks

ActiveUS20190355346A1Improve performanceLess training dataMathematical modelsNatural language analysisData-driven learningData-driven

Systems and processes for operating a digital assistant are provided. In accordance with one or more examples, a method includes, receiving training data for a data-driven learning network. The training data include a plurality of word sequences. The method further includes obtaining representations of an initial set of semantic categories associated with the words included in the training data; and training the data-driven learning network based on the plurality of word sequences included in the training data and based on the representations of the initial set of semantic categories. The training is performed using the word sequences in their entirety. The method further includes obtaining, based on the trained data-driven learning network, representations of a set of semantic embeddings of the words included in the training data; and providing the representations of the set of semantic embeddings to at least one of a plurality of different natural language processing tasks.

Global semantic word embeddings using bi-directional recurrent neural networks

Global semantic word embeddings using bi-directional recurrent neural networks

Global semantic word embeddings using bi-directional recurrent neural networks

Owner:APPLE INC

Deep learning model and system predicting blood flow characteristic in blood vessel path of blood vessel tree

ActiveCN106980899AAccurate predictionQuick forecastNeural learning methodsPattern recognitionAlgorithm

The invention discloses a deep learning model and system predicting a blood flow characteristic in a blood vessel path of a blood vessel tree. The deep learning model comprises a neural network arranged for different points in the blood vessel path, receives at least one, as input, selected from image, structure and function characteristics of the points in the blood vessel path, and predicts the blood flow characteristic, as output, of the points of a point sequence in the blood vessel path,; and the deep learning model is established via a recursive neural network or by combining a multilayer neural network with a recursive neural network sequentially. The deep learning model can predict the blood flow characteristic (as a blood flow reservation fraction) in the whole blood vessel path, and the computing efficiency is improved greatly.

Deep learning model and system predicting blood flow characteristic in blood vessel path of blood vessel tree

Deep learning model and system predicting blood flow characteristic in blood vessel path of blood vessel tree

Deep learning model and system predicting blood flow characteristic in blood vessel path of blood vessel tree

Owner:BEIJING CURACLOUD TECH CO LTD

Computer vision-based express parcel violent sorting identification method

ActiveCN106897670ALow priceFacilitate large-scale deploymentCharacter and pattern recognitionHuman bodyFeature extraction

The present invention discloses a computer vision-based express parcel violent sorting identification method. The method comprises the following steps of: a depth camera-based pose estimation: a human pose estimation problem is converted into a problem of classifying depth image pixels captured by a depth camera, and human body pose estimation is realized by using a random forest method; human body three-dimensional pose relative spatial-temporal feature extraction: relative spatial-temporal positions of geometric elements of points, lines and surfaces formed by joints in three-dimensional poses and the measures of the change of the relative spatial-temporal positions are extracted and are adopted as the feature representations of the poses; and recurrent neural network-based violent sorting identification: modeling training is performed on the pose spatial-temporal relative features which are continuous in time and are extracted from the human body three-dimensional poses through a long and short memory model (LSTM), so that the identification of express parcel violent sorting behaviors can be realized.

Computer vision-based express parcel violent sorting identification method

Computer vision-based express parcel violent sorting identification method

Computer vision-based express parcel violent sorting identification method

Owner:NANJING UNIV OF POSTS & TELECOMM

Generation method of image description from structured text

ActiveCN106503055AActive implementation of technical effectsGood varietyStill image data indexingSpecial data processing applicationsLexical analysisThe Internet

The invention discloses a generation method of an image description from a structured text. The generation method comprises the steps of downloading pictures from the internet to form a picture training set; conducting morphological analysis on descriptions which correspond to the pictures in the picture training set to form the structured text; using an existing neural network model to extract convolution neural network characteristics of the pictures in the training set, and using <, picture characteristics and structured text < as inputs to form a multitasking recognition model; using the structured text extracted from the training set and a description which corresponds to the structured text as inputs of a recurrent neural network, and conducting training to obtain a parameter of a recurrent neural network model; inputting the convolution neural network characteristics of an image ready to be described, and obtaining a predicted structured text through the multitasking recognition model; inputting the predicted structured text, and obtaining the image description through the recurrent neural network model. Compared with the prior art, a better image description effect, accuracy and sentence variety can be generated through the method, and the generation method of the image description from the structured text can be effectively popularized in an application of image retrieval.

Generation method of image description from structured text

Generation method of image description from structured text

Generation method of image description from structured text

Owner:哈尔滨米兜科技有限公司

Method for using a Multi-Scale Recurrent Neural Network with Pretraining for Spoken Language Understanding Tasks

ActiveUS20170053646A1Effective estimateReduce errorsBiological modelsSpeech recognitionSpoken languageRecurrent neural nets

A spoken language understanding (SLU) system receives a sequence of words corresponding to one or more spoken utterances of a user, which is passed through a spoken language understanding module to produce a sequence of intentions. The sequence of words are passed through a first subnetwork of a multi-scale recurrent neural network (MSRNN), and the sequence of intentions are passed through a second subnetwork of the multi-scale recurrent neural network (MSRNN). Then, the outputs of the first subnetwork and the second subnetwork are combined to predict a goal of the user.

Method for using a Multi-Scale Recurrent Neural Network with Pretraining for Spoken Language Understanding Tasks

Method for using a Multi-Scale Recurrent Neural Network with Pretraining for Spoken Language Understanding Tasks

Method for using a Multi-Scale Recurrent Neural Network with Pretraining for Spoken Language Understanding Tasks

Owner:MITSUBISHI ELECTRIC RES LAB INC

A socio-emotional classification method based on multimodal fusion

InactiveCN109508375AEfficient extractionImprove accuracySemantic analysisNeural architecturesShort-term memoryClassification methods

The invention provides a social emotion classification method based on multimodal fusion, which relates to information in the form of audio, visual and text. Most of affective computing research onlyextracts affective information by analyzing single-mode information, ignoring the relationship between information sources. The present invention proposes a 3D CNN ConvLSTM (3D CNN ConvLSTM) model forvideo information, and establishes spatio-temporal information for emotion recognition tasks through a cascade combination of a three-dimensional convolution neural network (C3D) and a convolution long-short-term memory recurrent neural network (ConvLSTM). For text messages, use CNN-RNN hybrid model is used to classify text emotion. Heterogeneous fusion of vision, audio and text is performed by decision-level fusion. The deep space-time feature learned by the invention effectively simulates the visual appearance and the motion information, and after fusing the text and the audio information,the accuracy of the emotion analysis is effectively improved.

A socio-emotional classification method based on multimodal fusion

A socio-emotional classification method based on multimodal fusion

A socio-emotional classification method based on multimodal fusion

Owner:CHONGQING UNIV OF POSTS & TELECOMM

Dual Stage Attention Based Recurrent Neural Network for Time Series Prediction

ActiveUS20180060665A1Image enhancementDigital data information retrievalDual stageTimestamp

Systems and methods for time series prediction are described. The systems and methods include encoding driving series into encoded hidden states, the encoding including adaptively prioritizing driving series at each timestamp using input attention, the driving series including data sequences collected from sensors. The systems and methods further includes decoding the encoded hidden states to generate a predicting model, the decoding including adaptively prioritizing encoded hidden states using temporal attention. The systems and methods further include generating predictions of future events using the predicting model based on the data sequences. The systems and methods further include generating signals for initiating an action to devices based on the predictions.

Dual Stage Attention Based Recurrent Neural Network for Time Series Prediction

Dual Stage Attention Based Recurrent Neural Network for Time Series Prediction

Dual Stage Attention Based Recurrent Neural Network for Time Series Prediction

Owner:NEC CORP

Road network state prediction method based on depth space-time convolution circulation network

InactiveCN107180530AAchieve forecastDetection of traffic movementNeural architecturesState predictionRoad networks

The invention discloses a road network state prediction method based on a depth space-time convolution circulation network. The method comprises steps of step 1, establishing a sample set and dividing a training set and a test set; step 2, modeling a road network state prediction model, wherein the road network state prediction module comprises a convolution neural network and a recurrent neural network, the convolution neural network is used for extracting space features of the road network and the recurrent neural network is used for extracting sequential rules of the evolution of the road network; and step 3, carrying out road network state prediction in a certain period in the future. According to the invention, through the depth convolution neural network, relations between sections in the road network are extracted; through the recurrent neural network, the sequential rules of the revolution of the road network are extracted; and quite precise prediction of road network traffic states can be obtained by fully considering the space-time information of the road network.

Road network state prediction method based on depth space-time convolution circulation network

Road network state prediction method based on depth space-time convolution circulation network

Road network state prediction method based on depth space-time convolution circulation network

Owner:BEIHANG UNIV

Popular searches

Output device Concatenation Analog device Human language Utterance Language model Telephone equipment Combined use Speech synthesis Speech sound

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com