Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

711 results about "Recurrent neural nets" patented technology

Recurrent neural networks (RNNs) are a kind of neural net often used to model sequence data. They maintain a hidden state which can "remember" certain aspects of the sequence it has seen. RNNs can be trained using backpropagation through time, although efficient training remains an open problem.

Voice identification method using long-short term memory model recurrent neural network

The invention discloses a voice identification method using a long-short term memory model recurrent neural network. The voice identification method comprises training and identification. The training process comprises steps of introducing voice data and text data to generate a commonly-trained acoustic and language mode, and using an RNN sensor to perform decoding to form a model parameter. The identification process comprises steps of converting voice input to a frequency spectrum graph through Fourier conversion, using the recursion neural network of the long-short term memory model to perform orientational searching decoding and finally generating an identification result. The voice identification method adopts the recursion neural network (RNNs) and adopts connection time classification (CTC) to train RNNs through an end-to-end training method. These LSTM units combining with the long-short term memory have good effects and combines with multi-level expression to prove effective in a deep network; only one neural network model (end-to-end model) exits from a voice characteristic (an input end) to a character string (an output end) and the neural network can be directly trained by a target function which is a some kind of a proxy of WER, which avoids to cost useless work to optimize an individual target function.
Owner:SHENZHEN WEITESHI TECH

Bidirectional long short-term memory unit-based behavior identification method for video

The invention discloses a bidirectional long short-term memory unit-based behavior identification method for a video. The method comprises the steps of (1) inputting a video sequence and extracting an RGB (Red, Green and Blue) frame sequence and an optical flow image from the video sequence; (2) respectively training a deep convolutional network of an RGB image and a deep convolutional network of the optical flow image; (3) extracting multilayer characteristics of the network, wherein characteristics of a third convolutional layer, a fifth convolutional layer and a seventh fully connected layer are at least extracted, and the characteristics of the convolutional layers are pooled; (4) training a recurrent neural network constructed by use of a bidirectional long short-term memory unit to obtain a probability matrix of each frame of the video; and (5) averaging the probability matrixes, finally fusing the probability matrixes of an optical flow frame and an RGB frame, taking a category with a maximum probability as a final classification result, and thus realizing behavior identification. According to the method, the conventional artificial characteristics are replaced with multi-layer depth learning characteristics, the depth characteristics of different layers represent different pieces of information, and the combination of multi-layer characteristics can improve the accuracy rate of classification; and the time information is captured by use of the bidirectional long short-term memory, many pieces of time domain structural information are obtained and a behavior identification effect is improved.
Owner:SUZHOU UNIV

Deep learning-based short-term traffic flow prediction method

The present invention discloses a deep learning method-based short-term traffic flow prediction method. The influence of the traffic flow rate change of the neighbor points of a prediction point, the time characteristic of the prediction point and the influence of the periodic characteristic of the prediction point on the traffic flow rate of the prediction point are considered simultaneously. According to the deep learning method-based short-term traffic flow prediction method of the invention, a convolutional neural network and a long and short-term memory (LSTM) recurrent neural network are combined to construct a Conv-LSTM deep neural network model; a two-way LSTM model is used to analyze the traffic flow historical data of the point and extract the periodic characteristic of the point; and a traffic flow trend and a periodic characteristic which are obtained through analysis are fused, so that the prediction of traffic flow can be realized. With the method of the invention adopted, the defect of the incapability of an existing method to make full use of time and space characteristics can be eliminated, the time and space characteristics of the traffic flow are fully extracted, and the periodic characteristic of the data of the traffic flow is fused with the time and space characteristics, and therefore, the accuracy of short-term traffic flow prediction results can be improved.
Owner:FUZHOU UNIV

Design method of hardware accelerator based on LSTM recursive neural network algorithm on FPGA platform

The invention discloses a method for accelerating an LSTM neural network algorithm on an FPGA platform. The FPGA is a field-programmable gate array platform and comprises a general processor, a field-programmable gate array body and a storage module. The method comprises the following steps that an LSTM neural network is constructed by using a Tensorflow pair, and parameters of the neural networkare trained; the parameters of the LSTM network are compressed by adopting a compression means, and the problem that storage resources of the FPGA are insufficient is solved; according to the prediction process of the compressed LSTM network, a calculation part suitable for running on the field-programmable gate array platform is determined; according to the determined calculation part, a softwareand hardware collaborative calculation mode is determined; according to the calculation logic resource and bandwidth condition of the FPGA, the number and type of IP core firmware are determined, andacceleration is carried out on the field-programmable gate array platform by utilizing a hardware operation unit. A hardware processing unit for acceleration of the LSTM neural network can be quicklydesigned according to hardware resources, and the processing unit has the advantages of being high in performance and low in power consumption compared with the general processor.
Owner:SUZHOU INST FOR ADVANCED STUDY USTC

Adaptive acoustic channel equalizer & tuning method

A method and apparatus for data communication in an oil well environment, wherein the method comprises detecting an acoustic signal transmitted along an acoustic channel, the acoustic signal being distorted from transmission through the acoustic channel, generating a transmitted data signal in response to the acoustic signal, inputting the transmitted data signal to an adaptive equalizer and adaptively equalizing the transmitted data signal to produce an equalized data signal related to the transmitted data signal by a mathematical function. The detecting step may include positioning an acoustic receiver in a communication unit along the acoustic channel. The communication unit may be positioned downhole and the adaptive equalizer may be positioned remotely relative to the communication unit or may be placed in the communication unit. The adaptive equalizer may be a frequency domain filter, a neural net adaptive equalizer or a nonlinear recurrent neural net equalizer. The acoustic signal may comprise a plurality of discrete transmissions which may be a training sequence for training the adaptive equalizer and may comprise a first discrete transmission transmitted repeatedly.The method of data communication in an oil well environment may comprise the steps of transmitting an acoustic signal from a first location along an acoustic channel, detecting the acoustic signal at a second location along the acoustic channel, generating a transmitted data signal in response to the acoustic signal, inputting the transmitted data signal to an adaptive equalizer and adaptively equalizing the transmitted data signal to produce an equalized data signal related to the transmitted data signal by a mathematical function. The transmitting step may further comprise positioning an acoustic transmitter in a first communication unit along the acoustic channel downhole or elsewhere. The method may further comprise acquiring data, generating an original data signal in response to the acquired data and inputting the original data signal to the acoustic transmitter. The acoustic signal may comprise a series of acoustic training signals for training the adaptive equalizer. The acoustic training signals may be transmitted at a predetermined time. A stored training signal may include a series of stored training data signals corresponding to the series of acoustic training signals. At least a portion of the stored training signals may be cross-correlated to the transmitted data signal. The acoustic signal may comprise a notification signal for notifying the adaptive equalizer of a training session.
Owner:WELLDYNAMICS INC

Behavior identification method based on recurrent neural network and human skeleton movement sequences

The invention discloses a behavior identification method based on a recurrent neural network and human skeleton movement sequences. The method comprises the following steps of normalizing node coordinates of extracted human skeleton posture sequences to eliminate influence of absolute space positions, where a human body is located, on an identification process; filtering the skeleton node coordinates through a simple smoothing filter to improve the signal to noise ratio; sending the smoothed data into the hierarchic bidirectional recurrent neural network for deep characteristic extraction and identification. Meanwhile, the invention provides a hierarchic unidirectional recurrent neural network model for coping with practical real-time online analysis requirements. The behavior identification method based on the recurrent neural network and the human skeleton movement sequences has the advantages of designing an end-to-end analyzing mode according to the structural characteristics and the motion relativity of human body, achieving high-precision identification and meanwhile avoiding complex computation, thereby being applicable to practical application. The behavior identification method based on the recurrent neural network and the human skeleton movement sequence is significant to the fields of intelligent video monitoring based on the depth camera technology, intelligent traffic management, smart city and the like.
Owner:INST OF AUTOMATION CHINESE ACAD OF SCI

Computer vision-based express parcel violent sorting identification method

ActiveCN106897670ALow priceFacilitate large-scale deploymentCharacter and pattern recognitionHuman bodyFeature extraction
The present invention discloses a computer vision-based express parcel violent sorting identification method. The method comprises the following steps of: a depth camera-based pose estimation: a human pose estimation problem is converted into a problem of classifying depth image pixels captured by a depth camera, and human body pose estimation is realized by using a random forest method; human body three-dimensional pose relative spatial-temporal feature extraction: relative spatial-temporal positions of geometric elements of points, lines and surfaces formed by joints in three-dimensional poses and the measures of the change of the relative spatial-temporal positions are extracted and are adopted as the feature representations of the poses; and recurrent neural network-based violent sorting identification: modeling training is performed on the pose spatial-temporal relative features which are continuous in time and are extracted from the human body three-dimensional poses through a long and short memory model (LSTM), so that the identification of express parcel violent sorting behaviors can be realized.
Owner:NANJING UNIV OF POSTS & TELECOMM

Generation method of image description from structured text

The invention discloses a generation method of an image description from a structured text. The generation method comprises the steps of downloading pictures from the internet to form a picture training set; conducting morphological analysis on descriptions which correspond to the pictures in the picture training set to form the structured text; using an existing neural network model to extract convolution neural network characteristics of the pictures in the training set, and using <, picture characteristics and structured text < as inputs to form a multitasking recognition model; using the structured text extracted from the training set and a description which corresponds to the structured text as inputs of a recurrent neural network, and conducting training to obtain a parameter of a recurrent neural network model; inputting the convolution neural network characteristics of an image ready to be described, and obtaining a predicted structured text through the multitasking recognition model; inputting the predicted structured text, and obtaining the image description through the recurrent neural network model. Compared with the prior art, a better image description effect, accuracy and sentence variety can be generated through the method, and the generation method of the image description from the structured text can be effectively popularized in an application of image retrieval.
Owner:哈尔滨米兜科技有限公司

Method for using a Multi-Scale Recurrent Neural Network with Pretraining for Spoken Language Understanding Tasks

A spoken language understanding (SLU) system receives a sequence of words corresponding to one or more spoken utterances of a user, which is passed through a spoken language understanding module to produce a sequence of intentions. The sequence of words are passed through a first subnetwork of a multi-scale recurrent neural network (MSRNN), and the sequence of intentions are passed through a second subnetwork of the multi-scale recurrent neural network (MSRNN). Then, the outputs of the first subnetwork and the second subnetwork are combined to predict a goal of the user.
Owner:MITSUBISHI ELECTRIC RES LAB INC

Dual Stage Attention Based Recurrent Neural Network for Time Series Prediction

Systems and methods for time series prediction are described. The systems and methods include encoding driving series into encoded hidden states, the encoding including adaptively prioritizing driving series at each timestamp using input attention, the driving series including data sequences collected from sensors. The systems and methods further includes decoding the encoded hidden states to generate a predicting model, the decoding including adaptively prioritizing encoded hidden states using temporal attention. The systems and methods further include generating predictions of future events using the predicting model based on the data sequences. The systems and methods further include generating signals for initiating an action to devices based on the predictions.
Owner:NEC CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products