Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Text classification method of gating loop unit based on residual jump connection

A cyclic unit and skip connection technology, applied in text database clustering/classification, neural learning methods, unstructured text data retrieval, etc., can solve the problem that the accuracy cannot meet the requirements, the word order information is not fully considered, and the hyperparameter adjustment is cumbersome, etc. question

Active Publication Date: 2021-12-17
CHINA THREE GORGES UNIV
View PDF7 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

There are some imperfections in the existing text classification technology, such as: FastText method, its disadvantage is that word order information is not fully considered, TextCNN method, its disadvantage is that on the one hand, it cannot establish longer sequence information, on the other hand, its hyperparameters Adjustment is extra cumbersome
However, with the deepening of the network, the traditional RNN will have the problem of gradient disappearance
Although long-term short-term memory (LSTM) and gated recurrent unit (GRU) methods can capture long-term contextual information, the complexity of their gate structures makes training slow
Although the simple recurrent unit (SRU) method can reduce the training time of the neural network, its accuracy cannot meet the demand

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text classification method of gating loop unit based on residual jump connection
  • Text classification method of gating loop unit based on residual jump connection
  • Text classification method of gating loop unit based on residual jump connection

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0187] In order to better display the experimental results in the present invention, the data set used is PTB, and the experimental results are shown in Table 1, Table 2 and Table 3. The data set contains 9998 different word vocabulary, plus special symbols for rare words and sentence end markers, a total of 10000 words. The source code to complete the training and testing of the PTB dataset is based on Pytorch's official language model example. In order to make the comparison more sufficient, here I choose to use the recurrent neural network (RNN), long short-term memory network (LSTM), gated recurrent unit (GRU), Transformer, simple recurrent unit (SRU), high-speed simple recurrent unit on this data set (H-SRU), the residual gated recurrent unit (R-GRU), and the gated recurrent unit (RT-GRU) based on the residual skip connection provided by the present invention were compared. And in order to better compare the pros and cons of each network, the parameter settings of the cy...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a text classification method of a gating loop unit based on residual jump connection. The method is characterized in that on the basis of an existing gating loop unit, residual information is introduced by utilizing jump connection and gating limitation of an expressway network is combined, and meanwhile, an unsaturated activation function and batch standardization are used to improve a definition formula of the gating loop unit, so that the gating loop unit can capture an ultra-long-term dependency relationship while coping with the gradient disappearance problem, and residual information is limited not to occupy a main position by utilizing gating of the expressway network. According to the method, the trained network model is utilized to automatically classify the Chinese texts and classify the positive / negative evaluation of the movie review, so that the labor cost is reduced. A plurality of groups of contrast experiments prove that the method has a certain applicability in a text classification task.

Description

technical field [0001] The invention relates to a text classification method of a gated recurrent unit based on a residual skip connection. Background technique [0002] In recent years, deep learning has been widely used in scientific and technological fields such as astronomy, geography, and industry. Time-series data is ubiquitous in our daily lives, from stock market trends, climate data in different cities, product sales over time, energy usage, and more. Among them, the text data on the network is also increasing day by day, and these data are mainly used in fields such as public opinion monitoring, information retrieval, and news text classification. If the data can be effectively classified, it will be more conducive to dig out valuable information, so the management and integration of text data is particularly important. Text classification is a basic task in natural language processing tasks. By using computer-related theoretical knowledge and skills, the purpose...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/35G06N3/04G06N3/08
CPCG06F16/353G06N3/084G06N3/048G06N3/044Y02D10/00
Inventor 孙水发胡枫裴伟董方敏吴义熔梅周俊森
Owner CHINA THREE GORGES UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products