Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Shallow layer model and depth model combination-based question text classification method

A deep model, text classification technology, applied in semantic analysis, special data processing applications, instruments, etc., to achieve the effect of improving performance, improving classification performance, and improving accuracy

Inactive Publication Date: 2018-09-28
KUNMING UNIV OF SCI & TECH
View PDF5 Cites 32 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The present invention provides a question text classification method based on the combination of a shallow model and a deep model. Aiming at the problems existing when a single deep model faces unbalanced training data, the traditional shallow model has strong memory characteristics for features, effectively improve the accuracy of question classification

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Shallow layer model and depth model combination-based question text classification method
  • Shallow layer model and depth model combination-based question text classification method
  • Shallow layer model and depth model combination-based question text classification method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0040] Embodiment 1: as Figure 1-4 As shown, the question text classification method based on the combination of the shallow model and the deep model, the specific steps of the method are as follows:

[0041] Step1. Crawl the question corpus of 5 categories of economy and finance, laws and regulations, sports, medical and health, and electronic digital, and then preprocess the corpus text;

[0042] Further, the specific steps of the step Step1 are as follows:

[0043] Step1.1. First, manually write a crawler program to crawl five categories of question corpus on Baidu Zhizhi: economics and finance, laws and regulations, sports, medical and health, and electronic digital;

[0044] Step1.2. Filter and deduplicate the crawled corpus to obtain non-repetitive question corpus, and store it in the database;

[0045] The present invention crawls 5,000 pieces of corpus on each of the five categories of economics and finance, laws and regulations, sports, medical and health, and elec...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a shallow layer model and depth model combination-based question text classification method, and belongs to the technical field of computer natural language processing. The method comprises the following steps of: firstly extracting a feature word set of a question text, obtaining corresponding feature word weights by utilizing normalized word vectors after vectorization,and taking the corresponding feature word weights as a part of input of a shallow layer linear model; convoluting the question text by a convolutional network by using multiple convolution cores withdifferent window sizes, rearranging the feature vectors extracted by different convolution cores with same-length convolution windows, respectively inputting the feature vectors into corresponding recurrent neural networks, and finally taking syntax semantic feature vectors, obtained by linking outputs of the plurality of recurrent neural networks together, of the question as another part of inputof the shallow layer linear model; and finally obtaining a classification result of the question by the shallow layer model according to an input spliced by the feature word vectors and an output ofa depth model. The method is capable of overcoming the defects of single depth models and effectively enhancing the question classification correctness.

Description

technical field [0001] The invention relates to a question text classification method based on the combination of a shallow model and a deep model, and belongs to the technical field of computer natural language processing. Background technique [0002] Question text classification belongs to short text classification and plays an important role in automatic question answering systems. Question text classification mainly classifies questions by analyzing their content. In the early days, there were rule-based methods, which used the corresponding relationship between keywords or grammatical patterns of questions and question types to classify questions. This method works well for question classification with obvious interrogative words or question category feature words, but not for more complex questions or question sentence texts without obvious category feature words, and the flexibility of the method is not enough. High, the workload is large, and the subjectivity of q...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30G06F17/27
CPCG06F40/289G06F40/30
Inventor 黄青松余慧郭勃刘利军冯旭鹏
Owner KUNMING UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products