Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A Text Intent Recognition Method and System Based on Projected Gradient Descent and Label Smoothing

A projection gradient descent and recognition method technology, applied in neural learning methods, biological neural network models, semantic analysis, etc., can solve problems such as difficulty in adapting to limited training samples, weak semantic coding ability, high model complexity, etc., and achieve good generalization Effect, Strong Resilience, Scale-Up Effect

Active Publication Date: 2020-12-08
ZHEJIANG UNIV
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In order to solve the problem in the field of intent recognition, the existing text classification model lacks a good trade-off between model complexity and model generalization performance. Too few parameters tend to make the semantic coding ability weak, and the accuracy rate is low in the case of many classification categories. ; Too many parameters make the model too complex and difficult to adapt to the situation of limited training samples

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Text Intent Recognition Method and System Based on Projected Gradient Descent and Label Smoothing
  • A Text Intent Recognition Method and System Based on Projected Gradient Descent and Label Smoothing
  • A Text Intent Recognition Method and System Based on Projected Gradient Descent and Label Smoothing

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0095] In order to verify the implementation effect of the present invention, comparison and ablation experiments were carried out on two large-scale public data sets IFLYTEK and TNEW. IFLYTEK is a long text classification data set, which contains a total of 17,000 long text annotation data about app application descriptions, including various application topics related to daily life, a total of 119 categories: "Taxi": 0, "Navigation": 1,"Free WIFI": 2,...,"Receipt": 117,"Others": 118, each category can be regarded as a type of intent in the question answering system. The data set is divided into three parts: training set, verification set, and test set, with 12133, 2599, and 2600 long texts respectively.

[0096] TNEW is a short text classification dataset from the news section of Toutiao. It extracts 15 categories of news, including tourism, education, finance, military, etc. The data set is also divided into three parts: training set, verification set, and test set, with 5...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a text intent recognition method and system based on projection gradient descent and label smoothing, and relates to the field of natural language processing question answering systems. Including (1) obtaining the initial vector encoding through the embedding layer; (2) using the projected gradient descent algorithm to add disturbances that satisfy the L2 constraint in the embedding layer to form an adversarial sample; (3) using the Transformer network to encode contextual semantic information; (4) using labels Smoothly scale the real intent category; (5) Input the encoder output features into the classifier, and calculate the cross entropy between the smoothed label; (6) Optimize the objective function; (7) After the model is trained, predict the intent category and output. In the classification task, the model of the present invention can fully encode the semantic vector of the input intention; at the same time, add disturbance to the text embedding layer to form an adversarial sample, and perform label sliding on the final classification target, which can significantly improve the robustness and generalization of the model ability.

Description

technical field [0001] The invention relates to the field of natural language processing question answering systems, in particular to a text intent classification method and system based on projection gradient descent and label smoothing. Background technique [0002] With a large number of publicly available online question answering corpora, question answering systems have received attention from researchers in both industry and academia. Q&A systems are usually based on intelligent products that meet the needs of B-end companies, which can significantly improve work efficiency and relieve customer service personnel. Its greatest hidden value is to automatically accumulate standardized data in actual scenarios, reduce costs and improve efficiency in mining customer service value information, and can also be used for future precision marketing and product upgrades. A typical application of a question answering system is to question and answer knowledge in a certain field, ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/30G06N3/04G06N3/08
CPCG06F40/30G06N3/084G06N3/045
Inventor 徐叶琛赵洲
Owner ZHEJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products