Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Public opinion classification optimization method for long text

A classification optimization and long text technology, applied in the field of long text public opinion classification optimization, can solve the problems of reducing the possibility of positive or negative and not easy to find

Pending Publication Date: 2022-03-08
时趣互动(北京)科技有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This is because, on the one hand, in the longer content, most of the paragraphs are indeed neutral and objective statements, while a small number of text fragments expressing public opinion tendencies are only mixed in, and it is not easy to find even manual reading; on the other hand On the one hand, when the bert model classifies text public opinion, it only gives the judgment of public opinion tendency from the whole article, which can be considered as a weighted average of the public opinion tendency of the entire text, and the longer the article, the more likely it is to lower the judgment as Positive or Negative Likelihood
As a result, when classifying public opinion on long texts, the important public opinion fragments contained in it are ignored, and the overall public opinion judgment is given as neutral.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Public opinion classification optimization method for long text
  • Public opinion classification optimization method for long text
  • Public opinion classification optimization method for long text

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0037] see Figure 1-2 , the present invention provides a technical solution:

[0038] The public opinion classification optimization method for long texts includes the following steps:

[0039] a. Use the traditional bert fine-tuned model to judge the public opinion on the input text. For the text judged as neutral public opinion, judge whether the length of the text exceeds the set length threshold. The threshold is 300, that is, whether the text has 300 charac...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a long text public opinion classification optimization method, which comprises the following steps of: a, performing public opinion judgment on an input text by using a traditional bert fine-tuning model, and judging whether the length of the text which is judged to be a neutral public opinion exceeds a set length threshold value or not; b, if not, maintaining an original public opinion judgment result, and if yes, performing more detailed public opinion analysis; and c, sending the current text to the pre-trained and fine-tuned bert models at the same time to obtain semantic vectors of each character in the current text before and after fine tuning. According to the method and the device, the character semantic change of the bert model before and after fine tuning is applied to the public opinion classification task for the long text; by identifying the text fragments with the public opinion tendency, the probability that the whole text fragments are judged to be neutral is reduced, and the detailed public opinion tendency of the user is better identified.

Description

technical field [0001] The invention relates to the technical field of text public opinion classification, in particular to a long text public opinion classification optimization method. Background technique [0002] When classifying public opinion on texts with many texts and long texts, the Bert model commonly used in the industry often gives a "neutral" judgment. This is because, on the one hand, in the longer content, most of the paragraphs are indeed neutral and objective statements, while a small number of text fragments expressing public opinion tendencies are only mixed in, and it is not easy to find even manual reading; on the other hand On the one hand, when the bert model classifies text public opinion, it only gives the judgment of public opinion tendency from the whole article, which can be considered as a weighted average of the public opinion tendency of the entire text, and the longer the article, the more likely it is to lower the judgment as Positive or ne...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/35
CPCG06F16/35
Inventor 唐亮曹特磊赵伟
Owner 时趣互动(北京)科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products