Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Novel network media platform variant comment adversarial text generation method

A network media and variant technology, applied in the field of variant comment adversarial text generation on a new network media platform, can solve the problems of insufficient training data for variant text and low classification accuracy of variant text, and achieve high fidelity effects

Active Publication Date: 2021-08-20
NORTHWESTERN POLYTECHNICAL UNIV
View PDF4 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Aiming at the problem that the variant text classification accuracy of the variant text classification method based on the deep neural network is not high due to insufficient training data, the present invention analyzes the variant text rules and utilizes feature word extraction, word sequence randomization, word Vector and text generation techniques enable different forms of variant adversarial text generation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Novel network media platform variant comment adversarial text generation method
  • Novel network media platform variant comment adversarial text generation method
  • Novel network media platform variant comment adversarial text generation method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0059] Embodiments of the present invention will be described in detail below, and the examples are illustrative, intended to be used to illustrate the invention.

[0060] Based on summarizing the variant text variant rules commonly used by the new network media platform, the present invention is first extracted, and the classification label text is extracted; and the feature words are generated based on variant vocabulary based on various rules, and this foundation The variant text generation based on variant rules is carried out; then the word text is trained by Word2Vec word vector, and the word list is obtained from the word vector to achieve neural network word vector. Variant text generation; Finally, a variant text generation method of combined variant rules and word vector similar words is achieved by probability randomization. Specific process figure 1 Indicated.

[0061] 1. Labeling text feature word extraction

[0062] Variant text generally tends to perform variants su...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a novel network media platform variant comment adversarial text generation method, which comprises the following steps of: on the basis of summarizing variant text variant rules commonly used by a novel network media platform, firstly, extracting feature words from classified annotated texts; carrying out variant vocabulary generation based on various rules on the feature words, and carrying out variant text generation based on variant rules on the basis of the variant vocabulary generation; then training the annotated text through a word2vec word vector method to obtain word vectors of all vocabularies, obtaining a similar word list of all vocabularies according to the word vectors, and achieving variant text generation based on neural network word vectors; and finally, achieving a variant text generation method combining variant rules and word vector similar words through a probability randomization method. According to the method, massive variant texts in different forms can be generated, conventional text filtering can be resisted, and high fidelity is achieved.

Description

Technical field [0001] The present invention relates to a natural language processing review text generation technology, and is specifically a new type of network media platform variant comment against text generation method. Background [0002] New network media platforms, such as shake, fast hand, Netease cloud music, etc. have hundreds of millions of account users, in which some bad users are used to avoid review of spam comments produced by variant methods such as homogenesis. These variant spam comments with negative emotional or poor metaphors have seriously polluted network environments, causing negative impact on platform users, analyzing and correctly identifying these variant spam comments. It is of great significance for platform health development. [0003] The existing variant garbage text classification method mainly includes two categories, one is based on variant word identification and normalization, by extracting variant word characteristics identifying text var...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06F40/284G06K9/62
CPCG06F16/35G06F40/284G06F18/22G06F18/2411Y02D10/00
Inventor 刘春刘峥殷茗
Owner NORTHWESTERN POLYTECHNICAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products