Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

AI-based objectified attribute text automatic classification method and system

A technology of automatic classification and classification method, which is applied in text database clustering/classification, unstructured text data retrieval, natural language data processing, etc. It can solve the problem of large manual workload, no description, calculation of text similarity and value, etc problems, to solve fatigue and work interest, avoid development, and reduce the workload of data processing

Pending Publication Date: 2021-06-15
北京星汉博纳医药科技有限公司
View PDF4 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The above-mentioned patent also has the following disadvantages: in most algorithms of machine learning, it is not possible to directly use text as a feature value for training. In the current feature engineering description, it does not explain how to calculate the similarity and value between texts. In this way, it is difficult to achieve the specified goal through model training; the algorithm mentions "manually labeled real label data" and uses it as positive sample data for training. If you want the results to be ideal, you must go through a lot of manual labeling. Yes, this kind of manual workload is huge, and it takes a lot of time to implement the algorithm. If there is a new classification situation, manual participation is required, and it takes the same or more time to label, which is obviously not very realistic;

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • AI-based objectified attribute text automatic classification method and system
  • AI-based objectified attribute text automatic classification method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention.

[0030] In describing the present invention, it should be understood that the terms "upper", "lower", "front", "rear", "left", "right", "top", "bottom", "inner", " The orientation or positional relationship indicated by "outside", etc. is based on the orientation or positional relationship shown in the drawings, and is only for the convenience of describing the present invention and simplifying the description, rather than indicating or implying that the referred device or element must have a specific orientation, so as to Specific orientation configurations and operations, therefore, are not to be construed as limitations on the invention.

[0031] 1. Reference figure...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention belongs to the technical field of data analysis and data mining, and particularly relates to an AI-based objectified attribute text automatic classification method and system, and the method comprises the following core steps: building a character coding library, carrying out character decomposition on all text data which are stored in a library in history, numbering single characters uniquely in the library, and integers being used as self-increasing numbers according to numbering rules; preprocessing standard attribute data, extracting stored standard data as to-be-trained data, and limiting the length of a character string to be 60 Chinese characters, such as drug general names, drug specifications, drug production enterprises, approved numbers and the like, and clearly expressing fields of data attribute characteristics. Through the invention, the subject attribute category described by a section of data can be quickly judged, and then whether the attribute category is consistent with the subject design or not is judged; in addition, attribute classification judgment can be carried out on a plurality of adjacent data, and the position of the main body description information can be positioned in the webpage.

Description

technical field [0001] The present invention relates to the technical field of data analysis and data mining, in particular to an AI-based automatic classification method and system for object-oriented attribute texts. Background technique [0002] In just five years, the number of people using the Internet has increased by 83%. Taking Weibo as an example, the monthly active users of Weibo increased to 462 million at the end of 2018, and the average daily text publishing volume reached 130 million. In the face of massive amounts of data, simple manual management and induction of different types of information will cost a lot of time and money. More and more applications are adopting automatic text classification technology, including spam comment recognition, pornography recognition, news classification, sentiment analysis, etc. Text classification technology is in a period of rapid development under the background of big data. [0003] After searching, the Chinese patent ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/35G06F16/951G06F40/126G06K9/62G06N20/00
CPCG06F16/35G06F16/951G06F40/126G06N20/00G06F18/214
Inventor 王建伟
Owner 北京星汉博纳医药科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products