Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A Classification Method of Enterprise Industry

A classification method, industry technology, applied in instrument, calculation, character and pattern recognition, etc., can solve problems such as low accuracy

Active Publication Date: 2020-11-24
广州探迹科技有限公司
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In addition to this, a single classifier model relies too much on the coverage of sample descriptions, and is less accurate when classifying a new sample of a description that has never appeared before.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Classification Method of Enterprise Industry
  • A Classification Method of Enterprise Industry
  • A Classification Method of Enterprise Industry

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] The accompanying drawings are for illustrative purposes only, and should not be construed as limitations on this patent; in order to better illustrate this embodiment, certain components in the accompanying drawings will be omitted, enlarged or reduced, and do not represent the size of the actual product; for those skilled in the art It is understandable that some well-known structures and descriptions thereof may be omitted in the drawings. The present invention will be further described in detail below in conjunction with the embodiments and the accompanying drawings, but the embodiments of the present invention are not limited thereto.

[0046] The main innovation of a kind of enterprise industry classification method in the present invention is to use the word vector and semi-supervised graph splitting clustering method to extract the main business keywords of the enterprise, eliminate garbage words, and construct a keyword library; use the extracted keywords as feat...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an enterprise industry classification method. The method utilizes the graph splitting clustering algorithm of semi-supervised learning to effectively extract the main business keywords of the enterprise, and uses the extracted keywords as features based on the gradient lifting decision tree to train The cascade classifier classifies enterprises by industry, which solves the cumbersome problem of manual classification. The specific method is: 1) Use the word vector and semi-supervised graph splitting clustering algorithm to extract the main business keywords of the enterprise, eliminate the junk words, and construct the keyword library; 2) Input the extracted keywords as features to train the cascade classifier , the classifier at each level classifies the enterprises, and the unclassified enterprises are classified by the classifier at the next level. The invention not only can automatically construct, update and classify keywords, but also solves the problem of classifying industries of tens of millions of enterprises, and can effectively solve the problem of manual labeling.

Description

technical field [0001] The present invention relates to the research field of data classification methods, and more specifically, relates to the extraction of industry keywords. In the case that the business scope of the enterprise overlaps with multiple industry descriptions, the fusion of semi-supervised graph splitting and clustering and cascading gradient boosting decision trees The enterprise industry classification method. Background technique [0002] In the industry classification standard issued by the National Bureau of Statistics of the People's Republic of China in 2013, it is divided into 20 first-level industries and 96 second-level industries. The industry label of an enterprise is an important field, and there are tens of millions of enterprises across the country, and many enterprises are incubated every day. How to quickly classify enterprises by industry is an important issue. In the previous industry classification norms, the industry to which an enterpr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/62
CPCG06F18/23213G06F18/241G06F18/214
Inventor 陈开冉吴璐璐
Owner 广州探迹科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products