Enterprise classification method and system based on big data deep learning and electronic equipment
A deep learning and enterprise classification technology, applied in neural learning methods, text database clustering/classification, text database query, etc., can solve the problems of self-learning iteration, low classification efficiency, and inability to achieve accurate classification in cases where classification cannot be achieved. High learning ability and accuracy, reduce manual intervention, improve the effect of recognition and completion ability
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0042] refer to figure 1 , the present embodiment provides a method for classifying enterprises based on big data deep learning, comprising the following steps:
[0043] S1: Obtain the comprehensive information of the enterprise and form a large data set;
[0044] S2: Based on the CRF word segmentation model and the probability graph model, extract the enterprise component keyword set, perform preprocessing actions, train the corresponding word vector model, and use the density clustering algorithm to predict several characteristic keywords for the constructed word vector model set, and remove noise words or update noise thesaurus;
[0045] S3: Use the FastText text classification model to perform TF-IDF screening on the word set, and use the LDA model to conduct topic analysis on large data sets, extract keywords about the company, and use the density clustering algorithm to predict a number of words based on expert threshold recommendations. a set of subject terms;
[004...
Embodiment 2
[0066] refer to figure 2 and image 3 , this embodiment 2 provides an enterprise classification system applied to the enterprise classification method based on big data deep learning in embodiment 1, including:
[0067] A corpus text module, the corpus text module is configured to obtain comprehensive information of enterprises to form a large data set; wherein, the corpus text module obtains enterprise Synthesize information, and sample and organize corpus texts from the big data set after big data cleaning, and this large amount of corpus texts constitutes a big data set;
[0068] A feature keyword generation module, the feature keyword generation module is configured to extract the enterprise component keyword set based on the CRF word segmentation model and the probability graph model, perform preprocessing actions, train the corresponding word vector model, and aim at the word vector model constructed , use the density clustering algorithm to predict several feature ke...
Embodiment 3
[0074] Embodiment 3 provides an electronic device, including a processor and a memory, at least one instruction, at least one program, code set or instruction set are stored in the memory, and the at least one instruction, at least one program, code set or instruction The set is loaded and executed by the processor to implement the enterprise classification method based on deep learning of big data in Embodiment 1.
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com