A Classifier-based Information Classification Method for Shopping Guide Web Pages

A classification method and classifier technology, applied in the field of information classification of shopping guide webpages, can solve the problems of manpower and time, achieve accurate weight values ​​and reduce manual participation

Inactive Publication Date: 2017-08-08
北京中搜云商网络技术有限公司
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] If you want to be a good shopping guide website, shopping guide web pages are indispensable, but there are a lot of shopping guide articles on the Internet, how to meet the needs of users in a short period of time has become a problem
[0005] It is one of the feasible solutions to realize the screening by classifying information on shopping guide webpages. However, the traditional manual classification method consumes a lot of manpower and time, and the demand for machine classification has to be put on the agenda.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Classifier-based Information Classification Method for Shopping Guide Web Pages

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0065] For shopping guide data such as 3C digital, set sub-categories including:

[0066] "Information, new products, evaluation, shopping guide, market, knowledge, experience", the whole process includes:

[0067] (1) First, through the information gain calculation process, a batch of weight words that can be used for calculation is obtained;

[0068] (2) Then train this batch of weighted words and training data to obtain the weight value of the weighted words under each category, that is, each category gets a weight vector;

[0069] (3) Finally, in the formal process, the weight vector is dot-multiplied to obtain the final classification.

[0070] Assuming that step (1) has been completed and a batch of weighted words (see the first column of the table below), set in step (2):

[0071] The maximum threshold is: 2

[0072] The minimum threshold is: 0.8

[0073] The training stop condition is:

[0074] (1) The number of training sessions exceeds 100; ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a shopping guide webpage information classifying method achieved based on a classifier. The method includes (1) processing shopping guide webpage data to generate a weight vector word list; (2) training shopping guide webpage to obtain weight vector of the word list under each class; (3) conducting computing through the weight vector to achieve automatic classification of the shopping guide webpage. The method is high in efficiency and simple, replaces manual classification and achieves information automatic classification aiming at the shopping guide webpage through program. Filtering is conducted from a data source, training and classification are only conducted on the shopping guide webpage, and the obtained weight words are more credible. In a formal process, manual participation is greatly reduced, even an automatic classification result can be directly used without manual check, and the classification accuracy can reach over 80%.

Description

technical field [0001] The invention belongs to an information classification method, and in particular relates to an information classification method of a shopping guide webpage realized based on a classifier. Background technique [0002] With the development of society, people's lives are becoming richer and richer both materially and spiritually. In comparison, the time available every day is very short, and the rapid development of the Internet has also made more and more More and more consumers are more willing to directly select products online instead of wasting time on long outdoor journeys. Therefore, many traditional companies have to turn to e-commerce. For a while, online shopping has become a new trend. Vocabulary is full of major websites and forums, and what follows is that the comparison of prices and quantities of major e-commerce companies is more affordable. [0003] However, due to the large number of e-commerce companies, the variety of product models...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30G06Q30/02
CPCG06F16/353G06Q30/02
Inventor 杨佳吴尉林
Owner 北京中搜云商网络技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products