Customs declaration commodity intelligent classification method based on historical data mining

A technology of historical data and commodities, applied in database indexing, data processing applications, structured data retrieval, etc., can solve problems such as the lack of intelligent classification methods of decision trees

Active Publication Date: 2019-11-19
BEIJING JIAOTONG UNIV
View PDF7 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although the decision tree classification algorithm has been applied to some extent in other fields, there is still a lack of a complete and effective intelligent classification method based on decision trees in the field of customs declaration commodity classification.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Customs declaration commodity intelligent classification method based on historical data mining
  • Customs declaration commodity intelligent classification method based on historical data mining
  • Customs declaration commodity intelligent classification method based on historical data mining

Examples

Experimental program
Comparison scheme
Effect test

Embodiment approach

[0060] Taking the real data set of a domestic customs as an example, it contains 1204 items and 4806 sub-items with a total of 75,251,542 pieces of historical data, including all the data for the 12 months of 2015 and 2016. 35 million pieces of data in the 12 months of 2015 were selected to form the training set, and 1000 pieces of data in each of the 12 months of 2016 were randomly selected to form the test set.

[0061] Step S1, analyzing the relationship between the category code of the customs declaration commodity and the commodity name, specification and model, and describing the classification of the customs declaration commodity.

[0062] Both the above training set and test set are composed of 3 fields, which are commodity category code, commodity name and specification model. Among them, the "commodity code" contains a total of 10 digits corresponding to the HS ("Harmonized System") adopted by my country's customs. The classification work in the customs declaration ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a customs declaration commodity intelligent classification method based on historical data mining. The method comprises the steps: analyzing the relation between a customs declaration commodity category code and the name, specification and model of a commodity, and describing a customs declaration commodity classification problem; preprocessing the historical information ofcustoms declaration commodities to remove useless parts of speech; designing an inverted index and a search algorithm for judging the first four codes of the commodity; performing feature selection onthe preprocessed commodities based on word frequency, and constructing a feature matrix based on a one-hot method; and based on the characteristics of different types of commodities, constructing a commodity classification model by adopting a decision tree algorithm; utilizing the classification model to classify customs declaration commodities, and obtaining a classification result, namely commodity codes. According to the invention, the category of the commodity can be well judged. The commodity can be effectively classified, and the commodity code based on the HS classification directory can be obtained. The method has high generalization performance. The customs clearance efficiency of enterprises can be improved. Trade risks caused by wrong classification of commodities are reduced.

Description

Technical field: [0001] The invention relates to the field of classification of customs declaration commodities, in particular to an intelligent classification method of customs declaration commodities based on historical data mining. Background technique: [0002] The HS ("Harmonized System") classification catalog adopted by my country's customs divides commodities into 21 categories and 97 chapters, and the chapters are further divided into items and subheadings. Based on the "Harmonized System", the total number of tax items in my country's "Tariff" has reached 8,547 (the 2017 edition of the "Tariff"). The traditional process of commodity classification is completed by professional classifiers. When faced with a large number of commodities that need to be classified, there are The shortcomings of long time-consuming and low efficiency, so according to the characteristics of the goods, automatically and intelligently classify the customs declaration goods and give the code...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/2458G06F16/22G06F17/27G06Q50/26
CPCG06Q50/26G06F16/22G06F16/2462G06F16/2465
Inventor 万怀宇林友芳王涛杜少华王强
Owner BEIJING JIAOTONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products