Document security level division method based on decision tree

A security level and decision tree technology, applied in computer security devices, instruments, electronic digital data processing, etc., can solve problems such as difficult judgment work and huge judgment workload for judgment personnel, so as to reduce the risk of document leakage and improve the accuracy of judgment. , the effect of reducing workload

Pending Publication Date: 2021-07-23
STATE GRID CORP OF CHINA +1
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This method mainly has the following significant defects. The number of files to be judged is the sum of the number of files in the computer of the detected object. When manual judgment is used, due to the large number of such files, the judgment workload of the judgment personnel is huge, and it is difficult to complete the judgment in a short time. Work

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Document security level division method based on decision tree
  • Document security level division method based on decision tree
  • Document security level division method based on decision tree

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0045] build training set

[0046] Assume that there are 9 keywords crawled by the system, namely k 1 、k 2 、k 3 ,...,k9 , the judgment result adopts multi-classification, assuming that the judgment result is divided into four categories, namely "high sensitivity", "medium sensitivity", "low sensitivity" and "not sensitive".

[0047] Build the sample collection as follows:

[0048]

[0049] Table 1 Training set sample collection

[0050] If the keyword exists, it will be recorded as 1, and if the keyword does not exist, it will be recorded as 0. There are four levels of confidentiality, which are represented by numbers from 1 to 4 respectively. Table 1 is further digitized and can be represented as Table 2.

[0051]

[0052] Table 2 Digital abstraction of the training set sample set

[0053] Due to the space factor, only part of the data of the 30 samples are shown. For this sample, the CART classification algorithm is used to realize the establishment of the decisi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a document security level division method based on a decision tree, which comprises the following specific steps: step 1, traversing all conditions in a sample set, calculating Gini indexes of the sample set under different conditions, and selecting a condition corresponding to a minimum value as a first segmentation point, so as to divide a leaf node and remaining child nodes; step 2, continuing to calculate the Gini index of the new sample set for the remaining conditions for the child nodes, performing further subdivision, selecting a value with the minimum Gini index as a second segmentation point, and continuing to screen out leaf nodes and the child nodes; step 3, repeating the process until division of all conditions is realized, and realizing establishment of a decision tree; and step 4, re-capturing the sample for determination, and verifying the new sample according to the decision tree established by the training set to realize determination of the document security level. According to the method, automatic judgment of the document security level is realized, the workload of judgment personnel is greatly reduced, the judgment accuracy is improved, and a powerful guarantee is provided for a company to reduce the document leakage risk.

Description

technical field [0001] The invention relates to the field of document security management, in particular to the field of document security level determination for deploying a security detection system, specifically a method for dividing document security levels based on a decision tree. Background technique [0002] As a carrier of various types of information, documents usually carry a large amount of confidential information. In institutions and departments involving key secrets, such as state-owned military enterprises, government agencies, and large companies, the confidentiality of documents is very important. [0003] The degree of confidentiality of an information document is determined by the content information it carries. According to national standards, documents can be classified into "top secret", "confidential", and "secret". Documents with different levels of confidentiality correspond to different management methods. Similarly, in enterprise management, diff...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F21/60G06F21/62G06N20/00
CPCG06F21/604G06F21/6209G06F2221/2113G06N20/00
Inventor 吴佩霖何涛冯浩余娅胡率赵锦辉谭俊邓国如卫莹冯伟东王红卫王敬靖代荡荡
Owner STATE GRID CORP OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products