The invention provides an automatic classification method based on Chinese
privacy policy terms, which belongs to the technical field of
natural language processing, and comprises the following steps:
data processing: obtaining privacy policies of a plurality of applications as a
data set, manually labeling to obtain a
data set with labels, and then cleaning the
data set to obtain a training sample data set; data training: performing
feature selection on the training sample data set, selecting effective features capable of identifying different clause categories, and establishing a detection model; and judging whether the
privacy policy text has integrity or not. According to the automatic classification method based on the Chinese
privacy policy terms, through automatic classification based on the privacy policy terms, the privacy policy content is quickly classified to each classification
category attribute, so that a user can read and understand conveniently, and integrity detectionof the privacy policy terms is realized; and the user can quickly identify whether the privacy policy is complete or not.