The invention discloses an extraction type document automatic abstracting method based on context semantic
perception. The extraction type document automatic abstracting method mainly solves the problem that a traditional
algorithm lacks the recognition degree of sentences in different contexts. The method comprises: firstly, using an LDA
topic model for calculating topic probability distributionin a document, and then determining the similarity between each
sentence and a topic word; extracting semantic features of sentences by using a CNN model, further calculating the similarity between each
sentence and the features, finally adding values of topic similarity and feature similarity of each
sentence to obtain a final sentence
score, and taking a proper number of sentences as abstracts according to
score ranking. According to the method, a
topic model and a
deep learning model are introduced, a topic abstracting method is determined, sentence meanings in different contexts can be analyzed more accurately, and a calculation reference method is provided for other automatic document abstracting methods.