Unsupervised learning-based text automatic abstract method, system and device, and medium
An unsupervised learning and automatic summarization technology, applied in the field of text summarization, can solve the problem of high data acquisition cost, achieve the effect of solving high acquisition cost, ensuring accuracy and readability, and reducing cost
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0072] This embodiment provides a method for automatic text summarization based on unsupervised learning, which is realized by using a generation network, a classification and discrimination network, and an authenticity discrimination network. The specific descriptions of the generation network, classification and discrimination network, and authenticity discrimination network are as follows:
[0073] 1) The input of the generation network is the original text (long text) to be processed, and the output is a shorter text. When the generated network is trained strong enough, the output text can be regarded as a summary (short text) of the original text; during testing, the generated network is the only network used, and the structure of the generated network is as follows: figure 1 shown.
[0074] 2) The classification discriminant network is the first discriminant network, whose input is the original text (long text) and the abstract (short text) that generates the network out...
Embodiment 2
[0117] Such as Figure 9 As shown, the present embodiment provides a text automatic summarization system based on unsupervised learning, the system includes an acquisition module 901, a building module 902, a first pre-training module 903, a second pre-training module 904, and a third pre-training module 905, confrontation training module 906 and text summary module 907, the specific functions of each module are as follows:
[0118] The obtaining module 901 is used to obtain a training set, randomly scramble the original text and the abstract in the training set, obtain the original text set and the abstract set, and obtain a data set of text classification;
[0119] The building module 902 is used to build a generation network, a classification and discrimination network, and an authenticity discrimination network.
[0120] The first pre-training module 903 is used to pre-train the generation network by using the original text set.
[0121] The second pre-training module 90...
Embodiment 3
[0128] This embodiment provides a computer device, which may be a server, a computer, etc., such as Figure 10 As shown, it includes a processor 1002 connected through a system bus 1001, a memory, an input device 1003, a display 1004 and a network interface 1005. The processor is used to provide calculation and control capabilities. The memory includes a non-volatile storage medium 1006 and internal Memory 1007, the non-volatile storage medium 1006 stores an operating system, computer programs and databases, the internal memory 1007 provides an environment for the operation of the operating system and computer programs in the non-volatile storage medium, and the processor 1002 executes memory storage During the computer program, realize the text automatic summarization method of above-mentioned embodiment 1, as follows:
[0129] Obtain the training set, randomly scramble the original text and the abstract in the training set, obtain the original text collection and the abstrac...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com