Short text automatic abstracting method and system based on double encoders
An automatic summarization and double-coding technology, applied in the field of information processing, can solve problems such as insufficient summarization precision and insufficient utilization of semantic information
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0090] For verifying effect of the present invention, carry out experimental verification according to the step described above, experimental verification result is as follows Figure 4 shown.
[0091] Step 1: The news corpus data set provided by Sogou Labs, which contains a total of 679,978 news-headline data pairs from entertainment, culture, education, military, society, finance, etc. The preprocessing of the data set removes the text with a length less than 5, and replaces messy characters such as English, special characters, and emoticons; the data is divided into three levels according to the semantic similarity between the abstract and the original text to select high-quality experimental data pairs. 1 means least relevant and 3 is most relevant. The text-abstract semantic similarity is 1 in the interval (0,0.4), 2 in the interval [0.4,0.65), and 3 in the interval [0.65,1). In this paper, the semantic correlation algorithm formula is designed as follows:
[0092] ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com