A Text Segmentation Method Based on Topic Information
A cutting method and technology of subject information, applied in the field of text cutting based on subject information, can solve the problems of inconvenient research and inconvenient reading, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0036] The present invention will be further explained below with reference to the drawings and examples.
[0037] Such as figure 1 As shown, a text cutting method based on topic information can be divided into the following five processes:
[0038] Step 1. Preprocess the input text and training set to obtain a series of sentences composed of words; it includes two steps.
[0039] 101. For the input text, divide it according to the ending punctuation mark. The ending punctuation mark refers to all the symbols that can be used at the end of a Chinese sentence; a series of separate sentences are obtained, and each sentence occupies a separate line. For the training set, its format It is: sentence-topic label, in which sentence and topic label are both Chinese text. Perform the above operations on the sentence part.
[0040] 102. Separate individual sentences, and remove numbers, stop words, punctuation marks, and non-Chinese special symbols. Obtain a series of sentences composed of wo...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com