Lucene full-text retrieval based Chinese word segmentation method
A Chinese word segmentation and full-text technology, applied in the field of power system, can solve the problems of fuzzy information, difficult quantitative and accurate analysis, redundancy, etc., to improve efficiency, clear word segmentation results, and improve the level of power grid marketing services.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0017] A Chinese word segmentation method based on lucene full-text search, figure 1 It is a flow chart of the Chinese word segmentation method based on lucene full-text search. The method includes the following steps:
[0018] 1. Store the dictionary in the database as one word per line. In addition to the main dictionary containing commonly used words and the quantifier dictionary of commonly used quantifiers that come with the program, users can add extended dictionaries and stop word dictionaries as needed.
[0019] 2. Cache the dictionary in the database in the server in the form of a tree. The dictionaries in the cache are divided into three types: the main dictionary, the stop word dictionary and the measure word dictionary. The extended word dictionaries added by users are stored in the main dictionary.
[0020] 3. Enter the text information that needs word segmentation;
[0021] 4. The input text is matched verbatim with the three dictionary trees of quantifiers, ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com