Chinese word segmentation algorithm based on reverse maximum matching
A reverse maximum matching, Chinese word segmentation technology, applied in computing, special data processing applications, instruments, etc., can solve the problems of inaccurate identification of unregistered words, low word segmentation accuracy, low performance, etc., to improve speed and improve relevance. and accuracy, improving efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0022] The present invention will be further described below in conjunction with the accompanying drawings, so that those of ordinary skill in the art can implement it after referring to this specification.
[0023] Such as figure 1 Shown, a kind of Chinese word segmentation algorithm based on reverse maximum matching of the present invention comprises the following steps:
[0024] Step 1, initialize the number of word segmentation dictionaries and the stop word dictionary StopWord in the memory, wherein the word segmentation dictionary database includes a data structure dictionary WordDictionary storing all word segmentation data structures, and a data directory dictionary WordList storing all word segmentation and word segmentation index positions. A single Chinese character is stored in the first layer of the data structure dictionary as the index directory of the data structure dictionary; the index position and the word of all word objects with the single Chinese characte...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com