Maximum entropy classification model and Thai grammar rule correction-based Thai sentence segmentation method
A classification model, a technique of maximum entropy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0044] Embodiment 1: as Figure 1-2 As shown, a Thai sentence segmentation method based on maximum entropy classification model and Thai grammar rule correction, the specific steps of the method are as follows:
[0045] Step1. Collect and preprocess the Thai sentence segmentation corpus to construct a Thai text corpus; perform Thai word segmentation and part-of-speech tagging on the Thai text corpus, and construct a structured Thai text corpus required for Thai sentence segmentation research;
[0046] Step1.1. Use web crawler technology to collect Thai texts of Thai news and e-books from the Internet, and perform preprocessing operations on the obtained Thai texts to filter, deduplicate and denoise, thereby constructing a Thai text corpus;
[0047] Step1.2. Use the Thai word segmentation tool and Thai part-of-speech tagging tool to perform Thai word segmentation and part-of-speech tagging on the Thai text corpus, and perform manual proofreading to build a structured Thai text ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com