Microblog text normalizing, word segmenting and part-speech tagging method and system
A part-of-speech tagging and microblogging technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problems of error propagation, task error rate increase, low efficiency, etc., to improve performance and improve overall performance Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0067] The principles and features of the present invention are described below in conjunction with the accompanying drawings, and the examples given are only used to explain the present invention, and are not intended to limit the scope of the present invention.
[0068] Such as figure 1 As shown, a microblog text normalization and word segmentation and part-of-speech tagging method, including the following steps:
[0069] Step 1, constructing annotated corpus, and dividing the annotated corpus in the annotated corpus into training set, development set and test set;
[0070] Step 2, using the SVM model to train and learn to construct a microblog dictionary, that is, standardization candidate set;
[0071] Step 3, using the training set, development set and Weibo dictionary, use the BeamSearch method to train and learn a joint model based on Weibo text normalization, word segmentation, and part-of-speech tagging;
[0072] Step 4, use the joint model to perform text normaliz...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com