Mongolian text sentiment analysis method based on multi-size CNN and LSTM model
A sentiment analysis and multi-scale technology, applied in the field of artificial intelligence, can solve the problems that the text sentiment analysis is not real-time, the Mongolian corpus resources are few, and the local and global information of the text cannot be extracted at the same time.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0046] The implementation of the present invention will be described in detail below in conjunction with the drawings and examples.
[0047] Such as figure 1 Shown, a kind of Mongolian text emotion analysis method based on multi-size CNN and LSTM model of the present invention, process is as follows:
[0048] Step 1: Preprocessing the Chinese and Mongolian emotional text corpora.
[0049] Before model training, the sentiment text corpus should be preprocessed. The present invention uses byte pair encoding technology (BPE) to carry out segmentation operation to corpus, because BPE technology is to use a character that does not appear in this character string to replace the most common pair of characters in the character string in the layer-by-layer iterative process , so by segmenting Mongolian vocabulary into stems and affixes, high-frequency words can be retained in the dictionary, while low-frequency words can be divided into smaller granular subunits, thereby alleviating ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com