Text classification method and system
A text classification and text data technology, applied in the field of deep learning, can solve problems such as poor text classification effect, Bert model does not take into account the relationship between words, etc., to achieve good text classification effect and good fine-tuning effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0025] The introduction of the following technical solutions takes the commonly used THUCNews data set in the field of Chinese text classification as an example, but the model obtained by the present invention is not limited to the field of Chinese text classification, and the use of the THUCNews data set is not a feature of the present invention.
[0026] Step 1, prepare the pre-training model dataset. Download public Chinese datasets from the Internet and perform data cleaning.
[0027] In the second step, word vector encoding, text vector encoding and position vector encoding are respectively performed on the dataset data, and word vector encoding is added.
[0028] For the data set obtained above, use jieba word segmentation or other tools to segment the text, thereby increasing the word encoding vector of the model; in addition, the word encoding, text sentence encoding, and position encoding are processed in the same way as the original Bert. The resulting vector contai...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com