Theme and semantic meaning-based dialogue corpus keyword extraction method
A keyword and corpus technology, applied in the field of natural language processing, can solve problems such as poor effectiveness, ignoring semantics and topics, and low accuracy of keyword extraction
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0094] Embodiment 1: as Figure 1-4 Shown, based on the topic and semantics dialogue data keyword extraction method, the specific steps of the method are as follows:
[0095] Step1, first crawl the Chinese corpus and the dialogue material of the talk show, and then preprocess the dialogue material and the Chinese corpus;
[0096] Step2. Combine the preprocessed dialogue corpus with the Chinese corpus to obtain word vectors and topic models;
[0097]Step3. Combining word semantic weight, word semantic clustering weight, and part-of-speech weight multi-weight to finally obtain the weight of the word, and extract keywords based on the word weight to obtain keywords in the dialogue material extracted based on semantics, referred to as the KSel method;
[0098] Step4. Use the TF-IDF method to extract keywords by calculating word frequency and reverse document frequency;
[0099] Step5. The keywords extracted by the TF-IDF method and the KSel method are used as nodes, and the grap...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com