Domain entity disambiguation method for fusing word vectors and topic model
A topic model and word vector technology, applied in the field of natural language processing and deep learning, can solve the problem of not being able to distinguish between different meanings of polysemy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0065] Embodiment 1: as Figure 1-4 As shown, a domain entity disambiguation method that integrates word vectors and topic models, the specific steps of the method are as follows:
[0066] Step1. First, use Word2vec to train the word vector model on the encyclopedia corpus in the field of tourism;
[0067] The concrete steps of described step Step1 are:
[0068] Step1.1. From the Chinese offline database of Wikipedia, extract the page information under the tourism category, extract the summary information of the page, and save it in the text;
[0069] Step1.2. Manually write a crawler program to crawl text information in the tourism field from travel websites and encyclopedia entries, and combine it with Wikipedia texts;
[0070] The present invention considers that the positions and tags to be crawled in the crawler program are different due to different webpage structures, and there is no ready-made program, so programs need to be written for different tasks of crawling. ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com