Name disambiguation method and system based on LightGBM classification and representation learning
A name and binary classification technology, applied in the information field, can solve problems such as limitations
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0040] In order to make the above objects, features and advantages of the present invention more comprehensible, the present invention will be further described in detail below through specific embodiments and accompanying drawings.
[0041]The invention is oriented to scientific literature data, and proposes a disambiguation algorithm based on a supervised learning algorithm and representation learning for the phenomenon of authors having the same name in the literature. Among them, the supervised learning part adopts the LightGBM (hereinafter referred to as LGB) binary classification model. Specifically, the meta-information and inter-paper association information of papers in the training set are extracted through feature engineering, and the LGB algorithm is used to train a binary classification model to determine whether any two papers belong to the same author. The representation learning part refers to the word2vec text semantic representation method and the meta-path-b...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com