An incomplete patent automatic indexing method
An automatic indexing and patented technology, applied in metadata text retrieval, instrumentation, unstructured text data retrieval, etc., can solve the problems of high cost of human resources, low efficiency of manual indexing methods, unsatisfactory accuracy and duplicate checking rate And other issues
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0028] In view of the lack of specific knowledge discovery and mining models and methods for pharmaceutical patents, this embodiment provides a self-labeled classification method for pharmaceutical English patents, combined with figure 2 , the method consists of the following steps:
[0029] step one:
[0030] Aiming at the small amount of manual indexing data, artificial indexing data plus Thomson Reuters data are used as the experimental set. The indexing results are shown in Table 1. The experimental set is divided into training set and training set according to the ratio of 8:2. For the verification set, here we do not impose too many completeness constraints on the patent itself, and only require the patent itself to have any of the three items of abstract, claims, and instructions as training data.
[0031] Indexing results of the training set in Table 1
[0032] NME
DDD
NCP
NAM
BLA
NFP
BTN
NUS
NDT
NCF
MIPs
NSP
...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com