Method for extracting novel field words
A field and new word technology, applied in the field of new word discovery in natural language processing, can solve the problems of new word coverage in the field, word segmentation algorithm can not do it, etc., to achieve the effect of ensuring the rate of word formation
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0025] The present invention will be further described in detail in conjunction with the following specific embodiments and accompanying drawings. The process, conditions, experimental methods, etc. for implementing the present invention, except for the content specifically mentioned below, are common knowledge and common knowledge in this field, and the present invention has no special limitation content.
[0026] The present invention utilizes word2vec and the field new word extraction method that Bootstrapping iteration combines to comprise the following steps:
[0027] Step 1: Obtain corpus in several fields, remove control characters in the corpus, and obtain neatly formatted field texts;
[0028] Step 2: segment the domain text into sentences according to the punctuation marks, and obtain the domain single sentence set S;
[0029] Step 3: Initialize and set the n-gram model, and segment the strings for the field single sentence set S to obtain the string set W 0 ;
[...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com