Concept identification method and device based on collaborative learning

A recognition method and concept technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as limited concept quality, and achieve the effect of improving the quality of concept recognition and improving quality

Active Publication Date: 2013-09-25
NEC (CHINA) CO LTD
View PDF3 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in many cases, the available training data is partially labeled (that is, only some of the concepts included are marked), and at this time, the same method as that used to construct a sequence classifier based on fully labeled training data is still used. Labeled training data to build a sequence classifier, so that the constructed sequence classifier recognizes concepts of limited quality

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Concept identification method and device based on collaborative learning
  • Concept identification method and device based on collaborative learning
  • Concept identification method and device based on collaborative learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] In order to reduce the requirements for training data during concept recognition and improve the quality of concept recognition, especially improve the quality of concept recognition when constructing a sequence classifier based on partially labeled training data, the embodiment of the present invention provides a concept recognition based on collaborative learning Methods and devices.

[0025] Among them, the Co-learning approach is a variant of the Co-training approach. The so-called Co-training method is to train two classifiers based on two relatively independent feature sets, and the Co-learning approach Instead of dividing the features into two relatively independent sets, the training data set is divided into multiple groups to build multiple classifiers from multiple perspectives.

[0026] Preferred embodiments of the present invention will be described in detail below in conjunction with the accompanying drawings.

[0027] as attached figure 2 As shown, in t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a concept identification method and a device based on collaborative learning to improve the quality of the concept identification, and particularly relates to improve the quality when building sequence categorizers to carry out the concept identification based on training data of part marks. The method comprises the following steps: dividing the training dataset into at least two subsets, defining the training data contained in the training dataset as a text document with marker words, carrying out collaborative learning based on the training data contained by the subsets and according to the feature word assembly extracted by the training dataset, building at least two sequence categorizers, adopting each acquired sequence categorizer to carry out the concept identification to the present text document respectively, and confirming the concept contained in the present text document according to the concept identified by each sequence categorizer.

Description

technical field [0001] The invention relates to the technical field of artificial intelligence, in particular to a concept recognition method and device based on collaborative learning. Background technique [0002] With the development of information retrieval (Information Retrieval, IR) technology, semantic information retrieval (Semantic Information Retrieval, Semantic IR) has great development potential compared with traditional keywords-based information retrieval (Keywords-Based IR) technology. Among them, Concept Detection and Concept Disambiguation play an important role in semantic information retrieval technology. The so-called concept recognition refers to finding a character string used to represent a concept or a concept in multiple concepts from the text. [0003] as attached figure 1 As shown, in the prior art, the specific process of using the machine learning method for text document concept recognition is as follows: [0004] A large number of marked tex...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 李建强陈宽桐刘春辰
Owner NEC (CHINA) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products