A method and device for concept recognition based on collaborative learning

A technology for identifying methods and concepts, applied in special data processing applications, instruments, electrical and digital data processing, etc., can solve the problem of limited concept quality, and achieve the effect of improving the quality of concept identification

Active Publication Date: 2016-08-03
NEC (CHINA) CO LTD
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in many cases, the available training data is partially labeled (that is, only some of the concepts included are marked), and at this time, the same method as that used to construct a sequence classifier based on fully labeled training data is still used. Labeled training data to build a sequence classifier, so that the constructed sequence classifier recognizes concepts of limited quality

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and device for concept recognition based on collaborative learning
  • A method and device for concept recognition based on collaborative learning
  • A method and device for concept recognition based on collaborative learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] In order to reduce the requirements for training data during concept recognition and improve the quality of concept recognition, especially improve the quality of concept recognition when constructing a sequence classifier based on partially labeled training data, the embodiment of the present invention provides a concept recognition based on collaborative learning Methods and devices.

[0025] Among them, the Co-learning approach is a variant of the Co-training approach. The so-called Co-training method is to train two classifiers based on two relatively independent feature sets, while the Co-learning approach does not require Instead of dividing the features into two relatively independent sets, the training data set is divided into multiple groups to build multiple classifiers from multiple perspectives.

[0026] Preferred embodiments of the present invention will be described in detail below in conjunction with the accompanying drawings.

[0027] as attached figu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a concept identification method and a device based on collaborative learning to improve the quality of the concept identification, and particularly relates to improve the quality when building sequence categorizers to carry out the concept identification based on training data of part marks. The method comprises the following steps: dividing the training dataset into at least two subsets, defining the training data contained in the training dataset as a text document with marker words, carrying out collaborative learning based on the training data contained by the subsets and according to the feature word assembly extracted by the training dataset, building at least two sequence categorizers, adopting each acquired sequence categorizer to carry out the concept identification to the present text document respectively, and confirming the concept contained in the present text document according to the concept identified by each sequence categorizer.

Description

technical field [0001] The invention relates to the technical field of artificial intelligence, in particular to a concept recognition method and device based on collaborative learning. Background technique [0002] With the development of information retrieval (Information Retrieval, IR) technology, Semantic Information Retrieval (Semantic Information Retrieval, Semantic IR) has great potential for development compared with traditional keyword-based information retrieval (Keywords-Based IR) technology. Among them, Concept Detection and Concept Disambiguation play an important role in semantic information retrieval technology. The so-called concept recognition refers to finding a character string used to represent a concept or a concept in multiple concepts from the text. [0003] as attached figure 1 As shown, in the prior art, the specific process of using the machine learning method for text document concept recognition is as follows: [0004] A large number of marked ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
Inventor 李建强陈宽桐刘春辰
Owner NEC (CHINA) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products