Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Semi-supervised Chinese and English implicit discourse relation identification method and system

A relationship recognition and semi-supervised technology, applied in biological neural network models, semantic analysis, natural language translation, etc., can solve problems such as limiting the performance of Chinese and English implicit textual relationship recognition

Active Publication Date: 2021-08-13
EAST CHINA JIAOTONG UNIVERSITY
View PDF8 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In view of the above situation, it is necessary to solve the lack of a method for identifying implicit discourse relations in the prior art using manually annotated discourse relationship datasets in both Chinese and English, which limits the identification of Chinese and English implicit discourse relations to a certain extent. The problem of performance improvement

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Semi-supervised Chinese and English implicit discourse relation identification method and system
  • Semi-supervised Chinese and English implicit discourse relation identification method and system
  • Semi-supervised Chinese and English implicit discourse relation identification method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0069] Embodiments of the present invention are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals designate the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary only for explaining the present invention and should not be construed as limiting the present invention.

[0070] These and other aspects of embodiments of the invention will become apparent with reference to the following description and drawings. In these descriptions and drawings, some specific implementations of the embodiments of the present invention are specifically disclosed to represent some ways of implementing the principles of the embodiments of the present invention, but it should be understood that the scope of the embodiments of the present invention is not limited by this limit. On the contrary, the embodiments of the present...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a semi-supervised Chinese and English implicit discourse relation identification method and system. The method comprises the following steps: step 1, preparing a data set; step 2, model construction: on the basis of the implicit discourse relation identification model based on the attention mechanism, respectively constructing a Chinese implicit discourse relation identification model and an English implicit discourse relation identification model; step 3, performing unsupervised model training; step 4, supervised model training; and 5, outputting a prediction result. According to the semi-supervised Chinese and English implicit chapter relationship recognition method provided by the invention, a Chinese and English implicit chapter relationship recognition model is jointly trained based on a large number of unlabeled chapter relationship data sets and a small number of manually labeled chapter relationship data sets in two languages; the unlabeled data set and the labeled data set in the two languages can be fully and effectively utilized, so that the Chinese and English implicit discourse relation recognition performance is improved at the same time.

Description

technical field [0001] The invention relates to the technical field of computer language processing, in particular to a semi-supervised Chinese-English implicit discourse relationship recognition method and system. Background technique [0002] Implicit discourse relation recognition aims to automatically infer semantic relations between two arguments (sentences or clauses) that lack discourse connectives, e.g., transition and causality. In different languages, there are usually a small number of human-annotated textual relationship datasets, for example, the Chinese CDTB dataset and the English PDTB dataset. Since it is very difficult to manually label chapter relational datasets, the scale of both the CDTB dataset and the PDTB dataset is relatively small. Among them, there are about 5,500 instances of implicit discourse relations marked in the CDTB dataset, while the current largest PDTB dataset only contains about 16,000 instances. Although these manually annotated corp...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F40/30G06F40/42G06N3/04
CPCG06F40/30G06F40/42G06N3/044
Inventor 邬昌兴胡明昆俞亮
Owner EAST CHINA JIAOTONG UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products