Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Dialogue text-oriented event extraction method and system

A technology of event extraction and text, which is applied in unstructured text data retrieval, text database clustering/classification, computer components, etc., can solve the serious problems of unfixed text length, poor adaptability and portability, and data sparseness And other issues

Pending Publication Date: 2021-05-18
INST OF INFORMATION ENG CAS
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] 1. The logical coherence of the context of the dialogue text is weak, and the expression of the dialogue text is more colloquial, the language is chaotic and complex, the length of the text is not fixed, and the event elements are scattered in different time slices, making event extraction more difficult;
[0007] 2. In the traditional event extraction method based on pattern recognition, the template definition is rigid, and most of the extracted objects are news texts. The adaptability and portability of event extraction for dialogue texts are poor;
[0008] 3. The traditional machine learning-based event extraction method has serious data sparseness and low accuracy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Dialogue text-oriented event extraction method and system
  • Dialogue text-oriented event extraction method and system
  • Dialogue text-oriented event extraction method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0027] Such as figure 1 As shown, the embodiment of the present invention provides a dialogue text-oriented event extraction method, including the following steps:

[0028] Step S1: Periodically acquire dialogue text sets;

[0029] Step S2: Filtering the dialogue text set twice to obtain an event-related dialogue text set;

[0030] Step S3: Create an event template. In the event-related dialogue text set, classify event categories according to the event template and trigger words in the event template to obtain candidate events; Event extraction.

[0031] The invention proposes a dialog text-oriented event extraction method, which filters event-independent dialog texts based on meaningless text library, empirical rules and SVM binary classification model. Establish five major event type templates for the dialogue text, determine candidate events according to the dialogue text template, and use machine learning to identify the event category and the event elements contained ...

Embodiment 2

[0095] Such as Figure 7 As shown, the embodiment of the present invention provides a dialogue text-oriented event extraction system, including the following modules:

[0096] A dialogue text set acquisition module, configured to periodically acquire a dialogue text set;

[0097] The dialog text set filtering module is used to filter the dialog text set twice to obtain the event-related dialog text set;

[0098] The event extraction module is used to create an event template. In the event-related dialogue text set, according to the event template and through the trigger words in the event template, event categories are classified to obtain candidate events; event elements are identified for candidate events to realize from Event extraction from dialog text sets.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a dialogue text-oriented event extraction method and system. The method comprises the steps of s1, periodically obtaining a dialogue text set; S2, filtering the dialogue text set twice to obtain an event-related dialogue text set; S3, creating an event template, and in the event-related dialogue text set, according to the event template, performing event category division through trigger words in the event template to obtain candidate events; and performing event element identification on the candidate events to realize event extraction from the dialogue text set. According to the dialogue text-oriented event extraction method and system provided by the invention, event extraction is carried out based on combination of pattern recognition and a machine learning method, so that the event template compiling cost is saved, the data sparsity is reduced, and the event extraction accuracy is improved.

Description

technical field [0001] The invention belongs to the field of text recognition and machine learning, and in particular relates to a dialogue text-oriented event extraction method and system. Background technique [0002] Event extraction refers to extracting event information from unstructured text, specifically extracting event components such as event trigger words, people, places, etc., and finally presenting it as structured text. As one of the important research directions in the field of information extraction, event extraction is widely used in public opinion analysis, information retrieval and other fields. [0003] Event extraction often adopts the following two methods: event extraction method based on pattern recognition and event extraction method based on machine learning. The method of event extraction based on pattern recognition uses pattern matching algorithm to match sentences and templates to realize event recognition and extraction. Pattern matching cons...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/35G06F40/205G06F40/284G06F40/295G06F40/30G06K9/62
CPCG06F16/35G06F40/205G06F40/284G06F40/295G06F40/30G06F18/2411
Inventor 林海伦刘璐刘建坤周永彬
Owner INST OF INFORMATION ENG CAS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products