Event-based Chinese coreference corpus library establishment method
A construction method and corpus technology, applied in the field of event-based Chinese referential corpus construction, can solve the problem of no Chinese referential corpus, etc., and achieve the effect of less classification, improved performance, and clear structure
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0044] see figure 1 , this event-based Chinese reference corpus construction method mainly includes the following steps:
[0045] (1) Select the CEC2.0 corpus as the basis for construction,
[0046] (2) Determine the target of referential labeling and labeling methods,
[0047] (3) Formulate corresponding labeling specifications according to specific reference targets,
[0048] (4) CEC2.0 corpus text preprocessing,
[0049] (5) Automatically label event elements and event references,
[0050] (6) Further optimize the labeling results through manual labeling,
[0051] (7) Set consistency check steps to ensure the quality of corpus annotation.
Embodiment 2
[0053] This embodiment is basically the same as Embodiment 1, and the special features are as follows:
[0054] The step (1) selects the CEC2.0 corpus as the basis for construction:
[0055] (1-1). Select CEC2.0 as the basic corpus for construction;
[0056] (1-2). Check the accuracy of event and event element annotation against the CEC2.0 corpus annotation specification;
[0057] (1-3). Supplement related annotations for incompletely annotated corpus, and correct incorrectly annotated corpus.
[0058] The step (2) determines the target and labeling method of referring to:
[0059] (2-1). The targets of referents are divided into two categories: the referents of event elements (object, environment and time) and the referents of events. The referents of event elements are divided into existing elements There are two kinds of referential labels for and default elements;
[0060] (2-2). In order to facilitate related processing by the computer, all types of re...
example 1
[0061] Example 1: Attribute labeling of object elements
[0062]
[0063] Shanghai Municipal Government Information Office
[0064] 15:45 on the 12th release
[0065] information
[0066]
[0067]
[0068] say
[0069]
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com