Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

United labeling method for syntax of Tibet language and semantic roles

A technology of semantic roles and syntax, applied in the field of joint annotation of Tibetan syntax and semantic roles, can solve problems such as the unsatisfactory performance of the Tibetan syntax analysis system and the difficulty in obtaining Tibetan deep syntax, and achieve the effect of improving performance.

Inactive Publication Date: 2013-12-11
MINZU UNIVERSITY OF CHINA
View PDF2 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

But at present, it is difficult to obtain the results of deep syntactic analysis of Tibetan
Existing Tibetan syntax analysis systems also perform unsatisfactory in the general domain

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • United labeling method for syntax of Tibet language and semantic roles

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] In order to make the technical means, creative features, goals and effects achieved by the present invention easy to understand, the present invention will be further described below in conjunction with specific embodiments.

[0027] The present invention comprises the following steps:

[0028] a) Distinguish between single and complex sentences: divide long sentences into several short sentences;

[0029] b) Semantic role markers: case markers, including grammatical role components, nominalization or non-predicate verb block markers, removing non-marked content;

[0030] According to the needs of case marking and semantic role labeling in Tibetan, clarify the semantic role of Tibetan. The core semantic roles are Arg0-5, Arg0 represents the agent of the action (agent case), Arg1 represents the impact of the action (result case), Arg2-5 will have different semantic meanings depending on the predicate verb, add some additional semantic roles , such as ArgM-LOC(bit cell)...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a method of processing minority characters into Chinese language, and in particular relates to a united labeling method for syntax of Tibet language and semantic roles. The united labeling method comprises the following steps of: a) distinguishing a simple sentence and a compound sentence; b) labeling semantic roles; c) recognizing a predicate; d) classifying verb semantics; e) labeling a syntactic structure; f) editing and revising semantic role labeling results. According to the united labeling method, the syntax of Tibet language and semantic features are extracted, on the one hand, semantic role information such as a performer, a receiver, time, a place and a way expressed in the sentence can be labeled by directly utilizing grammatical labels of the Tibet language; on the other hand, a syntax analytical process can be reacted upon by the predicate semantic role labeling result so that the influence of the syntax labeling which is not well-determined can be reduced, and accordingly the performance of a sentence processing system can be improved.

Description

technical field [0001] The invention relates to a method for processing ethnic minority characters into Chinese, in particular to a method for joint labeling of Tibetan syntax and semantic roles. Background technique [0002] The research content in the field of Tibetan information processing is flourishing, breakthroughs have been made in the processing of characters, words and phrases, and the research on sentence processing has begun. [0003] Semantic analysis is one of the most challenging topics in the field of computational linguistics, and it is also the main bottleneck restricting the large-scale application of language information technology. Semantic analysis is to deduce the actual semantics of the sentence according to the sentence structure and the meaning of the content words in the sentence, which is the main goal of sentence processing. [0004] The task of semantic role labeling is to find out the corresponding semantic role components of the predicates in...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/27
Inventor 邱莉榕
Owner MINZU UNIVERSITY OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products