Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Statistical machine translation method based on predicate argument structure (PAS)

A predicate argument structure, statistical machine translation technology, applied in the direction of instruments, calculations, special data processing applications, etc., can solve problems such as there is no very good solution

Active Publication Date: 2015-05-13
INST OF AUTOMATION CHINESE ACAD OF SCI
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, for global reordering, that is, reordering that takes the overall structure of the sentence into account, current machine translation models do not have a very good solution

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Statistical machine translation method based on predicate argument structure (PAS)
  • Statistical machine translation method based on predicate argument structure (PAS)
  • Statistical machine translation method based on predicate argument structure (PAS)

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment approach

[0034] 1. Perform automatic word segmentation, automatic word alignment, syntactic analysis and bilingual joint semantic role labeling for bilingual sentences in the bilingual corpus. The specific implementation is as follows:

[0035] Segment the source language sentence and the target language sentence in the bilingual sentence pair, and obtain the word segmentation results of the source language end and the target language end. If the source language or the target language does not contain Chinese, word segmentation is not required. If Chinese is included in the source language or the target language, word segmentation for Chinese is required. In the embodiment of the present invention, the Chinese word is automatically segmented with the lexical analysis tool Urheen. The Urheen lexer tool can be downloaded for free at:

[0036] http: / / www.openpr.org.cn / index.php / NLP-Toolkit-for-Natural-Language-Processing / .

[0037]After obtaining the word segmentation results of the s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a statistical machine translation method based on a predicate argument structure (PAS). The statistical machine translation method comprises the following steps of: carrying out word segmentation, automatic word alignment, syntactic analysis and bilingual combined semantic role labeling on bilingual sentences in a bilingual corpora; extracting PAS conversion rules of the bilingual sentences according to results of the bilingual combined semantic role labeling so as to model the relationship between PASs of two languages; matching a plurality of semantic role labeling results of sentences to be translated by using the PAS conversion rules and carrying out corresponding translation; and structuring a translation hypergraph according to results of matching and translation based on the PAS conversion rules to finally generate a translation result.

Description

technical field [0001] The invention relates to the technical field of natural language processing, and is a novel statistical machine translation method based on a predicate argument structure (abbreviated as PAS). Background technique [0002] The current statistical machine translation method is mainly a process of automatically learning translation rules from bilingual corpora and using these rules to translate test sentences. Statistical machine translation models have experienced word-based, phrase-based, and syntactic structure-based translation models, and the translation quality has also made great progress. However, current translation models only consider the hierarchical structure properties of sentences at best, and do not model the semantic knowledge in sentences. [0003] At the same time, reordering has always been an important and difficult topic in machine translation research. Current translation models model local reordering well. However, current mach...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/28G06F17/27
Inventor 宗成庆翟飞飞张家俊周玉
Owner INST OF AUTOMATION CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products