Multilingual information extraction method adopting hierarchical pipeline filter system structure

A pipeline filter and system structure technology, applied in the direction of instruments, special data processing applications, electrical digital data processing, etc., can solve the problems that there are no mature technical solutions based on pipeline filters, so as to improve research and development efficiency and improve reusable Effect

Active Publication Date: 2010-06-23
华建机器翻译有限公司
View PDF0 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0011] However, there is no mature technical solution to adopt the archite

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multilingual information extraction method adopting hierarchical pipeline filter system structure
  • Multilingual information extraction method adopting hierarchical pipeline filter system structure
  • Multilingual information extraction method adopting hierarchical pipeline filter system structure

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] Currently, under the software development method based on components and architecture, software development has been transformed into a process of "component development + architecture-based component assembly". This is because in some specific fields, there are similarities in architecture between different systems and different versions of the same system, and there are even many common components, which is very conducive to software reuse.

[0038] In order to adapt to the above-mentioned changes in the field of software development, the multilingual information extraction method provided by the present invention adopts such as figure 1Shown is a pipe-filter-based architecture. In this architecture, the work to be processed is encapsulated into filters (ie components), and the information interaction relationship is established between multiple filters through pipelines. However, although the pipeline filter style is very suitable for software architecture design in...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a multilingual information extraction method adopting a hierarchical pipeline filter system structure. In the method, the linguistic material to be processed is identified by a multilingual automatic identifying member; then four simple named entities, which are time, date, percent and amount of money, are identified by a simple named entity identifying member; a person name and a place name are extracted by a person name and place name identifying member; then, participialization is performed by a lingual independent participializing member; part-of-speech tagging is performed by a part-of-speech tagging member; an organization name is identified by an organization name identifying member; and the longest noun phrase is identified with a longest noun phrase identifying member. The method provides a practical basic framework for an information extraction system, so that the problems of reusing and generalization of a plurality of overlapped algorithms are solved successfully; reusability, maintainability and extensibility of software is improved; and the research and development efficiency of the information extraction system is improved.

Description

technical field [0001] The invention relates to a method for realizing information extraction, in particular to a method for extracting multilingual information using a hierarchical pipeline filter architecture, and belongs to the technical field of natural language processing (NLP). Background technique [0002] Information extraction is a technology that studies how to extract specific factual information from text and present it in a structured form. In the field of natural language processing (NLP), in order to complete the task of information extraction with high efficiency and high quality, it is necessary to design and develop an information extraction system specially. The main function of the information extraction system is to extract specific factual information from the text, and then perform structural processing, integrate them together, and turn them into a unified organizational form. The input to the information extraction system is the original text, and t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/27
Inventor 黄河燕
Owner 华建机器翻译有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products