Method for identifying and extracting water conservancy space relation words

A technology of spatial relations and relational words, applied in special data processing applications, instruments, unstructured text data retrieval, etc., can solve the problems of high cost, time-consuming and labor-intensive rules, etc. The effect of saving manpower and time

Pending Publication Date: 2019-12-03
HOHAI UNIV
View PDF1 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, manually writing rules is time-consuming and labor-intensive, and it is necessary to repeatedly write rules in various fields, which is too expensive

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for identifying and extracting water conservancy space relation words

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] Such as figure 1 As shown, a method for identifying and extracting water conservancy spatial relation words, comprising the following steps:

[0034] (1) Acquisition of spatial relationship seed sets based on quantitative statistical features;

[0035] (11) Data preprocessing; perform word segmentation and part-of-speech tagging on entities and co-occurrence sentences to form a word set, and filter stop words such as "is", "de", "ba", "le";

[0036] (12) Feature selection and statistics; obtain the distribution of spatial relational words in sentences by counting 7 features: (a) part of speech POS; (b) position LOC of relational words to water conservancy object entities; (c) left of spatial relational words When there are conjunctions or prepositions, the position LCCP (left and right or in the middle of two entities); (d) the distance DIS1 from the spatial relation word to entity 1; (e) the distance DIS2 from the spatial relation word to the end of the sentence; (f) ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for identifying and extracting water conservancy spatial relationship words. The method comprises the following steps: acquiring a spatial relationship seed set basedon quantitative statistical characteristics; constructing an original syntax mode; generalizing a syntax mode, and generalizing a plurality of original syntax modes for expressing the same kind of spatial relationship into one mode, so that the number of modes is reduced, and the abstraction degree is improved; and based on the generalized syntax mode, realizing extraction of the spatial relationship. According to the method, the spatial relationship extraction problem in the field of water conservancy is concerned, automatic identification of the spatial relationship, construction of a spatial relationship word set, acquisition of a spatial relationship syntactic mode and extraction of a spatial relationship tuple are realized by utilizing a weak supervision method, and a large amount ofmanpower and time are saved. Water conservancy data resource extraction oriented to the spatial relationship is realized, free texts in the water conservancy field are converted into structured data,and large-scale and professional spatial relationship supplementation is carried out on the atlas, so that more accurate query service is provided for users.

Description

technical field [0001] The invention relates to the technical field of water conservancy business, in particular to a method for identifying and extracting water conservancy space relational words. Background technique [0002] With the rapid development of Internet technology, the water conservancy business has accumulated a large amount of water conservancy data with spatial relationships, including a large number of official documents. Natural language text is an important source of spatial data, so extracting spatial relationship data from text is an important research direction in the field of water conservancy. [0003] The main purpose of information extraction is to extract specific factual information from text, that is, to convert unstructured natural language text into structured or semi-structured data and store it, which can help people acquire knowledge conveniently and quickly, and can also be used for detailed The mining and analysis of NLP plays an importan...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/27G06F16/35
CPCG06F16/35Y02A10/40
Inventor 冯钧相颖夏佩佩陆佳民朱跃龙
Owner HOHAI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products