A method, device and system for extracting Chinese entity association relationship

An association relationship and entity technology, which is applied in the fields of unstructured text data retrieval, text database clustering/classification, instruments, etc. The effect of reducing the amount of computation

Active Publication Date: 2022-04-08
ZHONGKE DINGFU BEIJING TECH DEV
View PDF8 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] This application provides a method, device and system for extracting associations of Chinese entities to solve the problem in the prior art that associations cannot be accurately extracted from unstructured Chinese texts

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method, device and system for extracting Chinese entity association relationship
  • A method, device and system for extracting Chinese entity association relationship
  • A method, device and system for extracting Chinese entity association relationship

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] In order to enable those skilled in the art to better understand the technical solutions in the present application, the technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the accompanying drawings.

[0033] Structured information is information managed by databases that we usually come into contact with, including records of production, business, transactions, and customer information. Unstructured information, the technical term is content, which covers a wider range of information and can be divided into: operational content: such as contracts, invoices, letters and procurement records; departmental content: such as document processing, electronic forms, briefing files and emails ; Web content: information in formats such as HTML and XML; multimedia content: such as sound, video, graphics, etc.

[0034] Massive information appearing on the Internet can be roughly divided into three type...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

This application discloses a method, device, and system for extracting associations of Chinese entities. According to the relationship properties of relational words in Chinese texts, the target implementing entity and target receiving entity related to the relational words in the text are extracted, and then according to the relationship The target implementing entity and the target receiving entity corresponding to the word and the relational word generate the Chinese entity association relationship corresponding to the relational word in the text. The technical solution provided by the embodiment of the present application divides the unstructured Chinese text into different words and sentences according to different relational properties, and further reduces the location range of the target implementing entity and target receiving entity of each relational word, so as to improve the search efficiency. Accuracy and search speed, reduce the amount of calculation. In addition, the technical solution in the embodiment of the present application also uses division rules on the Chinese grammatical level to largely filter out some redundant erroneous relative words and erroneous entities, improving the accuracy of extracting relative words and entities .

Description

technical field [0001] The present application relates to the technical field of natural language processing, in particular to a method, device and system for extracting associations of Chinese entities. Background technique [0002] With the rapid development of the Internet and the rapid improvement of the economic level, if you want to take the lead when formulating corporate strategies, you must have a keen sense of smell, grasp more relevant information, and grasp as much as possible between enterprises and enterprises. The relationship between enterprises and individuals can assist decision makers to make the most reasonable plan. [0003] Existing enterprise association identification technologies generally rely more on standardized and structured collected data. However, this method has great limitations, such as slow update of text information sources, high delay, etc., and the structuring of data will take more time to screen and organize information, and it may n...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/36G06F16/35G06F40/253
CPCG06F40/253
Inventor 李德彦晋耀红吴相博
Owner ZHONGKE DINGFU BEIJING TECH DEV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products