Enterprise subject matching method based on part-of-speech tagging of government affair text data

A technology of part-of-speech tagging and text data, which is applied in text database query, unstructured text data retrieval, data processing applications, etc., and can solve the time-consuming and labor-intensive problems of enterprise entities

Pending Publication Date: 2021-06-25
浙江非线数联科技股份有限公司
View PDF5 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The technical problem to be solved by the present invention is to solve the time-consuming and labor-intensive problem in the existing big data processing of government affairs mainly by manually checking the matching of the two parties’ text enterprise entities. According to the enterprise naming pattern extraction by known enterprise naming rules, the matching of the enterprise naming pattern, according to the matching result of the enterprise naming pattern, determines the matching result of the enterprise subject in the text. When collecting data, the matching of enterprise entities can be realized directly from the government affairs text data, which greatly improves the matching efficiency and realizes the intelligent intercommunication of government affairs big data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Enterprise subject matching method based on part-of-speech tagging of government affair text data
  • Enterprise subject matching method based on part-of-speech tagging of government affair text data
  • Enterprise subject matching method based on part-of-speech tagging of government affair text data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] Combine below Figure 1-Figure 4 The present invention will be further described with specific embodiments.

[0046] A method for matching business entities based on part-of-speech tagging of government affairs text data, comprising the following steps:

[0047] (1) Customize the naming of enterprise entities: use the enterprise entity recognition module to extract the subject vocabulary of enterprise names; based on the self-owned enterprise property field lexicon and geographical thesaurus, by determining the leftmost and rightmost boundaries of the enterprise name, the text can be obtained Business name subject vocabulary in .

[0048] (2) Use the ns+nn pattern extraction module to obtain the enterprise name to be matched from the government text: for the enterprise name subject vocabulary obtained by the enterprise entity recognition module in step (1), extract it according to the characteristics of its own thesaurus used in the recognition Matching pattern (ns+nn...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an enterprise subject matching method based on part-of-speech tagging of government affair text data. The method comprises the steps: extracting enterprise names in the government affair text data, extracting enterprise naming modes according to known enterprise naming rules, matching the enterprise naming modes, According to the matching result of the enterprise naming modes, carrying out part-of-speech tagging of the government affair text data, carrying out part-of-speech tagging of the government affair text data. according to the enterprise main body matching method, when the to-be-matched government affair data is processed, matching of the enterprise entities can be directly achieved from the government affair text data, the matching efficiency is greatly improved, and intelligent intercommunication of the government affair big data is achieved.

Description

technical field [0001] The invention relates to the field of computer application technology, in particular to an enterprise subject matching method based on part-of-speech tagging of government affairs text data. Background technique [0002] With the continuous advancement of national informatization construction, data resource sharing and integration work has been carried out in many regions. However, for government departments, there are still multiple systems working together and using complex interactive methods for data sharing. The status quo is prone to data updates not being timely, and when a system is out of service, the data of other systems is not updated. Enterprise information is the core content of multiple systems. However, due to the large number of enterprise information attributes and the need for changes, such as Enterprise name, the information of an enterprise name has been changed many times, and there may be manual mis-entry scenarios during the cha...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/295G06F40/284G06F16/33G06Q10/10G06Q50/26
CPCG06F40/295G06F40/284G06F16/3344G06Q10/103G06Q50/26
Inventor 张聪吴地龙吴天飞
Owner 浙江非线数联科技股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products