Organization name acquiring method and device

A technology of name and suffix name, applied in the field of obtaining organization name, can solve the problems of Beijing### loss, incomplete name of organization name, low recognition accuracy, etc., to achieve the effect of improving accuracy

Active Publication Date: 2017-11-17
ZHONGKE DINGFU BEIJING TECH DEV
View PDF5 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] At present, the names of many institutions recognized by the entity recognition system are incomplete names. For example, ...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Organization name acquiring method and device
  • Organization name acquiring method and device
  • Organization name acquiring method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0057] see figure 1 , an embodiment of the present invention provides a method for obtaining an institution name, the method comprising:

[0058] Step 101: mark the organization name included in the unstructured text file by word segmentation system and entity recognition system.

[0059] Step 102: Determine whether the institution name is a full name of an entity institution according to a suffix model, the suffix model including at least one suffix name of an entity institution.

[0060] Step 103: When the name of the institution is not the full name of the entity institution, obtain the words before the name of the institution that meet the preset conditions.

[0061] Step 104: Combine the obtained words and the name of the institution to form the full name of the entity institution.

[0062]In the embodiment of the present invention, after marking the name of the institution, by determining whether the name of the institution is a full name, if it is not a full name, obt...

Embodiment 2

[0064] see diagram 2-1 , an embodiment of the present invention provides a method for obtaining an organization name, the method is used to obtain the organization name included in an unstructured text file, including:

[0065] Step 201: mark the organization name included in the unstructured text file by word segmentation system and entity recognition system.

[0066] Both the word segmentation system and the entity recognition system can adopt existing systems. Unstructured text files are corporate official documents, and unstructured text files include text and other content. For example, see Figure 2-2 The unstructured text file of "Beijing ### Co., Ltd." is shown, and the unstructured text file is composed of characters.

[0067] In this step, the unstructured text file is input into the word segmentation system, and the words in the unstructured text file are segmented through the word segmentation system, and the part of speech of each word is marked; then the segm...

Embodiment 3

[0106] see image 3 , an embodiment of the present invention provides a method for obtaining an organization name, the method is used to obtain the organization name included in an unstructured text file, including:

[0107] Step 301: mark the organization name included in the unstructured text file by word segmentation system and entity recognition system.

[0108] Both the word segmentation system and the entity recognition system can adopt existing systems. Unstructured text files are corporate official documents, and unstructured text files include text and other content. For example, see Figure 2-2 The unstructured text file of "Beijing ### Co., Ltd." is shown, and the unstructured text file is composed of words.

[0109] In this step, the unstructured text file is input into the word segmentation system, and the words in the unstructured text file are segmented through the word segmentation system, and the part of speech of each word is marked; then the segmented uns...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an organization name acquiring method and device, and belongs to the field of information extraction and text mining. The method comprises the following steps: marking an organization name included in a non-structured text file through a word segmentation system and an entity identification system; determining whether the organization name is the full name of the entity mechanism according to a suffix model which comprises a suffix name of at least one entity mechanism; acquiring words which are in front of the organization name and meet preset conditions if the organization name is not the full name of the entity mechanism; and forming the full name of the entity mechanism through the obtained words and the organization name. The device comprises a marking module, a determining module, an acquiring module and a forming module. With the adoption of the method and the device, the organization name identification accuracy can be improved.

Description

technical field [0001] The invention relates to the fields of information extraction and text mining, in particular to a method and device for acquiring organization names. Background technique [0002] Most enterprises will produce a large number of corporate documents during their operation, which contain a lot of useful information that is helpful for understanding the company. In order to facilitate users to quickly understand the company, useful information can be extracted from corporate documents and displayed to users. [0003] Corporate documents often include useful information such as the name of the organization. The name of the organization is often the name of the company. For example, Beijing ### Co., Ltd. is a name of the organization. In order for users to quickly understand the company, it is often necessary to obtain the name of the organization from the company's official documents. At present, the name of the organization in the corporate document can ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/27G06K9/62
CPCG06F40/295G06F40/289G06F18/214
Inventor 任宁席丽娜吴云鹤
Owner ZHONGKE DINGFU BEIJING TECH DEV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products