Word2Vec-BiLSTM-CRF-based named entity recognition method in the legal field

A word2vec-bilstm-crf, legal technology, applied in neural learning methods, biological neural network models, instruments, etc., can solve problems such as inability to apply legal fields, poor extraction effect, etc.

Active Publication Date: 2021-05-14
江苏网进科技股份有限公司
View PDF5 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Since the named entities of patents are relatively simple and unified in the legal field, this method can realize the extraction of patent terms, but the effect of this extraction method cannot be applied to the legal field with complex named entities, and there is no effective identification method to mine entities in the legal field , the extraction effect is poor

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Word2Vec-BiLSTM-CRF-based named entity recognition method in the legal field
  • Word2Vec-BiLSTM-CRF-based named entity recognition method in the legal field
  • Word2Vec-BiLSTM-CRF-based named entity recognition method in the legal field

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0014] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0015] The present invention will be further described below in conjunction with the accompanying drawings and embodiments.

[0016] Please refer to figure 1 , the present invention provides a method for extracting entities in the legal field based on Word2Vec-BiLSTM-CRF, which specifically includes the following steps:

[0017] : Obtain the original data in the legal field and preprocess the data, and obtain the training corpus data, including the following ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a Word2Vec-BiLSTM-CRF-based named entity recognition method in the legal field, and the method specifically comprises the following steps: obtaining original data in the legal field, and carrying out the preprocessing of the data, so as to obtain training corpus data; inputting the obtained training corpus data into a Word2Vec algorithm and combining the Word2Vec algorithm with a CBOW model so as to obtain word vectors aiming at the legal field; labeling the training corpus data obtained through preprocessing in combination with template matching and Chinese corpus pause and other modes, obtaining labeled corpus, serving Bi-LSTM as a coding layer of a model, and combining the obtained labeled corpus and obtained word vectors to serve as input and output of the coding layer to obtain text semantic information features; and taking the text semantic information features obtained by the Bi-LSTM layer as input of the CRF, and finally outputting an identification result of the named entity. The method has the advantages that rich entities in the legal document can be identified, fine-grained description of the entities in the legal field and data structuring in the legal field are realized, and the method is of great significance in further mining the relationship among different entities in the legal field.

Description

technical field [0001] The invention relates to the field of named entity recognition, in particular to an entity extraction method in the legal field based on the Word2Vec-BiLSTM-CRF model. Background technique [0002] In the field of law, whether in the investigation process of a case or in court proceedings, there are many types and complex named entities involved. The most common of these entities are the elements of the case, such as people (suspects, victims), time, place, motivation, events, etc. These different case elements have different characteristics and manifestations in different contexts of criminal charges. [0003] There are many kinds of entities in the legal domain, and the representation forms of these entities are different. It is of great significance to use a unified method to identify these named entities with different representation forms, realize the fine-grained description of entities in the legal field, structure the data in the legal field,...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/295G06Q50/18G06N3/04G06N3/08
CPCG06F40/295G06Q50/18G06N3/084G06N3/044G06N3/045Y02D10/00
Inventor 李参宏
Owner 江苏网进科技股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products