Information extraction model training method and device, information extraction method and device and medium

A technology of information extraction and training methods, which is applied in character and pattern recognition, instruments, computer components, etc., can solve the problems that the model cannot learn the distribution well, cannot accurately identify information units, and the distribution of information units is unbalanced, etc., to achieve The effect of high information extraction ability, reducing concentration and enhancing concentration

Pending Publication Date: 2022-03-08
BEIJING SOGOU TECHNOLOGY DEVELOPMENT CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] In the information extraction scene, such as the information extraction scene for the image (such as extracting the formula or text in the image), the information in the image is usually extracted based on the trained model. However, in some cases, different feature classification The distribution of information units may be extremely unbalanced, which will cause the model to fail to learn the information units of feature classification with less distribution during training, resulting in the model being unable to accurately identify information units

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Information extraction model training method and device, information extraction method and device and medium
  • Information extraction model training method and device, information extraction method and device and medium
  • Information extraction model training method and device, information extraction method and device and medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] Example embodiments will now be described more fully with reference to the accompanying drawings. Example embodiments may, however, be embodied in many forms and should not be construed as limited to the examples set forth herein; rather, these embodiments are provided so that this application will be thorough and complete, and will fully convey the concepts of example embodiments to those skilled in the art.

[0042] Furthermore, the described features, structures, or characteristics may be combined in any suitable manner in one or more embodiments. In the following description, numerous specific details are provided in order to give a thorough understanding of the embodiments of the application. However, those skilled in the art will appreciate that the technical solutions of the present application may be practiced without one or more of the specific details, or other methods, components, devices, steps, etc. may be employed. In other instances, well-known methods,...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides an information extraction model training method and device and a medium, and relates to the technical field of computers and artificial intelligence. The method comprises the following steps: acquiring a sample image, wherein the sample image comprises at least one sample information unit; determining a respective corresponding sample label for the at least one sample information unit in the sample image, wherein each sample label is used for representing actual feature information of the sample information unit; based on the sample images and the sample labels, a to-be-trained model is trained through a dynamic loss function, an information extraction model is obtained, and the dynamic loss function is used for adjusting the attention of the to-be-trained model to sample information units of different feature classifications in the training process. According to the technical scheme, the information extraction accuracy of the information extraction model can be improved.

Description

technical field [0001] The present application relates to the technical field of computer and artificial intelligence, and in particular, relates to an information extraction model training and information extraction method, device, and medium. Background technique [0002] In the information extraction scene, such as the information extraction scene for the image (such as extracting the formula or text in the image), the information in the image is usually extracted based on the trained model. However, in some cases, different feature classification The distribution of information units may be extremely unbalanced, which will cause the model to fail to learn information units of feature classification with less distribution during training, resulting in the model being unable to accurately identify information units. Based on this, how to improve the accuracy of the information extraction model for information extraction is an urgent technical problem to be solved. Conten...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06V10/774G06V10/764G06V30/148
CPCG06F18/214G06F18/241
Inventor 边晓航辛晓哲
Owner BEIJING SOGOU TECHNOLOGY DEVELOPMENT CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products