Text regularization method and device, electronic equipment and storage medium

A text and regularization technology, which is applied in the direction of electrical digital data processing, digital data information retrieval, special data processing applications, etc., can solve the problems of low regularization efficiency, long running time, and computing resource consumption, so as to improve the regularization efficiency , Lower the reading threshold and prevent useless consumption

Pending Publication Date: 2021-08-13
ONE CONNECT SMART TECH CO LTD SHENZHEN
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, for complex regular expressions, the process of writing and verifying is not easy to implement, and the running time is long, resulting in low regularization efficiency. It is time-consuming and it is not clear whether the corresponding results can be produced after the operation is completed. , so that when the corresponding regularization results cannot be generated, computing resources are wasted in vain

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text regularization method and device, electronic equipment and storage medium
  • Text regularization method and device, electronic equipment and storage medium
  • Text regularization method and device, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] The following will clearly and completely describe the technical solutions in the embodiments of the application with reference to the drawings in the embodiments of the application. Apparently, the described embodiments are part of the embodiments of the application, not all of them. Based on the implementation manners in this application, all other implementation manners obtained by persons of ordinary skill in the art without creative efforts fall within the scope of protection of this application.

[0033] The terms "first", "second", "third" and "fourth" in the specification and claims of the present application and the drawings are used to distinguish different objects, rather than to describe a specific order . Furthermore, the terms "include" and "have", as well as any variations thereof, are intended to cover a non-exclusive inclusion. For example, a process, method, system, product or device comprising a series of steps or units is not limited to the listed s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the technical field of natural language processing, and particularly discloses a text regularization method and device, electronic equipment and a storage medium. The regularization method comprises the steps: replacing regular meta-characters in a regular expression to obtain a first character string; segmenting the first character string to obtain at least one second character string; generating a first dictionary according to the at least one second character string, and replacing the first character string according to the first dictionary to obtain a third character string; generating a second dictionary according to the first dictionary and a first text, wherein the first text is a text to be regularized by using the regular expression; converting the third character string into a bit operation formula according to the second dictionary, and obtaining an operation result of the bit operation formula; and determining whether to regularize the first text by using the regular expression or not according to the operation result.

Description

technical field [0001] The present invention relates to the technical field of natural language processing, in particular to a text regularization method, device, electronic equipment and storage medium. Background technique [0002] Regular expressions (Regular Expression, abbreviated as regex, regexp or RE), also known as regular expressions, are usually used to retrieve and replace text that conforms to a certain pattern or rule. At present, it is not difficult to write simple regular expressions, and the operating efficiency is relatively high. However, for complex regular expressions, the process of writing and verifying is not easy to implement, and the running time is long, resulting in low regularization efficiency. It is time-consuming and it is not clear whether the corresponding results can be produced after the operation is completed. , so that when the corresponding regularization results cannot be generated, computing resources are wasted. Contents of the in...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F40/157G06F16/9032
CPCG06F16/9032G06F40/157
Inventor 李超
Owner ONE CONNECT SMART TECH CO LTD SHENZHEN
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products