Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Novel conversion algorithm used for regular expression matching

An expression matching and expression technology, applied in complex mathematical operations, etc., can solve problems such as increasing the risk of network attacks

Inactive Publication Date: 2017-09-22
NANJING UNIV
View PDF3 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

While the increasing network bandwidth provides convenience to people, it also greatly increases the risk of network attacks

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Novel conversion algorithm used for regular expression matching
  • Novel conversion algorithm used for regular expression matching
  • Novel conversion algorithm used for regular expression matching

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] The specific implementation of the present invention is described in detail below in conjunction with accompanying drawing, the core train of thought of the present invention is exactly by the basic construction rule of regular expression character and symbol in the proposed algorithm, and the regular expression of traversal linked list form, obtains regular expression finite state machine.

[0028] The steps required to convert a regular expression into a non-deterministic finite state machine are as follows:

[0029] Step 1: the present invention uses self-written software program to randomly generate a regular expression in the PCRE rule set, where regular expression a{4}b*(c[ab]?|d)+f is example;

[0030] Step 2: Analyze the regular expression, process the regular expression, and get as figure 1 In the form of the regular expression parsing tree shown, since the parsing tree form produces an unnecessary increase in tree depth and an increase in the number of recurs...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a novel construction algorithm used for a regular expression matching method. The algorithm comprises the following steps that: S1: through software, generating any regular expression based on a PCRE (Perl Compatible Regular Expressions) rule set; S2: analyzing the regular expression, and converting the regular expression into a regular expression tree structure; S3: continuously converting the regular expression parse tree form into a token list form; S4: traversing the regular expression of the token list form, processing each node in the token list by the basic formation rule of the algorithm for characters, and generating a finite state machine of the regular expression; and S5: through the finite state machine of the regular expression, generating a corresponding circuit structure, and realizing a regular expression compiler. By use of the algorithm, any regular expression in the rule can be converted. In addition, compared with a traditional conversion algorithm, the generated finite state machine reduces a great quantity of intermediate node states and simplifies a circuit structure so as to finally obtain a regular expression matching circuit suitable for an FPGA (Field Programmable Gate Array), and the algorithm exhibits certain innovation.

Description

technical field [0001] The invention relates to a conversion method in the field of regular expression matching, in particular to an algorithm for constructing a finite state machine used in the process of converting a regular expression into a matching engine. Background technique [0002] With the rapid growth of Internet traffic and the proliferation of data collected and exchanged through the network, a large number of regular expression patterns are matched with high-bandwidth data, making regular expression matching a major bottleneck in applications. While the increasing network bandwidth provides convenience to people, it also greatly increases the risk of network attacks. Regular expression matching is the process of finding all substrings in a text document. We generally use a pattern to describe a regular expression. Since regular expression matching plays a vital role in intrusion detection systems (NIDSs), open source intrusion detection systems such as SNORT a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/11
CPCG06F17/11
Inventor 王中风于怀竹林军
Owner NANJING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products