Novel method for compressing bit bitmap index

A bitmap and index technology, applied in special data processing applications, instruments, electrical digital data processing, etc., to achieve the effect of good coding efficiency

Active Publication Date: 2014-07-23
TSINGHUA UNIV
View PDF0 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0033] (4) Literal (L), literal chunks that cannot be grouped according to (1) and (2) are classified into this type
[0047] The compression method is the same as COMPAX, but the input code stream is processed first, and similar packets are placed in similar positions, which greatly improves the compression rate

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Novel method for compressing bit bitmap index
  • Novel method for compressing bit bitmap index
  • Novel method for compressing bit bitmap index

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0083] The preferred embodiments will be described in detail below in conjunction with the accompanying drawings. It should be emphasized that the following description is only exemplary and not intended to limit the scope of the invention and its application.

[0084] The idea of ​​the present invention to solve the problem is to complete the ICX algorithm: first, the algorithm for effectively compressing the 01 bit sequence, through the designed ICX codebook, converts the 01 strings that meet the characteristics into two types of 31-bit blocks, called F-blocks and L-block, L block is divided into C-L-block, 0-NI-L-block and 1-NI-L block, 0-NI2-L-block and 1-NI2-L block; then, according to ICX codebook and The encoding rules are further encoded on the basis of the codebook encoding FL sequence, specifically including: L-word encoding, F-word encoding, FLF-word encoding, LFL-word encoding, NI-FL word encoding, NI2-FL word encoding.

[0085] The following takes the Internet tr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a novel method for compressing a bit bitmap index in the field of computer networks or big data analysis, namely an ICX algorithm, so that the problems existing in study of compressing bit bitmaps in the prior art are solved. According to the method, a COMPAX algorithm is improved, and a Nearly Identical concept is put forward. Specifically, the method includes the steps that a 01 bit sequence to be compressed is divided into chunks with 31bits as unit, classification marking is carried out on the chunks, the chunks are marked as F-chunks and L-chunks, and the L-chunks are divided into C-L-chunks, 0-NI-L-chunks, 1-NI-L-chunks, 0-NI2-L-chunks and 1-NI2-L-chunks; according to an ICX codebook and coding rules, on the basis of codebook coding FL sequences, coding is further carried out and includes the steps of L-character coding, F-character coding, FLF-character coding, LFL-character coding, NI-FL character coding and NI2-FL character coding. Compared with the COMPAX algorithm, the method for compressing the bit bitmap index improves coding efficiency and is effectively achieved.

Description

technical field [0001] The invention relates to the field of computer network or big data analysis, in particular to a new method for compressing bitmap indexes. Background technique [0002] The rapid development of Internet technology has brought us into the era of information explosion, and massive information content has greatly enriched users. The explosion of the mobile Internet has enabled users to access any content on the Internet from anywhere and at any time, generating more abundant traffic data. [0003] According to Cisco (Cisco) forecast, the user traffic data generated and accumulated by any large Internet company in its daily operations is so huge that it cannot be used in giga (Giga, G) or trillion (Trillion, T) level words. section data to measure. To this end, Cisco has predicted that the data traffic of the network will grow at a rate of 4 times between 2011 and 2016, and in 2016, it will reach 1.3 zetta (Zetta, one trillion trillion) bytes. According...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/2237H03M7/30
Inventor 陈震温禹豪马戈曹军威
Owner TSINGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products