Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Text compression method and device

A text compression and original text technology, applied in structured data retrieval, instrumentation, electrical digital data processing, etc., can solve the problem of low security and achieve the effect of high security, not easy to be cracked, and high confidentiality

Active Publication Date: 2016-12-07
AGRICULTURAL BANK OF CHINA
View PDF5 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This type of text compression is less secure

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text compression method and device
  • Text compression method and device
  • Text compression method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] The following will clearly and completely describe the technical solutions in the embodiments of the application with reference to the drawings in the embodiments of the application. Apparently, the described embodiments are only some of the embodiments of the application, not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the scope of protection of this application.

[0029] see figure 1 , which shows the flow of Embodiment 1 of the text compression method provided by the present application. In a specific application, the file compression method is applied in a HADOOP cluster, and each machine node in the HADOOP cluster executes the text compression method in parallel.

[0030] like figure 1 As shown, Embodiment 1 of the text compression method may include steps S101 to S103.

[0031] Step S101: In the original text file, determine several sa...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a text compression method. The method is applied in a HADOOP cluster and a plurality of machine nodes in the HADOOP cluster can use a computing framework MapReduce and execute the text compression method. The method comprises the following steps of: in a Map stage of the MapReduce, extracting sampling phrases from an original text file; in a Reduce stage of the MapReduce, setting a corresponding code for each sampling phrase, wherein a corresponding relationship between the sampling phrases and the codes can serve as a mapping function which is stored in a relationship database; after the mapping function is obtained, integrally compressing the original text file by using the mapping function so as to obtain a compressed file; after a to-be-queried phrase is received, compressing the to-be-queried phrase by using the mapping function so as to obtain a compressed phrase; and searching the compressed phrase in the compressed file. Moreover, the invention provides a text compression device.

Description

technical field [0001] The present application relates to the technical field of file compression processing, and more specifically, to a method and device for storing, compressing and analyzing text data based on HADOOP clusters and relational databases. Background technique [0002] Text compression is to encode a large amount of text data according to a certain method to achieve the purpose of information compression and storage. The compressed data can be restored to the state before compression through decoding without losing information. [0003] An existing text compression method is LZO (Lempel Ziv Oberhumer) compression, which uses a dictionary table to replace repeated character strings in data, thereby realizing compression. This type of text compression is less secure. Contents of the invention [0004] In view of this, the present application provides a text compression method to improve text security. In addition, the present application also provides a tex...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30H03M7/30
CPCG06F16/2282G06F16/258G06F16/284H03M7/30
Inventor 郭会耿鹏
Owner AGRICULTURAL BANK OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products