Data compression algorithm based on semantic analysis

A technology of data compression and semantic analysis, which is applied in the computer field, can solve the problems of ignoring keywords and the regularity of special sentences, etc., and achieve the effect of clear structure, high repeatability and high compression ratio

Inactive Publication Date: 2014-04-16
SHAANXI UNIV OF TECH
View PDF3 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Even the data compression technology based on Huffman coding, which is currently widely used and has a high compression ratio, still has some shortcomings, because this compression technology compresses all characters of th

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data compression algorithm based on semantic analysis
  • Data compression algorithm based on semantic analysis
  • Data compression algorithm based on semantic analysis

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0019] The technical solutions of the present invention will be further described below in conjunction with the accompanying drawings and specific embodiments.

[0020] refer to figure 1 , a data compression algorithm based on semantic analysis, including the following steps:

[0021] 1) Open the C language text file to be compressed, according to the classification of C language for text (keywords, variables, constants, strings, characters, comments, etc.), obtain each key element in turn, ignore the secondary elements, and save them in A doubly linked list LinkSrc;

[0022] 2) Traverse the linked list LinkSrc, count the number of occurrences of each element, and use the basic information (type, frequency, content, length, code, etc.) of the element as a node to newly build a two-way linked list LinkRate;

[0023] 3) Use the nodes in the linked list LinkRate as leaf nodes to form a Huffman tree with the root LinkRoot;

[0024] 4) Copy and save the leaf nodes in LinkRoot as...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a data compression algorithm based on semantic analysis. The data compression algorithm is a compression method used for language text files which are generally limited and is aimed to solve the compression problem of a C language source file text, strictness of a grammar format of C language is made use of, C language basic elements serve as basic elements of text compression, coding is carried out in a Huffman coding method, and a C language text compression method, namely a text compression method based on semantic analysis is given. As is shown in experiments, the compression ratio of the method is improved by nearly 1.5 times than that of an original Huffman text compression method.

Description

technical field [0001] The invention belongs to the technical field of computers and relates to a data compression algorithm based on semantic analysis. Background technique [0002] Text data plays a very important role as a medium and means of transmitting information, and has a wide range of applications in many fields. However, the amount of data used to represent these text files is very large, making the storage cost very high, and the role of the memory cannot be maximized, which is well adapted to the needs of high-speed communication in today's network environment. Data compression technology is one of the key technologies in the fields of communications and computers. It compresses data through certain technologies and methods, so that large-capacity files can be stored in a small storage space, which is very important for network resource sharing. , information exchange, high-speed communication, etc. have very important significance, and data compression technol...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): H03M7/30
Inventor 洪歧
Owner SHAANXI UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products