File fingerprint analyzing method for massive data

A file fingerprint and mass data technology, applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., to achieve the effect of reducing error rate, improving computing performance, and simplifying the analysis and processing process

Active Publication Date: 2013-08-14
UNIV OF ELECTRONICS SCI & TECH OF CHINA
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The purpose of the present invention is to propose a file fingerprint analysis method for massive data in order to solve the above-mentioned problems existing in file comparison under the existing massive data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • File fingerprint analyzing method for massive data
  • File fingerprint analyzing method for massive data
  • File fingerprint analyzing method for massive data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] The present invention will be further elaborated below in conjunction with the accompanying drawings and specific embodiments.

[0024] Before introducing the embodiments, some basic concepts and ideas are briefly described.

[0025] Parallel computing model: Parallel computing model usually refers to starting from the design and analysis of parallel algorithms, abstracting the basic characteristics of various parallel computers (at least a certain type of parallel computer) to form an abstract computing model. In a broader sense, the parallel computing model provides a hardware and software interface for parallel computing. Under the agreement of this interface, parallel system hardware designers and software designers can develop support mechanisms for parallelism, thereby improving system performance. performance.

[0026] A single computer and a computer system composed of multiple computers are connected to each other using a network. The hardware, software, and o...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a file fingerprint analyzing method for massive data. The file fingerprint analyzing method for the massive data comprises the following steps: establishing a parallel computing model; generating file fingerprint; transmitting the file fingerprint; storing the file fingerprint; contrasting the file fingerprint; and analyzing a contrast result. In the file fingerprint analyzing method, through the parallel computing model, and by using the file fingerprint generated by file system attributes and data contents and making full use of the parallel computing capabilities ofnetwork node computers, the integral computing performance is improved, the large-scale massive data analyzing and processing process under a heterogeneous system is simplified, the massive data processing efficiency is improved, and the error rate is reduced; the file fingerprint analyzing method is applicable to the fields of distributed systems, data centers, cloud storage and the like.

Description

technical field [0001] The invention belongs to the fields of computer data storage, data management and data analysis, and specifically relates to a method for analyzing file structure and content of data, generating file fingerprints and analyzing file fingerprints under massive data. Background technique [0002] With the rapid development of computer storage technology and network technology, the growth rate of data has also doubled; the storage of massive data uses functions such as cluster applications, grid technology, or distributed file systems to store a large number of different types of data in the network Devices work together through application software to jointly provide data storage and business access functions. Therefore, when faced with a large amount of data in a heterogeneous system, how to quickly compare and identify changes in data and file content, and provide corresponding feedback, has become a bottleneck in deploying large-scale services. [000...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
Inventor 唐雪飞石砾
Owner UNIV OF ELECTRONICS SCI & TECH OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products