Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method for adjusting distribution of data fragments and data server

A data sharding and data technology, applied in the field of communication, can solve the problems of crowding out the available IO capacity of user data, difficulty in file system coordination, affecting the efficiency of cache usage, etc., and achieve the effect of reducing the transmission bandwidth requirement.

Active Publication Date: 2017-07-07
ALIBABA CLOUD COMPUTING CO LTD
View PDF11 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, since the cached data is placed in the memory, which is several orders of magnitude smaller than the capacity of the disk memory (TB vs GB), the effect of the cache will decrease as the amount of data increases, and the cache system puts the cache in The hard disk will occupy the available IO capacity of user data. In addition, it is difficult to cooperate with the cache and the underlying file system.
For example, rewriting the data of a certain file will invalidate the cached data of the entire file, which will affect the efficiency of the cache.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for adjusting distribution of data fragments and data server
  • Method for adjusting distribution of data fragments and data server
  • Method for adjusting distribution of data fragments and data server

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0064] As mentioned in the background technology, if the limitation of cross-computer room traffic is avoided by increasing the bandwidth of the computer room, the cost will be very high, and if the cross-computer room traffic is avoided by setting a cache inside each computer room, it will be limited by memory And the overall storage efficiency will be reduced. Therefore, this application divides the data in the computer room into different data fragments. The data fragments can also be called data blocks. As a physical recording method of data, it is a group of records that are logically arranged continuously , each record consists of multiple copies, and a copy of a data shard is a unit of data transmitted between a data center and an input or output device or other data centers. Based on data fragmentation in different computer rooms, network traffic across data centers is optimized by dynamically adjusting data distribution, thereby maximizing data access performance whil...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for adjusting distribution of data fragments. The method comprises the following steps of: when an adjustment moment corresponding to a to-be-processed data fragment arrives, obtaining access frequency information of the to-be-processed data fragment; determining earning data of each data center corresponding to the data fragment; and finally generating optimal distribution information according to the earning data of each data center and the quantity of copies, and adjusting the position, at each data center, of each copy according to the optimal distribution information. According to the method, the distribution condition of data fragments can be dynamically optimized according to the access frequencies and characteristics of the data fragments without additionally arranging an internal memory or a hard disk for storage, so that the requirement for transmission bandwidth between the data center is reduced.

Description

technical field [0001] The invention relates to the field of communication technology, in particular to a method for adjusting the distribution of data slices. The invention also relates to a data server. Background technique [0002] In the environment of cloud computing and big data processing, the distributed file system serves as the underlying storage layer and provides nearly infinitely scalable storage service capabilities to the upper-layer business. However, with the increase in the scale of data centers and the popularity of global deployment, there are more and more accidents where data centers go offline due to various physical problems (such as cutting optical fibers, fires in computer rooms, etc.). Line, will cause serious service availability problems. [0003] In order to improve the availability and continuity of data services, a common practice in the industry is to distribute multiple copies of data to multiple data centers in a certain area, and improve...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F3/06H04L29/08
CPCG06F16/13G06F3/0655G06F3/067H04L67/1097G06F16/122G06F16/182G06Q40/125G06F16/278
Inventor 张海勇陆靖姚文辉董乘宇朱家稷
Owner ALIBABA CLOUD COMPUTING CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products