Method and device for segmenting fragments

A logarithmic algorithm and configuration file technology, applied in the computer field, can solve the problems of unbalanced utilization of cluster resources, affecting cluster performance, low read and write efficiency, etc., to solve hot issues and small file issues, and to improve performance and cluster resources. The effect of improving the rate, read and write performance

Pending Publication Date: 2019-08-16
BEIJING JINGDONG SHANGKE INFORMATION TECH CO LTD +1
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] There are two main problems in the existing HBase sharding method: the first fixed-value algorithm will cause hotspots, that is, when the amount of data is small, the data will be concentrated on one data shard on one server, and the read Low write efficiency and unbalanced cluster resource utilization
The second cubic algorithm will cause small file problems. According to the cubic algorithm, many small data fragments will be generated, which are essentially many small files. The efficiency of reading and writing queries on these small files is very low, which affects the cluster. performance

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for segmenting fragments
  • Method and device for segmenting fragments
  • Method and device for segmenting fragments

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] Exemplary embodiments of the present invention are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present invention to facilitate understanding, and they should be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the invention. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

[0039] figure 1 is a method for fragmentation and segmentation according to an embodiment of the present invention, such as figure 1 As shown, the method for fragmentation and segmentation includes:

[0040] Step S101, obtaining the size of the largest fragment of the current target table on the server.

[0041] Preferably, said server is a server in an HBase cluster. ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and device for segmenting fragments, and relates to the technical field of computers. A specific embodiment of the method for segmenting fragments comprises the steps:obtaining the size of a maximum fragment of a current target table on a server; and if determining that the size of the maximum fragment is greater than a fragment size threshold, performing segmentation on the fragment, wherein the fragment size threshold value is obtained according to the number of fragments of the current target table on the server. According to the method and device for segmenting fragments, the utilization rate of cluster resources can be effectively improved, and the problem of data hotspots is solved.

Description

technical field [0001] The present invention relates to the field of computer technology, and in particular to a method and device for slicing and splitting. Background technique [0002] HBase (HBase is a high-reliability, high-performance, column-oriented, and scalable distributed storage system, using HBase technology to build large-scale structured storage clusters on cheap servers.) technology has been widely used in the field of big data, Its characteristic is that the scale of the cluster can be expanded horizontally, and a large amount of data can be evenly distributed on each server of the cluster, so as to obtain very high read and write performance. However, the existing HBase data fragmentation management methods cannot effectively utilize cluster resources, and there are hot spots. [0003] There are two general methods for HBase sharding: [0004] The first, fixed-value algorithm, uses a fixed fragment size. For example: set each shard to be fixed at 50G, wh...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F3/06
CPCG06F3/0607G06F3/0644G06F3/067
Inventor 张楠
Owner BEIJING JINGDONG SHANGKE INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products