Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

MVCC multi-version folding tree implementation system and method based on ClickHouse database

A technology for implementing systems and implementation methods, applied in database indexing, electronic digital data processing, structured data retrieval, etc., can solve problems such as the inability of multi-version folded trees to meet requirements, achieve good popularization and application value, and reduce data reading. , the effect of reducing complexity

Pending Publication Date: 2022-04-15
上海沄熹科技有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] When we use ClickHouse's built-in multi-version folding tree for MVCC data storage, we find that when we store multi-version data, because our version number column needs to be calculated in two different cases, the multi-version that comes with ClickHouse Folding the tree doesn't work for our needs (it just keeps the latest one based on the version number)

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • MVCC multi-version folding tree implementation system and method based on ClickHouse database

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0043] Such as figure 1 As shown, the MVCC multi-version folding tree realization system based on the ClickHouse database of the present invention includes a multi-version folding tree definition declaration module, a multi-version folding tree merging module and a multi-version folding tree query module.

[0044] Multi-version folding tree definition declaration module: add a new folding tree, set an integer column as the deletion marker column, and set the wall_time column and logic_time column as the version number column. The deletion mark column is an integer value column in the selected table creation statement. Its value is 1, indicating that the data is inserted, and equal to -1, indicating that the data is deleted. The version number column is a tuple column, including two values ​​of wall_time and logic_time. The sorting rule is to compare wall_time first, and then compare logic_time if the values ​​are equal. This column is used to mark multi-version data.

[0045]...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an MVCC multi-version folding tree implementation system and method based on a ClickHouse database, and belongs to the technical field of database data storage processing. The MVCC multi-version folding tree implementation system based on the ClickHouse database comprises a multi-version folding tree definition declaration module, a multi-version folding tree merging module and a multi-version folding tree query module. The multi-version folding tree definition declaration module is used for adding a new folding tree, setting an integer column as a deletion mark column, and setting a walltime column and a logictime column as version number columns; and the multi-version folding tree merging module is used for merging the data partitions stored in the folding tree. According to the MVCC multi-version folding tree implementation system based on the ClickHouse database, reading of data in a hard disk can be effectively reduced, and the MVCC multi-version folding tree implementation system based on the ClickHouse database has good application and popularization value.

Description

technical field [0001] The invention relates to the technical field of database data storage and processing, and specifically provides a system and method for realizing an MVCC multi-version folding tree based on a ClickHouse database. Background technique [0002] ClickHouse is a columnar storage database with MPP architecture, which is mainly used for online analytical processing query (OLAP). Its analysis performance exceeds most of the columnar storage databases on the market, especially for single table analysis. [0003] MVCC (multi-version concurrency control) data contains an HLC timestamp, which is composed of two integer values, wall_time and logic_time, which are used to control the version information of the data. When wall_time and logic_time are not 0 at the same time, this piece of data is business data. When wall_time and logic_time are both 0, it is writing intent data. [0004] When we use ClickHouse's built-in multi-version folding tree for MVCC data stor...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/22G06F16/242
CPCG06F16/221G06F16/2433G06F16/2282
Inventor 李盟贾德星
Owner 上海沄熹科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products