Data parallel processing method and device based on many-core processor

A technology of many-core processors and parallel processing, applied in the field of distributed database analysis, can solve problems such as affecting the speed of parallel computing, reducing the efficiency of task processing, and low processing speed of a single-machine database

Active Publication Date: 2020-12-04
BEIJING CERESDATA TECH CO LTD
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Embodiments of the present invention provide a data parallel processing method and device based on many-core processors, which are used to solve the problem of low processing speed of single-machine databases and distributed The database is easy to reach the bottleneck of network data transmission speed between parallel computing nodes, which affects the speed of parallel computing and reduces the efficiency of task processing

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data parallel processing method and device based on many-core processor
  • Data parallel processing method and device based on many-core processor
  • Data parallel processing method and device based on many-core processor

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0027] Benefiting from the scalability of the open source database, the embodiment of the present invention adds a multi-core task scheduling auxiliary process to the open source database service process. The process includes three threads to be responsible for multi-core task generation and scheduling. By binding slave core acceleration thre...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides a data parallel processing method and device based on a many-core processor, which are applied to a stand-alone database. The method includes: dividing a to-be-operated data table to obtain a plurality of operation tasks after the to-be-operated data table is determined each time; and distributing the divided operation tasks to the slave cores of the many-core processor so as to process the operation tasks in parallel through different slave cores. On one hand, the data tables to be processed are sorted and divided to obtain the operation tasks suitablefor parallel processing, and the operation tasks are distributed to the many-core processor of the single-machine database, so that parallel operation when the single-machine database processes big data tasks is realized, and the operational capability of the many-core processor is fully exerted; and on the other hand, parallel processing of tasks is realized among many cores, the bottleneck of network data transmission speed among parallel computing nodes is eliminated, the operation process is accelerated, and the task processing efficiency is improved.

Description

technical field [0001] The invention relates to the field of analysis in a distributed database, in particular to a data parallel processing method and device based on many-core processors. Background technique [0002] In-library analysis refers to performing scientific computing, machine learning, regression analysis, and cluster analysis within the database, and implementing related algorithms through database-defined functions. Users can directly obtain analysis results by calling database-defined functions. The syntax of the database custom function is based on SQL, and all data calls and calculations can be completed in the database without importing and / or exporting data in multiple databases. The user calls the built-in function of the database through the database interface, and the built-in function of the database obtains the data by accessing the underlying storage, performs calculation, and returns the result. As more researchers in business fields get involved...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/27G06F16/25G06F16/22G06F9/50G06F9/48G06F9/38
CPCG06F9/3885G06F9/485G06F9/5083G06F16/2282G06F16/252G06F16/27
Inventor 杨刚郑磊李鸿伟
Owner BEIJING CERESDATA TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products