Coflow collaborative job flow scheduling perception data flow division method and device

A technology of data flow and job flow, applied in the field of information, can solve problems such as unreasonable division and processing of distributed data flow

Active Publication Date: 2019-10-11
HUNAN UNIV
View PDF5 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, distributed data stream processing applications are different from traditional static data processing. They usually involve a collection of parallel task flows executed on distributed machines. There are different logical dependencies and data dependencies between task flows. Simply follow the traditional static data Data flow division corresponding to the processing method It is obviously unreasonable to perform data flow division processing on distributed data streams

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Coflow collaborative job flow scheduling perception data flow division method and device
  • Coflow collaborative job flow scheduling perception data flow division method and device
  • Coflow collaborative job flow scheduling perception data flow division method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047]In order to make the purpose, technical solution and advantages of the present application clearer, the present application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present application, and are not intended to limit the present application.

[0048] The Coflow collaborative job flow scheduling-aware data flow division method provided by this application can be applied to such as figure 1 shown in the application environment. Among them, the externally sends the DDSP application data flow to be divided to the server, and the server obtains the DDSP application data flow to be divided, and divides the complex computing task flow in the DDSP application into multiple sub-processes through the preset Coflow collaborative job flow scheduling model. Task flow; analyze the data access requirements and data dependencie...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a Coflow collaborative job flow scheduling perception data flow division method and device, computer equipment and a storage medium. The method comprises the following steps:obtaining a to-be-divided DDSP application program data stream; extracting a complex calculation task flow of the DDSP application program; according to a preset Coflow collaborative job flow scheduling model, obtaining a Coflow collaborative job flow scheduling model, a complex calculation task flow in a DDSP application program is divided into a plurality of sub-task flows; further analyzing thedata access demand and the data dependence of each sub-task flow; carrying out multi-dimensional data segmentation on the DDSP application program data stream; reducing cross access to the data blocks among different tasks as much as possible and the dependence between the data blocks d; finally, according to the data access requirements of the sub-task flows in each computing node, taking data communication minimization between computing nodes and workload equalization of the computing nodes as optimization objectives, and allocating the segmented data blocks to the most appropriate computing nodes, so that the communication overhead between the distributed computing nodes is effectively reduced, the utilization rate of the data blocks is increased, the access speed is increased, and themethod is suitable for distributed data flow processing.

Description

technical field [0001] The present application relates to the field of information technology, in particular to a data flow division method, device, computer equipment and storage medium with Coflow collaborative job flow scheduling awareness. Background technique [0002] With the rapid development of the Internet, sensor networks, and mobile Internet technologies, various application fields continue to generate large amounts of data sets in the form of streams. Streaming computing is a highly real-time computing model and an effective way of big data computing. Practical application fields such as financial markets, network monitoring, telecommunications, and sensor networks all generate and store massive streaming data sets. DDSP (Distributed Data Stream Processing, distributed data stream processing) is an effective method to improve the performance of large-scale data stream processing. [0003] However, several key challenges are faced in DDSP applications at present:...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F9/50
CPCG06F9/5083
Inventor 李肯立陈建国彭继武胡俊艳阳王东李克勤廖湘科
Owner HUNAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products