Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Real-time data persisting method, device and equipment and storage medium

A real-time data and persistence technology, applied in the field of data processing, can solve problems such as difficulty in supporting real-time or near-real-time ETL processing, insufficient real-time performance, throughput and high availability, and unsatisfactory functions, performance and scalability.

Inactive Publication Date: 2018-06-12
SF TECH
View PDF7 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] At present, there are some related tools for synchronizing real-time data flow to Hadoop and other file systems in open source big data components, such as Logstash, Flume, Gobblin, etc. However, these open source components have certain limitations and cannot satisfy the functions, performance and Scalability and other actual needs
Logstash and Flume are relatively popular log collection components, but they have shortcomings in terms of real-time performance, throughput, and high availability, and it is difficult to ensure that data will not be lost; Gobblin is an open-source ETL tool of LinkedIn, which supports the synchronization of multiple data sources. Concurrent tasks such as Hadoop MapReduce have good support. However, it relies on the scheduling and execution of different Job components and MR tasks. The real-time performance is relatively insufficient, and it is difficult to support real-time or near real-time ETL processing.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Real-time data persisting method, device and equipment and storage medium
  • Real-time data persisting method, device and equipment and storage medium
  • Real-time data persisting method, device and equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] The present application will be further described in detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are only used to explain the related invention, but not to limit the invention. In addition, it should be noted that, for the convenience of description, only the parts related to the invention are shown in the drawings.

[0052] It should be noted that the embodiments in the present application and the features of the embodiments may be combined with each other in the case of no conflict. The present application will be described in detail below with reference to the accompanying drawings and in conjunction with the embodiments.

[0053] Please refer to figure 1 , the persistence method of real-time data provided by the embodiment of the present invention includes:

[0054] Step S101, the KafkaSpout (data source end) of Storm (real-time processing framework) reads the mes...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a real-time data persisting method, device and equipment and a storage medium, and relates to a data processing technology. By means of the method, a Storm+Kafka serves as thereal-time processing technology, information cached by a Kafka is read by a KafkaSpout, the content of the information is analyzed by a client side of a file system, the analyzed content of the Kafkainformation is persisted to the file system by the KafkaSpout, and then real-time data persisting is completed through the Storm+Kafka.

Description

technical field [0001] The present disclosure generally relates to data processing technologies, in particular to real-time data processing technologies, and in particular, to a method, apparatus and device, and storage medium for persisting real-time data. Background technique [0002] With the rapid development of IT information technology, the scale of major application systems in the Internet field continues to expand, and the amount of data shows an explosive growth trend. How to quickly integrate the online business data flow into the big data platform for subsequent data warehouse construction and analysis and mining has become a major problem faced by Internet companies. Therefore, seeking to quickly connect real-time business data and big data file systems, and even ETL (Extract-Transform-Load, Extract-Transform-Load) technical solutions to the data warehouse has become an urgent need for the construction of big data platforms. [0003] At present, there are some r...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
CPCG06F16/254
Inventor 陈东沂蔡适择陈敏陈军张强
Owner SF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products