Method and device for real-time summarization and interval summarization of data

A data aggregation and data technology, applied in the computer field, can solve the problems of poor readability of ElasticSearch API code, network resource occupation, system complexity and risk point increase, etc., to achieve complete real-time processing and storage, and minimize data network transmission , reducing the effect of aggregation time

Pending Publication Date: 2021-12-07
西安京迅递供应链科技有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] -ElasticSearch only stores a small amount of data in the time interval between quasi-real-time storage and data loss when the stage is summarized;
[0005] - The independent deployment of consumer application groups and summary application groups takes up a lot of machine resources, which increases the complexity and risk points of the entire system;
[0006] - Calculation and data are on different machines. Due to the huge amount of data, a large amount of network transmission is required, which leads to the occupation of a large amount of network resources in the stage summary stage, and also increases the time consumption of summary;
[0007] - Poor scalability of both computing resources and storage resources; and
[0008] -ElasticSearch API coding is poorly readable and difficult to get started

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for real-time summarization and interval summarization of data
  • Method and device for real-time summarization and interval summarization of data
  • Method and device for real-time summarization and interval summarization of data

Examples

Experimental program
Comparison scheme
Effect test

example

[0077] Hereinafter, an example of a usage scenario of real-time data aggregation of the present invention is briefly described. In this embodiment, it is necessary to summarize the data of the business table in real time.

[0078] Business table: In this example, the business table takes the waybill table as an example. Because the amount of data related to the waybill is huge and involves many fields, it is split into three mysql tables waybill_m, waybill_c, waybill_e and supported by sub-databases and tables. .

[0079] The main field information of the three tables is shown in the table waybill_m, table waybill_c, and table waybill_e in the second second below.

[0080] Waybill_m:

[0081]

[0082] Waybill_c:

[0083] field Remark WAYBILL_CODE Waybill number VENDOR_ID order number ARRIVE_AREA Destination area name PROVINCE_NAME province name The remaining thirty or so columns....

[0084] Waybill_e

[0085]

[00...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and a device for real-time summarization and interval summarization of data, and relates to the technical field of computers. One specific embodiment of the method comprises the following steps that a predetermined number of RegionServers based on HBase are deployed through an HDFS (Hadoop Distributed File System); and the RegionServers respectively carry out data summarization in parallel according to predetermined fields, and the summarized data of the RegionServers are sent to the client side for carrying out secondary data summarization. According to the embodiment, complete real-time processing and storage are reduced; storage and computing resources are placed on the same machine, and data network transmission is minimized, so that the summarization time is shortened; the resource expansion without perception of the program is realized. Support for the SQL is provided, so that the readability is better; and two modes of real-time summarization and interval summarization are provided to solve scenes with different data volumes.

Description

technical field [0001] The invention relates to the field of computer technology, in particular to a method and device for real-time data collection and interval collection. Background technique [0002] Under the current background of a large amount of business data, the association and aggregation of massive real-time data has always been a difficult problem in the field. The main steps of the solution currently adopted are: aggregate the real-time data generated by each business table into Kafka's Topic; use a consumer application group, and associate the two business table programs by caching or reading back the associated fields through ElasticSearch Then store it in the wide table created by ElasticSearch; periodically start the summary application group to read the wide table data in the time period to be summarized, perform summary processing and store it in the cache or relational database. [0003] In the course of realizing the present invention, the inventor fin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/22G06F16/242G06F16/182
CPCG06F16/2282G06F16/2433G06F16/182
Inventor 许奎
Owner 西安京迅递供应链科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products