Skyline query method orienting to probability data flow

A query method and data flow technology, applied in the field of Skyline query oriented to probabilistic data flow, can solve problems such as the Skyline calculation problem that no one considers probabilistic data flow, and achieve the effect of avoiding bias and improving efficiency.

Inactive Publication Date: 2013-06-12
SCHOOL OF SOFTWARE & MICROELECTRONICS AT WUXI PEKING UNIV
View PDF3 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Existing related work is limited to Skyline query processing on static data sets or traditional deterministic data streams, and no one has considered Skyline calculations on probabilistic data streams

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Skyline query method orienting to probability data flow
  • Skyline query method orienting to probability data flow
  • Skyline query method orienting to probability data flow

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] The present invention provides a Skyline query method oriented to probabilistic data flow, comprising:

[0035] (1) Preparation stage: Construct a state model of objects in a probabilistic data flow environment: treat each tuple in the probabilistic data flow as an object, and the objects observed in the data flow are stored in the buffer before entering the system ;

[0036] (2) Preparatory stage: Immediately after the arrival of the new object, the method of processing the expired object is invoked, the expired object is eliminated from the system and the Skyline probability of the object dominated by the expired object is increased;

[0037] (3) Processing stage: then call the method for determining the identity of the newly arrived object, calculate the Skyline probability of the newly arrived object and insert the newly arrived object into the corresponding queue in its belonging cell;

[0038] (4) Final stage: Finally, call the method of processing dominated by n...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a Skyline query method orienting to a probability data flow, which comprises the following steps: (1) constructing a state model for an object under a probability data flow condition: taking each tuple in the probability data flow as one object and pre-storing the object observed in the probability data flow in a buffering area before the object enters into a system; (2) after a new object arrives, obsoleting an overdue object from the system and increasing the Skyline probability of the object dominated by the overdue object; (3) calculating the Skyline probability of the new arriving object and inserting the object into a corresponding array in a belonging grid thereof; and (4) treating all the objects dominated by the new arriving object, namely, reducing the Skyline probability of the objects dominated by the new arriving object. On the basis of adopting grid index with higher adaptability, the invention provides heuristic rules, such as probability delimiting, stepwise refinement, obsoleting in advance, selective compensationand the like, for systemically optimizing the algorithm at two aspects of time and space.

Description

technical field [0001] The invention relates to a query processing method for uncertain data streams, in particular to a Skyline query method for probability data streams. Background technique [0002] Skyline query processing technology in multi-dimensional space is a research hotspot in the field of database in recent years. Skyline is widely used in preference query, multi-criteria decision support, and data mining and visualization. A lot of previous work has focused on calculating Skyline on static data sets. In recent years, some research results on calculating Skyline in sliding windows have also appeared. Recently, a data form called probabilistic data flow has gradually attracted people's attention. focus on. Skyline query refers to selecting a subset from a given set S of D-dimensional data objects, and any data object in the subset cannot be dominated by any other data object in S. The so-called dominance relationship means that in the data set S in the D-dimen...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 孙圣力刘京陈杭
Owner SCHOOL OF SOFTWARE & MICROELECTRONICS AT WUXI PEKING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products