Distributed data set indexing

An indexing and data value technology, applied in database indexing, structured data retrieval, data comparison, etc.

Active Publication Date: 2019-04-02
SAS INSTITUTE
View PDF3 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This leads to the challenge of enabling efficient index generation for such large data sets, enabling efficient searching of such large data sets across multiple node devices of the grid, and enabling specific pieces of data to be efficiently located and retrieve

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Distributed data set indexing
  • Distributed data set indexing
  • Distributed data set indexing

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0115] Various embodiments described herein generally relate to inter-device coordination to improve distributed indexing where parallel processing is used to access data of a dataset. The data of the data set can be divided into multiple super units, and the data in each super unit can be further divided into multiple data units. Within each data unit, data may be organized into a set of data records, each data record comprising the same set of data fields populated with data values. For each superunit, a set of indexes can be generated through which data within the superunit can be accessed more quickly, including a superunit index corresponding to the superunit as a whole and each corresponding to a data unit within the superunit One or more cell indices of one of . Within each unit index, data records within the corresponding data unit may be found present in the data value index within the selected subset of data fields. For each selected data field, the unique value in...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

An apparatus including a processor to receive search criteria including a data value for a search within a data field; in response to the receipt of the query instructions, and for each data cell within a super cell, perform the specified search by comparing the data value to ranges of values indicated in a corresponding cell index to determine whether the data cell includes a data record meetingthe search criteria, and in response to a determination that the data cell includes such a data record, use a unique values index in the cell index to search the data records of the data cell to identify one or more data records meeting the search criteria; and in response to identifying at least one data record meeting the search criteria, provide an indication that at least the data cell includes at least one data record meeting the search criteria.

Description

[0001] Cross References to Related Applications [0002] This application claims the benefit of priority to the following applications: U.S. Provisional Application Serial No. 62 / 458,162, filed February 13, 2017; U.S. Application Serial No. 15 / 838,110, filed December 11, 2017; December 11, 2017 U.S. Application Serial No. 15 / 838,175, filed December 11, 2017; U.S. Application Serial No. 15 / 838,195, filed December 11, 2017; and U.S. Application Serial No. 15 / 838,211, filed December 11, 2017. The disclosure of U.S. Provisional Application Serial No. 62 / 458,162, U.S. Application Serial No. 15 / 838,110, U.S. Application Serial No. 15 / 838,175, U.S. Application Serial No. 15 / 838,195, and U.S. Application Serial No. 15 / 838,211 is for all purposes in their respective The entirety is hereby incorporated by reference. technical field [0003] Various embodiments described herein generally relate to inter-device coordination to improve distributed indexing of data sets stored by multiple ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/22
CPCG06F16/2246G06F16/27G06F16/2255G06F16/2219G06F16/2228G06F9/5072G06F16/21G06F16/23G06F16/245G06F16/381G06F16/2365G06F16/9014G06F16/9027G06F7/08G06F7/20G06F9/5027G06F7/02
Inventor B.P.鲍曼G.L.基纳S.E.克吕格
Owner SAS INSTITUTE
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products