Method and device for determining quantile points of sample characteristics in distributed clusters

A distributed cluster and sample feature technology, applied in the field of sample feature quantile determination, which can solve problems such as frequent reading

Active Publication Date: 2022-05-17
ALIPAY (HANGZHOU) INFORMATION TECH CO LTD
View PDF10 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

When determining the quantile point of an attribute item in the sample feature, the CPU needs to frequently read the sample data from the memory

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for determining quantile points of sample characteristics in distributed clusters
  • Method and device for determining quantile points of sample characteristics in distributed clusters
  • Method and device for determining quantile points of sample characteristics in distributed clusters

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0089] The solutions provided in this specification will be described below in conjunction with the accompanying drawings.

[0090] figure 1 It is a schematic diagram of an implementation scenario of an embodiment disclosed in this specification. Multiple nodes in a Trusted Execution Environment (TEE) can form a distributed cluster. In this distributed cluster, nodes can be divided into master nodes and slave nodes. figure 1 Only 3 slave nodes are shown in , and this description does not limit the number of nodes in the distributed cluster. Nodes in TEE include CPU and memory. figure 1In the figure, the master node is taken as an example, and the CPU and memory in it are shown. Memory is used to store data, CPU is used to process data, and CPU can access data in memory. Specifically, the CPU can read data from the memory, process the data with an application program running on the CPU, and write data to the memory. The trusted execution environment isolates the CPU and m...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of this specification provides a method and device for determining the quantile point of a sample feature in a distributed cluster, which is used to determine the feature quantile point of the first attribute item in the sample feature. Multiple first arrays with a fixed number of items, the multiple first arrays are respectively obtained from multiple slave nodes, and are respectively obtained based on the sample characteristics of different batch samples in the sample set, and then according to a predetermined method, for multiple first The arrays are merged step by step until the last level of array merging; any level of array merging includes: for the feature value sets containing the items in the two arrays, the weight value is merged, and the pseudo-item filling operation is obtained to obtain the merged array , and write the merged array into the memory, and use inadvertent access to read from the memory the items in the merged array obtained by the last level of array merge except for several dummy items, based on the items read from the merged array The item determines the feature quantile of the first attribute item.

Description

technical field [0001] One or more embodiments of this specification relate to the technical field of data security, and in particular to a method and device for determining quantile points of sample characteristics in a distributed cluster. Background technique [0002] In various application fields that need to process data, data security issues have attracted much attention. Trusted Execution Environment (TEE) can provide an execution environment independent of the operating system, and provide security protection by isolating highly security-sensitive applications from the general software environment. For example, a trusted enclave (Enclave) manufactured based on Software Guard Extensions (Software Guard Extensions, SGX) technology and the like. Trusted execution environment technology usually adopts a hardware isolation mechanism to isolate a secure area including CPU and memory in the computing platform, and the encrypted data in the memory is only visible inside the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): H04L67/10H04L67/1097H04L9/40G06F21/60G06N3/04G06N3/08
CPCH04L67/10H04L67/1097H04L63/0428H04L63/0218H04L63/0884G06F21/602G06N3/04G06N3/082G06F2221/2107
Inventor 张兴盟余超凡王磊
Owner ALIPAY (HANGZHOU) INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products