Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Data category imbalance processing method, device and system and storage medium

A technology of data category and processing method, applied in the field of data processing, can solve problems such as poor classification effect of minority classes, and achieve the effect of reducing errors

Pending Publication Date: 2021-07-27
INDUSTRIAL AND COMMERCIAL BANK OF CHINA
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] With the development of machine learning, some machine learning methods began to be gradually used in the identification of abnormal consumption behavior, and achieved certain results, but these methods are still not effective in minority class classification when dealing with unbalanced samples.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data category imbalance processing method, device and system and storage medium
  • Data category imbalance processing method, device and system and storage medium
  • Data category imbalance processing method, device and system and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0081] Hereinafter, embodiments of the present disclosure will be described with reference to the drawings. It should be understood, however, that these descriptions are exemplary only, and are not intended to limit the scope of the present disclosure. In the following detailed description, for purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of the embodiments of the present disclosure. It may be evident, however, that one or more embodiments may be practiced without these specific details. Also, in the following description, descriptions of well-known structures and techniques are omitted to avoid unnecessarily obscuring the concept of the present disclosure.

[0082] The terminology used herein is for the purpose of describing particular embodiments only, and is not intended to be limiting of the present disclosure. The terms "comprising", "comprising", etc. used herein indicate the presence of stated features, ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a data category imbalance processing method, which is applied to the technical field of data processing, and comprises the following steps: clustering minority class samples to obtain a plurality of clusters; calculating k neighbor samples of the minority class samples in each cluster, obtaining the number of majority class samples in the k neighbor samples of the minority class samples in each cluster, calculating the ratio of the number of the majority class samples in the cluster to the number of the k neighbor samples, and processing the samples in the cluster according to the ratio of the number of the majority class samples in the cluster to the number of the k neighbor samples.. The invention provides data category imbalance processing equipment, a semi-supervised generative adversarial network training method and equipment, an abnormal transaction detection method and equipment, a system and a storage medium.

Description

technical field [0001] The present disclosure relates to the technical field of data processing, and more specifically, to a method and device for processing unbalanced data categories, a training method and device for a semi-supervised generative adversarial network, an abnormal transaction detection method and device, a system, and a storage medium. Background technique [0002] The number of samples of a certain type in the data set is quite different from the number of other samples. For example, in the data set collected for judging credit card fraud, the data of most users is normal, and only a very small part is the data of defrauded users. [0003] With the development of machine learning, some machine learning methods have been gradually used in the identification of abnormal consumption behavior, and have achieved certain results, but these methods are still not effective in minority class classification when dealing with samples with unbalanced distribution. Con...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06N3/04G06N3/08G06N20/00
CPCG06N20/00G06N3/08G06N3/045G06F18/23G06F18/214
Inventor 李进进汤仲喆赵旭东沈雪莲
Owner INDUSTRIAL AND COMMERCIAL BANK OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products