Data processing method and device and electronic equipment
A data processing device and data processing technology, applied in the field of data processing, can solve problems such as affecting operation efficiency and wasting computing resources, and achieve the effect of improving reading, writing and computing efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0057] Specifically, in order to apply the data processing method disclosed in this application to perform set operations such as intersection and difference, a computing architecture can be pre-built, such as figure 1 As shown, it includes a UDF module, a preprocessing module and an intersection and difference module.
[0058] Among them, UDF, the user-defined function module, can implement aggregation functions such as integer merge BITMAP and BITMAP merge based on RoaringBitmap, as well as functions such as bitmap initialization, bitmap intersection, union, difference, and base. It also includes the combination of continuous operations, such as bitmap_and_count (and operation base), which can save the performance overhead of serialization / deserialization once. In addition, it can support some special scenarios, such as bitmap_weight_and_count (weighted and operation base), which supports concurrent calculation of the accumulated value of complex Bitmap and weight product. ...
Embodiment 2
[0091] Corresponding to the above examples, such as Figure 4 As shown, the application provides a data processing method, the method comprising:
[0092] 410. Acquire a first data set to be processed and a second data set to be processed, the first data set to be processed includes a plurality of first data to be processed, and the second data set to be processed includes a plurality of second data to be processed , the data to be processed includes a target field and a corresponding field value;
[0093] 420. Determine buckets respectively corresponding to the first data to be processed and the second data to be processed according to the field values corresponding to the target fields included respectively;
[0094] Preferably, the determining the buckets respectively corresponding to the first data to be processed and the second data to be processed according to the field values corresponding to the target fields respectively included includes:
[0095] 421. Perform ...
Embodiment 3
[0115] Corresponding to the above examples, such as Figure 5 As shown, the present application provides a data processing device, the device comprising:
[0116] An acquisition module 510, configured to acquire a first data set to be processed and a second data set to be processed, the first data set to be processed includes a plurality of first data to be processed, and the second data set to be processed includes a plurality of first data to be processed 2. Data to be processed, the data to be processed includes target fields and corresponding field values;
[0117] A dividing module 520, configured to determine buckets respectively corresponding to the first data to be processed and the second data to be processed according to field values corresponding to the target fields included respectively;
[0118] A generating module 530, configured to generate a first bitmap corresponding to the bucket according to the corresponding first data to be processed and generate a sec...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com