Data processing method and device based on spark
A data processing and data technology, applied in the computer field, can solve the problems of data processing efficiency reduction, achieve the effect of saving storage space, saving data processing time, and improving data processing efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0023] According to an embodiment of the present invention, a Spark-based data processing method is provided, such as figure 1 As shown, the method includes:
[0024] S102, acquiring data to be processed;
[0025] S104, extracting the feature identifier of the data to be processed, wherein the feature identifier is used to identify the file type of the data to be processed;
[0026] S106. Write the data to be processed into the target file corresponding to the feature identifier according to the feature identifier.
[0027] Optionally, in this embodiment, the above-mentioned Spark-based data processing method can be applied to the log data writing process, but is not limited to, for example, the above-mentioned data to be processed is the log data obtained after parsing the log file, from which The characteristic identifier of the log data, and write the log data into a corresponding file according to the characteristic identifier, so that the log data with the same characte...
Embodiment 2
[0063] According to an embodiment of the present invention, a Spark-based data processing device for implementing the above-mentioned Spark-based data processing method is also provided, such as image 3 As shown, the device includes:
[0064] 1) an acquisition unit 302, configured to acquire data to be processed;
[0065] 2) Extraction unit 304, configured to extract the feature identifier of the data to be processed, wherein the feature identifier is used to identify the file type of the data to be processed;
[0066] 3) The processing unit 306 is configured to write the data to be processed into the target file corresponding to the feature identifier according to the feature identifier.
[0067] Optionally, in this embodiment, the above-mentioned Spark-based data processing device may be applied in the process of writing log data, but not limited to, for example, the above-mentioned data to be processed is the log data obtained after parsing the log file, from which The c...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com