Mass data quality verification method based on Hadoop
A data quality and verification method technology, applied in the field of big data, can solve problems such as increased learning costs, slow inspection cycle, and increased human and financial cost investment, so as to reduce human cost input, occupy less resources, and facilitate compatible development effect of using
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0035] The present invention will be further described below in conjunction with the drawings.
[0036] The Hadoop-based mass data quality verification method of the present invention includes the following steps:
[0037] Step 1. Develop data quality standards, including:
[0038] Regular rules, namely: rules formulated in the form of custom regular expressions.
[0039] Verification rules, namely: email number verification, mobile phone number verification, license plate number verification, etc.
[0040] Judging rules, namely: judging the content length, whether it is empty, and the data range.
[0041] Content format rules, such as whether to include certain specific content.
[0042] Algorithm rules in specific scenarios, such as: credit card generation rules, ID card numbers need to meet the first six digits representing administrative divisions, seven to fourteen digits representing the date of birth, the seventeenth digit representing gender, and the last digit meeting the verific...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com