Network protected index data obtaining method based on OCR technology
An acquisition method and protected technology, applied in the field of network communication, can solve problems such as fixed content, low accuracy of results, and reduced efficiency, and achieve the effect of batch acquisition of acquired data, accurate acquired data, and wide application value
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0065] A method for obtaining a network protected Baidu index based on OCR technology, such as figure 1 As shown, the specific steps include:
[0066] (1) Target data website login;
[0067] (2) Target data location and acquisition: Use the automated testing tool Selenium Webdriver to simulate the user's operations on the data platform before the target data is displayed; for example, log in, enter search keywords, set search time, etc. Load the image of the target data, and use the method of simulating mouse movement to dynamically load, collect and store the data values on the curve in the image of the target data;
[0068] (3) Target data preprocessing: preprocessing the image of the target data;
[0069] (4) Target data identification and storage: using improved OCR technology for target data identification and storage:
[0070] a. Custom font samples: For characters that are prone to failure in recognition and fonts that are not commonly used, expand the segmentatio...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com