Automatic BBS (bulletin board system) page acquisition method
An automatic collection and page technology, applied in special data processing applications, website content management, instruments, etc., can solve problems such as difficulty in making unified rules, real-time update of rules, abnormal data, etc., to achieve efficient solutions, optimize structure, simplify The effect of the acquisition process
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0039] The invention discloses a method for automatically collecting BBS pages, comprising the following steps: step 1, collecting and obtaining all element information of the BBS page; step 2, cross-comparing node elements in a system library; step 3, comparing the number of nodes if the node names are the same; Step 4, after confirming that the node names and numbers are the same, identify the two cross-compared nodes as the current floor node; Step 5, record the XPath of the floor node (XML path language, used to determine the location of a certain part in the XML document), and complete Segmentation of post floors, XPath extraction of floor content, and general information collection.
[0040] Specifically, the present invention comprises the following steps:
[0041] (1) Access the target BBS post page from the Internet and obtain the page byte stream.
[0042] (2) Parse the byte stream into a jdom object, which contains all the html tags corresponding to the Element, an...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com