Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Web crawler service system for housing library network

A web crawler and service system technology, applied in the field of website data mining, can solve problems such as no solution found, and achieve the effects of humanized management, value enhancement, and storage space reduction

Inactive Publication Date: 2014-12-03
ANHUI HUAZHEN INFORMATION SCI & TECH
View PDF3 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The foundation of building a database is data mining, but in today’s era of high-speed information dissemination and spam flying everywhere, how to quickly and effectively carry out website data mining has always been a hot topic, and no ideal and effective solution has been found.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Web crawler service system for housing library network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] refer to figure 1 , a web crawler service system for the Fangku network proposed by the present invention includes: a website crawler module, a monitoring service module, a management service module, a deployment service module and a scheduling service module. The website crawler module is respectively connected to the monitoring service module, the management service module, the deployment service module and the dispatching service module, the monitoring service module is connected to the management service module, and the management service module is respectively connected to the deployment service module and the dispatching service module.

[0024] The website crawler module is composed of multiple website crawlers. The website crawler corresponds to the website one by one, and analyzes the page elements of the website. The website crawler extracts the website data for semantic analysis and maps it to the preset data entity for storage. In this embodiment, the data m...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a web crawler service system for a housing library network. According to the system, website mining can be conducted quickly, and data related to housing estates can be extracted. The system comprises a website crawler module, a monitoring service module, a management service module, a deployment service module and a scheduling service module, wherein the website crawler module consists of multiple website crawlers, the website crawlers are in one-to-one correspondence with websites, analyze page elements of the websites, extract website data for semantic analysis and map the website data into preset data entities for storage; the monitoring service module is used for monitoring working conditions of all the website crawlers and judging whether the website crawlers work normally or not and whether data fetching is correct or not; the management service module is used for configuring settings of parameters related to working of the website crawlers, upgrading the website crawlers and managing start-up and stopping of the service system as well as life cycles and working of the website crawlers; the deployment service module is used for allocating and deploying the website crawlers; scheduling modules of the website crawlers are arranged in the scheduling service module and used for performing scheduling management on working modes, time and stopping of the website crawlers.

Description

technical field [0001] The invention relates to the technical field of website data mining, in particular to a web crawler service system for Fangku.com. Background technique [0002] The real estate industry is directly related to people's livelihood. The current residential market will enter the era of stock houses, and many home owners of stock houses are not professional salespersons, and the sales information provided is not comprehensive enough. At the same time, the file management of housing in government departments is still at the paper stage, and various data related to housing and real estate are scattered in various units and departments. This kind of effective data is not fully utilized. People choose housing, enterprises choose office space will face a serious lack of professional and detailed information services. [0003] In the social environment, it is of great significance to promote real estate informatization, facilitate home buyers to query informat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
CPCG06F16/951
Inventor 贾岩
Owner ANHUI HUAZHEN INFORMATION SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products