The invention discloses a public sentiment data real-time collecting method and
system based on distribution. The method comprises the following steps that firstly, a public sentiment data website class
library is established, and
crawling data items are classified and defined; secondly, a
data acquisition website
list is transmitted to a data collection
server, the
server allocates corresponding crawlers to circularly crawl data in a dormant mode; thirdly, crawled source webpage data is subjected to
label analysis, the position of a target data item is positioned, and the target data item is obtained; fourthly, an obtained result data item is encapsulated into a uniform format of a corresponding class; fifthly, the
encapsulated data is stored into a corresponding
database; sixthly, a monitoring log file is generated. According to the public sentiment data real-time collecting method and
system based on distribution, the architecture is advanced, a factory mode is used as a main design mode of the
system, new examples can be quickly generated, system core functions such as browser assess, log generating, data encapsulation, agent setting and
queue setting are encapsulated, the expandability and transportability of the system are enhanced, and the
reusability of codes and the
maintainability of the system are improved.