Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A method for dynamic aggregation of web news based on domain theme

A dynamic aggregation and news technology, applied in special data processing applications, instruments, electronic digital data processing, etc., can solve problems such as low information coverage, low information accuracy, and insufficient information pertinence, and achieve good structural characteristics. , good purity, the effect of eliminating noise information

Active Publication Date: 2016-06-15
HEFEI UNIV OF TECH
View PDF4 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The purpose of the present invention is to provide a dynamic aggregation method of Web news oriented to domain topics, relying on vertical search engines, meta search engines, domain modeling, information extraction, and content sorting technologies, which can provide users and application systems with The domain theme-oriented Web news dynamic aggregation service solves the problems of low information coverage, low information accuracy rate and difficulty in meeting domain theme-oriented retrieval requirements when search engine technology deals with Web news dynamic aggregation problems. Insufficient Information Diversity and Insufficient Pertinence of Information in Dynamic Aggregation of Web News

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method for dynamic aggregation of web news based on domain theme
  • A method for dynamic aggregation of web news based on domain theme
  • A method for dynamic aggregation of web news based on domain theme

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment

[0059] In this embodiment, the designated field is "academic field". For the convenience of description, the first three items of the search results of each general search engine among vertical search engines and meta search engines are taken.

[0060] (1) if figure 1 As shown in S101, a user uses a mobile phone as a user terminal, submits the theme of "big data data mining" to the server through the browser HTTP protocol through the mobile phone.

[0061] (2) if figure 1 As shown in S102, the server receives the subject information of "big data data mining" submitted by the user terminal, and obtains a list of search records based on the vertical search engine module.

[0062] Vertical search engines use timers and incremental crawlers to collect data. The homepages of news websites of 75 colleges and universities affiliated to the Ministry of Education were used as seeds, and incremental crawlers were put into them. Using timer control, these websites are periodically cra...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention is applicable to the field of network information processing and provides a field subject-oriented Web news dynamic aggregation method. The method comprises the following steps of: for a field site list pre-defined by a user, according to a subject provided by the user, acquiring a search record list by utilizing a vertical search engine and a meta search engine; performing deduplication on the search record list and identifying a Web news webpage to obtain a news webpage search record list; with a Web information extraction method, acquiring a structured news list from the news webpage search record list; and performing ranking on the structured news list according to a field model to obtain an ordered structured news list, and returning the ordered structured news list to the user as a dynamic aggregation result. According to the field subject-oriented Web news dynamic aggregation method, a multi-source associated Web news set is acquired in real time according to a field and a subject provided by the user, an interactive mechanism where the ranking of the Web news depends on the popularity of the Web news is provided, and the purpose is to provide a convenient and efficient internet information acquisition and sharing mode.

Description

technical field [0001] The invention relates to the field of network information processing, in particular to a field theme-oriented Web news dynamic aggregation method. Background technique [0002] Due to the inherent advantages in the dissemination of news information on the Internet, Web news has increasingly become the main way for people to obtain information. Due to the large amount of Web news information and its fast-changing characteristics, it is difficult to obtain Web news related to domain topics. Internet users and related applications urgently need a domain-oriented Web news dynamic aggregation method. Territory refers to the sphere of ideology or social activity. Such as: ideological field, academic field, life field, scientific field. Theme refers to the user's basic thoughts and interest tendencies reflected through the collection of keywords when expressing thoughts, explaining problems or reflecting social life. Web news refers to the reports of recen...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30H04L29/08
CPCG06F16/358H04L67/02
Inventor 吴共庆胡骏刘鹏程王钊胡东辉李磊胡学钢吴信东
Owner HEFEI UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products