Digital thesis retrieval method based on formal concept analyses

A formal concept and paper technology, applied in the field of data mining, can solve the problems that the traditional framework of FCA information retrieval cannot handle large-scale paper retrieval and the accuracy of retrieval results is not high.

Inactive Publication Date: 2013-12-11
BEIJING INSTITUTE OF TECHNOLOGYGY
View PDF5 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0010] The purpose of the present invention is to aim at the field of academic paper search, the accuracy rate of the retrieval results in the existing academic paper retrieval method combined with FCA theory is not high and the traditional framework of FCA information retrieval cannot handle large-scale paper retrieval, and proposes a new The formal background scale reduction mechanism and the method of acquiring and sorting academic papers based on concept lattice are used to retrieve academic papers

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Digital thesis retrieval method based on formal concept analyses
  • Digital thesis retrieval method based on formal concept analyses
  • Digital thesis retrieval method based on formal concept analyses

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0059] According to the above technical solutions, the present invention will be described in detail below through specific examples.

[0060] This embodiment uses the method proposed by the present invention to establish a digital paper retrieval system based on formal concept analysis. The classification system uses a JAVA development platform and a MySql database. Using 10,000 papers from CNKI (China National Knowledge Network) in the field of computer information retrieval to conduct experiments, the specific steps are as follows:

[0061] The operations in the preprocessing phase are:

[0062] Step 1: For all the keywords in the 10,000 papers in the field of computer information retrieval, calculate the TF-IDF value of each keyword in the 10,000 papers in the field of computer information retrieval in turn, and follow the TF-IDF value from high to low The keywords were sorted sequentially; then, the 40 keywords with the highest TF-IDF values ​​were identified as attribut...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the field of data mining, and relates to a digital thesis retrieval method, in particular to a digital thesis retrieval method based on formal concept analyses. According to the thesis retrieval method, concept lattice building and searching scale and time are shortened through the mode of sequencing and selection according to intervals, then other cut theses are attached to a selected thesis, and the effect of result losing is eliminated to a large extent; meanwhile, the problem that retrieval results are too dispersed and too large in the thesis retrieval process is solved through a concept lattice rough and approximate retrieval mechanism, and retrieval result recall rate and precision are also guaranteed. The digital thesis retrieval method based on the formal concept analyses provides a usable retrieval mode based on the formal concept analysis regarding to large-scale data.

Description

technical field [0001] The invention relates to a digital paper retrieval method, in particular to a digital paper retrieval method based on formal concept analysis, which belongs to the field of data mining. Background technique [0002] At present, for academic researchers, there are many search engines for academic papers, such as the public GOOGLE SCHOLAR search engine, the commercial ACM search engine, and the free CITESEER search engine. These search engines return their own results according to the user's request, but the results often have the following problems: ① too many results are returned; ② most of the returned results deviate from the request; Not very accurate. Therefore, how to meet the search requests of academic users and efficiently find the academic resources (papers) they need is a key research field in the field of academic search. [0003] Formal Concept Analysis (FCA) was proposed by R.Wille in 1982. Since 1990, FCA has been integrated with relate...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 施重阳牛振东张春霞赵向宇
Owner BEIJING INSTITUTE OF TECHNOLOGYGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products