Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Fuzzy cognitive map-based document semantic automatic generation method

A fuzzy cognitive map, automatic generation technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problem of automatic semantic generation of massive documents without a good solution, unable to reflect the semantic knowledge of the article, integrated For problems such as simple vectors, achieve the effect of improving precision, convenient application, and high precision

Inactive Publication Date: 2011-07-27
SHANGHAI REDNEURONS +1
View PDF0 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

But also because the set-to-vector of is too simple, the traditional search engine has the disadvantages of low precision rate and unable to reflect the semantic knowledge of the article.
[0003] Search based on document semantics has a good precision rate, but there is no good solution for the automatic generation of semantics of massive documents

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Fuzzy cognitive map-based document semantic automatic generation method
  • Fuzzy cognitive map-based document semantic automatic generation method
  • Fuzzy cognitive map-based document semantic automatic generation method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0036] Embodiment one: see figure 1 , the present invention is based on the document semantic automatic generation method of fuzzy cognitive map, and its realization process comprises the following steps:

[0037] First, preprocess the document: including document segmentation, sentence segmentation and word segmentation;

[0038] Secondly, each paragraph with atomic semantics is transformed into a corresponding atomic cognitive map (E-FCM);

[0039] Third, combine each paragraph-atomic cognitive map (E-FCM) with atomic semantics into a paragraphic cognitive map (S-FCM);

[0040] Fourth, the document cognitive map (D-FCM) is combined from the atomic cognitive map (E-FCM) and paragraph cognitive map (S-FCM) of the same document, so as to realize the representation of document knowledge.

[0041] Its specific process is as figure 1 As shown, S1, open the document; S2, segment and sentence the document; S3, perform word segmentation on the document; S4, extract keywords from t...

Embodiment 2

[0042] Embodiment two: see figure 1 , Figure 2-Figure 6 . In this embodiment, a method for automatically generating document semantics based on a fuzzy cognitive map takes the following steps to convert a paragraph with atomic semantics into an atomic cognitive map:

[0043] 1) Use the title of a text paragraph or the sentence with the largest ratio of the number of words to the number of keywords in the text paragraph as the topic node of the atomic cognitive map, that is, the topic concept;

[0044] 2) Preprocess the content of the paragraph. According to the results of sentence and word segmentation, m sentences and n different keywords are obtained, and the first N keywords with a higher probability of occurrence are taken to obtain N atoms of the cognitive map. concept; said m, n and N are natural numbers;

[0045] 3) For the N concepts in the atomic cognition graph, calculate the connection weight between the two;

[0046] The calculation formula is If concept C ...

Embodiment 3

[0048] Embodiment three: see figure 1 , Figure 2-Figure 6 . This embodiment is based on the document semantics automatic generation method of the fuzzy cognitive map. On the basis of the second embodiment, the weight of the N concepts in the atomic cognitive map to the topic concept is calculated by using the normalization method of arithmetic sum: keywords C i The weights on topic concepts are determined by the inference formula Calculated by one inference; the i-th concept C i State values ​​in text are denoted by V Ci =tanh(x i ) calculation; x i Indicates the frequency of the i-th keyword appearing in the text; f( ) indicates the normalized function of all keyword weights.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a document semantic automatic generating method, and particularly relates to a fuzzy cognitive map-based document semantic automatic generation method. Semantic representation is performed on paragraphs of a document through the fuzzy cognitive map, and the semantic representation of the whole document is realized by fuzzy cognitive map-base combination. The method comprises the following steps: first, preprocessing the document, including document segmenting, phrasing and word segmenting; second, converting each paragraph with atomic semantic into a corresponding atomic cognitive map; third, combining the paragraph atomic maps with the atomic semantic into a paragraph cognitive map; and fourth, combining the atomic cognitive map and the paragraph cognitive map of the same document into a document cognitive map so as to realize representation of document knowledge. In the method, the automation degree of document semantic representation can be improved, and the method is convenient for semantic representation applied to mass webpage text in a Web environment, so that the precision rate of Web searching can be improved.

Description

technical field [0001] The invention relates to a method for automatically generating document semantics, in particular to a method for automatically generating document semantics based on a fuzzy cognitive graph. Background technique [0002] Traditional search engines based on keyword matching search by matching pairs of <keywords, articles, frequency>. The advantage of this search method is that it is simple and quick, and has a high recall rate. However, because the set-to-vector of <keyword, article, frequency> is too simple, traditional search engines have shortcomings such as low precision rate and inability to reflect semantic knowledge of articles. [0003] Search based on document semantics has a good precision rate, but there is no good solution for automatic semantic generation of massive documents. Contents of the invention [0004] The problem to be solved by the present invention is to propose a method for automatically generating document sema...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/27G06F17/30
Inventor 邬江兴罗兴国刘超魏晓曹伟骆祥峰斯雪明雷咏梅贾云杰
Owner SHANGHAI REDNEURONS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products