Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Knowledge network-based text indexing system and method

A knowledge network and text technology, applied in the field of text indexing systems, can solve the problems of low technical accuracy

Active Publication Date: 2011-10-05
HYLANDA INFORMATION TECH
View PDF7 Cites 35 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The system can provide several different dimension indexes under a unified platform, effectively solving the problem of low accuracy of existing text indexing technology

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Knowledge network-based text indexing system and method
  • Knowledge network-based text indexing system and method
  • Knowledge network-based text indexing system and method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] The concept of knowledge network (Knowledge Network) was first proposed by the Swedish industry in the mid-1990s. It is generally believed that the knowledge network is a structure that adds weights on the basis of the concept network, thereby quantitatively expressing the relationship between users and knowledge nodes. Among them, knowledge nodes can be extracted from existing catalog search engines, and have the characteristics of independence, inheritance, variability, and multidimensionality. Specifically, the independence of knowledge nodes is reflected in that only knowledge elements and knowledge units that are cognitively independent can constitute knowledge nodes. The inheritance of knowledge nodes is mainly manifested in two aspects: one aspect is that the expansion of knowledge quantity is realized through integration, and the increase of knowledge quantity is the expansion and generation on the basis of inheritance; the other is that it is manifested in the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a knowledge network-based text indexing system and method. The text indexing system comprises a single text feature extraction unit, a multi-text word relation extraction unit, a knowledge tree generating unit, a knowledge tree application unit and a knowledge base storage unit. The text indexing method comprises the following steps of: partitioning words in a text input to the text indexing system, and acquiring text feature words in the text; deducing a class word TAG corresponding to the text according to node positions of a knowledge tree corresponding to the text feature words; and judging the validity of the TAG through a judgment type model based on the TAG, then extracting a reliable TAG word set, and repositioning a text feature word set through the reliable TAG word set to form a reliable text feature word set. According to the system and the method, content word extraction, class labeling and phrase extraction are integrated, so that the extraction effects can be mutually promoted; and the semantics of the words are expressed through the nodes of the knowledge network, so that different meanings are reduced.

Description

technical field [0001] The present invention relates to a system and method for realizing text indexing, in particular to a text indexing system and text indexing method based on knowledge network (Knowledge Network) in the process of text information processing, belonging to text information processing technology field. Background technique [0002] Text is the most basic and commonly used information carrier. With the increasing popularity of the Internet, text information expands rapidly. For example, hundreds of thousands of web pages are updated every day on the Internet, and millions of new web pages are added, making the information on the Internet rich and complex. How to effectively organize and manage this information, and quickly, accurately, and comprehensively find the information that users need is a major challenge in the field of text information processing. [0003] In the work of text information processing, text content word extraction, category labelin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
Inventor 张伟伟张旭成孙威宋传宝陶鹏
Owner HYLANDA INFORMATION TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products