Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Related knowledge point acquisition method and system

A technology of knowledge points and domain knowledge, which is applied in the field of electronic digital data processing, can solve problems such as poor objectivity, heavy workload, and artificial screening, so as to reduce workload, improve efficiency and accuracy, and save time and labor costs. Effect

Inactive Publication Date: 2016-05-25
PEKING UNIV FOUNDER GRP CO LTD +2
View PDF3 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] For this reason, the technical problem to be solved by the present invention lies in the problems of manual screening, heavy workload, and poor objectivity in obtaining relevant entries in the prior art, thereby proposing a method for determining relevant knowledge points based on semantic vectors

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Related knowledge point acquisition method and system
  • Related knowledge point acquisition method and system
  • Related knowledge point acquisition method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0033] In this embodiment, a method for obtaining related knowledge points is provided, through which the relevant knowledge points of all knowledge points in the field are obtained, and then according to the obtained related knowledge points, for the entries in the established field encyclopedia It has very good guiding value to check for leaks and fill in vacancies for further improvement. Knowledge point refers to the basic unit of information transmission. Researching the representation and association of knowledge points plays an important role in improving learning navigation, information recommendation, retrieval, and thesaurus establishment.

[0034] The method of obtaining the relevant knowledge points, the flow chart is as follows figure 1 As shown, the specific process is as follows:

[0035] First, obtain domain knowledge points and obtain all knowledge points in this field. For example, when building an encyclopedia, you can obtain all entries in this field that ...

Embodiment 2

[0048] This embodiment provides a method for acquiring relevant knowledge points, the steps of which are the same as those in Embodiment 1. This embodiment provides a specific method for calculating the semantic vector of each candidate knowledge point in the above process, and the specific process is as follows :

[0049] The first step is to determine the number of occurrences of each candidate knowledge point in the candidate document, so that the text of each candidate knowledge point and its occurrence times is obtained. The candidate text is the text obtained after word segmentation from the selected digital resources, and the candidate knowledge point is the word obtained after the word segmentation in the candidate text except common words. This part is the same as that in Embodiment 1, and will not be repeated here.

[0050] The second step is to calculate the binary tree with the minimum weighted path length according to each candidate knowledge point and the number ...

Embodiment 3

[0066] Field encyclopedias are an important digital publishing resource. Domain encyclopedias usually organize domain information in the form of entries. The domain encyclopedia needs to contain important entries in the domain. However, building a domain encyclopedia requires a lot of human input. This embodiment provides a method for obtaining related knowledge points, where the domain knowledge points are entries in the domain encyclopedia. In this embodiment, the domain e-book text and newspaper text are used to calculate the semantic vector of the candidate entry through the skip-gram model. The semantic similarity between the constructed domain entries and the obtained candidate entries is calculated through semantic vectors. By using the semantic similarity of the entries, other field entries that are semantically related to the field encyclopedia entries and that have been missed are found, so as to reduce the possibility of some field entries being missed. Specific...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention provides a related knowledge point acquisition method. The method comprises: firstly, acquiring domain knowledge points; then carrying out word segmentation on a text in a domain according to the domain knowledge points; obtaining candidate knowledge points after removing common words; obtaining semantic vectors of the candidate knowledge points; and obtaining candidate knowledge points, related to each domain knowledge point, as target knowledge points by calculating similarity between the domain knowledge points and the candidate knowledge points. Thus, a plurality of target knowledge points related to each domain knowledge point can be obtained. When constructing an encyclopedia directory entry, it may be determined, through searching, whether each domain knowledge point has a related knowledge point, and if not, a related knowledge point needs to be added. In this way, checking and construction of encyclopedia entries are completed, so that a manual workload is significantly reduced; time costs and labor costs are reduced; inaccuracy caused by subjectivity and non-uniform standards of manual checking is avoided; and efficiency and accuracy are greatly improved.

Description

technical field [0001] The invention relates to the field of electrical digital data processing, in particular to a method and system for acquiring relevant knowledge points. Background technique [0002] Digital publishing resources have become one of the main ways of information provision. People have shifted from paper reading to electronic reading in large numbers. Digital publishing resources include e-books, digital encyclopedias, digital periodicals, digital newspapers, etc. The information provided by digital publishing resources is usually more authoritative and accurate than that of the Internet. Therefore, how to improve people's learning or reading experience according to the characteristics of digital publishing resources has become particularly important. [0003] Encyclopedia (Encyclopedia) is a reference book that introduces all human knowledge or a certain type of knowledge. They are often arranged in the form of dictionaries (with entries as the basic u...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30G06F17/27
Inventor 叶茂徐剑波汤帜杨亮卢菁
Owner PEKING UNIV FOUNDER GRP CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products