Generic similarity calculation method and system based on heterogeneous information network
A heterogeneous information network and similarity calculation technology, which is applied in the fields of information technology and the Internet, can solve the problems of lack of calculation methods, low accuracy of results, and low calculation efficiency, so as to achieve high freedom of user choice, solve information overload, and improve Effect of Computational Accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0050] Example 1, such as figure 1 As shown, a general similarity calculation method based on a heterogeneous information network in the embodiment of the present invention includes:
[0051] Step 1, preprocessing the input data set to ensure the validity of the input data;
[0052] The following is an example of the movie recommendation dataset MovieLens100k provided by the University of Minnesota. The specific implementation is as follows: In the case of only using movie recommendations, a large amount of user data is redundant and needs to be removed. In this data set, there is a lack of information such as movie actors (Actor), director (Director), and the data set provides a link from the movie to the Internet movie data set IMDb. Combine the data set MovieLens 100K and the data set IMDb to obtain effective data.
[0053] Step 2, perform metadata extraction, extract the description information of the input data, and store the description information in the metadata datab...
Embodiment 2
[0076] Example 2, such as Figure 4 As shown, the present invention also provides a general-purpose similarity calculation system based on heterogeneous information networks, including:
[0077] The processing module preprocesses the input data set to ensure the validity of the input data;
[0078] The extraction module extracts metadata, extracts the description information of the input data, and stores the description information in the metadata database, where the description information includes the global information of the overall situation of the input data set, the local information of each record, and the attributes of the data Conversion and corresponding information between identifiers and internal representations;
[0079] In the modeling module, the user selects the entities and data attributes involved in the similarity calculation, queries the corresponding metadata, displays the data type and value range of each metadata, and prompts the user to select the met...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com