Image semantic disambiguation method and device based on image and text semantic similarity
A technology of semantic similarity and similarity, applied in semantic analysis, character and pattern recognition, special data processing applications, etc., can solve problems such as image ambiguity, and achieve the effect of improving accuracy and reducing error rate
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0053] The processing flow of an image semantic disambiguation method based on image and text semantic similarity provided by an embodiment of the present invention is as follows: figure 1 shown, including the following steps:
[0054] Step 1: Use the image saliency label to mark the image to be processed, obtain the label of the image to be processed, and mark the image content of the image to be processed.
[0055] Use a large number of known images to form training sample images, use the image visual saliency analysis method to perform saliency analysis on each training sample image, use NeuralTalk of convolutional neural network CNN, long short-term memory LSTM and / or recurrent neural network RNN The algorithm generates natural language descriptions for training sample images and obtains image saliency labels.
[0056] Collect a large number of images with polysemy ambiguity, such as images with apples, divided into Apple computers, mobile phones or edible apples, and put...
Embodiment 2
[0076] The structure of an image semantic disambiguation device based on image and text semantic similarity provided by this embodiment is as follows: image 3 As shown, the following modules are included:
[0077] The semantic processing module 31 is used to represent a meaning of a polysemy with a mean vector, and store all mean vectors and the meaning association of the polysemy corresponding to each mean vector in the mean vector database;
[0078] The image processing module 32 is used to mark the image to be processed using the image saliency label, obtain the label of the image to be processed, and mark the image content of the image to be processed, and convert the label and image content of the image to be processed into a vector form, to obtain the fusion vector of the image to be processed;
[0079] The image word sense disambiguation processing module 33 is used to use the cosine similarity to calculate the similarity between the fusion vector of the image to be p...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com