Fine-grained visual question-answering method combined with multi-view attention mechanism
An attention, fine-grained technology, applied in neural learning methods, computer components, biological neural network models, etc., to achieve high efficiency, improve accuracy and comprehensiveness, and high accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0045] The present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments.
[0046] In order to solve the shortcomings of the prior art, the present invention provides a fine-grained visual question answering method combined with a multi-view attention mechanism. Visual question answering can be viewed as a multi-task classification problem, where each answer can be viewed as a classification category. In a general visual question answering system, the One-Hot method is used to encode the answers to obtain the One-Hot vectors corresponding to each answer to form an answer vector table. One-Hot encoding is the representation of categorical variables as binary vectors. This first requires mapping categorical values to integer values, and then each integer value is represented as a binary vector, which is zero-valued except for the index of the integer, which is labeled as 1.
[0047] like figure 1 As shown, the fine...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com