Method for increasing credibility of question prerequisite in visual question answering scene

A technology of credibility and questions, applied in the field of visual question answering, can solve the problem of low credibility of the premise

Inactive Publication Date: 2017-09-15
SHENZHEN WEITESHI TECH
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Aiming at solving the problem of low credibility of question premise in the field of visual question answering, the purpose of the present invention is to provide a method for improving the credibility of question premise in the visual question answering scene, and propose a combination of one-hot encoding and deep learning encoding new frame

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for increasing credibility of question prerequisite in visual question answering scene
  • Method for increasing credibility of question prerequisite in visual question answering scene
  • Method for increasing credibility of question prerequisite in visual question answering scene

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] It should be noted that, in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined with each other. The present invention will be further described in detail below in conjunction with the drawings and specific embodiments.

[0034] figure 1 It is a system flowchart of a method for improving the credibility of question premise in a visual question-and-answer scene of the present invention. It mainly includes premise information extraction; question relevance prediction database; question relevance detection; data expansion of visual question answering.

[0035] Among them, the premise information extraction uses the semantic meta-ancestor picture title evaluation standard to extract the premise information in the question, specifically:

[0036] (1) The evaluation criterion converts a question sentence into a scene representation;

[0037] (2) Disable pronoun resolution and verb reduction during conversio...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method for increasing credibility of question prerequisite in a visual question answering scene. The method comprises following steps: prerequisite information extraction, question correlation prediction database, question correlation detection, visual question answering data expansion. First, the prerequisite information in the problem is extracted; the problem correlation prediction and explanation database are constructed; and binary classification of problem image pairs (Ii, Qi) is performed; whether the image Ii has the prerequisite information in question Qi is identified; then on the basis of one-hot coding, the image Ii and the problem Qi are encoded using the VGG network and the short and long term memory network, respectively, and input the data to the multi-layer sensor for prediction. The method of the invention can handle a plurality of target objects and their relations in different scenes, provide an encoding method to calculate the image matching distance, and improve the credibility of the question prerequisite information.

Description

technical field [0001] The invention relates to the field of visual question answering, in particular to a method for improving the credibility of question premise in a visual question answering scene. Background technique [0002] In recent years, it has attracted much attention to attach image labels or subject texts to image content independently. Especially in today’s generation of massive images, it is impossible to distinguish and classify image content completely according to human eyes. Therefore, how to use prior Knowledge of the key to hashtag specific image content and accurately answer the question in Visual Q&A is something to consider. If the content of the image can be successfully answered without the labor of the human eye, it will bring extremely high significance and economic value to engineering and the visual industry, especially in places with a wide background and sparse objects such as the deep ocean. Real-time navigation information, verification of...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/5866G06F16/90332G06F16/904
Inventor 夏春秋
Owner SHENZHEN WEITESHI TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products