Bidirectional reconstruction network video description method based on hierarchical attention mechanism
A network video and attention technology, applied in the computer field, can solve problems such as irrelevant background information, text description interference, and low semantic similarity, and achieve high semantic similarity, minimize reconstruction loss, and reduce interference.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0038] The present invention will be further described below in conjunction with accompanying drawing.
[0039] The two-way reconstruction network video description method based on the hierarchical attention mechanism focuses on extracting multi-scale video features to fully represent the temporal and spatial structure of the video, and at the same time uses the hierarchical attention mechanism to make the bidirectional reconstruction network model built pay more attention to the generated description sentences Most relevant video features. The main idea is to use convolutional neural network as an encoder to extract multi-scale regional features of video frames, and use hierarchical attention mechanism to process video features to obtain dynamic representation of video features; use long short-term memory neural network as decoder, minimize cross entropy The loss function obtains the probability distribution of vocabulary words and generates sentences accordingly; the reconst...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com