Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

183results about How to "Enhance feature expression" patented technology

Video super-resolution reconstruction method based on multi-memory and mixed loss

The invention discloses a video super-resolution reconstruction method based on multi-memory and mixed loss, which comprises two parts of an optical flow network and an image reconstruction network. In the optical flow network, the optical flow between the current frame and the reference frame is calculated for the input multiple frames, and the optical flow is used as motion compensation to compensate the current frame to be similar to the reference frame as much as possible. In the image reconstruction network, the compensated multi-frames are sequentially input into the network, and the network adopts multi-memory residual blocks to extract image features, so that the following input frames can receive the feature map information of the previous frames. Finally, the output low-resolution feature image is amplified by sub-pixel, and added with the image amplified by bi-cubic interpolation to obtain the final high-resolution video frame. A hybrid loss function is used to train the optical flow network and the image reconstruction network simultaneously in the training process. The invention greatly enhances the feature expression ability of the inter-frame information fusion, andcan reconstruct the high-resolution video with rich details.
Owner:WUHAN UNIV

Medical image segmentation method and system based on generative adversarial network, and electronic equipment

ActiveCN110503654AEnhance feature expressionImproving the Efficiency of Adversarial TrainingImage enhancementImage analysisDiscriminatorPattern recognition
The invention relates to a medical image segmentation method and system based on a generative adversarial network, and electronic equipment. The method comprises the following steps: firstly, researching how to extract pixel-level features of different types of high-quality images by a generator, and carrying out structured feature representation by utilizing a capsule model so as to generate a pixel-level labeled sample; secondly, constructing a proper discriminator for discriminating the authenticity of a generated pixel-level labeled sample, and designing a proper error optimization function; feeding back a discriminating result to models of a generator and a discriminator respectively, improving the sample generating capacity and the discriminating capacity of the generator and the discriminator respectively through continuous adversarial training, finally, adopting a trained generator to generate a pixel-level labeled sample, and achieving pixel-level segmentation of an image-level labeled medical image. According to the invention, the dependence of the segmentation model on pixel-level annotation data is effectively reduced, the adversarial training efficiency of the generated sample and the real sample can be improved, and high-precision pixel-level image segmentation can be effectively realized.
Owner:SHENZHEN INST OF ADVANCED TECH CHINESE ACAD OF SCI

Film comment-oriented naming entity recognition method

InactiveCN108874997ASolve the problem of incompatible namesAvoid mistakesSpecial data processing applicationsFeature vectorData set
The invention discloses a film comment-oriented naming entity recognition method, and belongs to the technical field of entity recognition. The method comprises the following steps of: defining a naming entity type of a name in a film comment; constructing a training data set TrainSet; segmenting a non-marked data set by taking characters as units so as to construct a character matrix, and constructing a mapping dictionary of characters and word vectors by utilizing a skip-gram model in word2vec; generating a naming feature dictionary corresponding to each character; combining word vectors andnaming features of the characters to generate a feature vector matrix for training a name recognition model; constructing a mixed model by utilizing Bi-LSTM and CRF so as to train the name recognition model; and carrying out name recognition on the feature vector matrix through the trained name recognition model, so as to realized film comment name recognition task-oriented name definition, carryout name labeling on character levels during name recognition, obtain name features and combine word vectors obtained via word2vec training with the naming features.
Owner:GUANGDONG UNIVERSITY OF FOREIGN STUDIES

Unsupervised cross-domain pedestrian re-identification method and system

The invention discloses an unsupervised cross-domain pedestrian re-identification method and an unsupervised cross-domain pedestrian re-identification system. The method comprises the following steps:constructing a source domain training set and a target domain training set; converting the training images in the source domain training set into a target domain, and generating image data related tothe target domain; training an initial pedestrian re-identification model by utilizing the generated image data; extracting local features of each training image in the target domain training set based on the trained pedestrian re-identification model; performing clustering analysis on training image data in a target domain training set by using the extracted local features; determining an optimal training sample in the target domain training set based on a clustering analysis result; utilizing the generated image data and the determined optimal training sample to retrain the pedestrian re-identification model, and repeating in sequence until an iteration stop condition is reached to obtain a final pedestrian re-identification model; and obtaining to-be-identified image data in the targetdomain, and identifying the to-be-identified image data by using the finally obtained pedestrian re-identification model.
Owner:中科人工智能创新技术研究院(青岛)有限公司

Image semantic segmentation model generation method and device, equipment and storage medium

The invention discloses an image semantic segmentation model generation method and device, equipment and a storage medium. The method comprises the steps of obtaining an image sample set; using the image sample set to train an image semantic segmentation model; wherein the image semantic segmentation model comprises a feature map extraction part and a feature map analysis part, the feature map extraction part comprises a plurality of cascaded hole convolution processing residual error modules, and the feature map analysis part is constructed based on an attention mechanism, a pixel correlationmechanism and multi-scale information. According to the technical scheme provided by the embodiment of the invention, the attention mechanism is effectively utilized to learn the dependency relationship in spatial position and channel dimension, the feature expression capability is enhanced, the pixel correlation mechanism is utilized to make the segmentation result more accurate, and meanwhile,the multi-scale feature information is also utilized to learn a global scene to improve the accuracy of pixel point classification.
Owner:BOE TECH GRP CO LTD

A remote sensing image super-resolution reconstruction method based on a convolutional neural network of channel attention

The invention discloses a remote sensing image super-resolution reconstruction method based on a convolutional neural network of channel attention. According to the method, a channel attention mechanism is introduced into the proposed convolutional neural network, and the characteristic expression capability of the network can be improved by utilizing the mutual dependency relationship among the characteristic channels. The method mainly comprises the following steps: designing and building a convolutional neural network model based on channel attention; Constructing a training sample by usinga smaller remote sensing database, and pre-training the deep convolutional neural network model constructed in the step 1; 2, when model training in the step 2 reaches convergence, a larger remote sensing database is added for fine adjustment, and the performance of the network is further improved; In the remote sensing image reconstruction stage, a low-resolution image is used as input, and themodel trained in the step 3 is used for reconstructing a final high-resolution image. According to the method, a better reconstruction effect can be obtained, and the method is an effective remote sensing image super-resolution reconstruction method.
Owner:SICHUAN UNIV

Remote sensing image classification method based on deep fusion convolutional neural network

The invention discloses a remote sensing image classification method based on a deep fusion convolutional neural network, and the method comprises the steps: constructing an original remote sensing image into a data set, carrying out the preprocessing of the original remote sensing image, dividing the preprocessed image into a training set, a test set and a verification set, and carrying out the data augmentation of the training set; constructing a deep fusion convolutional neural network; training to obtain an optimal network model; and classifying the actually measured remote sensing imagesby using the optimal network model. The invention provides a new classification method. A new deep fusion convolutional neural network is constructed; an improved encoder-decoder model is combined with a VGG16 model to obtain a VGG16 model; the model fuses the deep features and the middle-layer features of the remote sensing image, so that the defect of low classification precision caused by single or redundant feature extraction of the remote sensing image in the prior art is effectively overcome, the advanced feature expression capability of the target is obtained by establishing the novel network model, and the classification accuracy of the remote sensing image is improved.
Owner:CHENGDU UNIVERSITY OF TECHNOLOGY

Rotating machine fault diagnosis method based on an optimized structure convolution neural network

The invention discloses a rotating machine fault diagnosis method based on an optimized structure convolution neural network. Firstly, working signals are collected in the normal state and the fault state of a rotating machine, then are converted into gray scale images, and the gray scale images and the corresponding fault labels are used as training samples to train the constructed convolution neural network; and in the working process of the rotating machine, a working signal is collected and is converted into a gray scale image, and the trained convolution neural network is input for faultdiagnosis. The invention adopts the method of converting the collected working signals of the rotating machine into gray-scale images, and completes the task of multi-classification of faults of the rotating machine through a convolution neural network.
Owner:UNIV OF ELECTRONICS SCI & TECH OF CHINA

Target tracking method based on internal clipping and multi-layer feature information fusion

The invention belongs to the technical field of computer vision, and discloses a target tracking method based on internal clipping and multi-layer feature information fusion, which comprises the following steps: acquiring a video sequence data set, and forming a training set according to the video sequence data set; constructing a twin network, wherein a basic backbone of the twin network adopts acombination of a ResNet18 feature extraction network improved by an internal clipping unit and an RPN network; based on the training set, training a twin network, and obtaining a training convergenttwin network model; and performing online tracking by using the twin network model. The method solves the problems that a deep learning tracking method in the prior art is poor in feature expression ability and model generalization ability, and a tracker cannot effectively cope with severe space-time change of target appearance.
Owner:WUHAN UNIV

Solar cell panel defect detection method based on multi-scale joint convolutional neural network

The invention discloses a solar cell panel defect detection method based on a multi-scale joint convolutional neural network. General image data processing of a neural network is involved. Firstly, asolar cell panel image sample is collected and data is processed; balancing the sample and then carrying out pretreatment operation; dividing the processed various types of images into a training setand a test set; building an experiment software environment and an experiment hardware environment after the data set is obtained; constructing a multi-scale joint convolutional neural network model;inputting the marked training set into the model for learning; and the test set is used for testing to complete the detection of the defects of the solar cell panel, so that the defects of single function, weaker generalization ability, incapability of ensuring the accuracy in an actual detection task, difficulty in universality of a set of parameters, fewer detectable categories, larger calculated amount and lower efficiency in the prior art of the defect recognition or detection method of the solar cell panel are overcome.
Owner:HEBEI UNIV OF TECH

Long-time sheltering robust tracking method based on convolutional features and global search detection

The invention relates to a long-time sheltering robust tracking method based on convolutional features and global search detection. Through adopting a convolutional feature and multi-scale relative filtering method in a tracking module, the feature representation capacity of a tracking object appearance model is enhanced, so that a tracking result is very robust to factors such as illumination variation, object scale variation and object rotation. In addition, through the introduced global search detection mechanism, a detection module can detect the object again when the tracking fails because the object is sheltered for a long time, thus the tracking module can recover from error, and the object can be continuously tracked for a long time even in the case of object appearance changes.
Owner:NORTHWESTERN POLYTECHNICAL UNIV

Graph-based direct-push type semi-supervised pedestrian re-identification method

The invention discloses a graph-based direct-push type semi-supervised pedestrian re-identification method, and belongs to the technical field of computer vision pedestrian re-identification. The method comprises the steps of firstly, using labeled pedestrian data for training a double-channel model; after obtaining the base model, carrying out feature extraction on the label-free pedestrian data,establishing a graph model for the extracted label-free pedestrian data features, giving a pseudo label to the label-free pedestrian data according to the graph model, and constructing a positive andnegative sample pair by using the labeled pedestrian data and the label-free pedestrian data with the pseudo label; assigning confidence coefficients to positive and negative sample pairs by using the graph model, and jointly finely adjusting the base model; gradually increasing the difficulty and confidence of positive and negative sample pairs, training the base model to complete convergence byusing a course learning method, performing feature extraction and feature matching on verification set data after a final model is obtained, and completing pedestrian re-identification according to amatching result. According to the method, the negative influence caused by wrong pseudo tags is reduced, the robustness of the model is improved, and the pedestrian re-identification precision is further improved.
Owner:西安宏规电子科技有限公司

New crown diagnosis system based on deep convolutional neural network and multi-instance learning

The invention provides a new crown diagnosis system based on a deep convolutional neural network and multi-instance learning, and the system comprises a feature extraction module which packages a CT sequence of a patient, carries out time domain convolution, screens infected slice instances through weak supervised learning, and obtains an infected segment; The system comprises a multi-branch network system configured to input the feature sequence extracted from the CT sequence of the patient into a plurality of parallel branch networks, wherein class activation sequences output by different parallel branch networks are different, so that different infection fragments are positioned, and the integrity of specific case features of the patient is modeled, an attention divergence induced by the weak supervised learning is confronted to enhance the accuracy and robustness of the infected fragment; a multi-instance learning module configured to perform feature fusion of time domain convolution on multi-instance bagging so as to enhance specific case feature expression of the patient; and a gating attention mechanism module configured to perform adaptive instance feature weighted fusion to avoid gradient disappearance in multi-instance learning.
Owner:帝工(杭州)科技产业有限公司

Advertisement click rate prediction framework and algorithm based on user behaviours

InactiveCN108830416APredict interest drift in real timePredicting the probability of interest drift in real timeForecastingMarketingFeature extractionHigh dimensional
The invention discloses an advertisement click rate prediction framework and algorithm based on user behaviours. ID characteristics and other characteristics are co-converted in different levels intomeaningful numerical characteristics; due to the characteristics, the characteristic sparsity and redundancy can be reduced; the characteristic expressiveness can be improved; simultaneously, to further improve the characteristic expressiveness, characteristic selection and characteristic combination are carried out by utilization of a GBDT model in the invention; high-dimensional characteristicsare processed by utilization of an LR model; finally, to solve a class imbalance problem, a down-sampling algorithm based on a K_Means model is provided in the invention; in an experimental process, characteristic extraction on original characteristics is carried out at first; then, characteristic classification is carried out by adoption of heuristic thinking; characteristic combination is carried out by inputting perceptual characteristics into the GBDT model; finally, rational characteristics and combination characteristics are input into the LR model with a certain weight, so that advertisement click rate prediction is carried out; and an experimental result shows that the algorithm in the invention is improved both on RMSE and R2 indexes.
Owner:SICHUAN UNIV

Military target identification method based on deep learning

The invention discloses a military target identification method based on deep learning, and belongs to the field of automatic target identification based on images. The military target identificationmethod based on deep learning disclosed by the invention can be used in the fields, such as military target identification, large-scale military target retrieving, weapon equipment intellectualizationand situation estimation. According to the method disclosed by the invention, for the problems that each layer is in one-way connection and the feature expression capability is insufficient in the traditional target identification network based on deep learning, an algorithm model of a dense connected convolutional layer is re-designed; a dense connection mode is used; the algorithm model multiplexes features of each layer; therefore, the average target identification accuracy rate of the algorithm model is increased; the algorithm model obtained by training in this way is relatively small; and simultaneously, the algorithm model solves the problems of gradient diffusion and gradient expansion.
Owner:HANGZHOU DIANZI UNIV

Brain image segmentation method and device and storage medium

The embodiment of the invention discloses a brain image segmentation method and device and a storage medium, Specifically, the method comprises the steps of obtaining a to-be-segmented image group; onone hand, carrying out skull stripping according to various modal images in the to-be-segmented image group; on the other hand, carrying out feature extraction and fusion on the multiple modal imagesrespectively, then segmenting intracranial tissue according to the fused features, then fusing an initial segmentation result obtained through segmentation with the mask obtained before, and acquiring a final segmentation result; According to the scheme, the feature expression capability and the segmentation accuracy can be improved.
Owner:TENCENT TECH (SHENZHEN) CO LTD

Depth pedestrian re-identification method based on positive sample balance constraint

The invention provides a depth pedestrian re-identification method based on positive sample balance constraint. A residual error network employed in the method is simple in structure and can be widely used, and the network structure which is deep sufficiently improves the feature representation capability. Moreover, there is no need to specially design the network structure. A residual error classifier is used for feature extraction of an image, so the accuracy of pedestrian re-identification can be greater than the accuracy of most of well-designed methods. Compared with two-tuple loss and three-tuple loss methods, the method does not need to intentionally generate an effective sample for improving the structural loss and can achieve the similar effect. Moreover, the method enables the learned gradient direction to be more robust and effective through the overall distribution information. On the basis of improving the structural loss, the method improves the positive sample balance constraint, can control the distance of a positive sample pair, also can balance the gradients of the distance of the positive sample pair and the distance of a positive sample pair, enables the algorithm to be easier to train, and improves the performances of an algorithm.
Owner:SUN YAT SEN UNIV

Accompaniment and human voice extraction method and device and word-by-word lyric generation method and device

The embodiment of the invention provides an accompaniment and human voice extraction method and device, a word-by-word lyric generation method and device, a storage medium and electronic equipment, and relates to the technical field of audio processing. The accompaniment and human voice extraction method comprises the following steps: acquiring mixed audio data in a time domain; encoding the mixedaudio data through an encoding module in an accompaniment voice extraction model to obtain encoded audio features; performing first convolution processing on the encoded audio features through a first decoding module in the accompaniment human voice extraction model to obtain target human voice audio data; enabling the encoding module and the first decoding module to form a first U-net network; performing second convolution processing on the encoded audio features through a second decoding module in the accompaniment voice extraction model to obtain target accompaniment audio data; and enabling the encoding module and the second decoding module to form a second U-net network. According to the invention, the accompaniment and human voice extraction effect can be improved.
Owner:HANGZHOU NETEASE CLOUD MUSIC TECH CO LTD

Face super-resolution method and system based on fusion attention mechanism

The invention discloses a face super-resolution method and system based on a fusion attention mechanism, and belongs to the field of face image super-resolution. The method comprises the steps of down-sampling a high-resolution face image into a target low-resolution face image, carrying out partitioning operation, separating out image blocks which are overlapped with each other, and extracting superficial features by using a superficial feature extractor; fusing features of pixel, channel and space triple attention modules, and enhancing reconstructed face structure details; constructing a fusion attention network as a deep feature extractor, inputting the superficial facial features into the fusion attention network to obtain deep features, wherein the fusion attention network includes a plurality of fusion attention groups, each fusion attention group includes a plurality of fusion attention blocks; and performing up-sampling on the deep feature map, and reconstructing the face feature map subjected to up-sampling into a high-resolution face image of the target. The method is superior to other latest face image super-resolution algorithms, and face high-resolution images with higher quality can be generated.
Owner:WUHAN INSTITUTE OF TECHNOLOGY +1

Defect detection network construction method, anomaly detection method and system, and storage medium

PendingCN112001903AEnrich and improve performanceImplement cross-domain migration trainingImage enhancementImage analysisFeature vectorOpen data
The invention relates to a defect detection network construction method, an anomaly detection method and system, and a storage medium. The construction method comprises: obtaining a sample image in apreset open data set and a reference image of a standard product of a to-be-detected object; configuring a convolutional neural network module or a plurality of convolutional neural network modules with different scales and forming a defect detection network; training a main feature extraction model by using the sample image to obtain corresponding network parameters, and inputting the reference image into the main feature extraction model and the slave feature extraction model to obtain a corresponding first feature vector and a corresponding second feature vector respectively; and constructing a loss function of the slave feature extraction model according to the first feature vector and the second feature vector, and training and learning to obtain network parameters of the slave feature extraction model, thereby configuring and forming a defect detection network. The defect detection network can complete automatic extraction of features according to input image information, effectively reduces dependence on experience of workers in the process of product defect detection, and has practical value.
Owner:SHENZHEN HUAHAN WEIYE TECH

Model training method, business processing method and device, terminal and storage medium

The embodiment of the invention discloses a model training method, a business processing method and device, a terminal and a medium, the model training method comprises the following steps: obtaininga sample set, the sample set comprising a plurality of sample images and an image relationship, and the image relationship comprising an association relationship between at least two sample images; constructing a data structure based on the image relationship and the plurality of sample images; traversing each element of the data structure and clustering the sample images stored by each element toobtain a plurality of clustering sets and category information of the sample images in each clustering set; carrying out model training on an image processing model by adopting the plurality of sample images and the category information of each sample image; the embodiment of the invention can improve the model training efficiency and reduce the labor cost.
Owner:SHENZHEN TENCENT COMP SYST CO LTD

Image privacy perception method based on deep learning

An image privacy perception method based on deep learning, including the following steps: S1, building a privacy classification data set with labeled categories, and training a privacy perception network with a transfer learning method; S2, recognizing a privacy image using a deep convolutional neural network oriented to privacy perception; and S3, extracting an attention profile according to deep convolutional features of the neural network, and locating an attention focusing region to complete the perception of an image privacy region. The method has the following beneficial effects: by completing end-to-end training and testing based on the deep neural network, the privacy image can be accurately distinguished with the privacy region located, facilitating the selective protection of the privacy information in the image.
Owner:HARBIN INST OF TECH SHENZHEN GRADUATE SCHOOL

Text classification method, apparatus, computer device, and storage medium

The embodiment of the invention discloses a text classification method, a device, a computer device and a storage medium. The method comprises the following steps: the input text is processed by wordsegmentation and part-of-speech tagging to obtain a word segmentation list, wherein the word segmentation list comprises a word obtained by word segmentation of the input text and a part-of-speech ofthe word; Obtaining a word vector of each word and a word vector of a part of speech of each word in the word segmentation list; Obtaining a word vector matrix composed of a splicing word vector of each word in the word segmentation list, wherein, the splicing word vector is obtained by splicing a word vector of a word and a word vector of a part of speech of the word; Inputting the word vector matrix to Bi-LSTM to obtain a text feature vector of each word in the word segmentation list; Obtaining a text classification result of the input text according to a text feature vector of each word inthe word segmentation list. By implementing the method of the embodiment of the invention, the accuracy of the text classification can be improved.
Owner:深圳市牛鼎丰科技有限公司

Cross-model face feature vector conversion system and method

The invention discloses a cross-model face feature vector conversion system and method. Through a network training module, input samples in a training data set are input into a network model for training to obtain the trained network model. The network model is composed of a convolution network part and a deformation attention mechanism network part. A source feature vector is input into the convolution network part, and the depth convolution feature is obtained; a face image is input into the deformation attention mechanism network part and the depth feature of the face image is extracted; and then, the depth convolution features are spliced and fused with the depth features of the face image, and a feature map with stronger expressive ability is obtained and classified and converted. According to the method, the conversion relationship between the source feature vector and the target feature vector is learned by establishing the double-branch deep neural network, and meanwhile, the human face image is used as assistance, so that the problem of information loss in the main task training process is improved, the generalization of the network model is improved, and the conversion success rate is effectively improved.
Owner:成都东方天呈智能科技有限公司

Resume information extraction method and system

The invention relates to a resume information extraction method and system. The resume information extraction method comprises the steps: A, obtaining resume data; B, converting the resume data into resume texts by utilizing a BERT Chinese pre-training model and a data augmentation technology, and classifying the resume texts according to sentence characteristics of the resume texts; C, conductingnamed entity recognition on the classified resume text sentences through a BERT + BiGRU + CNN + CRF model, and then extracting needed information elements; and D, storing the extracted information elements in a database, and outputting corresponding information in a structured manner. The resume information extraction system is mainly composed of a resume acquisition module, an input module, a classification module, an information element extraction module, a storage module and an output module. The resume information extraction method uses an incremental learning method, and uses a clause mode in data preprocessing of the classification model, so that the language model can adjust parameters by incrementally inputting new training data on the basis of inheriting past parameters, and theresume information extraction method has better continuity and generalization ability.
Owner:DONGGUAN UNIV OF TECH +1

Storage file and network data flow encryption communication detection method and system

The invention discloses a storage file and network data flow encryption communication detection method and system, and the method comprises the steps: carrying out the byte conversion of to-be-detected data, and obtaining a two-dimensional gray image corresponding to the to-be-detected data, wherein the to-be-detected data is storage file data or network data stream data; inputting the two-dimensional grayscale picture into a pre-trained encrypted communication detection model, and outputting a result of whether the to-be-detected data is encrypted or not, wherein the encrypted communication detection model comprises a feature extraction module and a feature mapping module, the feature extraction module is used for carrying out feature extraction on a two-dimensional grayscale image to obtain an information correlation feature map of the two-dimensional grayscale image, and the feature mapping module is used for carrying out feature mapping on the information correlation feature map toobtain a result of whether the to-be-detected data is encrypted or not. According to the method, the network input problem can be solved, automatic feature extraction is realized, whether the data isencrypted or not is judged, the classification precision is improved while complex feature extraction is avoided, and the method and system are suitable for various types of data.
Owner:NANHAI RES STATION OF INST OF ACOUSTICS CHINESE ACADEMY OF SCI

SAR image ship target detection method and system in passive interference scene

The invention discloses an SAR image ship target detection method and system under a passive interference scene. The method comprises the steps: dividing an SAR image ship target detection data set under a passive interference condition into a training sample and a test sample; constructing a Faster R-CNN detection model with attention enhancement; constructing an overall loss function of a region-of-interest classification task, a region-of-interest position regression task, a target classification task, a target position regression task and a target region saliency detection task of the attention-enhanced Faster R-CNN detection model; training the attention-enhanced Faster R-CNN detection model added with the overall loss function L by using the training sample; inputting the test sample into the attention-enhanced Faster R-CNN detection model containing the weight parameters after training, predicting and outputting a target bounding box position, a target category and a confidence score of the target in the test sample, and completing SAR image ship target detection. According to the method, the SAR image ship target detection precision in the passive interference scene is improved.
Owner:XIDIAN UNIV

Spectral super-resolution adaptive weighted attention mechanism deep network data processing method

The invention belongs to the technical field of hyperspectral image processing, and discloses a spectral super-resolution adaptive weighted attention mechanism deep network data processing method, which comprises the steps of selecting two spectral reconstruction challenge data sets for verification; evaluation index selection: using a root mean square error RMSE and an average relative absolute value error MRAE as evaluation indexes, and then calculating the MRAE and the RMSE; constructing an adaptive weighted channel attention mechanism module; constructing a second-order non-local module based on partitioning; and constructing an adaptive weighted attention mechanism network and training. According to the invention, a brand-new adaptive weighted attention mechanism network is designed for spectral super-resolution, and a backbone network is formed by stacking a plurality of double-residual modules which realize double-residual learning through long and short connection. And a self-adaptive weighted channel attention module is embedded in the double-residual module, so that the channel characteristic response is recalibrated, and the characteristic expression capability of the network is enhanced.
Owner:XIDIAN UNIV

Rapid multi-scale estimation target tracking method related to re-detection

The invention provides a rapid multi-scale estimation target tracking algorithm related to depth characteristics and re-detection. The characteristics of the target are represented through a deep learning method, and the characteristic expression capability of the target is improved. In the tracking stage, when characteristics of image blocks with different scales are extracted, through PCA dimension reduction, the calculated amount can be reduced, and the overall calculation speed is increased. On the basis of two discrimination indexes, namely a peak sidelobe ratio (PSR) and a confidence coefficient smooth constraint (SCCM), a new detection index is provided, so that the tracking reliability of the current frame can be more accurately measured. If the reliability of the current frame isrelatively low, a series of target candidate boxes are generated through an Edgeboxes method so as to carry out re-detection.
Owner:NANJING UNIV OF INFORMATION SCI & TECH

Multi-task learning deep network model training and target detection method and device

The invention provides a multi-task learning deep network model training and target detection method and device, and relates to the technical field of computer vision and deep learning. The specific implementation scheme is as follows: obtaining training data of multi-task learning in a target detection scene; inputting the image sample into a backbone network in a multi-task learning deep network model based on Anchor-Free to obtain a feature map output by the backbone network; inputting the feature map into a feature pyramid network in a multi-task learning deep network model to obtain a multi-scale feature map output by the feature pyramid network; inputting the multi-scale feature map into a Head network in a multi-task learning deep network model to learn each task, and obtaining a prediction result corresponding to each task output by the Head network; and training a multi-task learning deep network model according to the prediction result corresponding to each task output by the Head network and the label on the image sample corresponding to each task. According to the invention, the representation capability of the network can be improved.
Owner:BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products