Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

1691results about "Still image data querying" patented technology

Directional impression analysis using deep learning

Systems and techniques are provided for detecting gaze direction of subjects in an area of real space. The system receives a plurality of sequences of frames of corresponding fields of view in the real space. The system uses sequences of frames in a plurality of sequences of frames to identify locations of an identified subject and gaze directions of the subject in the area of real space over time. The system includes logic having access to a database identifying locations of items in the area of real space. The system identifies items in the area of real space matching the identified gaze directions of the identified subject.
Owner:STANDARD COGNITION CORP

Wearable apparatus and methods for processing image data

A wearable apparatus and method are provided for processing images including product descriptors. In one implementation, a wearable apparatus for processing images including a product descriptor is provided. The wearable apparatus includes a wearable image sensor configured to capture a plurality of images from an environment of a user of the wearable apparatus. The wearable apparatus also includes at least one processing device programmed to analyze the plurality of images to identify one or more of the plurality of images that include an occurrence of the product descriptor. Based on analysis of the one or more identified images, the at least one processing device is also programmed to determine information related to the occurrence of the product descriptor. The at least one processing device is further configured to cause the information and an identifier of the product descriptor to be stored in a memory.
Owner:ORCAM TECH

Joint Embedding for Item Association

Methods and systems to associate semantically-related items of a plurality of item types using a joint embedding space are disclosed. The disclosed methods and systems are scalable to large, web-scale training data sets. According to an embodiment, a method for associating semantically-related items of a plurality of item types includes embedding training items of a plurality of item types in a joint embedding space configured in a memory coupled to at least one processor, learning one or more mappings into the joint embedding space for each of the item types to create a trained joint embedding space and one or more learned mappings, and associating one or more embedded training items with a first item based upon a distance in the trained joint embedding space from the first item to each said associated embedded training items. Exemplary item types that may be embedded in the joint embedding space include images, annotations, audio and video.
Owner:GOOGLE LLC

Image search method and image search device

The invention discloses an image search method and an image search device. The image search method comprises the steps of in allusion to images in an image search base, generating a label corresponding to each image according to corresponding description information of each image, saving the generated label and congruent relationships of the images, when image search is carried out, according to a received image search request, obtaining description information of a to-be-searched image which is included in the request, a label corresponding to the to-be-searched image is generated according to the description information, wherein the mode of generating the label corresponding to the to-be-searched image and the mode of generating the label corresponding to each image in the image search base are the same, further determining an image which corresponds to the label of the to-be-searched image according to the congruent relationships between the saved labels and the images, and sending the determined image to a sending end of the image search request. The image search technology based on the labels is capable of using an existing text search engine, and therefore use ratio of server resources is improved.
Owner:ALIBABA GRP HLDG LTD

Image retrieval method based on multi-task hash learning

The invention discloses an image retrieval method based on multi-task hash learning. Firstly, the deep convolutional neural network model is determined. Secondly, the loss function is designed by using multi-task learning mechanism. Then, the training method of convolutional neural network model is determined, in combination with the loss function, and back propagation method is used to optimize the model. Finally, the image is input to the convolutionalal neural network model, and the output of the model is transformed into hash code for image retrieval. The convolutional neural network modelis composed of a convolutional sub-network and a full connection layer. The convolutional subnetwork consists of a first convolutional layer, a maximum pooling layer, a second convolutional layer, anaverage pooling layer, a third volume base layer and a spatial pyramid pooling layer. The full connection layer is composed of a hidden layer, a hash layer and a classification layer. The training method of the model includes two training methods: a combined training method and a separated training method. The method of the invention can effectively retrieve single tag and multi-tag images, and the retrieval performance is better than other deep hashing methods.
Owner:CHANGSHA UNIVERSITY OF SCIENCE AND TECHNOLOGY

Method of matching image features with reference features and integrated circuit therefor

The invention is related to a method of matching image features with reference features, comprising the steps of providing a current image captured by a capturing device, providing reference features (r), wherein each of the reference features comprises at least one reference feature descriptor (d(r)), determining current features (c) in the current image and associating with each of the current features at least one respective current feature descriptor (d(c)), and matching the current features with at least some of the reference features by determining a respective similarity measure (D(c, r)) between each respective current feature descriptor (d(c)) and each respective reference feature descriptor (d(r)). According to the invention, the determination of the similarity measure is performed on an integrated circuit by hardwired logic or configurable logic which processes logical functions for determining the similarity measure. The invention is also concerned with an integrated circuit for matching of image features with reference features.
Owner:APPLE INC

System and methods for creating a collection of images

System and method for creating a collection of images are described, the method comprising: receiving images from at least one source of images; processing the images to produce an output collection of images, the processing comprising grouping the images to clusters of related images and selecting the preferred images in the clusters; and outputting the output collection of images, the output collection of images comprising the clusters of related images and indication of the preferred images in the clusters. The system for creating a collection of images comprising: a storage medium to receive images from at least one source of images; a processor to produce an output collection of images by grouping the images to clusters of related images and selecting the preferred images in the clusters; and a collection output medium for outputting the output collection of images.
Owner:SHUTTERFLY LLC

A method and system for assisting a teacher to understand a student's learning situation

The invention provides a method and a system for assisting a teacher to understand the learning situation of a student. The method comprises the following steps: timing is started when a student starts to answer a question in each answer area, the timing is stopped until the answer is stopped in each answer area; Shooting and acquiring an answer image corresponding to each answer area after stopping answering in each answer area; Identifying the answer image, comparing the recognized answer content with the preset answer to obtain the corresponding correction result; According to each answer area corresponding to the correction results and answer time, obtaining the students' learning situation. The invention not only detaches from electronic equipment such as mobile phone and tablet computer, but also avoids students playing mobile phone or tablet computer on the grounds of learning, thus delaying learning. The invention is more in line with the student 's learning environment, assists in correcting students' homework and improves the efficiency of correcting homework. The accurate grasp of each student 's learning situation is conducive to the follow-up of each student' s accurate differentiation training, and to improving students' performance.
Owner:GUANGDONG XIAOTIANCAI TECH CO LTD

Facial recognition technology-based walkman analysis method and facial recognition technology-based walkman analysis system

The invention belongs to the technical field of security and protection and discloses a facial recognition technology-based partner analysis method. The method comprises the following steps of carrying out intelligent analysis on captured image data through an artificial intelligence deep learning algorithm; carrying out face capture on personnel passing through a set monitoring area; abstractinga face image, extracting face feature code information, comparing the extracted face feature code information with face feature codes stored in a database; wherein the feature codes with the similarity greater than a certain threshold are classified into feature codes of the same person; taking a face feature code with the highest similarity as an identification mark of a target person; based on the identification mark and the time period, retrieving the image of the personnel to acquire namely acquiring data. The method is efficient and rapid. Through the facial recognition and big data technology, the analysis accuracy is improved, and labor intensity and the labor cost are reduced.
Owner:SHENZHEN INFINOVA

Systems and methods for remembering held items and finding lost items using wearable camera systems

Apparatuses and methods are provided for storing information related to objects associated with a hand of a user via a wearable camera system. In one implementation, a wearable apparatus for storing the information is provided comprising a wearable image sensor configured to capture a plurality of images from the environment of the user, and at least one processing device programmed to process the images. The processing device may detect the hand of the user, and an object associated with the user's hand. The processing device may proceed to store information related to the object. Consistent with disclosed embodiments, the stored information may be used for various purposes, such as warning the user of dangers, catering advertising to the user, and helping the user find objects when they are lost.
Owner:ORCAM TECH

Audio processing method and device based on artificial intelligence

The invention discloses an audio processing method and device based on artificial intelligence. One concrete implement mode of the method includes the steps that an audio file to be processed is converted into an image to be processed; the content feature of the image to be processed is extracted; according to the style feature and the content feature of the image to be processed, a target image is determined, and the style feature is obtained from a template image converted from a template audio file; the target image is converted into the processed audio file. By means of the implement mode, the processed audio file has the template audio style without changing the content of the audio file to be processed, and audio processing efficiency and flexibility are improved.
Owner:BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD

Electronic filing system searchable by a handwritten search query

A method of providing an electronic filing system which is searchable using a handwritten search query, the method including the steps of: obtaining the handwritten search query using an input device; performing a search of at least one database based on a comparison between the handwritten search query and handwritten annotations made on interactive pages stored in the at least one database; and, providing the results of the search to a user and facilitating access to at least one interactive page identified in the results of the search. An apparatus is also disclosed. Preferably, the interactive page is provided to the user in the form of printed paper, and the handwritten annotations are user handwriting, symbols, drawings, indicia or the like.
Owner:SILVERBROOK RES PTY LTD

Methods, systems, and computer program products for resource-to-resource metadata association

The subject matter described herein includes methods, systems, and computer program products for resource-to-resource metadata association. According to one method, a source resource and a destination resource are identified. A type of at least one of the source and destination resources is identified and used to select a transform for mapping data values associated with the resource to the destination resource as metadata. Based on the transform, at least one of the data values associated with the source resource is associated with a destination resource as metadata.
Owner:SCENERA TECH

Method of matching image features with reference features and integrated circuit therefor

The invention is related to a method of matching image features with reference features, comprising the steps of providing a current image captured by a capturing device, providing reference features (r), wherein each of the reference features comprises at least one reference feature descriptor (d(r)), determining current features (c) in the current image and associating with each of the current features at least one respective current feature descriptor (d(c)), and matching the current features with at least some of the reference features by determining a respective similarity measure (D(c, r)) between each respective current feature descriptor (d(c)) and each respective reference feature descriptor (d(r)). According to the invention, the determination of the similarity measure is performed on an integrated circuit by hardwired logic or configurable logic which processes logical functions for determining the similarity measure. The invention is also concerned with an integrated circuit for matching of image features with reference features.
Owner:APPLE INC

Image-text retrieval system and method based on multi-angle self-attention mechanism

The invention belongs to the technical field of cross-modal retrieval, and particularly relates to an image-text retrieval system and method based on a multi-angle self-attention mechanism. The systemcomprises a deep convolutional network, a bidirectional recurrent neural network, an image, a text self-attention network, a multi-modal space mapping network and a multi-stage training module. The deep convolutional network is used for acquiring an embedding vector of an image region feature in an image embedding space. The bidirectional recurrent neural network is used for acquiring an embedding vector of a word feature in a text space, and the two vectors are respectively input to the image and the text self-attention network. The image and text self-attention network is used for acquiringan embedded representation of an image key area and an embedded representation of key words in sentences. The multi-modal space mapping network is used for acquiring the embedded representation of the image text in the multi-modal space. The multi-stage training module is used for learning parameters in the network. A good result is obtained on a common data set Flickr30k and an MSCOCO, and the performance is greatly improved.
Owner:FUDAN UNIV

A multi-scale Hash retrieval method based on deep learning

Image pairing information and image classification information are optimized and a Hash code quantization process is used to realize a simple and easy end-to-end deep multi-scale supervision Hash method, and meanwhile design a brand new pyramid connected convolutional neural network structure, and the convolutional neural network structure takes paired images as training input and enables the output of each image to be approximate to a discrete Hash code. In addition, the feature map of each convolution layer is trained, feature fusion is carried out in the training process, and the performance of deep features is effectively improved. A neural network is constrained through a new binary constraint loss function based on end-to-end learning, and a Hash code with high feature representationcapability is obtained. High-quality multi-scale Hash codes are dynamically and directly learned through an end-to-end network, and the representation capability of the Hash codes in large-scale image retrieval is improved. Compared with an existing Hash method, the method has higher retrieval accuracy. Meanwhile, the network model is simple and flexible, can generate characteristics with strongrepresentation ability, and can be widely applied to other computer vision fields.
Owner:SHANDONG UNIV

Zero sample sketch retrieval method based on semantic adversarial network

InactiveCN110175251AReduce intra-class varianceGuaranteed discriminabilityStill image data queryingNeural architecturesRgb imageMedical diagnosis
The invention provides a zero sample sketch retrieval method based on a semantic adversarial network, which mainly solves the problems that in the prior art, the sketch intra-class variance is larger,and the visual knowledge is difficult to migrate from a known class to a non-seen class under the zero sample setting. The method comprises the steps of obtaining a training sample set, constructinga semantic adversarial network, and extracting the RGB image features through a VGG16 network, constructing a generation network to generate the RGB image features with discriminability, inputting theto-be-retrieved sketch into a semantic confrontation network to generate the semantic features, inputting the semantic features and the random Gaussian noise into the generation network to generate the RGB image features, and searching the first 200 images most similar to the RGB image features in an image retrieval library to obtain a retrieval result. According to the method, the intra-class variance of the sketch image features is reduced, the RGB image features generated according to the sketch image in each class can be ensured, the retrieval performance of zero sample sketch retrieval is improved, and the method can be used for the electronic commerce, medical diagnosis and remote sensing imaging.
Owner:XIDIAN UNIV

Image data processing method, apparatus, computer device, and storage medium

The invention discloses an image data processing method, an apparatus, a computer device and a storage medium, which are applied in the technical field of image recognition. The method comprises the following steps: crawling an original image by a crawler tool, wherein each original image corresponds to an image type; at least one text line region is obtain by using a text position algorithm to locate that original image, and each text line region is screened to obtain a region block image; based on the image type and the position information of the region block image, the target OCR recognition model is obtained. A target OCR recognition model is adopted to recognize a region block image, and a target recognition result is obtained, the target recognition result comprises at least two recognized characters and a corresponding recognition probability; a target text is obtained based on at least two recognize characters and corresponding recognition probability, that target text is determined as a labeled text, and a target image sample is obtain based on a region block image and a labeled text. The method can effectively improve the acquisition efficiency of target image samples and reduce the acquisition cost.
Owner:PING AN TECH (SHENZHEN) CO LTD

File creation method, device, device and storage medium

The invention provides a file creation method, a device, a device and a storage medium, belonging to the technical field of image processing. The file building method comprises the following steps: clustering the captured portrait to obtain at least one clustering result; Determining a standard portrait matching the clustering result from the archival database; filing the clustering result with the standard portrait. The invention aggregates the portraits of the same person at different times or at different positions in a clustering result by performing clustering treatment on the portraits captured, a standard portrait matching the clustering result is determined from the file database; The clustering result is filed in the personal file corresponding to the standard portrait, The invention realizes the file creation of the captured portrait, saves manpower to a great extent, effectively improves the accuracy of the archives establishment, solves the work of analyzing and archiving the number of the portrait at the tens of billions level, ensures the accuracy of the archives establishment, and can be competent for large-scale file management.
Owner:BEIJING KUANGSHI TECH

Image retrieval method, device and equipment and readable storage medium

The invention discloses an image retrieval method, and the method comprises the following steps: obtaining a to-be-retrieved target image, and inputting the target image into a target deep learning model; utilizing the target deep learning model to perform feature extraction on the target image to obtain image features of the target image; wherein the image features comprise global features, localfeatures and multi-scale global features, and the multi-scale global features are features obtained by performing weighted calculation on a plurality of intermediate-stage features generated in the global feature extraction process; respectively calculating similar distances between the target image and the images in the image library by utilizing the image features according to a distance calculation rule; and determining and outputting a similar image of the target image by using the similar distance. According to the method, the image retrieval accuracy can be improved. The invention further discloses an image retrieval device and equipment and a readable storage medium which have corresponding technical effects.
Owner:SUZHOU KEDA TECH

An expression image recommendation method and device based on voice emotion recognition

The invention relates to an expression image recommendation method and device based on voice emotion recognition, an electronic device and a storage medium. The method comprises the following steps ofacquiring a plurality of latest voice messages in a current interaction window of the instant messaging software, and extracting audio feature vectors of the voice messages; matching the audio feature vector of the voice information with a plurality of emotion feature models, wherein the plurality of emotion feature models respectively correspond to one of a plurality of emotion classifications;taking the emotion classification corresponding to the matched emotion feature model as the emotion classification of the voice information; and determining one or more target expression images basedon the emotion classification of the voice information, and recommending the target expression images to the current user, so that the efficiency of selecting the expression image is improved.
Owner:刘伯涵

System and method for content-based querying using video compression format

ActiveUS6906719B2Increase the compression ratioImage can be predictedCharacter and pattern recognitionStill image data queryingDirac (video compression format)Video sequence
A visual query system, and associated method and computer program product enhance and accelerate content-based querying, and present a new image similarity measure using known or available software applications and hardware components of video compression systems. The present system encodes images as consecutive frames in a video sequence and uses the ratio between the file length of the compressed sequence and the original file length as a distance measure. The system considers the compression ratio to be an estimate of the entropy of the combined images, which can be used to estimate the amount of new information introduced from one image to the other.
Owner:BEIJING PIANRUOJINGHONG TECH CO LTD

Image-text matching method and device, storage medium and equipment

The embodiment of the invention discloses an image-text matching method and device, a storage medium and equipment, and belongs to the technical field of computers. The image-text matching method comprises the steps of obtaining an image and a text to be matched; generating a candidate instance feature set according to the image; aggregating candidate instance features in the candidate instance feature set by using a self-attention mechanism to obtain an instance feature set, each instance feature in the instance feature set corresponding to one object in the image; encoding the text to obtaina text vector; and calculating a matching degree between the image and the text according to the instance feature set and the text vector. According to the embodiment of the invention, the realization difficulty of image-text matching can be simplified, and the accuracy of image-text matching can be improved.
Owner:TENCENT TECH (SHENZHEN) CO LTD

Wearable apparatus and method for capturing image data using multiple image sensors

A wearable apparatus and method are provided for capturing image data. In one implementation, a wearable apparatus for capturing image data is provided. The wearable apparatus includes a plurality of image sensors for capturing image data of an environment of a user. Each of the image sensors is associated with a different field of view. The wearable apparatus also includes a processing device programmed to process image data captured by at least two of the image sensors to identify an object in the environment. The processing device is also programmed to identify a first image sensor, which has a first optical axis closer to the object than a second optical axis of a second image sensor. After identifying the first image sensor, the processing device is also programmed to process image data from the first image sensor using a first processing scheme, and process image data from the second image sensor using a second processing scheme.
Owner:ORCAM TECH

Method and device for searching for first image through second image

The invention relates to the field of image processing technology, in particular to a method and a device for searching for a first image through a second image. The method and the device are used forsolving the problem that a search result error is large when images are searched for through images in the prior art. According to the method for searching for the first image through the second image in the embodiment, image features of a local region image in the first image and image features of the second image can be extracted based on a full-convolution twin neural network model; a score chart is generated based on the extracted image features; region blocks, associated with the local region image, in the score chart are determined according to a sum of scores of points in a designatedquantity in the score chart; and when it is judged that the similarity between the image features, corresponding to the determined region blocks, in the second image and the image features of the local region image is greater than a set similarity threshold value, it is determined that the second image is an image similar to the first image.
Owner:ZHEJIANG DAHUA TECH CO LTD

Small sample learning algorithm based on covariance measurement

The invention discloses a small sample learning algorithm based on covariance measurement, and belongs to the field of computer vision. The algorithm comprises the steps: (1) introducing an interpolation training mechanism to learn migratable knowledge; (2) designing a local covariance representation, and then embedding the local covariance representation into a deep network to learn and express each concept; (3) constructing a covariance measurement layer to measure the distribution consistency between a query sample and the concepts based on the local covariance representation. According tothe method, a novel and concise end-to-end covariance measurement network CovaMNet is provided, a second-order local covariance representation is designed to replace traditional first-order concept representation, and a new covariance measurement function is provided. A comparison experiment result on a plurality of reference data sets is analyzed to obtain that the CovaMNet framework provided bythe invention shows a competitive effect on a general small sample classification task and a fine-grained small sample classification task.
Owner:NANJING UNIV +1

Video image data retrieval method and device, apparatus and storage medium

The invention relates to a video image data retrieval method and device, an apparatus and a storage medium. The method comprises the steps of obtaining a picture retrieval database and a training database; performing clustering training on the feature data in the training database to generate a preset number of data buckets, and determining a clustering center of each data bucket; calculating thedistance between each piece of feature data in the picture retrieval database and each clustering center, and adding each piece of feature data in the picture retrieval database into the correspondingdata bucket according to a first distance rule to determine an inverted index table; calculating the distance between the feature matrix of the to-be-retrieved picture and each clustering center, anddetermining a target data bucket according to a second distance rule; and based on the inverted index table, calculating the distance between the feature vector matrix of the to-be-retrieved pictureand the clustering center of the target data bucket, and determining a picture similar to the to-be-retrieved picture as a retrieval result according to a retrieval rule, so that the performance and the efficiency of the video image data retrieval are improved.
Owner:四川东方网力科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products