Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

61 results about "Multimedia information retrieval" patented technology

Multimedia information retrieval (MMIR or MIR) is a research discipline of computer science that aims at extracting semantic information from multimedia data sources. Data sources include directly perceivable media such as audio, image and video, indirectly perceivable sources such as text, semantic descriptions, biosignals as well as not perceivable sources such as bioinformation, stock prices, etc. The methodology of MMIR can be organized in three groups...

Spatial browsing approach to multimedia information retrieval

A three dimensional user interface allows browsing of a database displayed as a three dimensional information space. The data is organized along three axes. A current plane or layer of data is summarized on an information landscape, with different planes being selectable using a tower that is located at the intersection of the three axes. A control wall with incorporated tools is used to formulate database queries. A preview wall previews selected data. The preview wall also provides transition between previewing the searches in the 3D space and actual viewing of the programming in full screen 2D display. Application to TV programming data is shown.
Owner:US PHILIPS CORP

Text-based query expansion and sort method in image retrieval

InactiveCN101901249AGuaranteed a high degree of commonalityImprove accuracySpecial data processing applicationsData setImage retrieval
The invention belongs to the field of multimedia information retrieval and relates to a method for realizing thesaurus-based query expansion and sort in image retrieval. The method comprises a WordNet-based English word semantic similarity metric algorithm, a HowNet-based Chinese word semantic similarity metric algorithm, an expansion rule-based query expansion word selection and optimization algorithm and a retrieval result evaluation and optimization algorithm. In the method, an image search engine is improved by the relevant text processing method and the relevant semantic network dictionary; and the retrieval result is sorted through semantic expansion, user interaction and improved similarity measurement. Compared with the traditional method, the method has the advantages of high accuracy rate, high integrality and low space-time cost. The method has very important significance for performing high-efficiency image retrieval according to image high-layer semantic information and on the basis of a large-scale image data set, and has wide application value in the field of cross-linguistic and cross-media retrieval.
Owner:FUDAN UNIV

Meta-descriptor for multimedia information

Multimedia information retrieval is performed using meta-descriptors in addition to descriptors. A “descriptor” is a representation of a feature, a “feature” being a distinctive characteristic of multimedia information, while a “meta-descriptor” is information about the descriptor. Meta-descriptors are generated for multimedia information in a repository (10, 12, 14, 16, 18, 20, 22, 24) by extracting the descriptors from the multimedia information (111), clustering the multimedia information based on the descriptors (112), assigning meta-descriptors to each cluster (113), and attaching the meta-descriptors to the multimedia information in the repository (114). The multimedia repository is queried by formulating a query using query-by-example (131), acquiring the descriptor / s and meta-descriptor / s for a repository multimedia item (132), generating a query descriptor / s if none of the same type has been previously generated (133, 134), comparing the descriptors of the repository multimedia item and the query multimedia item (135), and ranking and displaying the results (136, 137).
Owner:KONINKLIJKE PHILIPS ELECTRONICS NV

Deep learning-based freehand sketch image retrieval method

The invention belongs to the technical field of multimedia information retrieval, and specifically discloses a deep learning-based freehand sketch image retrieval method. According to the method, and edge contour detection technology and a non-maximum value suppression technology are utilized to realize the conversion from colored images to similar sketch images, a deep learning technology is utilized to construct distinguishing feature expressions for querying deep features of sketches and similar sketches, the deep features fuse the high-level semantic features and low-level visual features of images, and the deep features are more distinguishing in sketch retrieval. Through deeply mining visual information of a first retrieval result, uncorrelated images placed at the front in the retrieval result are rejected and a more correlated result is returned to the users. The method is high in correctness and strong in adaptability. On the basis of large-scale image data, the method is significant in carrying out efficient image retrieval which considers semantic information of sketches, so that the influences of fuzziness of freehand sketches can be decreased, the retrieval correlation can be improved and the user experience can be enhanced; and the method has an extensive application value in the field of multimedia image retrieval.
Owner:FUDAN UNIV

Commodity image category forecasting method based on online shopping platform

InactiveCN103345645ASimplify the online shopping processImprove experienceCharacter and pattern recognitionFeature extractionEngineering
The invention belongs to the technical field of multimedia information searching, and particularly relates to a commodity image category forecasting method based on an online shopping platform. The commodity image category forecasting method mainly involves six modules and comprises corresponding algorithms, namely training image obtaining, image characteristic extracting, irrelevant image filtering, image characteristic training, multilevel image classifying and relevant image selecting. According to the commodity image category forecasting method, based on real data obtained from the online shopping platform, commodity category information in images can be automatically analyzed through large-scale data training, shopping guide can be provided for a user, and therefore online shopping procedures can be simplified for the user, user experience is enhanced, and the commodity image category forecasting method has broad application value in the field of image searching.
Owner:SHANGHAI JILIAN NETWORK TECH CO LTD

Cross-media similarity measures through trans-media pseudo-relevance feedback and document reranking

A multimedia information retrieval system includes a storage and an electronic processing device. The latter is configured to perform a process including: computing values of a pairwise similarity measure quantifying pairwise similarity of documents of a multimedia reference repository; storing the computed values in the storage; performing an initial information retrieval process respective to the multimedia reference repository to return a set of initial repository documents; and identifying a set of top ranked documents of the multimedia reference repository based at least on the stored computed values pertaining to the set of initial repository documents.
Owner:XEROX CORP

System and method for multimedia information retrieval

A system and method for information retrieval are disclosed. The method includes querying a multimedia collection with a first component of a multimedia query (e.g., a text-based part of the query) to generate first comparison measures between the first component of the query and respective objects in the collection for a first media type (e.g., text). The multimedia collection is queried with a second component of the multimedia query (e.g., an image-based part of the query) to generate second comparison measures between the second component of the query and respective objects in the collection for a second media type (e.g., visual). An aggregated score for each of a set of objects in the collection is computed, based on the first comparison measure and the second comparison measure for the object. This includes applying an aggregating function to the first and second comparison measures in which a first confidence weighting is applied to the first comparison measure and a second confidence weighting is applied to the second comparison measure. The first confidence weighting is independent of the second comparison measure. The second confidence weighting is dependent on the first comparison measure. Information based on the aggregated scores is output.
Owner:XEROX CORP

Trademark identification searching method for multiple combined contents

The invention belongs to the field of multimedia information searching, and particularly discloses a trademark identification searching method for multiple combined contents. The trademark identification searching method for the multiple combined contents aims to overcome the defect that identification result errors are large in the prior art. According to the trademark identification searching method for the multiple combined contents, first, each trademark picture in a model data base is cut to obtain a character part and a figure part, and then characteristic information of two parts are respectively extracted, and characteristic data of all pictures and characters are respectively merged to generate a picture characteristic data base and a character characteristic data base; secondly, trademark image characteristics match with characteristics in characteristic data bases according to a similarity measurement mechanism of images of the multiple combined contents, the similarity distance between a target picture and each trademark model is worked out to obtain a primary identification and searching result, and the result is input to a user; afterwards, second processing is conducted on the primary identification result through a user feedback mechanism, and a final identification and searching result is obtained.
Owner:XIAN TECH UNIV

Image segmentation method based on annotated image learning

The invention provides an image segmentation method based on an annotated image learn. The method comprises two processes of: 1, learning an annotated training sample, namely segmenting the training image, performing scene classification on the training image, and establishing connection between the annotated words and the segmentation region on a special scene; and 2, determining the annotated words of the region to be segmented according to a model parameter acquired by learning in the process 1, performing information fusion according to the annotated information of the region and finishing segmentation. According to the method, the image segmentation and the identification process are fused by learning the annotated image; the annotated words serve as connecting link of the image segmentation and object identification; connection is established between low-grade visual stimulation and the annotated words representing high-grade semantic information to guide the image segmentation process, so that the cognitive ability of the image segmentation result is improved. The method can be directly applied to the actual application fields such as automatic image annotation, computer-aided diagnosis of a medical image, segmentation and classification of remote sensing images, multimedia information retrieval and the like.
Owner:三亚哈尔滨工程大学南海创新发展基地

Video information retrieval

A video information retrieval system comprising a client system having a request issuer for issuing a search request in respect of desired video material; and a video accessor for accessing video material on the basis of a uniform resource locator (URL) and a SMPTE unique material identifier (UMID). The retrieval system also comprising a server system having access to one or more databases containing metadata information relating to a plurality of video material items, a UMID associated with each video material item and at least one URL associated with each UMID. A receiver is provided for receiving a search request from the client system and detecting one or more video material items for which metadata information stored in at least one of the database(s) substantially corresponds to the search request. An information supplier supplies the metadata information, the URL and the UMID relating to the one or more detected video material items to the client system. The server system has at least one video repository having: a video storage arrangement storing video material and associated UMID data. The metadata, the URL and the UMID are communicated between the server and the client using a markup language having descriptors for data content.
Owner:SONY UK LTD

Cross-medial personage news searching method and system capable of fusing multi-mode information

The invention belongs to the technical field of multi-media information searching and news searching, and particularly relates to a cross-medial personage news searching method and system capable of fusing multi-mode information. The searching method includes the steps of obtaining multi-mode news information on the internet, extracting names of news figures to obtain textural features of news, extracting facial images of the news figures to obtain image features of the news, conducting network information supplement on the rarely-seen news figures, conducting name-image alignment cluster learning on the news figures, and achieving searching for faces of the figures and searching for the names of the figures. The searching system comprises six modules corresponding to the steps of the searching method respectively. The cross-medial personage news searching method and system capable of fusing the multi-mode information can well solve the problem of network news name-face alignment and solve the problem of searching for personage news accordingly. The two problems have significance in the multi-medium information searching field and the news searching field, and the cross-medial personage news searching method and system capable of fusing the multi-mode information have wide application value.
Owner:FUDAN UNIV

Binocular stereoscopic vision image feature extraction method combining shape and color

A binocular stereoscopic vision image feature extraction method combining shape and color belongs to the field of data processing of multimedia information retrieval, intelligent information processing, data mining and the like, and overcomes the defects of binocular stereoscopic vision image feature extraction is not high in precision and high in complexity. The method includes the steps of: performing depth map extraction on a binocular stereoscopic vision image through stereo matching; performing contour shape feature extraction on a binocular stereoscopic vision depth image based on a window of a selected size and performing dimensionality reduction; using a sliding window detection method to perform feature extraction on a complete depth image and performing dimensionality reduction; performing color feature extraction on a binocular stereoscopic vision left image to from histogram features; and performing Gaussian normalization on contour shape features and color features, thereby realizing multi-feature fusion binocular stereoscopic vision image feature extraction. The binocular stereoscopic vision image feature extraction method combining shape and color can obtain accurate and low-dimensional binocular stereoscopic vision image features, and can be well applied to content-based indexing and retrieval of related resources.
Owner:COMMUNICATION UNIVERSITY OF CHINA +1

Relevant information retrieval in record management systems

A record management system retrieves relevance information through an information retrieval model that models relevance between users, queries, and records based on user interaction data with records. Relevance information between different elements of the record management system are determined through a set of learned transformations in the information retrieval model. The record management system can quickly retrieve relevance information between different elements of the record management system given the set of learned transformations in the information retrieval model, without the need to construct separate systems for different types of relevance information. Moreover, even without access to contents of records, the record management system can determine relevant records for a given query based on user interaction data and the determined relationships between users, queries, and records learned through the information retrieval model.
Owner:SALESFORCE COM INC

Method for fuzzy logic rule based multimedia information retrival with text and perceptual features

A search system (200) for a database (224) including records having a multiple disparate types of media is provided. The search system supports queries, that include different types of search criteria, including content based retrieval search criteria. A fuzzy logic method (400) is provided for effectively combining the results of different types of search criteria. The fuzzy logic method also allows confidence levels entered by the user for search criteria to be considered in combining results. Retrieval relevance values for documents for at least some search criteria are used in the fuzzy logic method. For content based image retrieval searches, the retrieval relevance values are computed by mapping a distance between quantitative characterizations of a search basis image, and other images into a finite range.
Owner:GOOGLE TECH HLDG LLC

Hypervideo: information retrieval using time-related multimedia:

Disclosed is a method and device for selecting documents, such as Web pages or sites, for presentation to a user, in response to a user expression of interest, during the course of presentation to the user of a document, such as a video or audio selection, whose content varies with time. The method takes advantage of information retrieval techniques to select documents related to the portion of the temporal document in which the user has expressed interest. The method generates the search query to use to select documents by reference to text associated with the portion of the temporal document in which the user has expressed interest, as by using the closed caption test associated with the video, or by using speech recognition techniques. The method further uses a weighting function to weigh the terms used in the search query, depending on their temporal relationship to the user expression of interest.
Owner:VERIZON LAB

Multimedia information retrieval method, program, record medium and system

InactiveUS7167823B2Highly accurate and efficient retrieval of necessary informationEasy to masterData processing applicationsDigital data processing detailsVirtual spaceMultimedia information retrieval
Paired image information and text information correlated to each other are retrieved as information sets. Frequency information on words used in text is extracted from text information in a group of information sets, and text information features are extracted based on frequency information. Text features are used to lay out information sets in a virtual space such that similar pieces of text are located close to each other, and images are displayed at those positions. Further, important words are extracted from those words extracted from text information in a group of information sets, and those words are laid out in the virtual space in the same manner as with information sets and displayed as labels.
Owner:FUJITSU LTD

Zero-sample sketch image retrieval method and system based on graph convolutional neural network

The invention belongs to the technical field of multimedia information retrieval, and particularly relates to a zero-sample sketch image retrieval method and system based on a graph convolutional neural network. The zero-sample sketch image retrieval system architecture provided by the invention comprises three important components: a feature coding network, a semantic maintenance network and a semantic reconstruction network. The method comprises the steps ofextracing sketches and image visual features through a feature extraction network; processing the visual information of the sketch and the image and the label semantic information of the sketch and the image at the same time through a graph convolution network; establishing a relationship between unseen categories and seen categories;enhancing the generalization ability of the model through a semantic reconstruction network;and finally, the model taking the sketch of which the category is not seen as an input and performing retrieval to find an image similar to the sketch. According to the invention, the variational auto-encoder is adopted to generate semantic information from visual information, so that the generalization ability of the model is further enhanced.
Owner:FUDAN UNIV

Multimedia information searching method and system

The invention discloses a multimedia information searching method and system. The method includes the steps of extracting feature data of current multimedia information, obtaining a feature bit vector of the current multimedia information according to the extracted feature data, segmenting the feature bit vector of the current multimedia information to obtain k sub-vectors of the current multimedia information, determining a candidate set corresponding to each sub-vector according to each sub-vector of the current multimedia information, finding out a feature bit vector, corresponding to each vector identification in each obtained candidate set, in a multimedia feature database, calculating hamming distances between the feature bit vector of the current multimedia information and the found feature bit vectors, and outputting the multimedia information, corresponding to the feature bit vectors with the hamming distances conforming to a preset condition, as a search result. Through the method and due to the fact that a segmentation index structure is set up to search the feature bit vectors, the searching speed of the multimedia information and the searching efficiency of the multimedia information can be greatly improved.
Owner:新浪技术(中国)有限公司

Method and system for editing a multimedia message

The present invention provides a method for creating a multimedia information file. At first, obtaining an initial file that is based on the markup language and includes at least two objects, then, receiving a user's choice of one of said at least two objects, and fmally, marking the selected object as one being recommended, making it be preferentially recommended when editing the created file in future. This invention also provides a method for editing a multimedia information file that includes objects with recommended editing marks. This invention can help a common user of mobile phone easily find a object he wants to be modified from many objects in a multimedia information file according to recommended objects. Therefore, this invention facilitates greatly common users of mobile phone to handle multimedia information.
Owner:KONINKLIJKE PHILIPS ELECTRONICS NV

Method and system for adsorbing multimedia information through track points

The invention relates to a method and system for adsorbing multimedia information through track points, and belongs to the technical field of computer geographical information systems. In an existing geographical information system, generally speaking, track point information can only show the movement attribute of a target. The method includes the following steps: 1, responding to a map operation to obtain query conditions; 2, querying all track points of the target according to the query conditions, and drawing a track on a map; 3, querying all the related multimedia information according to the coordinates and geographical radii of the track points; 4, classifying and aggregating the related multimedia information; 5, hooking the multimedia event information to a time axis according to the sequence of occurrence time. By adopting the method and system, the related information such as videos, voices, pictures and texts around the track points of the target can be conveniently and rapidly shown on the electronic map, and then information of the track points of the target can be closely and sequentially related to the multimedia information of events occurring around the track points of the target.
Owner:方正国际软件(北京)有限公司 +1

Deep learning-based sketch retrieval method

The invention discloses a deep learning-based sketch retrieval method, and relates to the technical field of multimedia information retrieval. The method comprises the steps of firstly calculating a conventional picture edge probability graph and obtaining a feature descriptor of the edge probability graph to realize conversion from a color conventional graph to a similar freehand image; establishing a feature library required for freehand image retrieval through a convolutional neural network; and extracting depth features of edge graphs of different levels of the similar freehand image and adrawn sketch by utilizing a deep learning technology to perform similarity matching. By providing new feature extraction and matching methods, the sketch drawn by a user can be understood more accurately; the method has high accuracy and high adaptability; the influence of fuzziness of the freehand sketch can be reduced; the retrieval correlation is improved; the user experience is enhanced; andthe method has wide application values in the field of multimedia image retrieval.
Owner:GUANGDONG SANWEIJIA INFORMATION TECH CO LTD

System and method for multimedia information retrieval

A system and method for information retrieval are disclosed. The method includes querying a multimedia collection with a first component of a multimedia query (e.g., a text-based part of the query) to generate first comparison measures between the first component of the query and respective objects in the collection for a first media type (e.g., text). The multimedia collection is queried with a second component of the multimedia query (e.g., an image-based part of the query) to generate second comparison measures between the second component of the query and respective objects in the collection for a second media type (e.g., visual). An aggregated score for each of a set of objects in the collection is computed, based on the first comparison measure and the second comparison measure for the object. This includes applying an aggregating function to the first and second comparison measures in which a first confidence weighting is applied to the first comparison measure and a second confidence weighting is applied to the second comparison measure. The first confidence weighting is independent of the second comparison measure. The second confidence weighting is dependent on the first comparison measure. Information based on the aggregated scores is output.
Owner:XEROX CORP

Recommendation method and device for multimedia information

The invention discloses a recommendation method and device for multimedia information and belongs to the technical field of Internet. The recommendation method comprises: determining first multimedia classes interested by a first user according to a historical click action of the multimedia information; determining second multimedia classes which are potentially interested and are not historically clicked by the first user according to each first multimedia class of the first user and a plurality of historical click actions on the multimedia information by the second user; determining a target multimedia class according to the first multimedia classes interested by the first user and the second multimedia classes which are potentially interested by the first user; and recommending the multimedia information of the target multimedia class. When the information is recommended, the determined target multimedia class is more accurate by realizing the multimedia classes of the multimedia information historically clicked by the first user and realizing the multimedia classes which are potentially interested by the first user, so that the clicking rate of the recommended multimedia information can be improved and resources and financial resources can be saved.
Owner:SHENZHEN TENCENT COMP SYST CO LTD

Network for describing multimedia information

A method and apparatus for encoding knowledge using a multimedia network. A multimedia network represents semantic concepts and their relations using multimedia content. A multimedia network associates words and multimedia content with the semantic concepts in order to illustrate and exemplify the semantic concepts as well as describe lexical, semantic, and perceptual relations. The multimedia network can be searched, browsed, or summarized for purposes of information discovery. The multimedia network can also be used for personalizing multimedia content or for querying a multimedia information repository by expanding a query to include related concepts encoded in a Multimedia network.
Owner:IBM CORP

Multimedia classification recommendation method, apparatus and system

InactiveCN105843922ARecommend humanizationFit behaviorSpecial data processing applicationsComputer scienceMood state
The present invention discloses a multimedia classification recommendation method, apparatus and system. The method comprises: receiving state information uploaded by a client; matching the state information with a preset state tag, and determining one or more state tags matched with the state information; performing a multimedia information retrieval according to the one or more state tags, so as to obtain multimedia information corresponding to the one or more state tags; and pushing the multimedia information corresponding to the one or more state tags to the client. A current behavior state and a mood state such as a metal condition of a user are analyzed by means of the state information, and a video that is suitable for being watched under the current state is pushed, so as to make video recommendation more personalized and more coincide with the behavior and internal feelings of the user.
Owner:LETV HLDG BEIJING CO LTD +1

Interactive type image retrieval method of combining user evaluation and labels

InactiveCN103164539AImprove retrieval accuracyAvoid the time-consuming and labor-intensive problems of manual labelingSpecial data processing applicationsImage retrievalSelection system
The invention discloses an interactive type image retrieval method of combining user evaluation and labels, and belongs to the field of multimedia information retrieval. The method utilizes a comprehensive retrieval method based on the combination of physical characteristics of images and text. In the process of retrieval, a user is allowed to carry out text information description on query images or select keywords provided by a system. By carrying out relevant evaluation of 'satisfied' or 'unsatisfied' on retrieval results, an image retrieval system automatically carry out text marks on relevant satisfied images which are marked by the user to form high-level semantic information. Along with constant use of the user, the system can generate a rich semantic information database. Difference of different users to the text marks of the same image and difference of the same user to the text marks of the same image in different times are considered, and reliability of the users is combined in the process of generating the semantic information database. In retrieval, the comprehensive retrieval method based on the combination of the characteristics and the texts is utilized to carry out retrieval on the query images with semantic information, so that accuracy of the retrieval results is improved. The interactive type image retrieval method of combining the user evaluation and the labels has the advantages of being high in efficiency, high in accuracy and friendly in interactive mode.
Owner:COMMUNICATION UNIVERSITY OF CHINA

Image retrieval method

The invention discloses an image retrieval method, and belongs to the field of intelligent information processing such as multimedia information retrieval, mode identification and the like. A correctly-matched correlated image is obtained by using geometric verification after initial retrieval, weight adjustment is performed on document vectors of the correlated image and an inquiry image to construct a new inquiry vector so as to obtain extended inquiry, and new retrieval is performed to obtain a retrieval result. According to the method, weights of implicit visual words existing in the correlated image are added in the inquiry vector, so that the weights of the same visual words in the inquiry image and the correlated matched image are increased, and the retrieval efficiency is increased greatly.
Owner:COMMUNICATION UNIVERSITY OF CHINA +1

Method for reordering image or video search

The invention discloses a method for reordering image or video search, which relates to the field of multimedia-oriented information retrieval. The method comprises the following steps of grading an image sample set into a grade A, a grade B and a grade C according to the degrees of inquiry topic relevance; constructing a relevance diagram, an irrelevance diagram and a global diagram; acquiring a relevance divergence, an irrelevance divergence and a global divergence; constructing a target function according to the relevance divergence, the irrelevance divergence and the global divergence, and acquiring a novel characteristic vector of an image sample; inputting the novel characteristic vector of a marked image sample serving as a training set into a training model to obtain a trained ordering model; and ordering the image sample through the trained ordering model and outputting an ordering result. The invention discloses a dimensionality reduction method which belongs to the field relevant to multimedia retrieval and ordering. According to the method, specific properties of data are utilized fully on the premise of limited monitoring information, the ordering performance can be improved by effectively utilizing a small number of marks, and searching precision is increased.
Owner:深圳市点维文化传播有限公司

Image-text cross-modal retrieval method, system and device and storage medium

The invention discloses an image-text cross-modal retrieval method, system and device and a storage medium, and the method comprises the following steps: obtaining to-be-retrieved character / picture data, and obtaining similarity information by combining the character / picture data and a retrieval model trained by adopting a combined loss function; and obtaining corresponding picture / text data according to the similarity information. According to the invention, the to-be-retrieved character / picture data is input into the retrieval model to calculate the similarity matrix, corresponding picture / text data is obtained according to the similarity matrix; due to the fact that the retrieval model is trained through the combined loss function, the closer distance between the relevant pictures and the relevant texts can be kept, the retrieval model is far away from irrelevant data, the retrieval accuracy between the pictures and the relevant texts is greatly improved, and the method can be widely applied to the technical field of multimedia information retrieval.
Owner:SOUTH CHINA NORMAL UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products