Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

845 results about "Inverted index" patented technology

In computer science, an inverted index (also referred to as a postings file or inverted file) is a database index storing a mapping from content, such as words or numbers, to its locations in a table, or in a document or a set of documents (named in contrast to a forward index, which maps from documents to content). The purpose of an inverted index is to allow fast full-text searches, at a cost of increased processing when a document is added to the database. The inverted file may be the database file itself, rather than its index. It is the most popular data structure used in document retrieval systems, used on a large scale for example in search engines. Additionally, several significant general-purpose mainframe-based database management systems have used inverted list architectures, including ADABAS, DATACOM/DB, and Model 204.

Method and apparatus for identifying, extracting, capturing, and leveraging expertise and knowledge

The invention comprises a set of complementary techniques that dramatically improve enterprise search and navigation results. The core of the invention is an expertise or knowledge index, called UseRank that tracks the behavior of website visitors. The expertise-index is designed to focus on the four key discoveries of enterprise attributes: Subject Authority, Work Patterns, Content Freshness, and Group Know-how. The invention produces useful, timely, cross-application, expertise-based search and navigation results. In contrast, traditional Information Retrieval technologies such as inverted index, NLP, or taxonomy tackle the same problem with an opposite set of attributes than what the enterprise needs: Content Population, Word Patterns, Content Existence, and Statistical Trends. Overall, the invention emcompasses Baynote Search—a enhancement over existing IR searches, Baynote Guide—a set of community-driven navigations, and Baynote Insights—aggregated views of visitor interests and trends and content gaps.
Owner:MONETATE INC

File system with access and retrieval of XML documents

An XML-aware file system exploits attributes encoded in an XML document. The file system presents a dynamic directory structure to the user, and breaks the conventional tight linkage between sets of files and the physical directory structure, thus allowing different users to see files organized in a different fashion. The dynamic structure is based upon content, which is extracted using an inverted index according to attributes and values defined by the XML structure.
Owner:IBM CORP

Uniform search system and method for selectively sharing distributed access-controlled documents

A privacy-preserving index system addresses the problem of providing a privacy-preserving search over distributed access-controlled content. Indexed documents can be readily reconstructed from inverted indexes used in the search. The privacy-preserving index system builds a centralized privacy-preserving index in conjunction with a distributed access-control enforcing search protocol. The privacy-preserving index utilizes a randomized algorithm for constructing a privacy-preserving index. The privacy-preserving index is strongly resilient to privacy breaches. The privacy-preserving index system allows content providers to maintain complete control in defining access groups and ensuring its compliance, and further allows system implementors to retain tunable knobs to balance privacy and efficiency concerns for their particular domains.
Owner:IBM CORP

Serving advertisements using a search of advertiser Web information

Advertisers are permitted to put targeted ads on, or to serve ads in association with, various content such as search results pages, Web pages, e-mail, etc., without requiring the advertiser to enter and / or maintain certain targeting information, such as keyword targeting. This may be accomplished by using a searchable data structure, such as an inverted index for example, of available advertiser Web information. The advertiser Web information may include terms and / or phrase extracted from the advertiser's Website. In particular, a search query may be used to search for matching advertisers, and therefore matching ads. For example, the search query can be used to search an inverted index including words and / or phrases extracted from advertiser Websites. The advertiser Web page, or some other identifier, can be used as a key to search for an associated ad.
Owner:GOOGLE LLC

Tool for automatically mapping multimedia annotations to ontologies

A tool for learning to relate annotations and transcript of a multimedia sequence to nodes in a formally or semi-formally represented ontology covering a broad range of possible multimedia documents. The device includes learning data preparation that involves certain special techniques for deriving data from the past mappings of annotations to nodes in an ontology, building inverted indices maintaining certain special statistics and a retriever that exploits these special statistics to rank the relevance of the nodes in an ontology for a given a set of new annotations.
Owner:KNUMI

Method and apparatus for suggesting/disambiguation query terms based upon usage patterns observed

The invention comprises a set of complementary techniques that dramatically improve enterprise search and navigation results. The core of the invention is an expertise or knowledge index, called UseRank that tracks the behavior of website visitors. The expertise-index is designed to focus on the four key discoveries of enterprise attributes: Subject Authority, Work Patterns, Content Freshness, and Group Know-how. The invention produces useful, timely, cross-application, expertise-based search and navigation results. In contrast, traditional Information Retrieval technologies such as inverted index, NLP, or taxonomy tackle the same problem with an opposite set of attributes than what the enterprise needs: Content Population, Word Patterns, Content Existence, and Statistical Trends. Overall, the invention emcompasses Baynote Search—a enhancement over existing IR searches, Baynote Guide —a set of community-driven navigations, and Baynote Insights—aggregated views of visitor interests and trends and content gaps.
Owner:MONETATE INC

File system with access and retrieval of XML documents

A XML-aware file system exploits attributes encoded in a XML document. The file system presents a dynamic directory structure to the user, and breaks the conventional tight linkage between sets of files and the physical directory structure, thus allowing different users to see files organized in a different fashion. The dynamic structure is based upon content, which is extracted using an inverted index according to attributes and values defined by the XML structure. In one application, a dynamically changing federated repository is searchable using a system of local and merged master indices, wherein query results are presented as virtual directory paths that are semantically organized.
Owner:IBM CORP

Method and apparatus for determining peer groups based upon observed usage patterns

The invention comprises a set of complementary techniques that dramatically improve enterprise search and navigation results. The core of the invention is an expertise or knowledge index, called UseRank that tracks the behavior of website visitors. The expertise-index is designed to focus on the four key discoveries of enterprise attributes: Subject Authority, Work Patterns, Content Freshness, and Group Know-how. The invention produces useful, timely, cross-application, expertise-based search and navigation results. In contrast, traditional Information Retrieval technologies such as inverted index, NLP, or taxonomy tackle the same problem with an opposite set of attributes than what the enterprise needs: Content Population, Word Patterns, Content Existence, and Statistical Trends. Overall, the invention emcompasses Baynote Search—a enhancement over existing IR searches, Baynote Guide—a set of community-driven navigations, and Baynote Insights—aggregated views of visitor interests and trends and content gaps.
Owner:MONETATE INC

Supporting sub-document updates and queries in an inverted index

A system, method, and computer program product for updating a partitioned index of a dataset. A document is indexed by separating it into indexable sections, such that different ones of the indexable sections may be contained in different partitions of the partitioned index. The partitioned index is updated using an updated version of the document by updating only those sections of the index corresponding to sections of the document that have been updated in the updated version.
Owner:IBM CORP

Method for performing transactions on data and a transactional database

The present invention provides a method for performing transactions on data entities in a database and a transactional database. The database comprises an ordered set of data stores with at least one static data store, wherein said static data store uses an index structure based on a non-updatable representation of an ordered set of integers according to the principle of compressed inverted indices. The method allows to generate a modifiable data store when the performed transaction comprises an insert, update or delete operation, to execute operations of the transaction on the ordered set being present at the time when the transaction has been started and, if present, on the modifiable data store and to convert data stores to a new static data store, The insert, update or delete operation are executed on the modifiable data store which is the only data store modifiable for the transaction.
Owner:OPEN TEXT SA ULC

Method and apparatus for predicting destinations in a navigation context based upon observed usage patterns

The invention comprises a set of complementary techniques that dramatically improve enterprise search and navigation results. The core of the invention is an expertise or knowledge index, called UseRank that tracks the behavior of website visitors. The expertise-index is designed to focus on the four key discoveries of enterprise attributes: Subject Authority, Work Patterns, Content Freshness, and Group Know-how. The invention produces useful, timely, cross-application, expertise-based search and navigation results. In contrast, traditional Information Retrieval technologies such as inverted index, NLP, or taxonomy tackle the same problem with an opposite set of attributes than what the enterprise needs: Content Population, Word Patterns, Content Existence, and Statistical Trends. Overall, the invention emcompasses Baynote Search—a enhancement over existing IR searches, Baynote Guide—a set of community-driven navigations, and Baynote Insights—aggregated views of visitor interests and trends and content gaps.
Owner:MONETATE INC

Method for extracting text-oriented field term and term relationship

The invention discloses a method for extracting a text-oriented field term and term relationship. The method is characterized by comprising the following steps of: firstly, preprocessing original linguistic data to obtain a candidate word set including clauses, participles and part of speech tagging, and filtering noise words; secondly, extracting term characteristics from the original linguisticdata and the Internet, and separating terms from candidate words by combining with a dual-model structure algorithm; thirdly, constructing a term dictionary by adopting an inverted index method, and tagging the terms in a text to be identified by using a longest match algorithm; and finally, carrying out multilevel sign sequence tagging through a conditional random field model according to a multi-dimensional node signing rule to obtain a relationship among the terms in the text to be identified.
Owner:XI AN JIAOTONG UNIV

Method for segmenting and indexing scenes by combining captions and video image information

The invention relates to a method for segmenting and indexing scenes by combining captions and video image information. The method is characterized in that: in the duration of each piece of caption, a video frame collection is used as a minimum unit of a scene cluster. The method comprises the steps of: after obtaining the minimum unit of the scene cluster, and extracting at least three or more discontinuous video frames to form a video key frame collection of the piece of caption; comparing the similarities of the key frames of a plurality of adjacent minimum units by using a bidirectional SIFT key point matching method and establishing an initial attribution relationship between the captions and the scenes by combining a caption related transition diagram; for the continuous minimum cluster units judged to be dissimilar, further judging whether the minimum cluster units can be merged by the relationship of the minimum cluster units and the corresponding captions; and according to the determined attribution relationships of the captions and the scenes, extracting the video scenes. For the segments of the extracted video scenes, the forward and reverse indexes, generated by the caption texts contained in the segments, are used as a foundation of indexing the video segments.
Owner:INST OF ACOUSTICS CHINESE ACAD OF SCI

System and method for providing service for searching web site addresses

A method for searching for a partially specified Uniform Resource Locator (URL) addresses includes receiving a user request, from a user, including a partially specified URL address. A URL search request handler is invoked to search for the partially specified URL address within an inverted index of web site URLs. A web search request handler is invoked to rank the search results of the search for the partially specified URL address based on one or more keywords specified in the user request, a list of recently accessed URLs, and a user profile. Search results are returned to the user comprising a list of URL addresses based on the search for the partially specified URL and ranked based on the user search data.
Owner:GOOGLE LLC

Electronic map interest point data redundant detecting method and system

An embodiment of the invention discloses a redundancy detecting method and a system thereof. The redundancy detecting method comprises performing binary chopping on titles of POI data, creating an inverted index for chopped binary words, and searching similar POI data for the POI data according to the word with the lowest occurrence frequency in the inverted index, further, calculating similarity of titles and addresses for searched POI data which has title similar relations, obtaining a total similarity according to the title similarity and the address similarity of the POI data, comparing the total similarity with a preset threshold, and if the similarity is larger than the preset threshold, defining the corresponding POI data as redundant data. By utilizing the invention, redundancy detection in POI data of electronic maps can be realized.
Owner:BEIJING SOGOU TECHNOLOGY DEVELOPMENT CO LTD

Indexing and searching entity-relationship data

Method, system, and computer program product for indexing and searching entity-relationship data are provided. The method includes: defining a logical document model for entity-relationship data including: representing an entity as a document containing the entity's searchable content and metadata; dually representing the entity as a document and as a category; and representing each relationship instance for the entity as a category set that contains categories of all participating entities in the relationship. The method also includes: translating entity-relationship data into the logical document model; and indexing the entity-relationship data of the populated logical document model as an inverted index. The method may include searching indexed entity-relationship data using a faceted search, wherein the categories are all categories required for supporting faceted navigation.
Owner:IBM CORP

Indexing and querying semi-structured data

Generating an inverted index is disclosed. Semi-structured data from a plurality of sources is parsed to extract structure from at least a portion of the semi-structured data. The inverted index is generated using the extracted structure. The inverted index includes a location identifier and a data type identifier for one or more entries of the inverted index.
Owner:VMWARE INC

Intelligent response method, electronic device and storage medium

The invention provides an intelligent response method, which comprises the following steps that: after a consultation question is preprocessed, constructing an inverted index for a question and answerknowledge base; through an inverted index query way, inquiring a candidate question set related to the consultation question from the question and answer knowledge base; aiming at each candidate question in the candidate question set; independently calculating a question similarity between the consultation question and the candidate question, wherein the question similarity is obtained through the linear weighting of a text similarity, a semantic similarity, a theme similarity and a syntax similarity between the consultation question and the corresponding candidate question; and finally, selecting a candidate question corresponding to the highest question similarity obtained by calculation, and inquiring the associated answer of the selected candidate question in the question and answer knowledge base as a target answer to be output. The invention also provides an electronic device and a storage medium. By use of the intelligent response method, the accuracy and the response efficiency of intelligent response can be improved, and service quality is improved.
Owner:PING AN TECH (SHENZHEN) CO LTD

International information search and delivery system providing search results personalized to a particular natural language

Documents containing information about product offerings in various natural languages are passed through transitional translation layers which convert the data to a single computer language using a universal character set encompassing the character sets used in all supported natural languages. The documents are stored in their original natural languages and in English with documents segmented into components which components are identified by search terms arranged in a taxonomy tree based on product types. The names of the products in the national languages are added to the English language documents enabling quick keyword searches when the product name or number is known. A bi-directional inverted index is provided for access by the keyword search terms so that keywords with the same meaning in different languages are accessible together when the keyword in one of the languages is queried.
Owner:IBM CORP

Method and apparatus for determining expertise based upon observed usage patterns

The invention comprises a set of complementary techniques that dramatically improve enterprise search and navigation results. The core of the invention is an expertise or knowledge index, called UseRank that tracks the behavior of website visitors. The expertise-index is designed to focus on the four key discoveries of enterprise attributes: Subject Authority, Work Patterns, Content Freshness, and Group Know-how. The invention produces useful, timely, cross-application, expertise-based search and navigation results. In contrast, traditional Information Retrieval technologies such as inverted index, NLP, or taxonomy tackle the same problem with an opposite set of attributes than what the enterprise needs: Content Population, Word Patterns, Content Existence, and Statistical Trends. Overall, the invention emcompasses Baynote Search—a enhancement over existing IR searches, Baynote Guide—a set of community-driven navigations, and Baynote Insights—aggregated views of visitor interests and trends and content gaps.
Owner:MONETATE INC

Large-scale human face image searching method

The invention discloses a large-scale human face image searching method. The method comprises the following steps of preprocessing human face images; extracting local characteristics from the human face images; extracting overall geometrical characteristics from the human face images; quantifying the local characteristics; quantifying the overall geometrical characteristics; establishing a reverse index; searching a candidate human face image set; and re-arranging the candidate human face image set. By the method, an index for a large-scale human face image database can be established, quick human face research is realized, and the research efficiency is realized. In addition, the accuracy of human face research is improved by embedding an auxiliary information characteristic quantifying and candidate human face image set re-arranging algorithm. Effective and accurate large-scale human face image search is realized by the method, so that the method has higher use value.
Owner:NANJING UNIV

Music retrieval system based on audio fingerprint features

The invention belongs to the technical field of information retrieval, and particularly relates to a music retrieval system based on audio fingerprint features. The system is composed of a preprocessing module, a feature extraction module, a reverse index module and a fine matching module. The preprocessing module mainly carries out audio signal conversion, resampling and filtering; the feature extraction module is used for representing audio files, wherein the audio fingerprint features are adopted to select the most stable point from a frequency spectrum as the feature point through twice screening based on dynamic threshold values, and each feature is represented by a dot pair; according to the reverse index module, the features are used as key words, reverse indexes are built according to the features of a song library, and the index result is returned according to the number of the same key words; according to the fine matching module, the sequential relationship of the audio features is combined, an improved editing distance is adopted as the similarity of two feature sequences, and therefore the index result is optimized. The music retrieval system based on the audio fingerprint features is suitable for the retrieval of a large number of songs, and can particularly conduct effective retrieval on record inquiry segments.
Owner:FUDAN UNIV

System and Method for Providing a Trustworthy Inverted Index to Enable Searching of Records

A trustworthy inverted index system processes records to identify features for indexing, generates posting lists corresponding to features in a dictionary, maintains in a storage cache a tail of at least one of the posting lists to minimize random I / Os to the index, determines a desired number of the posting lists based on a desired level of insertion performance, a query performance, or a size of the storage cache, and reads a posting list corresponding to a search feature in a query to identify records that comprise the search feature. The system maps the features in the dictionary to the desired number of posting lists. The system uses a jump pointer to point from one entry to the next in the posting lists based on increasing values of entries in the posting lists.
Owner:LINKEDIN

Navigational learning in a structured transaction processing system

A method performed in connection with an arrangement of nodes representable as a graph, and an inverted index containing a correlation among keywords and nodes such that at least some nodes containing a given keyword are indexed to that given keyword, involves receiving a word, searching the inverted index to determine whether the word is a keyword and, if the word is a keyword, jumping to a node identified in the inverted index as correlated to that keyword, otherwise, learning a meaning for the word based upon reaching a result node and applying at least one specified rule such that a new input containing the word can be received thereafter and the word from the new input will be treated according to the learned meaning.
Owner:GUADA TECH LLC

Hash-Based Face Recognition System

In a face recognition system, overlapping patches are defined on a canonical face. Random clusters of pixel pairs are defined within each patch, and binary features are determined for each pixel pair by comparing their respective feature values. An inverted index hash table is constructed of the binary features. Similar binary features are then determined on a library of registrable samples of identified faces. A log probability of each registrable sample generating a binary feature from a corresponding cluster of pixel pairs at each specific patch location is determined and stored in the hash table. In a search phase, similar binary features are determined, and a hash key is determined for each binary feature. The log probabilities for each identity found in the hash table are summed for all clusters of pixel pairs and locations and sorted to find the high probability match.
Owner:SEIKO EPSON CORP

Multilayer quotation recommendation method based on literature content mapping knowledge domain

ActiveCN105653706AImprove the efficiency of obtaining citationsExpress research topicsSpecial data processing applicationsInformation processingData set
The invention discloses a multilayer quotation recommendation method based on a literature content mapping knowledge domain, and belongs to the field of information recommendation and intelligent information processing. The method comprises the following steps: firstly, obtaining the query requirement of a user, wherein the query requirement consists of the key words of the title and the digest of a thesis which needs to recommend a quotation thesis or quotation literature; then, on the basis of the literature content mapping knowledge domain, expanding and querying a retrieval word, wherein the mapping knowledge domain consists of the research object word and the research behavior word node of the literature, and edges which express various semantic relations including synonymy, synonym, an up and down position, part-whole, juxtaposition and the like; and finally, constructing the inverted index of the literature in a data set, selecting a candidate quotation, calculating the similarity between the candidate quotation and query, and adopting a gradient progressive regression tree to carry out quotation recommendation. The method carries out multilayer quotation recommendation on the basis of the literature content mapping knowledge domain, enlarges the range of the candidate quotation, accurately expresses the research object and contents of the thesis, improves efficiency for users to obtain a relevant literature and has a wide application prospect.
Owner:BEIJING INSTITUTE OF TECHNOLOGYGY

Index generation method and index generation device based on MapReduce programming architecture

The invention relates to an index generation method and an index generation device based on a MapReduce programming architecture. The index generation method comprises the following steps of: acquiring data, preparing the data into a unified format and storing the prepared data in a record set formula; carrying out head encapsulation on each data record in the record set; inserting the data records subjected to data encapsulation into an HBase cluster in batch; calling a MapReduce service and an HBase service in an Hadoop cluster and connecting an Solr cluster; carrying out MapReduce operation and submitting an operation index parallel generating task to form a reverse index intermediate file; carrying out Reduce operation to generate a reverse index file; and starting a new Map task for carrying out slit operation on the reverse index file to generate a final index. According to the index generation method and the index generation device, disclosed by the invention, the storage of high-efficiency distributed mass data and the establishment of the index can be realized; and in addition, the index generation method and the index generation device have the advantages of extensibility, high fault tolerance, high performance and the like.
Owner:XIAMEN MEIYA PICO INFORMATION
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products