Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

477results about How to "Improve query performance" patented technology

Network address cache apparatus and method

Cached information that includes multiple network addresses is delivered in a manner that changes the order of the network addresses each time the cached information is delivered to a client, thereby achieving load balancing in a manner that reduces network traffic and improves system performance. In the preferred embodiments, the user defines a minimum time to live (TTL) and one or more order change criteria. When a query is made to a DNS, the DNS returns an answer that includes a time to live (TTL). This answer may be stored in a cache, and may include multiple network addresses. If the TTL for the answer received from a DNS is less that the minimum TTL, the TTL for the answer is set to the minimum TTL before storing the answer in the cache. When a query may be satisfied by a cached answer that includes multiple network addresses, a cache mechanism delivers the multiple cached network addresses in an order determined by the one or more order change criteria if the TTL for the cached results is a positive number. In this manner load balancing can be achieved while still benefitting from the performance enhancements of caching DNS query results.
Owner:IBM CORP

System and method for executing compute-intensive database user-defined programs on an attached high-performance parallel computer

The invention pertains to a system and method for dispatching and executing the compute-intensive parts of the workflow for database queries on an attached high-performance, parallel computing platform. The performance overhead for moving the required data and results between the database platform and the high-performance computing platform where the workload is executed is amortized in several ways, for example,
    • by exploiting the fine-grained parallelism and superior hardware performance on the parallel computing platform for speeding up compute-intensive calculations,
    • by using in-memory data structures on the parallel computing platform to cache data sets between a sequence of time-lagged queries on the same data, so that these queries can be processed without further data transfer overheads,
    • by replicating data within the parallel computing platform so that multiple independent queries on the same target data set can be simultaneously processed using independent parallel partitions of the high-performance computing platform.
A specific embodiment of this invention was used for deploying a bio-informatics application involving gene and protein sequence matching using the Smith-Waterman algorithm on a database system connected via an Ethernet local area network to a parallel supercomputer.
Owner:IBM CORP

Method and device for realizing column storage based relational database

The invention relates to a method and a device for realizing a column storage based relational database. The method comprises: establishing data files, and serially numbering the data blocks which form the data files; defining a table segment; inserting records into the table segment; generating the only record identification number in the table segment for the record inserted in the table segment, and separating the records by columns; for each column in the records, executing the following operation: storing column values and the record identification numbers as value data in the data blocks and sorting according to the size of the column values; storing the record identification numbers and the serial numbers of the data blocks for storing the value data which are taken as connecting data in new data blocks, and sorting according to the size of the record identification numbers; and establishing indexes for the data blocks for storing the value data and the data blocks for storing the connecting data, and generating index data blocks. The embodiment of the invention improves the query performance of the database.
Owner:BEIJING CLOUDWAVE TIMES TECH CO LTD

Systems and methods for information retrieval

An information retrieval system implements a search language, through which a querying entity (e.g., a user, a program or process, or the like) formulates a search query. Preferably, a search query is composed of an ordered set of clause definitions, and each clause can have set membership operations applied to it. Each clause includes a clause pipeline, and a time constraint. A clause pipeline includes an ordered set of clause specifications separated by a pipeline operator. A clause specification can be either an expansion operation or a filtering operation. Preferably, a first clause specification in a pipeline operates on an initial universe of all objects, and each subsequent clause specification operates on a set of objects produced from the previous clause specification. The search language is exposed to users (typically, IT administrators), and one or more builder programs within the system (each referred to as a “model builder”) are used internally to present data models to the search language. A model builder extracts data from a given type of data source (including, without limitation, a relational database system, an application programming interface (API), or the like), and enables that data to be presented to one or more constructs of the search language according to a single unified data model.
Owner:STRATACLOUD

Extendible repeated data detection method

The invention discloses an extendible repeated data detection method, belongs to the technical field of computer storage, and solves the problem that in the existing repeated data detecting method, the storage capacity cannot be efficiently extended, so as to meet the requirements of the current situation that the storage demand increases and repeatedly deleted systems need upgrading and updating. The extendible repeated data detection method comprises the following steps: partitioning processing, fingerprint extraction, retrieving of Bloom filters, retrieving of fingerprint subset table, judgment of unfulfilled Bloom filters, new fingerprint marking, judgment of Bloom filter quantity, and extending of Bloom filter array. In the invention, the Bloom filter array is used to retrieve the fingerprint data, so as to quickly locate the retrieval range, improve the retrieval efficiency and realize detection on the repeated data; the extendible repeated data detection method is high in expansibility and querying performance, can support element location and control the misjudgment rate, and further can effectively reduce the memory overhead. The Bloom filter array is composed of a series of isomorphic Bloom filters, so that once the misjudgment rate epsilon' and the pre-established retrieving fingerprint total quantity nmax are provided, the quantity of the required Bloom filters and the number of the hush functions can be worked out.
Owner:HUAZHONG UNIV OF SCI & TECH

Method and device for inquiring data

The invention discloses a method and device for inquiring data. The method comprises the steps that an inquiry statement is decomposed into one or more inquiry clauses; traversal is carried out on the inquiry clauses, the inquiry clauses are matched with metadata, and the inquiry clauses are replaced with execution results stored in the metadata; otherwise, the inquiry clauses are executed, and the metadata are updated; the final execution result is used for main inquiry to obtain an inquiry result. The executed inquiry clauses and the execution results of the executed inquiry clauses are used as the metadata to be managed, the identical inquiry clauses in the current session or different sessions are directly replaced with the execution results stored in the metadata, the inquiry clauses are prevented from being executed repeatedly, and therefore the inquiry performance of the whole system is improved to a greater extent. Meanwhile, compared with optimization of the existing inquiry clause level, the inquiry optimization granularity is thinner, the method and device are more suitable for inquiring large data in data warehouse application, and therefore the inquiry performance of the whole system is further improved.
Owner:ZHEJIANG TMALL TECH CO LTD

Mobile object continuous k-nearest neighbor (CKNN) query method based on road based road networks tree (RRN-Tree) in road network

The invention provides a mobile object continuous k-nearest neighbor (CKNN) query method based on a road based road networks tree (RRN-Tree) in a road network and relates to a data query method. The mobile object CKNN query method aims to solve the problem that in the prior art, an index structure is utilized to index the road network, the road network is modeled to a directed / undirected graph, and a nearest neighbor query request is processed based on a memory data structure; however, when the road network has a large data size and multiple rod segments, the query efficiency is decreased rapidly; furthermore, the modeling mode based on the graph cannot reflect the steering relation of a mobile object at the crossing, and the nearest neighbor query of the complex road network with crossing steering and U-shaped turning constraints cannot achieved. According to the mobile object CKNN query method, an RRN-Tree index structure is provided to index the road network and interesting point objects, an adjacency linked list is established for crossed points on paths in the index structure, the connection relations between road segments at the crossing are stored, and therefore, the CKNN query of the road network under complex constrain conditions is completed. The mobile object CKNN query method is used for inquiring CKNN query of the road network.
Owner:NORTHEAST FORESTRY UNIVERSITY

Image retrieval method integrating classification with hash partitioning and image retrieval system utilizing same

The invention discloses an image retrieval system integrating classification with hash partitioning, which comprises a downloading module, a classification module training module, an image classification module, a characteristic extraction module, a recording table building module, a partitioning module, a request processing module, a retrieval module, a similarity acquiring module and a result returning module. The downloading module is used for downloading images so as to build an image library, the classification model training module classifies the images in the image library according tothe shapes and selects representative sample images from the image library to form a sample library firstly, then extracts characteristic descriptors of the classification bottom layer of all images in the sample library and performs trainings on the characteristic descriptors of the classification bottom layer by a support vector machine so as to obtain discriminant of each classification. Classification models are formed according to the discriminants of all the classifications. Precision ratio of the image retrieval system is increased, the problem of low recall ratio during classificationmistakes is overcome, and the retrieval speed of the image retrieval system is increased integrally.
Owner:HUAZHONG UNIV OF SCI & TECH

Parallel query method for distributed database

The invention discloses a parallel query method for a distributed database. In the method, due to the introduction of the design of proxy and qcs modules, a query system is able to support a high-concurrency distributed database and meets requirements of a large-sized application system. Meanwhile, based on a concept of 'dividing and ruling' query tasks, the qcs module initiates query concurrently, granularity data which can be queried quickly are fed back first, and thus, the overall query performance of the system is improved obviously and the needs for quick query of a majority of application systems are met.
Owner:南京中新赛克科技有限责任公司

Distributed type reverse index organization method based on user log analysis

The invention discloses a distributed type reverse index organization method based on user log analysis. The distributed type reverse index organization method comprises the following steps: 1) analyzing query logs of the user, extracting high-frequency words and non-high-frequency words, establishing a relativity matrix of the high-frequency words, and establishing a high-frequency word relation graph according to the relativity of the high-frequency words; 2) calculating the load of each high-frequency word, and clustering the high-frequency words according to the high-frequency word relation graph and the loads of the high-frequency words; 3) distributing the clusters to nodes, establishing a high-frequency word index, hashing non-high-frequency words to the nodes, and establishing a non-high-frequency word index; 4) establishing a global index table according to the high-frequency word index and the non-high-frequency word index, and inquiring routes according to the global index table. The distributed type reverse index organization method disclosed by the invention has the advantages of small query cost, high query efficiency, and favorable query performance, and also has the advantages that the distributed type reverse index organization method can realize the balance of the throughput of the entire system and the query response speed of each time, and less nodes is referred during the query of a plurality of words.
Owner:ZHEJIANG UNIV

Method and system for extracting knowledge graph from software project data and answering knowledge graph

ActiveCN108959433AFriendly and easy-to-use automatic question and answer supportGood query effectReverse engineeringSpecial data processing applicationsNatural languageKnowledge graph
The invention discloses a method and system for extracting a knowledge graph from software project data and answering the knowledge graph. The method comprises the following steps that for each type of software project data in a software project database, entities and an incidence relation between entities are extracted from the type of the software project data to be stored in a corresponding graph database; based on a traceability correlation technology of software data, correlation processing is carried out on the data in each graph database, and the correlation between entities of different types of software project data is obtained; and a corresponding edge is added into each graph database according to the incidence relation between entities of different types of software project data, and the entities with different sources are connected to generate the knowledge graph of the software project data; for the input natural language query statement, a matched communication sub-graphis inquired from the knowledge graph to serve as an answer. By virtue of the method and system, the problems of data correlation deletion, serious information isolation and difficulty in connection query and analysis of the software project are solved.
Owner:PEKING UNIV

Distributed device log collection method

The invention discloses a distributed device log collection method. According to the method, an integrated data intermediate layer is built through a distributed log processing framework in a mediator mode, integrated data intermediate management service is formed, the data intermediate management service collects device logs, the logs are stored on each distributed storage point in a distributed way, in addition, data collection is carried out, and if the distributed storage points need to be increased, the goal is achieved by adopting a distributed storage point dynamic expansion mechanism. The distributed device log collection method has the advantages that the integrated data intermediate layer is built by adopting the mediator mode, the logs are subjected to collection and formatting processing in a unified way, and the distributed storage points are uniformly managed and dispatched in a centralized way; the sub table structure is adopted, the multithreading processing advantages are better realized, a particular sub table indexing mechanism is built, and forms a super-volume data grading indexing system together with MariaDb database indexes of each data storage point, meanwhile, the performance advantage of a distributed server is utilized, and the storage and query performance of log data is greatly improved.
Owner:GUIZHOU POWER GRID INFORMATION & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products