Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

40 results about "Schema matching" patented technology

The terms schema matching and mapping are often used interchangeably for a database process. For this article, we differentiate the two as follows: Schema matching is the process of identifying that two objects are semantically related (scope of this article) while mapping refers to the transformations between the objects. For example, in the two schemas DB1.Student (Name, SSN, Level, Major, Marks) and DB2.Grad-Student (Name, ID, Major, Grades); possible matches would be: DB1.Student ≈ DB2.Grad-Student; DB1.SSN = DB2.ID etc. and possible transformations or mappings would be: DB1.Marks to DB2.Grades (100-90 A; 90-80 B: etc.).

Method and system for partial-order analysis of multi-dimensional data

A method and system for performing data analysis where multiple, categorical dimensions and statistical measures are used. The method includes preparing multi-dimensional data into nodes and arranging them logically in the form of a partial-order database. The database may then be traversed by a data-mining tool, or queried in an OLAP manner. Queries are answered through a process of associative partial-order search, which includes optional support for “children-of” requests, and optional support for fuzzy pattern-matching. By aggregating measures on-the-fly, this method is able to preform well without requiring a large number of summary nodes, when taken as a percentage of the total node count.
Owner:GULA CONSULTING LLC

Selective schema matching

A system that automatically matches schema elements is provided. In one aspect, given a selected element of one schema, the system can calculate the best matching candidate elements of another schema. The calculation can be based on a heuristic combination of factors, such as element names, element types, schema structure, existing matches, and the history of actions taken by the user. Accordingly, the best candidate (according to the calculation) can be emphasized and / or highlighted. The tool can auto-scroll to the best choice. Similarly, the user can request the calculation and display to best candidates by pressing a keyboard key or hot key. As well, the user can prompt display of the best candidates by using the mouse (e.g., moving the mouse over the element E or clicking on E), or both (e.g., mouse over with hot key depressed).
Owner:MICROSOFT TECH LICENSING LLC

Tools and methods for semi-automatic schema matching

Tools and methods for schema matching that generate schema graphs, populate match matrices and display the schema graphs and the match matrices. These tools and methods characterize potential matches between disparate schemata in terms of both a strength of evidence indicating the potential match and an amount of evidence indicating the potential match. A number of match voters generate a set of match scores for each potential match, and these match scores are combined by a vote merger to form a single confidence value for each potential match. A number of filters display the confidence value for each potential match as a link on a graphical user interface. Machine-learning techniques may be employed to adaptively determine confidence values based on previously established matches.
Owner:MITRE SPORTS INT LTD

Method, system, and computer program product for comparing two computer files

A method and system for comparing two documents, such as XML (Extensible Markup Language) files, where each document is capable of being parsed into a DOM (Document Object Model) trees. Each tree structure is converted into an array of leaf paths containing nodes. These arrays are then compared to identify corresponding matched nodes—either exactly matched nodes or schema matched nodes. In reporting the results of the comparison, unmatched nodes of the source document are reported as “deleted nodes”, that is, existing in the source but not in the target. Similarly all unmatched nodes of the target document are reported as “added nodes”, that is, existing in the target but not in the source. In addition, schema matched nodes are reported as “modified nodes”between source and target documents.
Owner:IBM CORP

Structured method for schema matching using multiple levels of ontologies

A structured method of matching schemas that uses multiple levels of ontologies is disclosed. The method maps functions of a target system to a process ontology and maps functions of a source system to the process ontology to produce a first mapping of target functions and source functions using the process ontology. The method identifies target function parameters upon which the target functions operate and identifies source function parameters upon which the source functions operate. Then, the method maps the target function parameters to a concept ontology and maps the source function parameters to the concept ontology to produce a second mapping of the target function parameters and the source function parameters using the concept ontology. This second mapping is enhanced by mapping the target function parameters to a data-type ontology and mapping the source function parameters to the data-type ontology. This produces an enhanced second mapping of the target function parameters and the source function parameters using the data-type ontology. This enhanced second mapping can be the resultant output to be used in subsequent processing.
Owner:IBM CORP

System and method for schema matching

A system and method for matching one or more source schemas with one or more target schemas is provided. The matching between source and target schemas is performed by gathering inputs pertaining to the source and target schemas, wherein the inputs comprises a set of details in a predefined format. Thereafter, the gathered inputs are processed by comparing the source schemas with the target schemas. The processing is performed to identify a set of matches between the source and target schemas based on the linguistic similarity, structural similarity and functional similarity and relationship between the source and target schemas. Subsequently, the identified matches are stored.
Owner:INFOSYS LTD

Methods and apparatus for contextual schema mapping of source documents to target documents

Methods and apparatus are provided for improved schema mapping of source documents to target documents. A list of matches are generated between at least one source table and at least one target table. One or more of the matches are annotated with a logical condition providing a context in which the match applies. Matches can be annotated with a logical condition, for example, by generating a set of candidate view conditions, C, to be applied to the one or more source tables. A schema match algorithm can generate the list of matches. Candidate logical conditions can be identified, for example, by (i) creating a set of views for categorical attributes in the tables and adding a view for each partitioning of the attribute values; (ii) using a classifier built on target attribute values; or (iii) evaluating internal features of a source table.
Owner:LUCENT TECH INC

Automated database schema matching

Described are system, method, and computer-program product embodiments for automatically ingesting data from disparate data sources into a target database having a target data schema. In some embodiments, the data is received in a data file including data columns formatted according to a data schema, and a data dictionary describing the source data columns. Count data is generated for each cell selected from a data column, each count datum including counts of occurrences of a detected characteristic in each cell. One or more target data columns from the target data schema can be selected and displayed to a user as being semantically related to the data column based on the count data for each cell, a column header of the data column, and the data dictionary. Based on input received from the user, a data table is generated to store the source columns and loaded into the target database.
Owner:PRICEWATERHOUSECOOPERS

Method for implementing automatic database schema matching

The invention discloses a method for realizing automatic matching of database schemas, which comprises the four steps of schema information analysis, schema information classification, schema information integration and calculation of the similarity of schema elements, wherein the step of schema information analysis is used for realizing the analysis of element information of formatted source schemas and target schemas; the step of schema information classification is used for classifying schema element information analyzed in the schema information analysis stage and classifying all the elements according to the name, the description and the type respectively; the step of schema information integration is used for integrating classified results of schema information classification and element structure information and establishing weighted fuzzy concept lattices; and the step of calculation of the similarity of the schema elements is used for calculating the matching degree of the schema elements according to the weighted fuzzy concept lattices and setting a threshold to determine the matching relation between the elements. The method for realizing automatic matching of the database schemas solves the problems of schema matching in actual application, improves the efficiency of computers when solving the problem of schema matching, and reduces the waste of CPU resources.
Owner:SOUTHEAST UNIV

Online recognition and filling method for hourly energy consumption abnormality data of office building

The invention discloses an online recognition and filling method for the hourly energy consumption abnormality data of an office building, and the method comprises the following steps: obtaining historical energy consumption data; clearing the historical energy consumption data, and eliminating the abnormality data; building an energy consumption mode feature set according to the outdoor temperature, moment and date attributes of the hourly energy consumption data, and calculating a feature value; collecting the energy consumption data hour by hour, carrying out the energy consumption mode matching, and finding a corresponding feature value; carrying out the online recognition of abnormal data; carrying out the cyclic interpolation of the abnormality data, and carrying out the correction of the interpolation value through a formula; storing the data without abnormality into a database after interpolation; carrying out the rolling correction of the feature values of the energy consumption mode regularly, and then carrying out the online interpolation of the hourly energy consumption data. The method can solve a problem of the recognition and interpolation of the usual abnormality data in the hourly energy consumption of the office building, improves the energy consumption data quality, provides effective data for the further data mining, and promotes the development of the energy saving operation of the building.
Owner:SOUTH CHINA UNIV OF TECH

Method, system and computer program for discovering inventory information with dynamic selection of available providers

A solution (200) for discovering inventory information in a data processing system is proposed. For this purpose, a corresponding discovery request is submitted (A1) to an inventory tool (200); the discovery request specifies a selected query pattern for the desired inventory information (for example, all the files included in a specific directory). Multiple providers (210)—such as of the interactive, cached or monitor type—are available for executing the required discovery operation. A predictive model (220) is associated with each provider; the models are used to estimate (A2,A3a,A3b-A4) the expected performance of the different providers for discovering the inventory information (according to the selected query pattern). In this way, it is possible to select (A5-A9) the provider, among the available ones, which is best suited for this purpose. The selected provider is then invoked (A10-A12) to discover the inventory information matching the selected query pattern. At the same time, the result of the discovery is used to refine (A13-A14) the models of the providers, so as to improve their accuracy.
Owner:IBM CORP

Systems and methods for performing schema matching with data dictionaries

Systems and methods for performing schema matching of multiple data dictionaries. Lexical graphs are generated for two or more data dictionaries. The lexical graphs include terms with definitions and identified links between terms. A data processing device determines if matches occur between the generated lexical graphs and outputs the result of the determination.
Owner:THE BOEING CO

Web services database cluster structure and method thereof

There is disclosed configuring of clustered web services nodes accessing a common database, including implementing a data virtualization layer at each node to abstract an instance of the database from a web service application. In one embodiment, at each node is performed creating a first, data virtualization entity bean having all read and write operations of an application-developed (master) entity bean, creating a second entity bean that carries only the read operations of the master entity bean and addresses the replica instance, receiving an operation request at the first entity bean, and routing a request to either the master entity bean or the second entity bean depending upon the requested operation to access the respective database instance. In another embodiment, at each node is performed implementing an empty database instance having a schema matching the common database, identifying a relevant partitioning in a query utilizing the empty database, and routing the query to a respective partitioned database instance.
Owner:IBM CORP

Multi-source data deep fusion method based on deep learning

The embodiment of the invention provides a multi-source data deep fusion method based on deep learning. The method comprises the steps of obtaining a to-be-fused relational data table; constructing adeep learning model, importing training data to carry out word vectorization processing on contents in the to-be-fused relational data table, and carrying out mode matching on the processed data; performing stratified sampling on the data in the to-be-fused relational data table based on the similarity between entities corresponding to the data, importing the sampled data into a preset structure model to perform integration processing based on word vectors to obtain a trained data bucket model, and performing entity-based data bucket processing based on the data bucket model; and judging whether the data in each bucket refers to the same entity, and performing data fusion on the data referring to the same entity to obtain a data table formed by the fused data. According to the invention, character string data is modeled in a word vector mode, and the method can be used for modeling texts and semantics of the character string at the same time, so that the tolerance to dirty data is improved.
Owner:STATE GRID ZHEJIANG ELECTRIC POWER CO MARKETING SERVICE CENT +2

PMU-SCADA data time synchronization and fusion method based on Lagrange interpolation and cubic exponential smoothing

InactiveCN111625769ASolve the problem of time frequency inconsistencySolving Difficult to Calibrate FiducialsMachine learningComplex mathematical operationsAlgorithmNoise reduction
The invention relates to a PMU-SCADA data time synchronization and fusion method based on Lagrange interpolation and cubic exponential smoothing. The method mainly comprises the following steps: (1) an SCADA three-phase voltage and current amplitude time series data filling method based on Lagrange interpolation; (2) a PMU three-phase voltage and current amplitude time series data noise reductionmethod based on cubic exponential smoothing; and (3) a SCADA filling time series data and PMU noise reduction time series data time synchronization and fusion method based on Euclidean distance time series mode matching.
Owner:HUNAN UNIV +1

Spreadsheet program-based data classification for source target mapping

The subject disclosure relates to third party cloud schema matching to a user's data. The user can select the type of data to be looked up from an external source in order for the user's data to be augmented with data from the external source. The user can be presented with a list of service providers that support the selected type of data. Columns from the user's data are associated with supported lookup information types either automatically or manually. Based on the association, data from the external data source is automatically mapped to the user's data and updates are selectively applied to the user's data.
Owner:MICROSOFT TECH LICENSING LLC

Visual urban big data presentation platform and method

The invention discloses a visual urban big data presentation platform and method. The platform comprises a standard data adapter interface, a visual big data processing engine, a visual big data tool set, an artificial intelligence knowledge base, a multi-screen interactive operating system and video cloud. In the platform provided by the embodiment of the invention, the visual big data processing engine separately invokes a visual processing tool in the visual big data tool set and a big data interpretation mode set in the artificial intelligence knowledge base, visual processing and interpretation mode matching processing are correspondingly carried out on the format standardized big data obtained after the processing of the standard data adapter interface to obtain dynamic easy-readable data stream, distributed presentation processing of the dynamic easy-readable data stream is realized by the multi-screen interactive operating system to improve the readability of the big data of an urban operating sign, and it is conducive to improving the study and judge speed of an urban management decision maker on the big data of the urban operating sign.
Owner:云神科技投资股份有限公司

Geological ontology-based geological report text information extraction method

The invention provides a geological ontology-based geological report text information extraction method, which comprises the steps of S1, performing preprocessing operation on a geological report document, converting a file type into a data source format, and performing sentence segmentation, word segmentation, stop word removal and part-of-speech tagging; S2, constructing a place name dictionarylibrary and a geological entity dictionary library by utilizing the structured information, expanding on the basis of the existing geological domain ontology, and forming a place name ontology and a geological time ontology; and S3, extracting geological entity information, space-time relationship information and attribute information from the geological report text through a mode matching methodand a rule matching method. The method has the advantages that a large number of training data sets do not need to be marked manually, and manpower and material resources are saved; an existing information extraction model is abstracted and improved, and a certain thought is provided for information extraction in other fields.
Owner:CHINA UNIV OF GEOSCIENCES (WUHAN)

Mapping questions to complex database lookups using synthetic events

An approach is provided in which a knowledge manager identifies training entities in a training question answer pair and identifies database paths a database that link the training entities. The knowledge manager then generates a generalized query based upon the identified database paths. In addition, the knowledge manager generates a generalized syntactic pattern based upon syntactic relationships between the training entities in the training question answer pair. When the knowledge manager receives a question and matches the syntactic pattern of the received question to the generalized syntactic pattern, the knowledge manager generates a custom query from the generalized query and queries the database using the custom query.
Owner:IBM CORP

Service providing method based on data classification

InactiveCN112417492AEfficient classification and gradingGood effectDigital data protectionText database queryingData warehouseEngineering
The invention discloses a service providing method based on data classification and grading. The service providing method comprises the steps that 1, data classification and grading standards being formulated, comprising the following contents of service subdivision, data classification, data classification feature description and level judgment; 2, making a data desensitization strategy. 3, matching data classification and grading rules, comprising the following steps of metadata blood relationship analysis marking, data mode matching marking and manual judgment; and 4, developing a data service, comprising the following steps of basic information configuration, access logic development, service testing, data service auditing and data service application. The technical problem to be solved by the invention is to carry out efficient data classification and grading in a large data warehouse environment and provide a safe data interface method on the basis of classification and grading.Through combination of classification and grading and data service, a method support is provided for realizing security data service, and a good effect is achieved in practical application.
Owner:中科大数据研究院

UEFI firmware vulnerability static detection method and device based on taint analysis

PendingCN114462044ASufficient and effective anchoring standardsEffective anchoring criteriaPlatform integrity maintainanceData stream analysisUnified Extensible Firmware Interface
The invention discloses a UEFI (Unified Extensible Firmware Interface) firmware vulnerability static detection method and device based on taint analysis, relates to the field of UEFI firmware static analysis, and aims at solving the problem that effective and automatic static analysis is not carried out on UEFI firmware vulnerability detection at present by utilizing technologies such as UEFI firmware mode matching and customized data flow analysis. By analyzing UEFI security standard specifications, UEFI firmware implementation examples and actual commercial equipment firmware, extracting UEFI firmware, recovering semantic information of the UEFI firmware, combining with code performance characteristics of vulnerabilities, tracking a propagation process of taint data with the vulnerabilities by adopting methods such as data stream tracking and mode matching, large-scale UEFI firmware vulnerability static scanning and detection are realized, and the vulnerability detection efficiency is improved. And identifying, positioning and discovering potential security vulnerability threat risks.
Owner:INST OF INFORMATION ENG CAS

Enterprise subject matching method based on part-of-speech tagging of government affair text data

The invention discloses an enterprise subject matching method based on part-of-speech tagging of government affair text data. The method comprises the steps: extracting enterprise names in the government affair text data, extracting enterprise naming modes according to known enterprise naming rules, matching the enterprise naming modes, According to the matching result of the enterprise naming modes, carrying out part-of-speech tagging of the government affair text data, carrying out part-of-speech tagging of the government affair text data. according to the enterprise main body matching method, when the to-be-matched government affair data is processed, matching of the enterprise entities can be directly achieved from the government affair text data, the matching efficiency is greatly improved, and intelligent intercommunication of the government affair big data is achieved.
Owner:浙江非线数联科技股份有限公司

Method for searching for business process change domain based on merged model Digest

InactiveCN103870968AFix compatibility issuesHuge amount of work to solveCommerceModel extractionEngineering
The invention discloses a method for searching for a business process change domain based on merged model Digest. The method is to solve the problem of determining the change domain of process models in a large-scale complex electronic business system and is also suitable for change domain processing of a large-scale complex service system. According to the method, real cases in electronic business are analyzed firstly, the process models are established through a Petri network, and complex matching relations among the process models are obtained through semantic matching and mode matching. Then the models are standardized, the problems of merging and compatibility judging on the models in the complex matching relations are solved through model standardization, the standardized models are merged according to the matching relations, the merged model Digest is extracted to obtain a public part of model interaction, finally, the change domain in the Digest is found through analyzing the difference between behavior outline relations of the Digest and the standardized models, the change domain in the Digest is respectively mapped into the original models by the utilization of a mapping function, and therefore the change domain of the original models is obtained.
Owner:ANHUI UNIV OF SCI & TECH

Reusable and extensible machine learning method based on plug-in model

The invention discloses a reusable and extensible machine learning method based on a plug-in model, which takes a lightweight feature learning model as a basic unit to construct a reusable and plug-in model to realize target part positioning and attribute analysis; semantic content necessary for a generative semantic description framework is provided for describing component features and component composition modes; time and space probability distribution of sample data is formed and serves as an important basis for target mode matching, so that intelligent analysis, learning and accurate and rapid identification tasks of a target are completed. According to the method, under the condition of limited samples and computing resources, the complexity of the model can be simplified, and the utilization efficiency of data samples is greatly improved.
Owner:北京理工雷科电子信息技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products