Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

4821 results about "Data extraction" patented technology

Data extraction is the act or process of retrieving data out of (usually unstructured or poorly structured) data sources for further data processing or data storage (data migration). The import into the intermediate extracting system is thus usually followed by data transformation and possibly the addition of metadata prior to export to another stage in the data workflow.

Information Infrastructure Management Tools with Extractor, Secure Storage, Content Analysis and Classification and Method Therefor

The present invention is a method of organizing and processing data in a distributed computing system. The invention is also implemented as a computer program on a computer medium and as a distributed computer system. Software modules can be configured as hardware. The method and system organizes select content which is important to an enterprise operating said distributed computing system. The select content is represented by one or more predetermined words, characters, images, data elements or data objects. The computing system has a plurality of select content data stores for respective ones of a plurality of enterprise designated categorical filters which include content-based filters, contextual filters and taxonomic classification filters, all operatively coupled over a communications network. A data input is processed through at least one activated categorical filter to obtain select content, and contextually associated select content and taxonomically associated select content as aggregated select content. The aggregated select content is stored in the corresponding select content data store. A data process from the group of data processes including a copy process, a data extract process, a data archive process, a data distribution process and a data destruction process is associated with the activated categorical filter and the method and system applies the associated data process to a further data input based upon a result of that further data being processed by the activated categorical filter utilizing the aggregated select content data.
Owner:DIGITAL DOORS

Document management system with enhanced intelligent document recognition capabilities

InactiveUS20050289182A1Enhances document management qualityImprove efficiencyCharacter and pattern recognitionOffice automationXMLData extraction
An intelligent document recognition-based document management system includes modules for image capture, image enhancement, image identification, optical character recognition, data extraction and quality assurance. The system captures data from electronic documents as diverse as facsimile images, scanned images and images from document management systems. It processes these images and presents the data in, for example, a standard XML format. The document management system processes both structured document images (ones which have a standard format) and unstructured document images (ones which do not have a standard format). The system can extract images directly from a facsimile machine, a scanner or a document management system for processing.
Owner:SAND HILL SYST

Method and system for extracting and classifying geolocation information utilizing electronic social media

Methods, systems and processor-readable media for extracting and classifying location information utilizing social media messages and / or data thereof. The social media messages can be sampled from a social media database and the messages filtered based on a heuristic rule. A geolocation entity from the unstructured social media messages can be extracted utilizing a geolocation entity extracting module. The messages with the geoentities can be uploaded onto a crowd sourcing platform to manually annotate the messages with a label. A text classification model can be built and learned from the label utilizing a machine learning algorithm and the messages can be classified by a location classifier in order to extract the user location. The user location can then be transformed into a geocode so that a spatial search can be enabled and the distance between the locations can be easily calculated.
Owner:XEROX CORP

Data extraction for feed generation

A system (and a method) automatically generates a feed from structured or unstructured data. The system identifies a resource having two or more data elements. The resource is matched with a pre-defined template. The pre-defined template is structured for a feed and includes a plurality of fields. The system extracts data elements from the two or more data elements of the resources. Each extracted data element corresponds to a field or the plurality of fields in the pre-defined template. Each extracted data element is then merged into the corresponding field or the plurality of fields in the pre-defined template to generate the feed.
Owner:SIMPLEFEED

Systems for mobile image capture and processing of documents

The present invention relates to automated document processing and more particularly, to methods and systems for document image capture and processing using mobile devices. In accordance with various embodiments, methods and systems for document image capture on a mobile communication device are provided such that the image is optimized and enhanced for data extraction from the document as depicted. These methods and systems may comprise capturing an image of a document using a mobile communication device; transmitting the image to a server; and processing the image to create a bi-tonal image of the document for data extraction. Additionally, these methods and systems may comprise capturing a first image of a document using the mobile communication device; automatically detecting the document within the image; geometrically correcting the image; binarizing the image; correcting the orientation of the image; correcting the size of the image; and outputting the resulting image of the document.
Owner:MITEK SYST

Method and apparatus for seismic data acquisition

A marine seismic exploration method and system comprised of continuous recording, self-contained ocean bottom pods characterized by low profile casings. An external bumper is provided to promote ocean bottom coupling and prevent fishing net entrapment. Pods are tethered together with flexible, non-rigid, non-conducting cable used to control pod deployment. Pods are deployed and retrieved from a boat deck configured to have a storage system and a handling system to attach pods to cable on-the-fly. The storage system is a juke box configuration of slots wherein individual pods are randomly stored in the slots to permit data extraction, charging, testing and synchronizing without opening the pods. A pod may include an inertial navigation system to determine ocean floor location and a rubidium clock for timing. The system includes mathematical gimballing. The cable may include shear couplings designed to automatically shear apart if a certain level of cable tension is reached.
Owner:MAGSEIS FF LLC

Methods for mobile image capture and processing of checks

The present invention relates to automated document processing and more particularly, to methods and systems for document image capture and processing using mobile devices. In accordance with various embodiments, methods and systems for document image capture on a mobile communication device are provided such that the image is optimized and enhanced for data extraction from the document as depicted. These methods and systems may comprise capturing an image of a document using a mobile communication device; transmitting the image to a server; and processing the image to create a bi-tonal image of the document for data extraction. Additionally, these methods and systems may comprise capturing a first image of a document using the mobile communication device; automatically detecting the document within the image; geometrically correcting the image; binarizing the image; correcting the orientation of the image; correcting the size of the image; and outputting the resulting image of the document.
Owner:MITEK SYST

Method and system for preventing data leakage from a computer facilty

In embodiments of the present invention improved capabilities are described for the steps of identifying, through a monitoring module of a security software component, a data extraction behavior of a software application attempting to extract data from an endpoint computing facility; and in response to a finding that the data extraction behavior is related to extracting sensitive information and that the behavior is a suspicious behavior, causing the endpoint to perform a remedial action. The security software component may be a computer security software program, a sensitive information compliance software program, and the like.
Owner:SOPHOS

Method and system for secure cashless gaming

A secure cashless gaming system comprises a plurality of gaming devices which may or may not be connected to a central host network. Each gaming device includes an intelligent data device reader which is uniquely associated with a security module interposed between the intelligent data device reader and the gaming device processor. A portable data device bearing credits is used to allow players to play the various gaming devices. When a portable data device is presented to the gaming device, it is authenticated before a gaming session is allowed to begin. The intelligent data device reader in each gaming device monitors gaming transactions and stores the results for later readout in a secure format by a portable data extraction unit, or else for transfer to a central host network. Gaming transaction data may be aggregated by the portable data extraction unit from a number of different gaming devices, and may be transferred to a central accounting and processing system for tracking the number of remaining gaming credits for each portable data unit and / or player. Individual player habits can be monitored and tracked using the aggregated data. The intelligent data device reader may be programmed to automatically transfer gaming credits from a portable data device the gaming device, and continually refresh the credits each time they drop below a certain minimum level, thus alleviating the need for the player to manually enter an amount of gaming credits to transfer to the gaming device.
Owner:SMART CARD INTEGRATORS

Systems for mobile image capture and processing of checks

The present invention relates to automated document processing and more particularly, to methods and systems for document image capture and processing using mobile devices. In accordance with various embodiments, methods and systems for document image capture on a mobile communication device are provided such that the image is optimized and enhanced for data extraction from the document as depicted. These methods and systems may comprise capturing an image of a document using a mobile communication device; submitting the image to a server; and processing the image to create a bi-tonal image of the document for data extraction. Additionally, these methods and systems may comprise capturing a first image of a document using the mobile communication device; automatically detecting the document within the image; geometrically correcting the image; binarizing the image; correcting the orientation of the image; correcting the size of the image; and outputting the resulting image of the document.
Owner:MITEK SYST

Transcription data extraction

A computer program product, for performing data determination from medical record transcriptions, resides on a computer-readable medium and includes computer-readable instructions for causing a computer to obtain a medical transcription of a dictation, the dictation being from medical personnel and concerning a patient, analyze the transcription for an indicating phrase associated with a type of data desired to be determined from the transcription, the type of desired data being relevant to medical records, determine whether data indicated by text disposed proximately to the indicating phrase is of the desired type, and store an indication of the data if the data is of the desired type.
Owner:DELIVERHEALTH SOLUTIONS LLC +1

Systems and methods for automatically reducing data search space and improving data extraction accuracy using known constraints in a layout of extracted data elements

InactiveUS20110258195A1Reducing data search spaceImproving data extraction accuracyDigital data processing detailsSpecial data processing applicationsElectronic documentData ingestion
A method of automatically narrowing data search space and improving accuracy of data extraction using known constraints in a layout of extracted data elements for classified documented is provided. The method includes: analyzing each document to classify it within a document category, each category having a corresponding set of expected layouts; analyzing each electronic document to automatically extract images and text features; automatically constructing a data structure including a layout of the extracted features and layout relationships amongst the extracted features, wherein each of the extracted features in the layout maintains a reference to neighboring features and wherein closely related features are merged to form a combined feature; automatically narrowing data search space by detecting and removing parts of the layout that are not associated with any data elements using the data structure; and automatically detecting data using the extracted feature layout and the layout relationships amongst the extracted features.
Owner:GRUNTWORX

Scalable data extraction techniques for transforming electronic documents into queriable archives

A method for extracting an attribute occurrence from template generated semi-structured document comprising multi-attribute data records comprises identifying a first set of attribute occurrences in the template generated semi-structured document using an ontology. The method further comprises determining a boundary of each multi-attribute data record in the template generated semi-structured document, learning a pattern for an attribute corresponding to an identified attribute occurrence of the first set in the template generated semi-structured document, and applying the pattern within the boundary of each multi-attribute data record in the template generated semi-structured document to extract a second set of attribute occurrences.
Owner:THE RES FOUND OF STATE UNIV OF NEW YORK

Massive multi-source heterogeneous data ETL method and system supporting interface adaptation

The invention discloses a massive multi-source heterogeneous data ETL method and system supporting interface adaptation. The method comprises a data extraction step of setting basic information of data sources and a target database, adaptively matching corresponding ETL tools for different data sources and performing parameter setting on the ETL tools, a data conversion step of finishing ETL operation control execution and scheduling management, performing buffer storage and management on extracted data and finishing processing of data cleaning and conversion and the like, a data loading stepof carrying out quality inspection on converted data objects and updating and loading the data inspected to be correct into the target database according to table structure output defined by a data model, and a data monitoring step of performing monitoring management on an ETL operation execution process, an operation resource usage condition and a system operation condition. The proper ETL tool is adaptively matched; the extraction and conversion of massive data are achieved; and efficient execution and orderly management of ETL operation are realized.
Owner:DAREWAY SOFTWARE

System and method for data extraction and management in multi-relational ontology creation

InactiveUS20060053174A1Easy to controlEfficient and precise derivation and loading of relevant informationMetadata text retrievalComputer security arrangementsData ingestionKnowledge Field
The invention relates to a system and method for data extraction and management in multi-relational ontology creation. The system of the invention includes selecting a corpus of documents containing information relevant to a targeted knowledge domain, extracting assertions and their constituent concepts and relationships from the corpus, and storing the assertions, wherein the extraction processes may rules and utilize natural language processing.
Owner:BIOWISDOM

Systems and methods for trend extraction and analysis of dynamic data

The invention is directed generally to providing methods and systems for trend extraction and analysis. Embodiments include methods and systems for trend extraction and analysis of information extracted from dynamically changing data included in computer systems and / or networks. Various exemplary embodiments are provided that may generate characteristic indicators for trend(s) and / or distribution(s) for one or more data sources by use of, for example, temporal indicators derived through analysis of the difference in contribution separate portions of the data to the whole data set being considered, contribution of individual sources, and / or the interaction of the separate portions of the data with one another. Some exemplary approaches may include the use of singular value decomposition (SVD) and higher-order singular value decomposition (HOSVD) data extraction and analysis techniques. One use of these techniques is in the analysis of the dynamic data contained in Weblogs and the blogosphere.
Owner:NEC LAB AMERICA

Patient Information Documentation And Management System

A method and system for documenting and managing patient information in real time for accurate medical examination and treatment of a patient is provided. An information carrier device that stores patient information is provided to the patient. A data extraction unit extracts the stored patient information from the information carrier device and transmits the extracted patient information to a patient information processing system. A recording unit records observational data comprising medical examination information and information on treatment prescribed for the patient. The patient information processing system processes and analyzes the patient information received from the information carrier device and / or the data extraction unit and the observational data extracted from the recording unit, generates a medical examination report in one or more of multiple formats, and transmits the medical examination report in one or more formats to the information carrier device for updating the patient information in the information carrier device.
Owner:LI CREATIVE TECH

Regional information retrieving method and regional information retrieval apparatus

A Web regional information retrieval apparatus 10 retrieves a Web page which is information in the Worldwide Web (WWW), and includes a collecting unit 11, a regional meta-data extraction unit 12, and a region information retrieval unit 14. The collecting unit 11 collects a Web page from the WWW. The regional meta-data extraction unit 12 extracts regional meta-data indicating the region relating to the collected Web page, and assigns the data to the Web page. The region information retrieval unit 14 receives location information indicating the location of the terminal 20 from the terminal 20, retrieves a Web page assigned the regional meta-data related to the location information from the WWW, and transmits the retrieval result to the terminal 20.
Owner:FUJITSU LTD

Web Browser Device for Structured Data Extraction and Sharing via a Social Network

A method and system for implementing a browser based information extraction and transmission method. A method and system for identifying, extracting, and transmitting predefined structured information from web pages browser interface. The extracted information is then added to a user profile on a social network and a database. The information is shared with other users who can comment, copy, vote on, or go to the original information source. The information can be combined with other extracted information to form collections for the purposes of voting on one or more items in the collection, combining multiple items to form a useful kit, saving information for later use, adding addition information such as dates and purchase location for personal inventory purposes, and for saving bookmarks to structured data.
Owner:DATA RECORD SCI INC

Data transfer

Circuitry for transferring multiple digital data streams, e.g. digital audio data, over a single communications link such as a single wire. A pulse-length-modulator is responsive to a plurality of data streams to generate a series of data pulses with a single data pulse having a rising and falling edge in each of a plurality of transfer periods defined by a first clock signal. The timing of the rising and falling edge of each data pulse is dependent on a combination of the then current data samples from the plurality of data streams. The duration and position of the data pulse in the transfer window in effect defines a data symbol encoding the data. An interface receives the stream of data pulses, and data extraction circuitry samples the data pulse to determine which of the possible data symbols the pulse represents and determines a data value for at least one received data stream.
Owner:CIRRUS LOGIC INC

Method for abstracting network data and web reptile system

A web crawler system used for picking up webpage data is prepared as providing data pick-up task to the second component and receiving execution result of data pick-up task from the second component by the first component, communicating with webpage server to obtain webpage data and operating DOM model to pick up data as well as describing picked up data then sending picked up data and its description to the first component by the second one.
Owner:李沫南

Machine learning of document templates for data extraction

InactiveUS7561734B1Speed up template developmentAssures template qualityNatural language data processingSpecial data processing applicationsData ingestionGraphics
The present system can perform machine learning of prototypical descriptions of data elements for extraction from machine-readable documents. Document templates are created from sets of training documents that can be used to extract data from form documents, such as: fill-in forms used for taxes; flex-form documents having many variants, such as bills of lading or insurance notifications; and some context-form documents having a description or graphic indicator in proximity to a data element. In response to training documents, the system performs an inductive reasoning process to generalize a document template so that the location of data elements can be predicted for the training examples. The automatically generated document template can then be used to extract data elements from a wide variety of form documents.
Owner:LEIDOS

Method and system of data automatic conversion and storage

The invention discloses a system of data automatic conversion and storage. The system comprises a data extractor, a data converter, a data register and a data storage unit. The data extractor is used for extracting original data from different data sources and transmitting the original data to the data converter; the data converter is used for converting the original data from different data sources to specific data formats; the data register is connected with the data converter and is used for organizing the obtained data results from the data converter to uniform data structures; and the data storage unit is used for storing the data from the data register to a target database. According to a method and the system of the data automatic conversion and storage, different system information configuration is opened, conducting of system customization according to field needs is supported, the purpose of heterogeneous data integration of existing different business systems of enterprises is achieved, and data are integrated effectively.
Owner:INST OF AUTOMATION CHINESE ACAD OF SCI +1

Map update data supply device and map update data supply program

A map update data supply device includes: a request update data extraction unit that based on an update request extracts a request update section, and a latest version of an overwrite update data file for overwrite updating; and a safeguard update data extraction unit that extracts a safeguard update section that safeguards a network between adjacent sections, and up to an update safeguard version of a difference update data file, wherein the extracted data files are supplied to a navigation device.
Owner:AISIN AW CO LTD +2

Remote data collection systems and methods using read only data extraction and dynamic data handling

Remote data collection systems and methods retrieve data including financial, sales, marketing, operational and the like data from a plurality of databases and database types remotely over a network in an automated, platform-agnostic manner. A remote data collection system includes a network interface, a connection to a data source, a processor communicatively coupled to the network interface and the connection, and memory storing instructions for remote data collection that, when executed, cause the processor to: receive a request to extract data from the data source; extract the data in a non-intrusive manner from the data source using a two phase process comprising a reconciliation phase and a collection phase; and transmit one of an entire set and a subset of the extracted data based on the request.
Owner:ZEEWISE
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products