Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

100 results about "Document representation" patented technology

Publishing layout wizard

InactiveUS6931591B1Expensive to maintainExpensive to updateCathode-ray tube indicatorsNatural language data processingGraphicsWeb browser
The present invention facilitates the specification and distribution of templated content materials by a content provider over an information exchange network such as the Internet. The present invention incorporates a system for managing inventories of graphical elements and their relationships to pre-defined page templates. A database capable of keeping track of users and their corresponding access privileges within the system is employed to monitor user activity. Ultimately, through the use of a software component delivered over the Internet for use within standard web browsers, end-users are able to populate templates under the constraints imposed by the rules of the manufacturers at the time of template design. These population elements which “fill in the blanks” of the pre-defined templates may be either of type IMAGE or TEXT. Image regions are populated by choosing from a subset of the entire image inventory, while TEXT types can be completely free form, with specific rules guiding justification, point size, font, and leading, or “fill in the blank” form with the same constraint rules as free form. Once the end user has met all of the criteria for a fully populated template, the system provides sophisticated means for downloading a high resolution file (such as a print-ready file or other file representation of the composed publication) which encapsulates all resources needed (layout, images, fonts, and constraint geometries) to fulfill the requirements of the publication. The downloaded file may be printed or published by electronic transfer, e.g., to a publisher for printing of the actual publication.
Owner:SAEPIO TECH

Using a metadata image of a file system and archive instance to restore data objects in the file system

Provided are a computer program product, system, and method for using a metadata image of a file system and archive instance to restore files in the file system. A metadata image of the file system for a point-in-time backup as of a point-in-time includes information on files and directories in the file system as of the point-in-time and an archive instance including a copy of database records in the backup database for the files in the point-in-time backup. A restore request is received. A file representation is created of each file to restore in the directory structure of the file system from the metadata image, wherein at least one of the created file representations indicates that the file is stored off-line and has an external identifier used to access information on the file in the database records in the archive instance for the point-in-time backup.
Owner:IBM CORP

Code, system and method for representing a natural-language text in a form suitable for text manipulation

A computer method, system and code, for representing a natural-language document in a vector form suitable for text manipulation operations are disclosed. The method involves determining (a) for each of a plurality of terms selected from one of (i) non-generic words in the document, (ii) proximately arranged word groups in the document, and (iii) a combination of (i) and (ii), a selectivity value of the term related to the frequency of occurrence of that term in a library of texts in one field, relative to the frequency of occurrence of the same term in one or more other libraries of texts in one or more other fields, respectively. The document is represented as a vector of terms, where the coefficient assigned to each term includes a function of the selectivity value determined for that term.
Owner:WORD DATA

System to catalog and search point-in-time instances of a file system

A system to catalog and search point-in-tine instances of a file system is disclosed. A catalog engine takes backups of file data generated by a storage system and catalogs the backups of file data into a searchable catalog of independent metadata records. The metadata is represented by baseline structure and delta files.
Owner:VERITAS TECH

A title generation method based on a variational neural network topic model

The invention discloses a title generation method based on a variational neural network subject model, belonging to the technical field of natural language processing. This method automatically learnsthe document topic hidden distribution vector by variational self-encoder, and combines the document topic hidden distribution vector and the document representation vector learned by multi-layer neural network with attention mechanism, so as to express the comprehensive and deep semantics of the document on the topic and global level, and to construct a high-quality title generation model. Thismethod uses the multi-layer encoder to learn the more comprehensive information of the document, and improves the effect of summarizing the main idea of the full text of the title generation model; the topic implicit distribution vector of VAE learning is utilized, and the document content is represented in the abstract level of topic. The topic implicit distribution vector and the document information learned by the multi-layer encoder are combined with the deep semantic representation and context information to construct a high quality title generation model by using the attention mechanism.
Owner:BEIJING INSTITUTE OF TECHNOLOGYGY

Crf-based span prediction for fine machine learning comprehension

A method for determining, from a document, an answer to a query using a query answering system, comprising: (i) encoding, using an encoder, one or more documents; (ii) encoding a received query; (iii) generating, using an attention mechanism, a query-aware document representation comprising alignment between one or more words in one of the plurality of documents and one or more words in the query; (iv) generating, using a hierarchical self-attention mechanism, a word-to-sentence alignment of the query-aware document representation; (v) labeling, using a conditional random field classifier, each of a plurality of words in the word-to-sentence alignment with one of a one of a plurality of different sequence identifiers, resulting in possible labeled answering spans; and (vi) generating, from the one or more possible labeled answering spans, a response to the query.
Owner:KONINKLJIJKE PHILIPS NV

A text representation method and device based on a hierarchical neural network

The invention discloses a text representation method and device based on a hierarchical neural network. The method comprises: converting each word forming a sentence into a vector; Inputting vectors corresponding to all words in the sentence into a neural network for aggregation, and outputting sentence representation corresponding to the sentence; Inputting all the sentence representations into aneural network to be aggregated, and generating document representations corresponding to all the sentence representations; And converting the document representation into a document classification vector through a full connection network, and obtaining prediction probability distribution of document classification based on the document classification vector. According to the method and a device,A hierarchical mechanism is introduced into a neural network model to solve a document representation problem for text classification; Interoperability of different tasks is better improved, a hierarchical neural system structure is fused into a neural network method, a new neural network model based on layering is caused, accuracy, performance and the like are obviously superior to those of an existing neural network model, and consumption is lower.
Owner:NAT UNIV OF DEFENSE TECH

Cross-domain emotion classification system based on attention mechanism fusion

The invention relates to a cross-domain emotion classification system based on attention mechanism fusion. The system comprises a comment text preprocessing module used for obtaining vector forms of texts in a source domain and a target domain; a text semantic learning module which is used for learning a semantic dependency relationship between words; an attention mechanism fusion module which isused for fusing different attention modes to obtain comprehensive weights of words for text classification; a hierarchical attention module which is used for calculating attention weights of the textfrom a word level and a sentence level respectively and judging weights of words for sentence representation and sentences for document representation; and an emotion category output module which is used for obtaining a final emotion classification result by utilizing the classification function. According to the method, the potential universal features of the target domain and the source domain can be automatically extracted, the features are abstracted and combined, and finally the emotion category of the text of the target domain is recognized.
Owner:FUZHOU UNIV

Judicial document paragraph classification method and device, computer equipment and storage medium

The invention relates to a judicial document paragraph classification method and device, computer equipment and a storage medium. The method comprises the steps of obtaining judicial documents; performing character segmentation on the judicial document to obtain a character matrix; carrying out vector extraction according to the character matrix to obtain sentence representation vectors; splicingthe sentence representation vectors to obtain a document representation vector; inputting the document representation vectors into a classification model for classification to obtain paragraph categories; feeding back the paragraph category to the terminal for the terminal to perform information extraction, wherein the classification model is obtained by training a model composed of a bidirectional recurrent neural network and a conditional random field by taking a document representation vector with a category label as sample data. According to the method, the sentence representation vectorsare classified through the classification model composed of the trained bidirectional recurrent neural network and the conditional random field to obtain the paragraph categories, judicial document paragraphs are automatically classified, the generalization ability is achieved, and the extraction accuracy and recall rate are high.
Owner:深圳市华云中盛科技股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products