Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

647 results about "Text retrieval" patented technology

Text retrieval is a branch of information retrieval where the information is stored primarily in the form of text.

Triggering applications based on a captured text in a mixed media environment

A Mixed Media Reality (MMR) system and associated techniques are disclosed. The MMR system provides mechanisms for forming a mixed media document that includes media of at least two types (e.g., printed paper as a first medium and digital content and / or web link as a second medium). In one particular embodiment, the MMR system includes an action processor and method, and MMR documents with an associated action. The MMR document structure is particularly advantageous because the ability to specify different actions for different MMR documents, combined with the ability to create any number of MMR documents for a particular location on any media, allows the MMR architecture to serve as a universal trigger or initiator for additional processing. In other words, addition processing or actions can be triggered or initiated based on MMR recognition. The action processor receives the output of the MMR recognition process which yields an MMR document including at least one action. The action processor executes that action which includes various commands to the MMR system or other systems coupled to the MMR system. The MMR system architecture is advantageous because an action can be executed by pointing the capture device at a block of text, and the action is performed. Example actions include retrieving the text in electronic form to the capture device, retrieving the specification for the action, inserting data to a MMR document, transferring data between documents, purchasing items, authoring actions or reviewing historical information about actions. The MMR system includes a variety of user applications (one or more actions) initiated by the MMR recognition of a text patch such as information retrieval for a travel guide book, stock listings or advertisements; information capture such as recording content from a conference, recording and storing multimedia associated with the document, capturing information for a calendar and on the fly authoring; purchasing media files for storage on any part of an MMR document.
Owner:RICOH KK

Video content quick retrieving method based on object tag

The invention provides a video content quick retrieving method based on an object tag. The method comprises the following steps: extracting and analyzing the color feature, contour feature, scene feature and character feature of a moving object in each image frame of a video; processing a plurality of pictures of known types by using the feature extraction method, and training a contour classifier and a scene classifier by using the contour features and scene features of the pictures; processing a video to be retrieved by using the feature extraction and analysis method and the classifiers soas to generate type tags of objects in each image frame of the video, wherein the type tags are used for constructing an object tag database; and retrieving a response server to search the object tagdatabase to find videos related to a query request submitted by a user, and generating an ordered result for the user to browse and refer. The method provided by the invention can be used for retrieving the video content at a speed similar to that of the conventional text retrieval only by searching the object tag database and achieving the fine granularity retrieval of the video content, so thatthe method is more accurate than the conventional method.
Owner:SOUTH CHINA UNIV OF TECH

Full text retrieval system based on natural language

InactiveCN101246492AIntelligent Information ServiceConvenient information serviceNatural language data processingSpecial data processing applicationsNatural language understandingConcept search
The invention discloses a full text retrieval system based on natural language understanding, comprising: a database server, an information receiving judging module, a natural language processing module, a retrieving module, an indexing module, an index database and a result set processing module. The system of the invention provides two resolution strategies, that is, word classification static with semantic analysis associated with automatic segmentation and expanding inquired word static according to Hownet rule for low intelligence situation of current search engine. The deployed system converts information retrieval from current key word-based layer to knowledge (or concept)-based layer; the invention is capable of using techniques such as word classification, synonym, concept search, phrase identification, etc. with understanding and processing ability to knowledge. The search engine is provided with intelligence and humanization of information service. The user is allowed using natural language for information retrieval. The invention is capable of adding user selection behavior in interactive operation mode, so as to provide more convenient, more precise search service.
Owner:HUAZHONG UNIV OF SCI & TECH

Terminology translation for unaligned comparable corpora using category based translation probabilities

The invention relates to a method and apparatus for generating translations of natural language terms from a first language to a second language. A plurality of terms are extracted from unaligned comparable corpora of the first and second languages. Comparable corpora are sets of documents in different languages that come from the same domain and have similar genre and content. Unaligned documents are not translations of one another and are not linked in any other way. By accessing monolingual thesauri of the first and second languages, a category is assigned to each extracted term. Then, category-to-category translation probabilities are estimated, and using said category-to-category translation probabilities, term-to-term translation probabilities are estimated. The invention preferably exploits class-based normalization of probability estimates, bi-directionality, and relative frequency normalization. The most important applications are cross-language text retrieval, semi-automatic bilingual thesaurus enhancement, and machine-aided human translation.
Owner:XEROX CORP

Modified local sensitive hash vehicle retrieval method based on multitask deep learning

The invention discloses a modified local sensitive hash vehicle retrieval method based on multitask deep learning. A multitask end-to-end convolution neural network is used to identify a vehicle model, a vehicle system, a vehicle logo, a color and a license plate simultaneously in a subsection parallel mode. A network module for extracting vehicle image example features based on a characteristic pyramid and an algorithm by using a modified local sensitive hash sorting algorithm to sort the vehicle characteristics in a database, and a cross-modal text retrieval method when a retrieval vehicle image can not be acquired are included. The multitask end-to-end convolution neural network and the modified local sensitive hash vehicle retrieval method are provided, the automation level and the intelligence level of vehicle retrieval can be improved effectively, little storage space is used, and image retrieval requirements in a big data era are met by using a quicker retrieval speed.
Owner:ZHEJIANG UNIV OF TECH

Method for extracting and processing network information and its system

The invention relates to a network information extracting and processing method, adopting artificial intelligence and natural language processing technique, able to automatically download daily up-to-date news and information from named websites, making content extraction, classification, automatic abstracting and retrenching full text, then storing the full text, and then indexing the full text for making high-efficiency full text retrieval in future.
Owner:陈文中

System, method, and computer program product for anticipatory hypothesis-driven text retrieval and argumentation tools for strategic decision support

InactiveUS20070018953A1Effective and accurate decision makingImprove accuracyDigital data information retrievalCathode-ray tube indicatorsDomain modelStrategic decision support
Provided are systems, methods, and computer programs for facilitating strategic decision support that include providing a domain model, receiving a hypothesis or query, using the domain model and hypothesis or query with a related prediction, and searching for evidentiary results related to a prediction obtained from the hypothesis or from the query and domain model. A method may search and extract evidentiary results based on the hypothesis, query, or prediction. Evidentiary results may be associated with domain concepts and ranked according to relevancy to the associated domain concepts. And a user may select certain evidentiary results as being relevant, and these relevant evidentiary results may be used to create a report.
Owner:THE BOEING CO

Fast data and big data combined data processing method and system

InactiveCN103268336AReduce the problem of slow reading and writingLow costSpecial data processing applicationsStatistical analysisEngineering
The invention discloses a fast data and big data combined data processing method which includes steps: (1) data input of different data sources is received and is classified and transmitted according to fast data and big data; (2) fast data enter a real-time trading module which performs real-time calculation and inquiring on fast data by aid of a distributed memory; (3) a full-text retrieval module performs full-text retrieval according to the fast data result; (4) big data enter a volume historical data analysis module, are stored and are subjected to complete inquiring and statistic analysis; and (5) an application module receives data processed in the step (2), the step (3) and the step (4), and terminal display is carried out as required. The invention further provides a fast data and big data combined data processing system. The fast data and big data combined data processing method and system are low in cost and convenient to maintain, resources are distributed according to needs, and the performance is linearly expanded.
Owner:刘峰

Information retrieval apparatus, information retrieval method and computer product

An information retrieval apparatus includes contents, an index data generating unit, a character frequency management data generating unit, a compressing / encrypting unit, a retrieval initializing unit, a full text retrieving unit, and a retrieval result displaying unit. The character frequency management data generating unit generates character frequency management data based on the contents. The compressing / encrypting unit compresses the contents and encrypts the character frequency management data. The retrieval initializing unit decrypts encrypted character frequency management data. The full text retrieving unit executes full text retrieval for compressed contents using the character frequency management data and index data when receiving a retrieval keyword. The retrieval result displaying unit decompresses a retrieval candidate selected from retrieval candidates and displays as a retrieval result.
Owner:FUJITSU LTD

Text retrieval method and device

The embodiment of the invention provides a text retrieval method and device. The text retrieval method includes the steps that an original text input by a user is acquired; retrieval words are acquired from the original text; according to the retrieval requirement of the user, the retrieval words are filtered to acquire keywords; the keywords are combined, texts in a text database are retrieved according to the combined keywords, and at least one retrieval text is acquired; the retrieval texts are displayed in a relevancy inverted order mode, and the keywords are highlighted in the retrieval texts, wherein relevancy is used for representing the relevancy degree of the original text and the retrieval texts. Due to the fact that the keywords are acquired by filtering the retrieval words according to the retrieval requirement of the user, the probability that the keywords are invalid words is reduced, and the retrieval requirement is better met compared with the manner that the retrieval words are directly acquired from the original text, the retrieval texts acquired through retrieval by the application of the combined keywords can well meet the retrieval requirement, and therefore retrieval accuracy is improved.
Owner:STATE GRID CORP OF CHINA +3

Full text retrieval inquiry index method for extensible markup language document in relational database

The invention provides a full text retrieval inquiry index method for an extensible markup language document in a relational database. The method comprises the following four steps of: storing XML document data in the way of a mark sequence-based dimensional relation table; constructing a document structure basic information table; creating a word-based inverted index on a node text column of the document structure basic information table; and carrying out full text retrieval inquiry on the basis of the index. By the index method, the management efficiency of the extensible markup language document and the execution efficiency of the full text retrieval operation of the extensible markup language document can be effectively improved, and the inquiry execution time is shortened. The method has relatively high commonality and can be seamlessly fused with existing relation database in the way that the XML document data and the index data are stored in a using relation mode. At the same time, the method can be applied to inquiry of keyword research of the XML document data and then the execution efficiency of inquiry is improved.
Owner:NORTHEASTERN UNIV

Full-text retrieval system based on semantic analysis of relevant words

InactiveCN103838833AImprove recallAvoid the impact of loss of contributionSpecial data processing applicationsDocument preparationResult set
The invention belongs to the information retrieval technology and provides a full-text retrieval system based on semantic analysis of relevant words. The full-text retrieval system based on semantic analysis of the relevant words comprises an inquiry information receiving module, a concept semantic analysis module based on the relevant words, a semantic knowledge base module, a retrieval module, an index database, an index module, a theme semantic analysis module based on the relevant words, a result set processing module and a data server. The full-text retrieval system based on semantic analysis of the relevant words is based on the improvement on a traditional Internet search engine, and by the adoption of the system, concept semantic analysis based on the relevant words and theme semantic analysis based on the relevant words of a document can be achieved, and users can obtain search results which are more accurate, more comprehensive and more intelligent.
Owner:HUAZHONG NORMAL UNIV

Image retrieval method and image retrieval system

The invention discloses an image retrieval method and an image retrieval system. The method includes for given query texts and / or query images, acquiring multiple similarity ordered lists of in-base images according to text relevance and image content relevance, and then returning a comprehensive ordered list by combining the acquired ordered lists and comprehensively considering the text similarity and the image content similarity. Through the multi-mode mixed retrieval mechanism, shortcomings of conventional single-mode retrieval mechanisms are overcome, respective advantages of a text retrieval method and an image content retrieval method are developed, and accuracy of image retrieval is greatly improved. Since only ordering results of single retrieval models are fused, the single retrieval models can be increased, decreased and replaced conveniently, text and image content feature retrieval models are configured flexibly, and performance of the image retrieval system is improved.
Owner:GUANGDONG TUTUSOU NETWORK TECH

Spark SQL-based distributed full text retrieval system and method

The invention relates to a Spark SQL-based distributed full text retrieval system and method. The system comprises an SQL translation layer, a data source management layer, a parallel calculation layer and a distributed storage layer; an SQL-based full text retrieval method and translation processes, among modules of the SQL translation layer, of full text retrieval SQL statements are proposed; a full text retrieval process parallelization method is designed in a data source management module; and in a retrieval optimization module, two index storage models and corresponding primitive table data reduction strategies during query are designed, wherein a partition align connection algorithm which is used for reducing primitive table data during query and has a complexity of O (n) is designed for an index appointed column-based storage model. Under the two storage models, the index construction time is shortened to 0.6% / 0.5% of the traditional database, the query time is shortened to the 1% / 10% of the traditional database, and the index storage amount is decreased to 55.0% of the traditional database. According to the method, the Spark SQL data analysis function is strengthened, and the requirements for traditional business migration and full text retrieval carried out on mass data in the existing businesses can be satisfied.
Owner:INST OF SOFTWARE - CHINESE ACAD OF SCI

Information processing apparatus, full text retrieval method, and computer-readable encoding medium recorded with a computer program thereof

An information processing apparatus for creating a retrieval result displaying a list of retrieval documents is disclosed. Retrieval documents corresponding to a retrieval condition are classified into groups based on scores indicating degrees of relevance to the retrieval condition. A clustering process is conducted with respect to the retrieval documents in a group, for each of groups to which the retrieval documents belong.
Owner:RICOH KK

Cryptogram-based safe full-text indexing and retrieval system

The invention discloses a cryptogram-based safe full-text indexing and retrieval system. In the system, a cryptogram index library comprises a cryptogram entry reverse index and an internal document object set; a cryptogram document library is responsible for storing and managing an encrypted XML document; a word segmentation encryption server carries out Chinese word segmentation on a plaintext document and encrypts the plaintext document item by item; a cryptogram full-text indexing server standardizes an original plaintext document into an XML document, encrypts and stores the XML document in the cryptogram document library, creates a corresponding internal document object in the cryptogram index library by combining document metamessage, and creates a cryptogram reverse index for the XML document through the cryptogram entry; and a cryptogram full-text retrieval server retrieves the cryptogram index library to obtain the internal document object set through user authority information and the cryptogram entry, obtains a corresponding encrypted XML document result set from the cryptogram document library according to a pointer, decrypts the corresponding encrypted XML document result set, and returns the decrypted corresponding encrypted XML document result set to a user. The Chinese word segmentation method, the safe and high-efficiency indexing structure and the retrieval mechanism of the invention based on the special requirements of cryptogram full-text indexing can realize the cryptogram full-text indexing integrated with an access control strategy. The cryptogram-based safe full-text indexing and retrieval system has the advantages of a safe and high-efficiency indexing process, no decrypted docuterms in the indexing process, a high recall ratio and a high precision ratio in a cryptogram environment, and the like.
Owner:HUAZHONG UNIV OF SCI & TECH

Transmedia search method based on multi-mode information convergence analysis

The invention relates to a mediate-span search method based on multi-mode information fusion analysis, wherein the invention can fuse and analyze the multi-mode information to understand the multimedia semantic, to realize multimedia document search, image search, sound search and text search based on content; user can via provided search sample at any mode searches the media object or multimedia document at any mode; for example, for searching image, user can provide image as search sample to search, or provide sound or text or their combination as the search sample to search. Since the invention not only uses keyword, also fuses and analyzes all multimedia objects in the multimedia document, to synthesize the information carried by variable mode mediate to understand the mantic to obtain better search effect. Since the search sample and feedback result are in different modes, it has strong function and wide application.
Owner:ZHEJIANG UNIV

Searching Using Patterns of Usage

In various embodiments, the present invention relates disparate objects based on user behavior, thus enabling search engines to provide more comprehensive and accurate results. According to various embodiments of the present invention, multiple kinds of interactions by users with multiple classes of objects can be analyzed. The result is that disparate classes of objects can be related. Derived relations between text and objects can be used to implement search-like functionality or to extend a conventional text retrieval system.In one embodiment, the present invention is used to improve search results and / or recommendations by employing a filtered co-occurrence matrix that provides a representation as to which queries tend to co-occur with the originally submitted query. By supplementing or replacing the original query with co-occurring queries, the system of the present invention is able to generate results that are more likely to be of interest.
Owner:CORP ONE

OCR picture and text recognition and retrieval method and system through web mode

The invention discloses a method for retrieving and recognizing OCR picture and text in a web manner. The method comprises the following steps: acquiring text information and picture information from a picture and text to be recognized; storing the text information and the picture information into an OCR database; and retrieving the full text in the OCR database. The invention further discloses a system for retrieving and recognizing OCR picture and text, which comprises a picture and text information acquisition unit, an OCR database and a retrieval unit. By utilizing the OCR picture and text recognizing technique, the invention ensures that the recognition is more efficient, editable text formatting can be exported, and the required information resource can be retrieved conveniently and effectively by utilizing the full text retrieval technique and inputting characters embedded in the picture information.
Owner:JIANGYIN MINGLUN TECH

Image-text retrieval system and method based on multi-angle self-attention mechanism

The invention belongs to the technical field of cross-modal retrieval, and particularly relates to an image-text retrieval system and method based on a multi-angle self-attention mechanism. The systemcomprises a deep convolutional network, a bidirectional recurrent neural network, an image, a text self-attention network, a multi-modal space mapping network and a multi-stage training module. The deep convolutional network is used for acquiring an embedding vector of an image region feature in an image embedding space. The bidirectional recurrent neural network is used for acquiring an embedding vector of a word feature in a text space, and the two vectors are respectively input to the image and the text self-attention network. The image and text self-attention network is used for acquiringan embedded representation of an image key area and an embedded representation of key words in sentences. The multi-modal space mapping network is used for acquiring the embedded representation of the image text in the multi-modal space. The multi-stage training module is used for learning parameters in the network. A good result is obtained on a common data set Flickr30k and an MSCOCO, and the performance is greatly improved.
Owner:FUDAN UNIV

Text index online updating method in cloud environment

The invention relates to a text index online updating method in cloud environment, belonging to the technical field of computer information retrieval. After a user adds, deletes or updates a file in a text retrieval system, an index module creates incremental data of index sheets belonging to the file and merges multiple groups of incremental data of the same index sheet. A cluster main node selects first batch of nodes and second batch of nodes by sequencing child nodes according to load size and executes index updating by batch. After each batch of node receives an updating command, the retrieval service is firstly stopped, the read incremental data is merged to own index sheet, and the retrieval service is recovered. The cluster main node decides the time for starting using the retrieval service of first batch of nodes and updating the second batch of nodes according to the index service switching conditions set by the user. Finally, the cluster main node recovers the retrieval service of all nodes to complete updating. The method reduces the requirements of index updating on network bandwidth and computer resources and shortens index updating time.
Owner:TSINGHUA UNIV

Titan-based enterprise information analysis platform and construction method thereof

The invention discloses a Titan-based enterprise information analysis platform and a construction method thereof. The Titan-based enterprise information analysis platform comprises a web crawler, a Hadoop distributed system infrastructure, a Titan server, an Elasticsearch server, a Cassandra database and an application layer, wherein the Hadoop distributed system infrastructure is used for storing collected structured or non-structured original data; the Titan server stores an enterprise relationship atlas, and utilizes the Cassandra database as a data storage medium and the Elasticsearch server as a storage medium for full-text retrievals in the enterprise relationship atlas; the application layer is used for displaying enterprise data and relationships on a front-end page by establishing a foreground application frame. The Titan-based enterprise information analysis platform and the construction method thereof provided by the invention have the advantages that through the data visualization technology, interest relationships among enterprises, such as investment relationships, can be rapidly sorted out; long-time and real-time full-automatic collection, storage and analysis can be achieved.
Owner:山东合天智汇信息技术有限公司

Method and software for digitalizing full text of standard document

The invention discloses a method and software for digitalizing the full text of a standard document, belongs to the technical field of standard documents and information, solves the problems of the full text retrieval and detailed retrieval of the standard document and realizes standard information text mining. Set out from the application prospect of the standard document, processes including visualizing, characterizing and structuring are performed; the digitalization processing method is performed by a scanned image processing module, an OCR identifying and correcting module, a standard title recording module, a structured full text making module and the like; and a standard full text XML format recording and defining file and a standard full text XML file are defined. According to the standard full text XML format recording and defining file and the standard full text XML file, the method and the software define schema file development software, realize data processing of a standard title, a single-layer PDF file, a double-layer PDF file, the full text XML file, a table, an image and the like, and realize image and table retrieval and data deriving in determined ranges, such as a standard preface, a foreword, a range, referenced files, terms and the like.
Owner:广东省标准化研究院

Keyword cipher text retrieval method for cloud storage

The invention provides a keyword cipher text retrieval method for cloud storage. The method comprises a data owner, a cloud service sever and a data user. The data owner stores encrypted files, headlines, keywords and verify certifications of the data user and the like to the cloud service server. The cloud service server stores and builds up corresponding relationships of the encrypted files, headlines and keywords, and stores the verify certifications of the data user and the like. When the data user visits cipher texts inside the cloud service server, the data user needs the verify certification awarded by the data owner and provides the cloud service server with information like a keyword trap door and the like, the cloud service server sends the cipher texts the user needs to the data user according to the information like the keyword trap door and the like, and the data user utilizes a secret key to decipher the cipher texts. Compared with the prior art, the keyword cipher text retrieval method for the cloud storage is high in safety, and capable of lowering burdens of communication, storage, and calculation, and improving accuracy and efficiency of retrieval of cipher texts.
Owner:深圳爱来福云健康科技有限公司

New-generation industry knowledge full-text search method

A new-generation industry knowledge full-text search method includes: 1, segmentation dictionary setup, namely setting up a segmentation dictionary, and storing dictionary information into a database; 2, full index setup, namely reading, dividing words and analyzing existing full-text documents called 'knowledge point documents' to set up an index file; 3, increment index setup, namely processing newly added documents and updating an index file on a hard disk; 4, memory index setup and memory segmentation dictionary setup, namely reading segmentation dictionary data in a memory to set up a memory segmentation dictionary data structure; 5, full-text search, user question standardization, word dividing, semantic comprehension, semantic extension, candidate document acquisition and candidate document ordering, wherein segmentation dictionary setup is conducted during system initialization; full index setup includes reading all knowledge documents to fully set up a hard disk index file called 'index file' for short; increment index setup is conducted when full-text files are newly added; the three events are independent from a full-text retrieval module and in independent operation.
Owner:中科国力(镇江)智能技术有限公司

An ES-based electronic medical record retrieval method

The invention discloses an ES-based electronic medical record retrieval method, which relates to the technical field of medical data retrieval. This method introduces semantic analysis model into electronic medical record analysis, including the extraction of subject words and the calculation of semantic similarity, taking advantage of their advantages in text semantic mining, This paper providesthe algorithm support for the latent semantic mining of text information in electronic medical record retrieval by establishing the general medical semantic database (negative word, synonym, ambiguousword), It realizes the high accuracy and recall rate of information retrieval, better adapts to the medical terminology compared with the common natural language is often more complex and constantlychanging, and medical abbreviations, synonyms and polysemous words more characteristics. It meets the scientific research needs of multi-dimensional combined retrieval and the needs of full-text retrieval of related literatures based on latent semantic search. Realize intelligent full-text retrieval with semantic extension and semantic connotation extension in real sense.
Owner:弘扬软件股份有限公司

Method and system for managing die manufacturing information

The invention relates to a method and a system for managing die manufacturing information, which are applied to managing various categories of information data of die manufacturing, wherein a graded server is arranged in the system to form a management terminal and is connected with one or more client terminals through a router; the server and the client terminals also comprise human-computer interfaces connected with the server and the client terminals and are provided with a same storage cell for storing data information of various information categories; the storage cell comprises a plurality of data cells for storing data; a classifying label die can use the periodical maximum life value, initialize and store the numerical value of the maximum life value in a database, and modify and store the numerical value of the life cycle according to a diminishing law by recording the calling times of the die; the client terminals communicate with the server by connecting the router, add, modify, delete, search and perform full-text retrieval on the information data of the die manufacturing; and the human-computer interfaces display and input related die data information.
Owner:龙光电子集团有限公司

Retrieval method and device for policy text, storage medium and electronic device

The invention provides a policy text retrieval method and device, a storage medium and an electronic device. According to the method, policy texts are classified; the theme type of the policy text isused as a retrieval label; when the user performs retrieval, corresponding tags are matched in the policy text based on a complete matching algorithm and a non-complete matching algorithm through thetags selected by the user, and an evaluation value of each label is counted; the retrieval results are arranged and displayed on the basis of the sum of the evaluation values of each policy text for all the tags, so that the problem that the relevancy between the retrieval mode for the policy text and the search keywords is low in the prior art is solved, and the technical effect of improving therelevancy between the retrieval results and the search words is achieved.
Owner:PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products