Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

30 results about "Controlled vocabulary" patented technology

Controlled vocabularies provide a way to organize knowledge for subsequent retrieval. They are used in subject indexing schemes, subject headings, thesauri, taxonomies and other knowledge organization systems. Controlled vocabulary schemes mandate the use of predefined, authorised terms that have been preselected by the designers of the schemes, in contrast to natural language vocabularies, which have no such restriction.

Speech to text system using controlled vocabulary indices

A synthesis of automated speech recognition (voice to text) technology and a knowledge-based analysis of the concepts and contexts of the free text therefrom enable a directed-vocabulary look up index to be used in conjunction with the speech recognition technology thus enabling medical dictation to be transcribed in real time without elaborate training of the dictator or the speech recognition technology. Thus, caregivers can create and review Computer-Based Patient Records in the necessary timeframe consistent with good patient care. The Computer-Based Patient Records can be linked to other applications such as prescription cross checking, lab test results, payer regulations, etc.
Owner:MEDCOM INFORMATION SYST

Speech to text system using controlled vocabulary indices

A synthesis of automated speech recognition (voice to text) technology and a knowledge-based analysis of the concepts and contexts of the free text therefrom enable a directed-vocabulary look up index to be used in conjunction with the speech recognition technology thus enabling medical dictation to be transcribed in real time without elaborate training of the dictator or the speech recognition technology. Thus, caregivers can create and review Computer-Based Patient Records in the necessary timeframe consistent with good patient care. The Computer-Based Patient Records can be linked to other applications such as prescription cross checking, lab test results, payer regulations, etc.
Owner:MEDCOM INFORMATION SYST

Field-based method and system for feeding back text error correction after speech recognition

The invention discloses a field-based method for feeding back text error correction after speech recognition, and belongs to the speech recognition field. Text sentences after speech recognition are analyzed based on errors of speed pauses of Chinese sentence structures. The method is characterized by detecting whether structures before and after a sentence separator meet the sentence pattern rules of Chinese language, finding pause errors, calculating and dividing sentences based on phoneme string similarity and converting into pinyin, converting pinyin into phoneme strings according to a phoneme table, finding sentences corresponding to the phoneme strings similar to the strings in a corpus, establishing a body based on a body controlled word query module through the controlled word table of the field, correcting the errors related to the field in the text after speech recognition through the body, outputting the matching result by a feedback module, and adding the correct identification result selected by a user and the original phoneme strings in the corpus. According to the method and system, the originally correct result of speech recognition may not be affected, and the speech recognition accuracy can be better determined through a body and feedback mechanism.
Owner:CHONGQING UNIV

Method, system and software arrangement for reconstructing formal descriptive models of processes from functional/modal data using suitable ontology

A method, system and software arrangement in accordance with an exemplary embodiment of the present invention are provided to extract descriptive narrative from numerical experimental data augmented with ontological controlled vocabulary. One exemplary application of such system, method and software arrangement is in organizing gene-expression time course data in terms of biological processes that may be activated and deactivated as the biological system responds to its normal or perturbed environment. The present invention may also have biological applications to drug-or-vaccine discovery, understanding behavior of a cell in an altered diseased state (e.g., cancer, neuro-degeneration or auto-immune disease, etc.), genetically modifying a natural wild-type organism, genetic-engineering, etc. Other exemplary applications may include understanding neural behavior, market behavior of a population of users interacting on the Internet, etc.
Owner:NEW YORK UNIV

Using a controlled vocabulary library to generate business data component names

Methods and apparatus, including computer program products, for generating a name for a business data component in an electronic business process use a received textual description of the business data component. One or more proposed names are generated in accordance with a predefined naming format. The proposed names are generated using a matching algorithm to select terms from a library of available terms based on the textual description. Each proposed name includes multiple terms, and each term in the library of available terms defines an object class, a property, a representation class, or a qualifier.
Owner:SAP AG

System and method for ontology and rules based segmentation engine for networked content delivery

There is provided a system and method for an ontological and rules based segmentation engine for networked content delivery. There is provided a segmentation engine for use by a network accessible computing device providing customized content for a user on the network, comprising a user context regarding the user, a content management system for storing content, a controlled vocabulary categorizing content, an ontology using the controlled vocabulary for referencing the content of the content management system, segment definitions grouping users into segments matching content types to controlled vocabulary elements, segment rules using user context to associate with segment definitions, and a segmentation processor. The segmentation processor can receive a content request from the user, and by using the elements of the segmentation engine, determine the segment definitions applicable to the user and provide customized content from the content management system. The segment definitions are readily modifiable without detailed low-level knowledge.
Owner:DISNEY ENTERPRISES INC

Computer-Program Products and Methods for Annotating Ambiguous Terms of Electronic Text Documents

ActiveUS20150135053A1Natural language translationWeb data indexingControl vocabularyDocumentation
Computer-program products and methods for automatically annotating terms, such as ambiguous terms, in an electronic text document are disclosed. In one embodiment, a method of annotating a text document includes determining, by a computing device, a term of interest within the text document. The method further includes searching a data structure including incongruous term pairs (tx, tt) determined from a controlled vocabulary for the term of interest appearing as a term tt, wherein the term tt is a linguistic head of a term tx of the incongruous term pairs (tx, tt). The method further includes annotating the term of interest with a meaning provided by the controlled vocabulary only if a term tx of the incongruous term pairs (tx, tt) associated with the term of interest in the data structure is not present within a predetermined textual distance of the term of interest in the text document.
Owner:ELSEVIER

Method, device and system for protein knowledge mining and discovery in Chinese bibliographic database

The invention discloses a method, a device and a system for protein knowledge mining and discovery in a Chinese bibliographic database and can achieve mining and discovery of protein knowledge in the Chinese life-science bibliographic database. The technical scheme includes that the method comprises constructing the Chinese bibliographic database and a scientific data type database, performing translation and compiling of a protein-relevant text mining tool dictionary on the basis of a standard control vocabulary of the scientific data type database and with protein nouns in the Chinese bibliographic database as mining and discovery objects; converting identification number into hyperlink information according to protocols provided by the scientific data type database, and generating the Chinese bibliographic database facing to themes and application; and removing false-positive protein mining results in data mining and information integration results and modifying Chinese bibliographic text mining results.
Owner:SHANGHAI INST OF BIOLOGICAL SCI CHINESE ACAD OF SCI +1

Method for the determination of supplementary content in an electronic device

The invention relates to a method for replacing subtitle text with occasional word or idiom translations. In the method an initial language skill is determined. A content object is selected for presentation. A content vocabulary associated with the content object is determined. The content vocabulary is reduced based on the initial language skill to produce a target vocabulary. The presentation of said content object is started in the electronic device. The presence of a word in the content object is detected. The translation of the word is displayed on a display of the electronic device, if the word belongs to the target vocabulary.
Owner:CORE WIRELESS LICENSING R L

System and method for ontology and rules based segmentation engine for networked content delivery

There is provided a system and method for an ontological and rules based segmentation engine for networked content delivery. There is provided a segmentation engine for use by a network accessible computing device providing customized content for a user on the network, comprising a user context regarding the user, a content management system for storing content, a controlled vocabulary categorizing content, an ontology using the controlled vocabulary for referencing the content of the content management system, segment definitions grouping users into segments matching content types to controlled vocabulary elements, segment rules using user context to associate with segment definitions, and a segmentation processor. The segmentation processor can receive a content request from the user, and by using the elements of the segmentation engine, determine the segment definitions applicable to the user and provide customized content from the content management system. The segment definitions are readily modifiable without detailed low-level knowledge.
Owner:DISNEY ENTERPRISES INC

Method and system for an online patient community based on "structured dialog"

A system for an online community based on Structured Dialogs offering organizations the opportunity to create online communities and have a dialog with users on their status, condition, and progress without the risk of adverse event reports by users. The system may include an interface that limits user communication to Structured Dialogs comprising controlled vocabulary elements of specific choices, including pop-ups, drop downs, and sliders. The system includes a personal health and wellness management tool coupled with the community to enable the user to manage his or her medication and condition using Structured Dialogs and thus making the interaction more interesting and effective for users. The system may also include an information repository “Info” Layer that can offer articles that are relevant to users. The “We”, “Me” and “Info” Layers can share statistics and Structured Dialogs data.
Owner:OPTIMIZERX

Method for automatically authenticating value of documentary archives

ActiveCN106776695ADetermine the archive categoryDetermine the storage periodSpecial data processing applicationsRetention periodWord list
The invention relates to a method for automatically authenticating the value of documentary archives. The method includes the steps that the keywords of the tile and the full text of a documentary archive are extracted respectively; keyword distribution is conducted according to a controlled word list to obtain a keyword set; the keyword distribution result is calculated and discriminated, the archiving class is determined through keyword weight word frequency calculation, the retention period is determined through automatic annotation classification calculation, and then a first conclusion and a second conclusion which include the archiving class and the retention period respectively are obtained; the archiving class and the retention period are comprehensively recommended. A keyword base with the documentary archive retention value as the theme is built, the keywords in the tile and the document are extracted according to a related file, the archiving class of the documentary archive is determined, a means is provided for automatically authenticating the value of a large batch of documentary archives, the concurrent operation of authenticating the retention value of multiple documentary archives can be achieved, and the efficiency of authenticating the value of the documentary archives is improved.
Owner:SHANGHAI ZHONGXIN INFORMATION DEV

A control method and a control device for incrementally updating vocabulary data

The invention provides a control method for incrementally updating vocabulary data, which is used for incrementally updating an input method vocabulary. The method comprises the following steps: a. Determining whether incremental updating is required based on the current version of a user's core vocabulary and the latest version of a server's core vocabulary; if so, entering into step b; B. Determining incremental information based on the current version of the user's core thesaurus and one or more versions of the server's core thesaurus; C. Updating the incremental information to the user core thesaurus of the current version, by determining whether an incremental update is required, For the current version of the user's core thesaurus that requires incremental updates, determining the incremental information, and then adding the incremental information to the current version of the user core vocabulary, thereby realizing the update of the user core vocabulary, and the operation is simple. The invention provides a control method and a control device for incrementally updating thesaurus data with shorter updating steps and shorter updating period, which has extremely high commercial value.
Owner:SHANGHAI 2345 NETWORK TECH

New keyword extraction technology

The invention provides a new keyword extraction technology. According to the technology, vocabulary position weights and word class weights are determined according to a Chinese word segmentation preprocessing process, the relevancy between two vocabularies is calculated with reference to a core vocabulary with the highest text vocabulary contribution, a multi-subject network model is constructed, an objective function is constructed to extract connection words, a cross function is utilized to fuse the connection words into the multi-subject network model, a new model graph is obtained, and then an anteposition vocabulary, namely a text keyword is extracted. The technology is high in accuracy and has higher application value, the contributions of different vocabularies to text ideology can be precisely calculated, multi-subject performance is considered, different characteristics are distinguished, and a good theoretical basis is provided for subsequent text similarity analysis and text clustering.
Owner:SICHUAN YONGLIAN INFORMATION TECH CO LTD

Method, system and software arrangement for reconstructing formal descriptive models of processes from functional/modal data using suitable ontology

A method, system and software arrangement in accordance with an exemplary embodiment of the present invention are provided to extract descriptive narrative from numerical experimental data augmented with ontological controlled vocabulary. One exemplary application of such system, method and software arrangement is in organizing gene-expression time course data in terms of biological processes that may be activated and deactivated as the biological system responds to its normal or perturbed environment. The present invention may also have biological applications to drug-or-vaccine discovery, understanding behavior of a cell in an altered diseased state (e.g., cancer, neuro-degeneration or auto-immune disease, etc.), genetically modifying a natural wild-type organism, genetic-engineering, etc. Other exemplary applications may include understanding neural behavior, market behavior of a population of users interacting on the Internet, etc.
Owner:NEW YORK UNIVERSITY

Corpus generation method, device, electronic equipment and readable storage medium

The invention discloses a corpus generation method, a corpus generation device, electronic equipment and a readable storage medium. The method comprises the steps: obtaining a first vocabulary in eachfirst vocabulary classification set from a control vocabulary library according to the identification information of each first vocabulary classification set corresponding to a sentence pattern structure, the sentence pattern structure being preset; and combining the acquired first vocabularies according to the first position information of the vocabularies in the first vocabulary classificationset in the sentence pattern structure to generate a first corpus conforming to the sentence pattern structure. According to the invention, the generated first corpus is composed of the first vocabularies belonging to the first vocabulary classification set. The first vocabularies are combined according to the first position of the first vocabulary classification set in the sentence pattern structure, so that the corpus does not need to be manually annotated. The manual annotation cost is saved. The error rate is reduced, and the training accuracy of the control model is improved.
Owner:GREE ELECTRIC APPLIANCES INC +1

Text subject indexing method and device, electronic device and computer storage medium

The embodiment of the invention relates to the technical field of text processing, and discloses a text subject indexing method and device, an electronic device and a computer storage medium, and thetext subject indexing method comprises the steps: determining a text word list of a to-be-indexed text; determining a text representation vector of the to-be-indexed text based on a predetermined wordvector library according to the text word list; then, based on a mapping table between subject words and common words, which is pre-established according to the controlled word table, determining thesubject words of which the association strength with any text word is greater than a first preset threshold value as the subject words of any text word to obtain the subject words corresponding to the text words respectively; determining a target subject word of the to-be-indexed text according to the text representation vector and the subject word corresponding to each text word, and performingsubject indexing on the to-be-indexed text through the target subject word. Therefore, the operand is greatly reduced, the comparison frequency is effectively reduced, and the text topic indexing efficiency is greatly improved.
Owner:INST OF SCI & TECHN INFORMATION OF CHINA

Computer-program products and methods for annotating ambiguous terms of electronic text documents

Computer-program products and methods for automatically annotating terms, such as ambiguous terms, in an electronic text document are disclosed. In one embodiment, a method of annotating a text document includes determining, by a computing device, a term of interest within the text document. The method further includes searching a data structure including incongruous term pairs (tx, tt) determined from a controlled vocabulary for the term of interest appearing as a term tt, wherein the term tt is a linguistic head of a term tx of the incongruous term pairs (tx, tt). The method further includes annotating the term of interest with a meaning provided by the controlled vocabulary only if a term tx of the incongruous term pairs (tx, tt) associated with the term of interest in the data structure is not present within a predetermined textual distance of the term of interest in the text document.
Owner:ELSEVIER

Method, system and software arrangement for reconstructing formal descriptive models of processes from functional/modal data using suitable ontology

A method, system and software arrangement in accordance with an exemplary embodiment of the present invention are provided to extract descriptive narrative from numerical experimental data augmented with ontological controlled vocabulary. One exemplary application of such system, method and software arrangement is in organizing gene-expression time course data in terms of biological processes that may be activated and deactivated as the biological system responds to its normal or perturbed environment. The present invention may also have biological applications to drug-or-vaccine discovery, understanding behavior of a cell in an altered diseased state (e.g., cancer, neuro-degeneration or auto-immune disease, etc.), genetically modifying a natural wild-type organism, genetic-engineering, etc. Other exemplary applications may include understanding neural behavior, market behavior of a population of users interacting on the Internet, etc.
Owner:NEW YORK UNIV

Core vocabulary special topic construction method and system based on big data analysis

The invention belongs to the technical field of computer software, and discloses a core vocabulary special topic construction method and system based on big data analysis. A user assigns initial keywords or keyword set of a special topic; related documents of the special topic are acquired; a candidate core vocabulary set and a relationship thereof can be automatically discovered from a related document set of the special topic to form a special topic candidate core vocabulary map; core vocabularies in a candidate special topic are manually intervened to form final special topic output. The core vocabulary special topic construction method and system based on the big data analysis has the advantages that the special-topic-level core vocabulary set can be rapidly formed, the time of an expert building the special topic can be greatly shortened, the coverage and timeliness of special topic construction are improved, and the rapid construction of resources and the promotion of the systemare facilitated.
Owner:GLOBAL TONE COMM TECH

Detection model compression method based on semantic segmentation

The invention discloses a detection model compression method based on semantic segmentation, and relates to the fields of artificial intelligence and computer vision. The method comprises the following steps: (1) pruning: 1) inputting a convolution kernel weight; 2) pruning is carried out on the trained network model to obtain a parameter space of sparse weights; (2) semantic segmentation: 1) performing semantic segmentation on the parameter space to obtain a hyper-parameter block and a central vocabulary, and calculating the central position of the hyper-parameter block; and 2) updating the original parameter space by using the central vocabulary. 3) judging whether the change of the current central vocabulary and the previous central vocabulary is smaller than a specified threshold value, if so, continuing to search parameters similar to the central vocabulary, and updating the central vocabulary and returning to the step 2); and ending the updating of the central vocabulary if the current vocabulary is smaller than the threshold. And (3) model storage: storing the hyper-parameter block boundary position, the parameter block center position and the center vocabulary value obtained by training. According to the method, hyper-parameters are used for describing the whole parameter space, overall compression of the parameter space is achieved, and the overall compression ratio ofthe model is increased to the maximum extent.
Owner:北京同方软件有限公司

The Method of Realizing the Automatic Appraisal of the Value of Documents and Archives

ActiveCN106776695BDetermine the archive categoryDetermine the storage periodText database queryingSpecial data processing applicationsRetention periodWord list
The invention relates to a method for automatically authenticating the value of documentary archives. The method includes the steps that the keywords of the tile and the full text of a documentary archive are extracted respectively; keyword distribution is conducted according to a controlled word list to obtain a keyword set; the keyword distribution result is calculated and discriminated, the archiving class is determined through keyword weight word frequency calculation, the retention period is determined through automatic annotation classification calculation, and then a first conclusion and a second conclusion which include the archiving class and the retention period respectively are obtained; the archiving class and the retention period are comprehensively recommended. A keyword base with the documentary archive retention value as the theme is built, the keywords in the tile and the document are extracted according to a related file, the archiving class of the documentary archive is determined, a means is provided for automatically authenticating the value of a large batch of documentary archives, the concurrent operation of authenticating the retention value of multiple documentary archives can be achieved, and the efficiency of authenticating the value of the documentary archives is improved.
Owner:SHANGHAI ZHONGXIN INFORMATION DEV

Method, device and system for protein knowledge mining and discovery in Chinese bibliographic database

The invention discloses a method, a device and a system for protein knowledge mining and discovery in a Chinese bibliographic database and can achieve mining and discovery of protein knowledge in the Chinese life-science bibliographic database. The technical scheme includes that the method comprises constructing the Chinese bibliographic database and a scientific data type database, performing translation and compiling of a protein-relevant text mining tool dictionary on the basis of a standard control vocabulary of the scientific data type database and with protein nouns in the Chinese bibliographic database as mining and discovery objects; converting identification number into hyperlink information according to protocols provided by the scientific data type database, and generating the Chinese bibliographic database facing to themes and application; and removing false-positive protein mining results in data mining and information integration results and modifying Chinese bibliographic text mining results.
Owner:SHANGHAI INST OF BIOLOGICAL SCI CHINESE ACAD OF SCI +1

A field-based text error correction method and system after speech recognition with feedback

The invention discloses a field-based method for feeding back text error correction after speech recognition, and belongs to the speech recognition field. Text sentences after speech recognition are analyzed based on errors of speed pauses of Chinese sentence structures. The method is characterized by detecting whether structures before and after a sentence separator meet the sentence pattern rules of Chinese language, finding pause errors, calculating and dividing sentences based on phoneme string similarity and converting into pinyin, converting pinyin into phoneme strings according to a phoneme table, finding sentences corresponding to the phoneme strings similar to the strings in a corpus, establishing a body based on a body controlled word query module through the controlled word table of the field, correcting the errors related to the field in the text after speech recognition through the body, outputting the matching result by a feedback module, and adding the correct identification result selected by a user and the original phoneme strings in the corpus. According to the method and system, the originally correct result of speech recognition may not be affected, and the speech recognition accuracy can be better determined through a body and feedback mechanism.
Owner:CHONGQING UNIV

Structured task naming

The concept of a specialized task identifier is disclosed to indicate the content of a file within a computer-implemented system for providing help content to a user. In one embodiment, the specialized task identifier includes at least one element selected from a controlled vocabulary. In another embodiment, the specialized task identifier is arranged in accordance with a predetermined structure of organizational elements. In yet another embodiment, the specialized task identifier is utilized as a basis to at least semi-automatically categorize within a taxonomic organization scheme.
Owner:MICROSOFT TECH LICENSING LLC

Dictionary-based sememe knowledge base construction method and device

The invention provides a dictionary-based sememe knowledge base construction method and device, and the method comprises the steps: constructing a sememe set according to a controlled word list of a target language dictionary; obtaining a paraphrase word set corresponding to the paraphrase of each semantic item according to the semantic item of each word in the target language dictionary; and according to the sememe set, performing sememe extraction on the paraphrase word set, and according to a sememe extraction result, constructing a sememe knowledge base corresponding to the target language dictionary. Through the dictionary of the target language and the controlled word list corresponding to the dictionary, the primitive knowledge base can be efficiently, economically and automatically constructed for the target language, the problem that time and labor are wasted when the primitive knowledge base is manually constructed is solved, and good practicability is achieved.
Owner:TSINGHUA UNIV

Hash processing based vocabulary management method and device

The invention discloses a hash processing based vocabulary management method and device. The method comprises the steps of maintaining a first hash vocabulary list and a second hash vocabulary list through a server, wherein the first hash vocabulary list is used for recording the correspondence relationship between the hash value and the non-conflict vocabulary, and the second hash vocabulary list is used for recording the correspondence relationship between the hash value and a plurality of conflicted vocabularies; performing hash processing for the vocabulary to be processed through the server to obtain the corresponding hash value; searching the first hash vocabulary list and the second vocabulary list according to the hash value; if the hash value is recorded in the first hash vocabulary list or the second hash vocabulary list and all vocabularies to be processed are corded in the hash value, determining that the vocabulary to be processed is the first type of vocabulary through the server. With the adoption of the method, the wrong conclusion caused by that a plurality of vocabularies cannot be recorded in the hash list can be avoided, and thus the vocabulary inspection accuracy can be improved.
Owner:阿里巴巴(中国)网络技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products