Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

1317 results about "Semantic similarity" patented technology

Semantic similarity is a metric defined over a set of documents or terms, where the idea of distance between them is based on the likeness of their meaning or semantic content as opposed to similarity which can be estimated regarding their syntactical representation (e.g. their string format). These are mathematical tools used to estimate the strength of the semantic relationship between units of language, concepts or instances, through a numerical description obtained according to the comparison of information supporting their meaning or describing their nature. The term semantic similarity is often confused with semantic relatedness. Semantic relatedness includes any relation between two terms, while semantic similarity only includes "is a" relations. For example, "car" is similar to "bus", but is also related to "road" and "driving".

Method For Information Retrieval

A method of retrieving documents using a search engine includes providing a reverse index including one or more keywords and a list of documents containing the one or more keywords, the reverse index further including a measure of confidence (MOC) value associated with the one or more keywords. One or more query terms are input into the search engine. The query terms are disambiguated and a MOC value is associated with each meaning of the disambiguated query term. A list of documents is retrieved containing the query terms wherein the documents are initially ranked based at least in part on the MOC values of the keywords and query terms. The list of documents may be re-ranked based at least in part on the semantic similarity of each document to the disambiguated query terms.
Owner:RGT UNIV OF CALIFORNIA

Text similarity, acceptation similarity calculating method and system and application system

The invention discloses a calculating method of text similarity degree and vocabulary meaning similarity degree and system and application system, which comprises the following steps: basing on vocabulary data bank; proceeding initialize; calculating; getting initial vocabulary meaning similarity degree among vocabulary in the vocabulary data bank; basing on the initial vocabulary meaning similarity degree; calculating initial semantic similarity degree among text; iterating semantic similarity degree among each text and vocabulary meaning similarity degree among vocabulary till constriction; constructuring final vocabulary meaning similar matrix with final vocabulary similarity degree; transforming the text vocabulary frequency vector of the initial text to the new text vocabulary text vocabulary frequency vector; calculating text similarity degree in the text collection. This invention can improve related property of current text especially about short text.
Owner:蒙圣光 +1

System and method for similarity search of images

A system and method for an efficient semantic similarity search of images with a classification structure are provided. The system and method provide for building a semantic classification-search tree for the plurality of images, the classification tree including at least two categories of images, each category of images representing a subset of the plurality of images, receiving a query image, classifying the query image to select one of the at least two categories of images, and restricting the search for the image of interest using the query image to the selected one of the at least two categories of images.
Owner:THOMSON LICENSING SA

Methods and systems for creating and using an adaptive thesaurus

Methods and systems are provided for creating an adaptive thesaurus. A term pair including an index term and an expansion term is received. A recall gain, an expansion independence, and a semantic similarity of the term pair are calculated using a processor. Whether to store the term pair is determined based on the recall gain, the expansion independence, and the semantic similarity. The term pair is stored based on the determination. Methods and systems are provided for searching using an adaptive thesaurus. A search query including a query term is received. An expansion term stored in association with an index term matching the query term in the adaptive thesaurus is retrieved. Using a processor, the search query is expanded using the expansion term based on a recall gain, an expansion independence, and a semantic similarity.
Owner:RELX INC

Method for detecting code similarity based on semantic analysis of program source code

The invention discloses a method for detecting code similarity based on semantic analysis of a program source code, which relates to computer program analyzing technology and a method for detecting complex codes of computer software. The method solves the prior problems of low similarity detection accuracy and high computing complexity on the codes of different syntactic representations and similar semantemes, and also solves the problem of incapability of realizing large-scale program code similarity detection. The method comprises the following steps: resolving two segments of source codes to be detected into two control dependence trees of a system dependence graph respectively and executing basic code standardization respectively; utilizing a measure method to extract candidate similar code control dependence trees of the control dependence trees which are subjected to the basic code standardization; executing an advanced code standardization operation on extracted candidate similar codes; and computing semantic similarity to obtain a similarity result so as to finish the code similarity detection. The method is applied to source code piracy detection, software component library query, software defect detection, program comprehension and the like.
Owner:HARBIN INST OF TECH

Automatic question-answer processing method and automatic question-answer system

The invention discloses an automatic question-answer processing method and an automatic question-answer system. The method includes: acquiring question text from question-answer data pairs collected in advance, performing word separation on the question text to obtain the corresponding key words of the question text, and building the index relation between the key words and the question text; whenoptional target question text is received, and performing word separation on the target question text to acquire target key words corresponding to the target text question text; according to the index relation of the key words and the question text, determining key words matched with the target key words, and acquiring the question text having index relation with the key words to serve as the candidate question text; calculating the semantic similarity of the candidate question text and the target question text; determining an answer corresponding to the target question text according to thesemantic similarity. The method has the advantages that the semantic similarity of the target question text and each question text is considered to determine the answer of the target question text, and the accuracy of automatic question-answer processing is increased.
Owner:TENCENT TECH (SHENZHEN) CO LTD

Intelligent response method, electronic device and storage medium

The invention provides an intelligent response method, which comprises the following steps that: after a consultation question is preprocessed, constructing an inverted index for a question and answerknowledge base; through an inverted index query way, inquiring a candidate question set related to the consultation question from the question and answer knowledge base; aiming at each candidate question in the candidate question set; independently calculating a question similarity between the consultation question and the candidate question, wherein the question similarity is obtained through the linear weighting of a text similarity, a semantic similarity, a theme similarity and a syntax similarity between the consultation question and the corresponding candidate question; and finally, selecting a candidate question corresponding to the highest question similarity obtained by calculation, and inquiring the associated answer of the selected candidate question in the question and answer knowledge base as a target answer to be output. The invention also provides an electronic device and a storage medium. By use of the intelligent response method, the accuracy and the response efficiency of intelligent response can be improved, and service quality is improved.
Owner:PING AN TECH (SHENZHEN) CO LTD

Chinese network review emotion classification method based on integrated study frame

The invention discloses a Chinese network review emotion classification method based on an integrated study frame. According to the method, a part-of-speech combination mode, an order-preserving sub-matrix mode and a frequent word sequence mode are adopted as input characteristics, in the level of characteristics, factors of the influence of Chinese word order information, interval phrase characteristics and the sentence length are considered, and the characteristic vector sparsity problem is solved through semantic similarities; the problem that many review text characteristics exist is solved, the inter-base-classifier independence is guaranteed, and the classification performance of base classifiers is improved as much as possible; a base classifier algorithm constructed based on product attributes is adopted to comprehensively review emotion information of each attribute in a text, and then the sentence-level emotional tendency of reviews is judged, so that a final classification result is more accurate. The Chinese network review emotion classification method based on the integrated study frame is applicable to e-commerce network review emotion classification in various fields, can make a potential consumer know evaluation information of a commodity before purchase and can also make a merchant better sufficiently know the consumer's opinion, and therefore the service quality is improved.
Owner:NANJING SILICON INTELLIGENCE TECH CO LTD

Text similarity measuring system based on multi-feature fusion

The invention provides a text similarity measuring system based on multi-feature fusion and relates to the field of intelligent information processing. According to the system, the text similarity is measured by fusing multiple features based on word frequencies, word vectors and Wikipedia labels. The invention aims to solve the problem of semantic loss caused by non-considering of contexts in a conventional text similarity measuring system and the problem of low similarity result accuracy caused by larger text length difference. The text similarity measuring system is implemented by the following steps: carrying out preprocessing such as word segmentation and stop word removal on a training text; training corpora of the processed training text as a word vector model; measuring the similarity based on the word frequencies, the similarity based on the word vectors and the similarity based on the Wikipedia labels between input text pairs to be computed, and carrying out weighted summation to obtain a final text semantic similarity measuring result. According to the system, the measurement accuracy of the text similarities can be improved, so that the requirement on intelligent information processing is met.
Owner:XINJIANG TECHN INST OF PHYSICS & CHEM CHINESE ACAD OF SCI

Statistics-based machine translation method and apparatus, and electronic device

The present invention discloses a statistics-based machine translation method and apparatus and an electronic device, a semantic similarity-degree calculation method and apparatus and an electronic device, and a word quantization method and apparatus and an electronic device. The statistics-based machine translation method comprises: according to a feature that affects a translation probability and that is of each candidate translation and a pre-generated translation probability prediction model generating a translation probability of a sentence to be translated into each candidate translation, wherein the feature that affects the translation probability at least comprises a semantic similarity-degree between the sentence to be translated and the candidate translation; and selecting a preset number of candidate translations whose translation probabilities rank top as a translation of the sentence to be translated. By adoption of the statistics-based machine translation method provided by the present application, the semantic level of the natural language can be reached deeply when the machine translation model is constructed, and the deviation of semantics between the translation and the source text is avoided, so as to achieve the effect of improving translation quality.
Owner:阿里巴巴(中国)网络技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products