Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

1109 results about "Cosine similarity" patented technology

Cosine similarity is a measure of similarity between two non-zero vectors of an inner product space that measures the cosine of the angle between them. The cosine of 0° is 1, and it is less than 1 for any angle in the interval (0,π] radians. It is thus a judgment of orientation and not magnitude: two vectors with the same orientation have a cosine similarity of 1, two vectors oriented at 90° relative to each other have a similarity of 0, and two vectors diametrically opposed have a similarity of -1, independent of their magnitude.

Text joins for data cleansing and integration in a relational database management system

An organization's data records are often noisy: because of transcription errors, incomplete information, and lack of standard formats for textual data. A fundamental task during data cleansing and integration is matching strings—perhaps across multiple relations—that refer to the same entity (e.g., organization name or address). Furthermore, it is desirable to perform this matching within an RDBMS, which is where the data is likely to reside. In this paper, We adapt the widely used and established cosine similarity metric from the information retrieval field to the relational database context in order to identify potential string matches across relations. We then use this similarity metric to characterize this key aspect of data cleansing and integration as a join between relations on textual attributes, where the similarity of matches exceeds a specified threshold. Computing an exact answer to the text join can be expensive. For query processing efficiency, we propose an approximate, sampling-based approach to the join problem that can be easily and efficiently executed in a standard, unmodified RDBMS. Therefore the present invention includes a system for string matching across multiple relations in a relational database management system comprising generating a set of strings from a set of characters, decomposing each string into a subset of tokens, establishing at least two relations within the strings, establishing a similarity threshold for the relations, sampling the at least two relations, correlating the relations for the similarity threshold and returning all of the tokens which meet the criteria of the similarity threshold.
Owner:AMERICAN TELEPHONE & TELEGRAPH CO +1

Short text similarity calculating method

The invention discloses a short text similarity calculating method. The calculating method comprises steps of acquiring corpus data, pre-processing the corpus data to achieve training corpus, achieving a key word extracting model based on the training corpus, segmenting the training corpus via a segmenting tool, and acquiring a word vector set via training of word2vec, acquiring a text input by a user and questions correctly answered by candidate answers to achieve a segmenting result and a key word extracting result, calculating the questions correctly answered by candidate answers and the word vector set of the text input by the user based on the segmenting result and the key word extracting result, acquiring a sentence vector based on the word vector set, calculating similarity of two sentence vectors, and conducting similarity correction for information contained in the user input text and the questions correctly answered by the candidate answers to achieve corrected similarity. Cosine similarity of sentence vectors of the user input text and the question sentences correctly answered by candidate answers is calculated and similarity is corrected via sentence patterns, named entity and pronouns.
Owner:EMOTIBOT TECH LTD

Method and apparatus for natural language call routing using confidence scores

Methods and apparatus are provided for classifying a spoken utterance into at least one of a plurality of categories. A spoken utterance is translated into text and a confidence score is provided for one or more terms in the translation. The spoken utterance is classified into at least one category, based upon (i) a closeness measure between terms in the translation of the spoken utterance and terms in the at least one category and (ii) the confidence score. The closeness measure may be, for example, a measure of a cosine similarity between a query vector representation of said spoken utterance and each of said plurality of categories. A score is optionally generated for each of the plurality of categories and the score is used to classify the spoken utterance into at least one category. The confidence score for a multi-word term can be computed, for example, as a geometric mean of the confidence score for each individual word in the multi-word term.
Owner:AVAYA INC

Cross-age face verify method based on characteristic learning

The invention discloses a cross-age face verify method based on characteristic learning; the method comprises the following steps: 1, obtaining to-be compared two face images; 2, using a face characteristic point positioning method to carry out align operation for the two face images; 3, respectively carrying out feature extraction for each image, wherein the extraction method includes the following steps: a, automatically extracting high-level meaning characteristics through a depth convolution nerve network; b, calculating LBP histogram characteristics of the image; c, fusing the characteristics obtained in the a and b steps, and expressing characteristic vectors; 4, using a cosine similarity method to calculate a distance between the characteristic vectors obtained by the step3, and determining whether the two images are from a same person or not. The method firstly uses the depth network to cross-age face verify, and creatively combines the handwork design LBP histogram characteristics with depth network autonomous learning characteristics, thus realizing complementation between high-rise meaning characteristic and lower characteristics, and providing better accuracy.
Owner:SUZHOU UNIV

Text information similarity matching method and device, computer device and storage medium

The invention provides a text information similarity matching method and device on the basis of TF-IDF. The method includes the steps of acquiring text information; performing word segmentation on thetext information to obtain word segments w1, w2, ..., wn-1 and wn; using a CBOW model for calculating word vectors V(w1), V(w2), ..., V(wn-1) and V(wn) of the word segments respectively; using the TF-IDF algorithm for calculating TF-IDF values k1, k2, ..., kn-1 and kn of the word segments respectively; according to products of the word vectors of the word segments and the corresponding TF-IDF values, obtaining a sentence vector V; calculating the cosine similarity between the sentence vector V and sentence vectors of pre-stored statements, and determining the pre-stored statement with the maximum cosine similarity. Through the process above, the pre-stored statement which is the most similar to the text information can be found, the accuracy of problem recognition can be improved in the aspects of robot dialogue, information classification and the like, and therefore the dialogue efficiency or the classification efficiency is improved. A computer device and a storage medium are also provided.
Owner:PING AN TECH (SHENZHEN) CO LTD

A face recognition convolutional neural network training method based on a novel loss function

The invention discloses a face recognition convolutional neural network training method based on a novel loss function. Put SoftMax loss, cosine similarity loss, Center Loss are combined used as the objective function in the training of a convolutional neural network for face recognition. The loss of cosine similarity is added to overcome the difference caused by different measurement methods of face feature alignment in training and testing while the class spacing is enlarged and the intra-class distance is reduced. The method includes such steps as 1, dividing face recognition data into training sample set and a test sample set; 2, carrying out data preprocessing on the face images in the training sample set; 3, constructing a new convolutional neural network structure for face recognition based on a new loss function; 4, inputting that training sample set into a face recognition convolution neural network for training; 5, saving that parameters of the face recognition model; 6, testing the face recognition model by using the test sample set after data preprocessing.
Owner:CHINA JILIANG UNIV

Document similarity calculating method and similar document whole-network retrieval tracking method

InactiveCN106095737AAccurate processing of similarityAccurate analysis and statisticsSemantic analysisText database indexingCosine similarityTwo-vector
The invention relates to a document similarity calculating method and a similar document whole-network retrieval tracking method. The technical scheme is characterized in that the document similarity calculating method includes: S01, performing word segmentation on an original document and a target document to acquire respective word segmentation sets; S02, performing preprocessing and feature weighting: utilizing TF-IDF technology to calculate weight of each word segmentation, extracting core key words, utilizing Word2vec to dig correlation degree among different word segmentation in the documents, and performing semantic analysis on each document; S03, adopting a vector space model and a cosine similarity algorithm: utilizing a cosine value of an included angle of two vectors in vector space to evaluate similarity of the documents, wherein the cosine value is between 0 and 1, and the greater the cosine value is, the higher the similarity of the documents is. The document similarity calculating method and the similar document whole-network retrieval tracking method are suitable for news information redistribution tracking and transmissibility statistics.
Owner:HANGZHOU FANEWS TECH

Automatically answering method, device, storage medium and electronic equipment

The invention provides an automatically answering method, device, storage medium and electronic equipment. The automatically answering method comprises the steps: features of a to-be-answered questionare extracted; candidate question-answer pairs are screened from a question-answer corpus through similarity calculation which is performed on textual features of the to-be-answered question and textual features of questions of question-answer pairs in the question-answer corpus; feature vectors of the to-be-answered question are calculated based on a neural network model which completes a question-answer training in advance, the answers of the candidate question-answer pairs are calculated based on the neural network model, and the feature vectors of the answers of the candidate question-answer pairs are acquired; an answer is screened from the candidate question-answer pairs through performing cosine similarity calculation on the feature vectors of the to-be-answered question and the feature vectors of the answers of the candidate question-answer pairs, so that generalization of semantic distance is avoided, the semantic distance of the to-be-answered question and the candidate question-answer pairs is accurately measured by utilizing the neural network model, and the accuracy rate of screened answer is improved.
Owner:CLOUDMINDS SHANGHAI ROBOTICS CO LTD

Graph model-based automatic abstracting method

ActiveCN105243152AMeasuring Semantic RelevanceAchieve complementary effectsSpecial data processing applicationsCosine similaritySubject matter
The invention relates to the field of automatic abstracting, and discloses a graph model-based automatic abstracting method. According to the technical scheme, an LDA probability topic model is applied to measurement of semantic correlation between sentences and improvement of the measurement effect of sentence correlation; and an idea of topic correlation and position sensitivity of the sentences is provided, so that abstract generation is relatively reasonable and effective. The method comprises the following steps: firstly, obtaining topic probability distribution of a document and word probability distribution of the topic through training the LDA topic model, determining the topic probability distribution of the sentences and effectively converting a semantic similarity measurement between the sentences into a similarity measurement problem of the topic probability distribution of the sentences; with the sentences as nodes, building edges by referring tothe cosine similarity and according to the semantic similarity between the sentences and generating a text graph representing the document; calculating the topic correlation between the sentences according to the topic probability distribution of the sentences and the topic probability distribution of the document; and calculating the position sensitivity and the like of the sentences according to the positions of the sentences in the document.
Owner:TONGJI UNIV

System and method for determining view invariant spatial-temporal descriptors for motion detection and analysis

A method and system for determining view invariant spatial-temporal descriptors encoding details of both motion dynamics and posture interactions that are highly representative and discriminative. The method and system describe determining posture interactions descriptors and motion dynamics descriptors by utilizing cosine similarity approach thereby rendering the descriptors to be view invariant.
Owner:BAE SYST INFORMATION & ELECTRONICS SYST INTERGRATION INC

Method for extracting novel field words

The invention discloses a method for extracting novel field words by the aid of combination of word2vec and Bootstrapping iteration. The method includes preprocessing field word corpus; segmenting preprocessed field texts by the aid of n-gram; counting word frequencies, left and right adjacent word numbers, left and right word entropy and mutual information six-dimensional statistics of segmented character strings; setting a group of parameters by the aid of kmeans; carrying out preliminary evaluation; carrying out filtering to obtain first round of results; respectively computing sums of cosine similarity of each candidate word and seed sets by the aid of word vector spaces and a group of field seed data; setting sum threshold values and carrying out secondary evaluation so as to extract the novel words of fields. The word vector spaces are obtained by means of word2vec training. The method has the advantages that the method is applicable to extracting the novel words from the large-scale field corpus and excellent in portability; the problem of difficulty in filtering non-field words with verb-object constructions, reduplication and the like can be fundamentally solved by the aid of the method.
Owner:EAST CHINA NORMAL UNIV

Countercheck method for automatically identifying speaker aiming to voice deception

The invention provides a countercheck method for automatically identifying a speaker aiming to voice deception, which is a voice anti-spoofing technology based on a method combining various features and a plurality of sub-systems. According to the invention, the serial features of the posterior probability of a phoneme in the phonological level and the MFCC features of voice level or MFDCC features of phase level are combined, thus the performance of the system is significantly enhanced. By combining the provided i-vector sub-system and OpenSMILE (open Speech and Music Interpretation by Large Space Extraction criterion containing voice and rhythmic information, the final presentation of the system is further enhanced. To a back-end model, the development datum are used; and under the situation of knowing deceptive attacks, a two-level support vector machine has better performance compared with one-level cosine similarity or PLDA evaluations, while the one-level evaluation approach has better robustness under the situation without seeing the test datum and knowing the deceptive conditions.
Owner:SUN YAT SEN UNIV +1

Face image feature extraction and comparison method based on deep learning

The invention discloses a face image feature extraction and comparison method based on deep learning. The method comprises the steps: performing local modularization on the images in a face library; constructing a convolution neural network, and inputting image modules into the convolution neural network to train; obtaining the image modules after performing local modularization on the images in a face library, using the trained convolution neural network model to extract the features of each image module, and obtaining the feature vector corresponding to each image; after performing local modularization on the face images to be compared, utilizing the trained convolution neural network model to extract the features of each image module, and obtaining the feature vectors corresponding to the face images to be compared; and using the cosine similarity to compare the feature vectors corresponding to the face images to be compared with the feature vector corresponding to each image in order. For the face image feature extraction and comparison method based on deep learning, face comparison in the OFD eastern face library ranks top 5, and the accuracy is 95%.
Owner:CHINACCS INFORMATION IND

Vehicle type recognition method based on CNN and domain adaptive learning

The invention relates to a vehicle type recognition method based on a CNN and domain adaptive learning. The method comprises steps of: establishing a CNN-based initial model by adding a rotation-invariant layer in an Alexnet network, distinguishing a discriminant layer and designing a new objective function; separately extracting the feature maps of different-domain sample convolution layers by using the established initial model, calculating the cosine similarity between the sample feature maps, determining the shared convolution kernel or the non-shared convolution kernel of the CNN, retaining the weight and the offset of the shared convolution kernel, and updating the weight and the offset of the non-shared convolution kernel; based on a target-domain training sample, calculating the cosine similarity between respective feature map layers and the average similarity of the entire target domain, and clustering each type of similar feature maps according to the average similarity; expanding source-domain samples with similar distribution characteristics in the target domain to new samples in the target domain, adjusting the entire CNN model by using the new samples in the target domain, and then using a softmax classifier to classify the vehicle types of the test samples in the target domain.
Owner:NANJING UNIV OF INFORMATION SCI & TECH

Short text similarity calculation method and system

The invention provides a short text similarity calculation method. The method comprises the steps as follows: S1, performing word segmentation on training corpus, obtaining the word vector of each word with a word2vec algorithm, and combining the word vectors into a word vector set; S2, performing word segmentation on to-be-calculated short texts respectively, finding the word vector of each wordof the to-be-calculated short texts from the word vector set, and combining the word vectors into a short text vector set; S3, calculating the cosine similarity between each word vector in the word vector set and each word vector in the short text vector set, obtaining the maximum similarity value of each word vector, and combining the maximum similarity values into short text sentence vectors; S4, calculating the similarity between two short text sentence vectors to calculate the similarity between two short texts. The invention also provides a short text similarity calculation system. According to the similarity algorithm, short text sentences are represented by sentence vectors, the semantic similarity between short text sentences is effectively depicted, and the accuracy is high.
Owner:NAT COMP NETWORK & INFORMATION SECURITY MANAGEMENT CENT

Face-image comparison method, device, computer device and storage medium

The invention discloses a face image comparison method, a device, a terminal device and a storage medium. The method comprises the following steps: obtaining a face image to be compared and extractinga feature vector to obtain a feature vector to be compared; the feature vector to be compared is normalized to obtain a comparison normalized vector; obtaining a base library matrix, calculating thecosine similarity between the feature vectors to be compared and the N standard feature vectors of the base library by a parallel calculation method based on the comparison normalized vector and the base library matrix, and obtaining N cosine similarity values; the maximum cosine similarity value is retrieved from the N cosine similarity values, and the comparison result is obtained according to the maximum cosine similarity value and the preset similarity threshold. The face image comparison method calculates the similarity of the face images by using a matrix operation library and a parallelcalculation mode, thereby speeding up the running speed of the computer and improving the efficiency of the face image comparison.
Owner:PING AN TECH (SHENZHEN) CO LTD

Method and device for obtaining synonym

The invention relates to a method and a device for obtaining synonyms. The method comprises: obtaining a text set, performing word segmentation on the text set to generate a first term set; filtering null words of the first term set through a stop word list, to generate a second term set; performing distance editing processing on any two words in the second term set, to generate a first synonym set; establishing a vector space model on the words in the first term set; according to the model, obtaining space vectors of each pair of synonyms, calculating cosine similarity value of each pair of synonyms, performing cosine threshold filtration strategy identification on each pair of synonyms, to generate a second synonym set; performing part-of-speech tagging on the words in the second synonym set, to generate a third synonym set; and processing the words in the third synonym set by a unary model, to obtain synonyms. Thus, the method and the device realize that retrieved synonyms are more accurate, and ambiguous words and null words do not exist, so as to intelligently retrieve related webpage of synonyms.
Owner:ADVANCED NEW TECH CO LTD

Healthy diet knowledge network construction method based on neural network and graph structure

The invention discloses a healthy diet knowledge network construction method based on a neural network and a graph structure. The method comprises the steps that word vector modeling is performed on a text corpus, so that each non-stop word in the text corpus corresponds to one word vector with a fixed length; a cosine similarity between two word vectors is used to measure the relational degree between entities corresponding to the two word vectors; food material entity nodes and symptom entity nodes are extracted, the two types of entity nodes are regarded as entity nodes in a topological structure, edge relations between the entity nodes are constructed to form the graph structure, and all the edge relations between the entity nodes are described by one group of representative words; vector expressions corresponding to each representative word are arranged to obtain a representative matrix of the edge relations between the entity nodes; and a classification framework based on a deep neural network is designed, the representative matrix is input, and polarities of the edge relations between the entity nodes are classified. Through the method, the problems that a traditional healthy diet knowledge base is not high in automation degree and obvious in domain limitation are effectively solved.
Owner:SOUTH CHINA UNIV OF TECH

Systems, methods and computer products for recommending media suitable for a designated activity

Media content is recommended based on suitability for a designated activity. A vector engine is trained using a plurality of lists, each of the lists containing metadata associated with a plurality of media objects. The vector engine includes a neural network trained with corpus data including (i) the plurality of lists (ii) a plurality of titles, each one of the titles associated with one of the lists, and (iii) the metadata associated with the plurality of media objects. Training the vector engine involves initializing, using the vector engine, a plurality of feature vectors representing each of the lists, each of the media objects, and each of a plurality of words in the titles of the lists. The training then further involves nudging, using the vector engine, the feature vectors based on a plurality of co-occurrences of the lists, the media objects, the words in the titles of the lists, or a combination thereof. A feature vector corresponding to an activity is identified among the feature vectors. At least one of the media objects, (ii) at least one of lists or (iii) a combination thereof suitable for the activity is selected based on cosine similarities between the feature vector corresponding to the activity and others of the feature vectors.
Owner:SPOTIFY

LDA (latent dirichlet allocation) and VSM (vector space model) based similar Chinese herb literature recommendation method

ActiveCN103823848AFast and efficient similar recommendationRobustSpecial data processing applicationsLexical itemVector space model
The invention discloses an LDA (latent dirichlet allocation) and VSM (vector space model) based similar Chinese herb literature recommendation method. The method includes: adopting an IKAnalyzer to perform word segmentation on topics and summary information of literature on the basis of a terminological dictionary for Chinese herbs, constructing a vector space, performing dimensionality reduction on the vector space, constructing a semantic dictionary, numbering all lexical items in the dictionary in sequence, performing vectorization through each document on the basis of the semantic dictionary, constructing term vectors of each document, utilizing LDA and a Gibbs sampling algorithm to perform training to obtain probability distribution of each document on themes, then computing a value of similarity between every two documents by the aid of KL divergence, computing cosine similarity of the term vectors of each document on the basis of term frequency, performing joint weighting on the two kinds of similarities prior to performing similarity sorting, and then making recommendation. By the method, the literature, similar both in content and theme, in the Chinese herb literature can be recommended to users, and recommendation results are closer to user requirements.
Owner:ZHEJIANG UNIV

Video retrieval method based on multi-core canonical correlation analysis

Disclosed is a video retrieval method based on multi-core canonical correlation analysis. The method includes grasping text descriptions corresponding to the video on internet, and then operating on the video: firstly dividing the video according to whether a shot is mutated or not, extracting key frames, extracting vision features of the key frames and moving features of the shot to form video feature vectors, and extracting word-frequency features from the text descriptions of each video; then utilizing the method of the multi-core canonical correlation analysis to obtain mapping matrixes and low-dimensional representation of the video features and the word-frequency features, and allowing the mapping matrixes and the low-dimensional representation to have the maximum correlation in low-dimensional space; finally, when a user inputs key words to perform video retrieval, acquiring the low-dimensional representation of the word-frequency features of the key words according to the mapping matrixes of the word-frequency features, and returning video retrieval results sequentially from large to small of the degrees of cosine similarity with the low-dimensional representation of the video features. The method has the advantages that the correlation of video content and the retrieval key words is enhanced, and the accuracy of retrieval by the user is improved.
Owner:ZHEJIANG UNIV

Zero-sample classifying method based on class transfer

A zero-sample classifying method based on class transfer comprises the steps of acquiring a vision characteristic of C kinds of training samples, a class semantic characteristic of the training sampleand a true label matrix; calculating a semantic similarity matrix by means of cosine similarity or Gaussian similarity through the class semantic characteristic; calculating a diagonal matrix of a class semantic similarity matrix; calling a Sylvester equation in an MATLAB toolset for obtaining a mapping matrix; inputting the vision characteristic of the training sample, the corresponding class semantic characteristic and the true label matrix into a target function, continuously adjusting the value of a model regularization parameter, calculating the least value of the target function, and finishing model training; and in a testing period, inputting the vision characteristic of the testing sample and the corresponding semantic characteristic, calculating scores of the classes, and determining the class with highest score as the predicated class of the testing sample. The zero-sample classifying method based on class transfer has advantages of sufficiently digging the semantic relationbetween different classes, realizing knowledge transfer between a known class classifier and an unknown class classifier, and realizing high convenience in application in image classification.
Owner:TIANJIN UNIV

Indoor fusion positioning method based on multi-features

The invention discloses an indoor fusion positioning method based on multi-features. The whole process of the method is divided into three parts such as Wi-Fi fingerprint probability calculation, inertial navigation calculation and particle filtering fusion, wherein the Wi-Fi fingerprint probability calculation uses a cosine similarity matching algorithm, the inertial navigation calculation is divided into three parts such as step number calculation, step size calculation and walking direction estimation, the step number calculation uses methods such as walking detection, center moving equating, window wave peak detection, dynamic time regularization, derivative analysis filtering and wave peak resurrection, the step size calculation uses a step size and step frequency relationship model, the walking direction estimation uses the time and frequency zone analysis of a horizontal direction acceleration and strategy adjusting based on a gravity direction, and finally in the particle filtering fusion part, the hidden markov model constructed based on the inertial navigation and the Wi-Fi fingerprint probability uses the particle filtering method and combines with the map information to calculate the position probability so as to finally obtain the positioning result.
Owner:SUN YAT SEN UNIV +1

Cross-scene pedestrian searching method based on depth learning

InactiveCN105631413AStrong feature robustnessImprove feature performanceCharacter and pattern recognitionCosine similarityFeature vector
The present invention discloses a cross-scene pedestrian searching method based on depth learning. The method comprises a step of carrying out preprocessing on each image in a sample library, a step of constructing and training a convolutional neural network, a step of extracting an upper half body local feature vector set and a lower half body local feature vector set from two groups of preprocessed image sets, and then the two local feature vector sets are fused to obtain a global feature vector, a step of carrying out preprocessing on an image to be searched, extracting an upper half body local feature vector and a lower half body local feature vector and fusing the two vectors to obtain a global feature vector, a step of orderly comparing the global feature vector corresponding to the image to be searched and the global feature vectors corresponding to the sample library images through a cosine similarity, outputting a group of similarity values, and sorting the similarity values according to a sorting algorithm. The method has the advantages that with the pedestrian images obtained in a monitoring video as the sample library, the design of features is not needed, the robustness is high, and the accuracy rate of actual searching is high.
Owner:CHINACCS INFORMATION IND

Action evaluation method based on human body three-dimensional articulation point detection

ActiveCN111144217AAccurate evaluationComprehensive action standard evaluationCharacter and pattern recognitionHuman bodyPersonalization
The invention relates to an action evaluation method based on human body three-dimensional articulation point detection, which belongs to the field of computer vision and comprises the following steps: S1, performing human body three-dimensional articulation point detection on a single-frame picture after video framing; S2, extracting key frames of a specified number of frames of the video; S3, constructing motion vector features and joint kinetic energy features, and extracting feature values; and S4, constructing a key frame action similarity comparison model through multi-feature fusion: fusing the sub-features in the step S3, and constructing a personalized model for different types of actions; constructing a motion vector feature similarity function based on cosine similarity, and constructing a joint kinetic energy similarity function based on a weighting function; and obtaining a key frame action similarity comparison model based on the two similarity functions, comparing the action to be detected with the key frame set of the standard action, and finally obtaining the action similarity of the motion video. The method is more accurate and scientific, and can be used for physical fitness action correction and teaching.
Owner:CHONGQING UNIV OF POSTS & TELECOMM

Method for generating emotion abstract of dialogue text

A method for generating an emotion abstract of a dialogue text includes the steps of constructing feature vectors, comprising word characteristics and emotion characteristics, of various sentences in the dialogue text; calculating similarity of the vectors by the adoption of a revised cosine similarity algorithm and then calculating the transition probability of the sentences according to the similarity; calculating weight values of the sentences through a PageRank algorithm, carrying out sorting according to the weight values and selecting the sentences with large weight values and of the number according with a preset number to form the emotion abstract of the dialogue text at last. According to the method, the abstract does not need to be generated manually, manpower and material resources are saved, and meanwhile precision for generation of the emotion abstract of the dialogue text is improved.
Owner:ZHANGJIAGANG INST OF IND TECH SOOCHOW UNIV

Sign-in data based user behavior trajectory clustering method

The present invention discloses a sign-in data based user behavior trajectory clustering method. The method comprises: steps 1, acquiring user sign-in data; step 2, preprocessing the user sign-in data; step 3, on the basis of comprehensively considering the influence of a marginal effect of a user sign-in date and difference of times of sign-in, calculating a sign-in value, in a sign-in position, of a user; step 4, initializing a cluster center, and performing clustering by using a cosine similarity method; step 5, recalculating the cluster center, and performing re-clustering by using the cosine similarity method; and step 6, repeating the step 5 until the requirement of preset clustering precision is satisfied.
Owner:中图科信数智技术(北京)有限公司

Modeling method for parallel smart case recommendation model

The invention relates to a modeling method for a parallel smart case recommendation model. The method comprises the following steps of obtaining existing patient cases from an electronic case database, carrying out denoising, clustering and word segmentation on the patient cases, and establishing a patient case corpus database; defining that TFIDFi, j shows the importance degree of a word or an expression in a case of the patient case corpus database, establishing an LSI vector space model according to the TFIDFi, j, and moreover, establishing a BOW word bag model according to all words and expressions in the patient case corpus database; calculating history case vectors and to-be-processed case vectors in the patient case corpus database through utilization of the LSI vector space model and the BOW word bag model; calculating cosine similarity among the history patient cases and storing the cosine similarity; and calculating the cosine similarity between the to-be-processed case vectors and the history patient case vectors, and searching similar cases of to-be-processed cases according to the cosine similarity. The model established through adoption of the method provided by the invention is high in accuracy and low in error. A recommendation result is high in quality.
Owner:QINGDAO ACADEMY OF INTELLIGENT IND

Item-based explicit and implicit feedback mixing collaborative filtering recommendation algorithm

The invention discloses an item-based explicit and implicit feedback mixing collaborative filtering recommendation algorithm. The method comprises the following steps of obtaining the information of interest of users on every item and establishing the score matrix of every user on all the items; calculating the average score of every user, the quantity of the scoring users of every item and the average score of every item; calculating a common comment user quantity matrix; calculating the Pearson similarity and the modified cosine similarity of between any two items; calculating the similarity based on explicit feedback; calculating the cosine similarity based on implicit feedback; calculating a final similarity; obtaining the nearest neighbor set I of a current item; when providing a recommendation list to a target user u, according to the score matrix, obtaining the scored items and the unscored items of the target user u; calculating the prediction scores of the unscored items of the target user u and selecting N items with the highest scores inside the unscored items of the target user u to the user. The item-based explicit and implicit feedback mixing collaborative filtering recommendation algorithm can effectively improve the accuracy of prediction recommendation.
Owner:ZHEJIANG UNIV

Method, device and equipment for obtaining supervision recognition result in multiple modes and storage medium

The invention relates to the field of artificial intelligence, discloses a method, device and equipment for obtaining a supervision recognition result in multiple modes and a storage medium, and solves the problem of semantic similarity matching of current business supervision terms and business products. The method comprises the following steps: creating a knowledge graph; processing the knowledge graph according to a first preset rule, a second preset rule and the entity relationship file to obtain an entity and an entity relationship; updating the knowledge graph according to the entities and the entity relationship to obtain a target knowledge graph; analyzing the target knowledge graph and the training text through an encoder to obtain fused to-be-processed information; performing random masking processing on the fused to-be-processed information according to a preset strategy to obtain training data; performing word embedding vector processing and self-made force mechanism processing on the training data to obtain a target sentence vector and a target word vector; and calculating a weighted average value of the semantic cosine similarity and the character string similarity ofthe target sentence vector and the target word vector according to a preset weight ratio to obtain a supervision and identification result.
Owner:CHINA PING AN LIFE INSURANCE CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products