Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

65 results about "Lexical set" patented technology

A lexical set is a group of words that all fall under a single category based on some shared phonological feature.

Comment analysis method based on word vectors and syntactic features and visual interactive interface

The invention provides a comment analysis method based on word vectors and syntactic characteristics in the field of data analysis. The comment analysis method comprises the steps of obtaining commodity page comment data of an e-commerce website; preprocessing the acquired target data set; extracting a appendix lexical set provided by Hownet and NTU to form a basic emotion dictionary; carrying outword vector training on the obtained preprocessed data set through a Word2Vec tool; establishing a probability transfer matrix by using the semantic similarity matrix; carrying out core sentence rule-based processing on the obtained commodity comment text; carrying out preprocessing on the obtained text without the redundancy; performing part-of-speech extraction (commodity attributes, negative words, degree words and sentiment words) evaluation matching on the obtained dependency relationship pairs; combining the evaluation matching pair with an emotion dictionary, subjecting evaluation objects to appraisal value calculation and quality sorting, and finally, realizing the evaluation objects through a visual interaction interface, so that accurate, real-time, automatic and convenient processing and analysis on commodity comment data are realized, and the method can be used in an e-commerce platform.
Owner:NANJING UNIV OF POSTS & TELECOMM

Limited domain-oriented knowledge graph updating method and system

The invention provides a limited domain-oriented knowledge graph updating method and system, and the method comprises the steps: inputting limited domain question and answer corpora, extracting candidate entities of sentences in the corpora through word segmentation, screening common functional words in a word segmentation result through a word frequency dictionary, and obtaining a candidate entity set; constructing an inverted index dictionary according to the limited domain knowledge graph to obtain a similar vocabulary set of each candidate entity; training the candidate entities and the corresponding similar vocabulary sets into word vectors, and calculating cosine similarity so as to judge the types of the candidate entities; obtaining the relationship between every two candidate entities in the candidate entity set by using the trained Bert text classification model; and updating the candidate entity type and the relationship between the candidate entities into the knowledge graph according to the judgment. The knowledge graph updating method provided by the invention is higher in efficiency, can recognize the newly appearing entity type according to the existing entities inthe graph, and effectively improves the knowledge graph updating speed and accuracy.
Owner:HUAZHONG NORMAL UNIV

Text keyword recognition method and device, computer equipment and readable storage medium

The invention relates to the technical field of intelligent decision making of artificial intelligence, and discloses a text keyword recognition method. The method comprises the steps of obtaining text information, and performing word segmentation on the text information to obtain a vocabulary set; calculating the word frequency of each vocabulary in the vocabulary set, splitting the vocabulary set to obtain a sub-vocabulary set and an association relationship among the vocabularies in the sub-vocabulary set, and obtaining a total vocabulary table with characteristic values according to the word frequency of each vocabulary in the sub-vocabulary set and the association relationship among the vocabularies; and arranging the vocabularies in the total vocabulary table according to the characteristic values, and setting the vocabularies of which the characteristic values exceed a preset characteristic threshold value as keywords. The invention also relates to a blockchain technology, and information can be stored in the blockchain node. The key degree of the vocabulary is evaluated from two dimensions of the word frequency of each vocabulary in the vocabulary set and the degree of dependence of any vocabulary in the vocabulary set by other vocabularies, so that the accuracy of obtaining the keyword capable of reflecting the core meaning of the text information is improved.
Owner:ONE CONNECT SMART TECH CO LTD SHENZHEN

Method and device for extracting data in medical information, equipment and storage medium

The embodiment of the invention provides a method and device for extracting data in medical information, equipment and a storage medium. The method comprises the steps of determining a positioning word of the medical information; generating an adjacent vocabulary set corresponding to the positioning word based on the position of the positioning word in the medical information, wherein the adjacent vocabulary set comprises vocabularies in the medical information, and the vocabularies and the positioning word are located within a set distance; extracting target data having a specified attachment relationship with the positioning word from the adjacent vocabulary set by using a data extraction rule pre-configured for the positioning word; and positioning and extracting target data by using the positioning word and the data extraction rule pre-configured by the positioning word so as to realize extraction of specified data in different medical data.
Owner:SHANGHAI TAIMEI DIGITAL TECH CO LTD

Construction method and device of user knowledge concept network and evaluation method of user knowledge

The invention discloses a construction method and device of a user knowledge concept network and an evaluation method of user knowledge.The construction method of the user knowledge concept network comprises the steps that firstly, each text contained in a text set containing m independent texts is preprocessed, and then each vocabulary of corpus serves as a concept subject term; all sentences and vocabularies are traversed, vocabularies appearing together with the concept subject terms in the same sentence are included into vocabulary sets corresponding to the concept subject terms, then vocabulary element screening is conducted on each vocabulary set, and a concept library is constructed; the field division is performed on concepts contained in the concept library by adopting a hierarchical clustering method; then, according to the matching condition of vocabularies contained in the user text data and a concept library, concepts contained in the user text data are obtained; and finally, a user knowledge concept network is constructed according to the concepts contained in the user text data and the divided concept fields. According to the method, the accuracy and objectivity of evaluation can be improved.
Owner:武汉渔见晚科技有限责任公司

Text information processing method and system

The invention provides a text information processing method and system, and the method comprises the steps: carrying out the word segmentation of a to-be-approved text, and obtaining a vocabulary setcomprising a plurality of vocabularies; extracting features of each vocabulary in the vocabulary set to obtain a vocabulary feature set; inputting the vocabulary feature set into a preset classification model for vocabulary classification, and determining whether the to-be-approved text contains sensitive words or not; if the sensitive words are contained, outputting text information used for indicating that the to-be-approved text does not pass the approval; and if the sensitive words are not included, outputting text information used for indicating that the to-be-approved text passes approval. According to the scheme, vocabulary classification is carried out on the to-be-approved text by utilizing the pre-trained classification model, and whether the to-be-approved text contains the sensitive words or not is determined. And outputting text information used for indicating whether the approval text passes approval or not according to the determination result without manual approval, sothat manpower and approval cost are saved, and approval speed and approval efficiency are improved.
Owner:BANK OF CHINA

Dictionary-based word vector generation method and system

PendingCN112163422AAdequate vocabularySufficient lexical relationsSemantic analysisParaphraseLexical set
The invention relates to a dictionary-based word vector generation method and system, and the method comprises the steps: enabling vocabularies contained in the dictionary to form a vocabulary set, carrying out the statistics of the occurrence frequency of each vocabulary in the vocabulary set in vocabulary paraphrases contained in the dictionary, carrying out the word segmentation of each vocabulary paraphrase according to the frequency, and obtaining a paraphrase vocabulary sequence; taking the vocabularies as nodes, connecting the nodes according to the corresponding relation between the vocabularies and the paraphrasing vocabulary sequences to form directed edges, and determining the weight of each directed edge to obtain a directed graph based on a dictionary; and calculating the directed graph based on a depth walk algorithm to obtain a word vector. According to the method, the vocabulary information provided by the dictionary is fused into the word vector, so that a high-qualitydata basis can be provided for word vector training, word meanings can be better mined, and a natural language processing task is supported.
Owner:WORKWAY SHENZHENINFORMATION TECH CO LTD

Element extraction method and device, electronic equipment and storage medium

The invention provides an element extraction method and device, electronic equipment and a storage medium. The method comprises the steps of obtaining a to-be-extracted text and a vocabulary set of the to-be-extracted text; based on a matching result between character strings corresponding to every two characters in the to-be-extracted text and the vocabulary set, the relevancy between every two characters is determined, and the character strings are obtained by being intercepted from the to-be-extracted text with the two corresponding characters as starting points and ending points; coding each character in the to-be-extracted text on the basis of the relevancy between every two characters to obtain an element boundary feature of each character; and determining an element extraction result of the to-be-extracted text based on the element boundary features of the characters. According to the element extraction method and device, the electronic equipment and the storage medium provided by the invention, the matched vocabularies and the original sentences do not need to be spliced, and the original input length is not changed, so that the coding efficiency is improved. In addition, compared with an existing vocabulary splicing method, the storage space is saved.
Owner:IFLYTEK (SUZHOU) TECH CO LTD

Voice tag judgment method and system, storage medium and electronic equipment

The invention relates to the field of audio recognition, and in particular, relates to a voice tag judgment method and system, a storage medium and electronic equipment. The method comprises the steps: acquiring open source vocabularies to form an open source vocabulary set; performing word segmentation processing on the text in the related scene to obtain a word segmentation set; obtaining an audio file, and processing the audio file to obtain a high-frequency vocabulary set; obtaining a preset list, and processing the preset list to obtain a related vocabulary set; performing union processing on the open source vocabulary set, the word segmentation set, the high-frequency vocabulary set and the related vocabulary set, and obtaining a vocabulary list; and performing tag processing on the voice content according to the vocabulary list. The method is high in operability and suitable for the cold start stage; and the ASR recognition accuracy in the content risk control field and the downstream nlp classification task and tag effect can be effectively improved, and the method can be quickly applied to related fields.
Owner:北京数美时代科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products