Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

36 results about "Function word" patented technology

In linguistics, function words (also called functors) are words that have little lexical meaning or have ambiguous meaning and express grammatical relationships among other words within a sentence, or specify the attitude or mood of the speaker. They signal the structural relationships that words have to one another and are the glue that holds sentences together. Thus they form important elements in the structures of sentences.

Hybrid adaptation of named entity recognition

A machine translation method includes receiving a source text string and identifying any named entities. The identified named entities may be processed to exclude common nouns and function words. Features are extracted from the source text string relating to the identified named entities. Based on the extracted features, a protocol is selected for translating the source text string. A first translation protocol includes forming a reduced source string from the source text string in which the named entity is replaced by a placeholder, translating the reduced source string by machine translation to generate a translated reduced target string, while processing the named entity separately to be incorporated into the translated reduced target string. A second translation protocol includes translating the source text string by machine translation, without replacing the named entity with the placeholder. The target text string produced by the selected protocol is output.
Owner:XEROX CORP

Systems and methods for collaborative note-taking

Techniques are provided for determining collaborative notes and automatically recognizing speech, handwriting and other type of information. Domain and optional actor / speaker information associated with the support information is determined. An initial automatic speech recognition model is determined based on the domain and / or actor information. The domain and / or actor / speaker language model is used to recognize text in the speech information associated with the support information. Presentation support information such as slides, speaker notes and the like are determined. The semantic overlap between the support information and the salient non-function words in the recognized text and collaborative user feedback information are used to determine relevancy scores for the recognized text. Grammaticality, well formedness, self referential integrity and other features are used to determine correctness scores. Suggested collaborative notes are displayed in the user interface based on the salient non-function words. User actions in the user interface determine feedback signals. Recognition models such as automatic speech recognition, handwriting recognition are determined based on the feedback signals and the correctness and relevance scores.
Owner:FUJIFILM BUSINESS INNOVATION CORP

Method and VLSI circuits allowing to change dynamically the logical behavior

A method, named the product terms method that allows to implement and / or to change dynamically the logical behavior of any combinational or synchronous sequential circuits has been presented. The method uses for every product term of logical equations, expressed as a sum-of-product, three memory words: mask word, product word and function word. The words of all product terms are ranged in a table, which characterize the logical behavior of the circuit.The invention provides the hardware structure of several new types of VSLI circuits, having re-configurable logic behaviors. A first embodiment implements any type of multiple output combinational circuit, a second embodiment implements any synchronous sequential circuit with only clock input and, a third embodiment implements any synchronous sequential circuit s with data inputs and clock input.An expert system capable to generate the tables used for the product terms method by interpreting and analysing the logical equations either supplied by the user or found in a database is also provided.
Owner:IOAN DANCEA

Automatic extraction method for text labels in combination with theme model and semantic analyses

The invention relates to an automatic extraction method for text labels in combination with theme model and semantic analyses, pertaining to the technical field of computer application. The method comprises pre-treatment, LDA modeling, context analyses and label extraction.The pre-treatment comprises following steps: removing low-frequency words, removing stop words and removing label information, wherein stop words are auxiliary words without any information, words showing sentence grammar structures, all function words and punctuations. The LDA modeling process comprises following steps: obtaining two matrixes after processing the LDA model: one is a file-theme matrix of N*K with each element corresponding to a hidden theme distribution of each file and the other is a K*M theme-word matrix with each element corresponding to a word distribution of each theme. Based on a conventional counting method, the method takes correlations of words in files into consideration and fully utilizes one key feature of context information so that label information of files is obtained.
Owner:DATAGRAND TECH INC

Clustering hypertext with applications to WEB searching

A method and structure for providing a database of documents comprising performing a search of the database using a query to produce query result documents, constructing a word dictionary of words within the query result documents, pruning function words from the word dictionary, forming first vectors for words remaining in a word dictionary, constructing an out-link dictionary of documents within the database that are pointed to by the query result documents, adding the query result documents to the out-link dictionary, pruning documents from the out-link dictionary that are pointed to by fewer than a first predetermined number of the query result documents, forming second vectors for documents remaining in the out-link dictionary, constructing an in-link dictionary of documents within the database that point to the query result documents, adding the query result documents to the in-link dictionary, pruning documents from the in-link dictionary that point to fewer than a second predetermined number of the query result documents, forming third vectors for documents remaining in the in-link dictionary, normalizing the first vectors, the second vectors, and the third vectors to create vector triplets for document remaining in the in-link dictionary and the out-link dictionary, clustering the vector triplets using the toric k-means process, and annotating / summarizing the obtained clusters using nuggets of information, the nuggets including summary, breakthrough, review, keyword, citation, and reference.
Owner:INT BUSINESS MASCH CORP

System for automation of business knowledge in natural language using rete algorithm

The present invention is directed to a system for managing business knowledge expressed as statements, preferably sentences using a vocabulary, where such statements may be automated by the generation of programming language source code or computer program instructions. As such, the present invention also manages software design specifications that define, describe, or constrain the programming code it generates or programs with which it or the code it generates is to integrate. All information managed within the present invention is maintained within a relational database that is encapsulated within an object-oriented model. Each object in this model is subject to version control and administration using permissions. Each user of the system is an object and belongs to one or more groups. Users and groups may be granted privileges. Objects may be created, examined, used, modified, deleted, or otherwise operated upon only if corresponding permission or privilege has been granted. The vocabulary managed by the present invention consists of the function words commonly used in a language, such as the auxiliary verbs, prepositions, articles, conjunctions, and other essentially closed parts of speech in English, as well as open parts of speech, such as nouns, verbs, adjectives, and adverbs.
Owner:ORACLE INT CORP +1

Computer words input method and system and its word library maintenance method and device

The invention discloses a computer text input method and a system together with a maintenance method and a maintenance device of the thesaurus. The method includes the following steps: pre-storing a deficiency thesaurus of function words; storing the text information input through the computer text input system in the user thesaurus and count the word input frequency; searching out whether a same word as the function word in the thesaurus of function words exists in the user thesaurus and delete the same word from the user thesaurus; analyzing the word frequency of the user thesaurus and merge the text meeting the special requirement of the matching frequency larger than one. The invention can reduce the user thesaurus occupation of stored resources and computing resources and improve input efficiency and accuracy through the maintenance of user thesaurus. The invention also can select candidate word from the maintained user thesaurus for input choice according to the word frequency, thus further improving the input efficiency and accuracy.
Owner:TENCENT TECH (SHENZHEN) CO LTD

Chinese author identification method based on double-layer classification model, and device for realizing Chinese author identification method

The invention relates to a Chinese author identification method based on a double-layer classification model and a device for realizing the Chinese author identification method, belonging to the field of information security. Aiming at the problem of low identification accuracy caused by excessive authors, an author grouping layer is added in an author identification model; each author is represented into an author vector; authors are grouped by a clustering algorithm; a second layer is an author identification layer; a dependence relationship, a function word, a punctuation mark and a word class mark are extracted from the second layer to use as characteristics; and author identification is carried out in the group. According to the method or the device, the problem that the identification accuracy is lowered because of excessive authors can be effectively solved. Meanwhile, with a proposed characteristic dimensionality reduction and optimization method based on a main ingredient analysis method, the problem that the identification accuracy is affected by noise comprised by a high-dimensionality characteristic vector is solved. The Chinese author identification method can be applied to the author textual research field of a literature and also can be applied to the field of information security, such as copyright protection.
Owner:HUNAN UNIV

English-Burmese bilingual parallel sentence pair extraction method and device based on BiLSTM-CNN

The invention relates to an English-Burmese bilingual parallel sentence pair extraction method and device based on BiLSTM-CNN, and belongs to the technical field of natural language processing. The method comprises the following steps: firstly, pre-training a bilingual word vector through a Muse tool; secondly, performing function marking on the sentence by utilizing the characteristics of the Burmese virtual words and the Burmese assistant words for identifying the subject-called guest of the Burmese, splicing syntactic structure information of each word into a word vector, encoding the sentence by using BiLSTM-CNN, and taking an output probability as a condition for measuring whether the sentence is a parallel sentence pair or not. According to the above steps, the BiLSTM-CNN-based British-Burmese bilingual parallel sentence pair extraction device is prepared through functional modularization. Compared with a traditional bilingual parallel sentence pair recognition system, the methodand the device are simpler. Experimental results show that the method and the device are superior to a baseline system in the aspects of accuracy, recall rate and other indexes. The accuracy is generally improved.
Owner:KUNMING UNIV OF SCI & TECH

Method of recognizing language information by applying language rule by machine

The invention relates to a machine language information processing technology. For the purpose that the machine imitates logic thinking method of human body to understand language and master grammar function, a presentation can be made from sentence structure of a subject, a predicate, an object, an attribute, an adverbial modifier and a complement to theory and application of a noun, a verb, an adjective, a quantifier, an adverb and a function word, and the analysis process of the function of each part can be demonstrated to be used as language teaching demonstration and provide basic exercise for language learning. The method provides each language with a grammar function of analyzing, judging and understanding language information, the grammar function is established on a commonly used and communicating platform, so that the machine can not only recognize language information, but also apply the language information to inter-translate and exchange between languages.
Owner:徐文和

System and method for setting number shortcut function keys

InactiveCN102035922AImprove efficiency in operating non-touchscreen handheld devicesTelephone sets with user guidance/featuresInput/output processes for data processingKey pressingFunction word
The invention discloses a system and method for setting number shortcut function keys. In the method, a function icon triggering and executing program corresponding to a block of function icons to be identified and function words corresponding to the function icons are obtained by identifying the block of the function icons to be identified; then, key serial numbers are previously set for the function icon triggering and executing program and associated to number keys, and the number keys corresponding to the key serial numbers are connected with the function icon triggering and executing program; and finally, the function words corresponding to the key serial numbers are displayed and marked. Therefore, users can set customized number shortcut function keys by the users per se, and the operation efficiency of the users for handheld type devices with non-touch control screens is improved.
Owner:INVENTEC CORP

System for registering key words of articles and its method

The system possesses a data storage device including symbol base, a function word base and a keyword database, as well as a processor. The processor compares an article with the symbol base, further deletes symbols, which are appeared in the symbol base, in the article. Function words, which are appeared in the function word base, in the article are deleted. Then, The number of times of all words appearing in the article is calculated so as to obtain multiple candidate words as well as their relevant appearing number of times. Finally, based on preset conditions, multiple key words are selected from the said candidate words, and the selected candidate words are registered to the keyword database.
Owner:VIA TECH INC

Clue management method and device, terminal and computer readable storage medium

The invention discloses a clue management method and device, a terminal and a computer readable storage medium. The clue management method comprises the steps of obtaining an original customer name corresponding to a target customer; Identifying a location word group in the original customer name; Judging whether continuous fields in the original customer name are the same as brand names in a brand word bank or not; Taking a continuous field which is the same as the brand name of the brand word bank in the original customer name as a brand keyword; If no continuous field in the original customer name is the same as the brand name in the brand word bank, taking the field between the place word group and the enterprise function word group as a brand keyword; Judging whether the name of a cooperative customer is consistent with a place phrase and a brand keyword or not; And if yes, marking the target client as a cooperative client. According to the technical scheme, the clue library is subjected to data analysis, and the client, coinciding with the cooperative client, in the target client can be marked as the cooperative client, so that the business personnel can be prevented from repeatedly following the client.
Owner:PINGAN CITY CONSTR TECH SHENZHEN CO LTD

User knowledge demand model establishing method based on Gaussian mixed model

The invention provides a method for establishing a user knowledge demand model by utilizing a Gaussian mixed model for the first time. Firstly, high-dimensional vectors of function words are generated by considering the semantic information of the function words based on a skip-gram model of knowledge base training word2vec, then the Gaussian mixed model is trained by utilizing selected knowledge corpus set, multiple Gaussian distributions are applied to describe the probability distributions of function word knowledge demands of a user, an EM method is applied to optimize parameters of the Gaussian mixed model; finally, the mapping relation between the words and entries is established, a knowledge entry demand model of the user is obtained, and knowledge entries, most possibly interested by the user, in a knowledge base are calculated on the basis and are pushed to the user. The established Gaussian mixed model can more closely fit the user knowledge demand model, and the knowledge push accuracy rate is improved.
Owner:BEIJING INSTITUTE OF TECHNOLOGYGY

Document classifying method based on network measure index

The invention relates to a document classifying method based on a network measure index. The document classifying method comprises a sample training phase and a document classifying phase. The sample training phase comprises the first step of sample collecting, the second step of text segmenting, the third step of word class analyzing, the fourth step of function word and name removing, the fifth step of word frequency counting, the sixth step of characteristic set Vd establishing, the seventh step of characteristic network peak establishing, the eighth step of characteristic network edge establishing, the ninth step of average degree calculating, the tenth step of cluster coefficient calculating, the eleventh step of characteristic path length calculating and the twelfth step of network measure index interval obtaining. The document classifying phase comprises the first step of processing a document to be classified and the second step of judging document classification. According to the document classifying method, classifying is accurate, classifying efficiency is high, the problem that according to an existing classifying method, scientific and technical literature, novels and prose cannot be distinguished is solved, and a scientific classification method and a theoretical foundation is laid for automatic distinguishing of the scientific and technical literature, the novels and the prose.
Owner:INFORMATION RES INST OF SHANDONG ACAD OF SCI

Dialogue generation method and device based on two-stage decoding, medium and computing equipment

The invention discloses a dialogue generation method and device based on two-stage decoding, a medium and computing equipment, and the method comprises the steps of dividing a dialogue reply generation process into two decoding stages, firstly inputting a dialogue context into a dialogue generation model, and mapping the dialogue context into a word embedding vector; inputting a word vector into a context self-attention encoder to obtain a feature vector of a dialogue context, inputting the feature vector into a first-stage Transformer decoder, and decoding to generate a notional word sequence; inputting the notional word sequence into a notional word sequence encoder to obtain a feature vector of the notional word sequence; and finally, inputting the context and the feature vector of the notional word sequence into a second-stage Transformer decoder, and decoding to generate a final reply. Through the two-stage decoding process, interference of the virtual words which are high in frequency but lack semantic information on the notional words is prevented, and therefore reply relevance and information amount are improved.
Owner:SOUTH CHINA UNIV OF TECH

Information recommendation method and device and electronic equipment

The embodiment of the invention discloses an information recommendation method and device and electronic equipment. The method comprises the following steps of: firstly, obtaining a keyword input in aretrieval interface; extracting to-be-retrieved functional words capable of representing categories or features of POIs from the keywords; then, according to the corresponding relationship between the function words and POIs, obtaining a point of interest POI corresponding to the to-be-retrieved functional word. In order to achieve accurate recommendation of information, POIs (Point Of Interest)extracted according to the to-be-retrieved functional words are set as candidate POIs; then the association degree between the function words to be retrieved and each candidate POI is calculated, andfinally the candidate POIs are selected and recommended according to the association degree, so that the technical problem that in the prior art, the fitness degree of POI recommendation and user retrieval intention is not high is solved, and the POI recommendation accuracy is improved.
Owner:BEIJING SOGOU TECHNOLOGY DEVELOPMENT CO LTD

A Document Classification Method Based on Network Metrics

The invention relates to a document classifying method based on a network measure index. The document classifying method comprises a sample training phase and a document classifying phase. The sample training phase comprises the first step of sample collecting, the second step of text segmenting, the third step of word class analyzing, the fourth step of function word and name removing, the fifth step of word frequency counting, the sixth step of characteristic set Vd establishing, the seventh step of characteristic network peak establishing, the eighth step of characteristic network edge establishing, the ninth step of average degree calculating, the tenth step of cluster coefficient calculating, the eleventh step of characteristic path length calculating and the twelfth step of network measure index interval obtaining. The document classifying phase comprises the first step of processing a document to be classified and the second step of judging document classification. According to the document classifying method, classifying is accurate, classifying efficiency is high, the problem that according to an existing classifying method, scientific and technical literature, novels and prose cannot be distinguished is solved, and a scientific classification method and a theoretical foundation is laid for automatic distinguishing of the scientific and technical literature, the novels and the prose.
Owner:INFORMATION RES INST OF SHANDONG ACAD OF SCI

Method for semantic recognition and graph recommendation

The invention discloses a method for semantic recognition and graph recommendation. The method comprises the steps that words and symbols input by a user in an input method are used to judge the integrity degree of the user's contents at first, and then function word parts in the contents will be filtered and notional word parts will be reserved according to word natures of the input words in the contents; a cloud side service is utilized synchronously, the filtered notional words are transmitted to the cloud side service according to the language input by the user, the cloud side service recommends hot graphic documents selected by users recently according to meanings of the notional words, the documents are presented on an input method interface for selection of the user, and the document will be automatically sent to an instant chat window after the user selects the graphic document; and operations are simple, execution efficiency is high, and the user can search for a graph without leaving a current application.
Owner:SHENZHEN AOE NETWORK TECH CO LTD

Man-machine interaction intention analysis method and device, computer equipment and storage medium

The embodiment of the invention discloses a man-machine interaction intention analysis method and device, computer equipment and a storage medium. The method comprises: picking up semantic interaction voice and converting the semantic interaction voice into a semantic text; performing syntactic dependency analysis to obtain an analysis result; judging whether punctuation marks exist in the analysis result or not; if the first punctuation mark exists, cutting off the analysis result according to the position of the first punctuation mark to obtain two clauses; determining a core relationship and a relationship between the advertent of the clause where the core relationship is located and the head word; determining whether the semantic text contains effective information or not; if not, judging whether the end of the semantic text is a virtual word or not; if yes, deleting the virtual words at the end of the semantic text; if not, retrieving a core relationship; and judging whether the semantic text contains effective information or not by combining the subscript length of the core relationship. By implementing the method provided by the embodiment of the invention, the problem that the current semantic service cannot accurately judge the real intention of the user is solved, so that the semantic service understanding is more accurate.
Owner:深圳科卫机器人科技有限公司

Method and device for enhancing grammar error correction data based on real error mode

The invention discloses a method and a device for enhancing grammar error correction data based on a real error pattern. The method comprises the following steps: acquiring a to-be-noise-added statement and a noise adding strategy set; determining the noise adding probability of each word in the statement to be subjected to noise adding; randomly selecting a noise adding strategy from a noise adding strategy set according to the noise adding probability to carry out noise adding processing on the to-be-noise-added word; and constructing parallel statement pairs according to the error statements subjected to noise addition processing and the correct statements before noise addition processing. The noise adding strategy set comprises a real error pattern-based replacement strategy, a synonym replacement strategy, a function word replacement strategy, a similar spelling replacement strategy and a flexion replacement strategy. According to the embodiment of the invention, through introduction of real errors and simulation of various real errors, high-quality artificial error enhancement data which is more real and closer to real errors of learners can be generated; and various grammar errors can be manufactured through various types of noise schemes, and the method and the device can be widely applied to the technical field of data processing.
Owner:GUANGDONG UNIVERSITY OF FOREIGN STUDIES

Function word extraction method, model training method, electronic equipment and medium

The invention relates to a function word extraction method and device, a model training method and device, electronic equipment and a medium, and relates to the technical field of computers.The function word extraction method can comprise the steps that target text information is obtained, then function word extraction is conducted on the target text information through a function word extraction model, and a function word extraction result is obtained; obtaining a standard efficacy word corresponding to the target text information; wherein the efficacy word extraction model is obtained by training based on a plurality of text samples and standard efficacy words corresponding to the text samples. According to the efficacy word extraction method and device, the model training method and device, the electronic equipment and the medium, the efficacy word extraction time can be shortened, and the efficacy word extraction accuracy can be improved.
Owner:企知道科技有限公司

An Automatic Text Label Extraction Method Combining Topic Model and Semantic Analysis

The invention relates to an automatic extraction method for text labels in combination with theme model and semantic analyses, pertaining to the technical field of computer application. The method comprises pre-treatment, LDA modeling, context analyses and label extraction.The pre-treatment comprises following steps: removing low-frequency words, removing stop words and removing label information, wherein stop words are auxiliary words without any information, words showing sentence grammar structures, all function words and punctuations. The LDA modeling process comprises following steps: obtaining two matrixes after processing the LDA model: one is a file-theme matrix of N*K with each element corresponding to a hidden theme distribution of each file and the other is a K*M theme-word matrix with each element corresponding to a word distribution of each theme. Based on a conventional counting method, the method takes correlations of words in files into consideration and fully utilizes one key feature of context information so that label information of files is obtained.
Owner:DATAGRAND TECH INC

Pronunciation dictionary generation method and word speech recognition method and device

The embodiment of the invention provides a pronunciation dictionary generation method, a word speech recognition method, a word speech recognition device, electronic equipment and a storage medium. The pronunciation dictionary generation method comprises the steps: acquiring a training corpus which comprises a first phoneme sequence corresponding to one or more notional words, and a pronunciationrule corresponding to the language to which the notional word belongs; constructing one or more function words according to the pronunciation rule, wherein the function words have corresponding a second phoneme sequence; and generating a pronunciation dictionary by adopting the notional words, the first phoneme sequence, the function words and the second phoneme sequence. According to the method,the data volume of the pronunciation dictionary is ensured, and a pronunciation dictionary with sufficient words can be generated by using training corpora less than that for training a common pronunciation dictionary when facing unknown small languages, so that pronunciation of the to-be-identified words is accurately identified by increasing little corpora with large corpora.
Owner:BEIJING SINOVOICE TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products