Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

50 results about "Tibetan language" patented technology

Standard Tibetan is the most widely spoken form of the Tibetic languages. It is based on the speech of Lhasa, an Ü-Tsang (Central Tibetan) dialect. For this reason, Standard Tibetan is often called Lhasa Tibetan. Tibetan is an official language of the Tibet Autonomous Region of the People's Republic of China.

Tibetan language entity knowledge information extraction method

The invention relates to a Tibetan language entity knowledge information extraction method, which comprises the following steps that: Tibetan and Chinese comparable language material information is extracted from Tibetan and Chinese text language material information; entity equivalence pairs are extracted from the Tibetan and Chinese comparable language material information; the Tibetan and Chinese cross-language entity relationship is extracted from the entity equivalence pairs; a Tibetan language "entity-attribute-value" triad is extracted from the Tibetan and Chinese cross-language entity relationship; and the triad is stored into a Tibetan language entity knowledge semantic resource library. The Tibetan language entity knowledge information extraction method solves the problem of Tibetan language training language material deficiency to a certain degree, promotes the knowledge sharing among different languages, and provides support for the study in the fields of Tibetan and Chinese cross-language knowledge questions, information retrieval, machine translation and the like.
Owner:MINZU UNIVERSITY OF CHINA

Method of Tibetan language webpage text classification based on semanteme

The invention relates to a method of Tibetan language webpage text classification based on semanteme. The method of the Tibetan language webpage text classification based on the semanteme comprises the following steps: firstly, extracting text messages which are used for charactering a webpage from the Tibetan language webpage, then carrying out word classification processing on the text messages, and expressing words obtained by means of the word classification processing as a word vector space, secondly, mapping the words expressed through the word vector space to a concept of a semanteme space according to a preset Tibetan language classification body to obtain a semanteme space of a text to be classified, and at last adopting a classification algorithm to classify the semanteme space of the text to be classified according to a preset semanteme space of a training sample set. The method of the Tibetan language webpage text classification based on the semanteme carries out preprocessing on the webpage and adopting a KNN classification algorithm of the text similarity of a weighting semantic net to achieve real-time and high-efficiency classification of the Tibetan language webpage.
Owner:MINZU UNIVERSITY OF CHINA

On-line identification method and recognition system for 'ding' of handwriting Tibet character

The invention discloses an on-line handwritten Tibetan language recognition method and a recognition system. The on-line handwritten Tibetan language recognition method comprises the following steps of: carrying out pretreatment on a handwritten character mode; extracting the direction features of handwriting points and the features of the space occupied by the characters, and carrying out dimensionality reduction and switching on a higher-dimension feature through a linear discriminating analysis method, thus obtaining the recognition features of the Tibetan language characters; improving the speed of a standard quadratic classifier and obtaining a high-speed quadratic classifier which is used for recognizing the Tibetan language characters; associating the syllables with the recognized characters; and finally inputting the standard Tibetan language syllables into the text. The invention finishes an integral on-line handwritten Tibetan language recognition system through the pretreatment on the Tibetan language characters, feature extraction, classifier designing and syllable association. The on-line handwritten Tibetan language recognition method and the recognition system can be used in the on-line handwritten Tibetan language recognition inputting in computers and mobile phones, and has the advantages of high recognition rate, high inputting speed, steady performance, and the like.
Owner:NORTHWEST UNIVERSITY FOR NATIONALITIES

Tibetan language-based multi-modal emotion calculation method and system

The embodiment of the invention provides a Tibetan language-based multi-modal emotion calculation method and system, and a server. The method comprises: firstly, obtaining Tibetan language data to beclassified; collecting video signals, voice signals and text information from the Tibetan language data; then, extracting high-level video features, high-level voice features and text features in a classification emotion corpus, respectively extracting the high-level video features, the high-level voice features and the text features, performing learning based on a deep learning model to obtain high-level fusion features, and finally, classifying the high-level fusion features in the classification emotion corpus based on SVM and storing the high-level fusion features in the classification emotion corpus. Therefore, the blank state of the Tibetan language in sentiment analysis can be filled. A basic corpus is provided for Tibetan multi-modal sentiment analysis. The Tibetan language data sentiment recognition method based on the three modes is beneficial to development of Tibetan language multi-mode sentiment analysis, the natural language processing capacity and the intelligent sentiment recognition capacity of the Tibetan language can be promoted, the artificial intelligence information processing capacity of the Tibetan language is improved, and in addition, the sentiment recognition rate of the Tibetan language data can be effectively increased under the condition of mutual fusion of the three modes.
Owner:QINGHAI UNIVERSITY

Syllable splitting method of Tibetan language of Anduo

The syllable division of Tibetan speech is an important part and essential link in Tibetan speech information processing, and lays the foundation for Tibetan speech recognition and speech database making; however, a software for the syllable division of Tibetan speech is still absent. Through extracting the time domain parameter short-time energy and short-time zero-crossing rate of Anduo Tibetan speech, the invention realizes Tibetan syllable division through a specific algorithm. The invention is used to carry out syllable division of 30 continuous Anduo Tibetan speech samples, and the division accuracy reaches to 30.6 percent.
Owner:NORTHWEST UNIVERSITY FOR NATIONALITIES

Tibetan language thesis copying detection method and Tibetan language thesis copying detection system based on Tibetan language sentence levels

The invention discloses a Tibetan language thesis copying detection method and a Tibetan language thesis copying detection system based on Tibetan language sentence levels. The Tibetan language thesis copying detection method includes: subjecting Tibetan language text characters to code conversion and noise removal preprocessing; segmenting a text into text blocks according to sentences through boundary identification of Tibetan language sentences and establishing a temporary table of segmented text blocks; extracting and computing text features from a sentence-document inverted index table and the temporary table according to the number of the sentences to obtain sentence similarity; establishing an adjacency list in accordance with the sentence similarity, computing text block similarity and detecting copying of two Tibetan language theses according to a text block similarity value. The Tibetan language thesis copying detection system comprises a Tibetan language thesis copying detection device and a database, wherein the Tibetan language thesis copying detection device is connected to a client terminal server through the Internet, and the database is connected to the server and used for storing Tibetan language theses. The Tibetan language thesis copying detection device comprises a preprocessing module for code conversion and noise removal of the text characters, a temporary table module for constructing the segmented text blocks, an extracting module for constructing sentence text features and a copying detection module for detecting whether the theses have similar copied data or not.
Owner:QINGHAI UNIV FOR NATITIES

Tibetan language recognition method and device and electronic device

ActiveCN110032938ASolve the defect of low recognition accuracyImprove accuracyCharacter and pattern recognitionText recognitionText string
The invention provides a Tibetan language recognition method, which belongs to the technical field of optical character recognition, and solves the problem of low Tibetan language recognition accuracyin the prior art. The method comprises the steps of determining the target image blocks which are sequentially distributed in a text line image to be identified and the overlapping information of thetarget image blocks; identifying each target image block through a preset first text string identification model, and determining a text identification result of each target image block; and integrating the text recognition results of the target image blocks according to the overlapping information of the target image blocks, and determining a text recognition result of the to-be-recognized textline image. According to the Tibetan language recognition method disclosed by the invention, the Tibetan language text line image does not need to be segmented into individual characters, but is recognized through the string recognition model in the form of image blocks, so that the recognition accuracy can be effectively improved.
Owner:BEIJING HANVON DIGITAL TECH CO LTD +1

Tibetan language encoding method and terminal

The invention discloses a Tibetan coding method and a terminal thereof, relating to the minority written language processing field and aiming at solving the coding problem of information exchanging in the Tibetan terminal. In the method, when an obtained Tibetan character is judged to be in accordance with the UNICODE standard, the UNICODE code of the Tibetan character is displayed as the display code of the character; when the obtained Tibetan character is judged to be not in accordance with the UNICODE standard, the display code of the Tibetan character obtained from the corresponding UNICODE code of the Tibetan character in an expanded zone is displayed. The terminal comprises a judgment unit, a coding unit, a first processing unit, the expanded zone and a second processing unit. Strictly conforming to the UNICODE standard, the method of the invention adapts to the information technical globalization. With the inevitable trend that the UNICODE standard is taken as the unified coding standard, the invention brings convenience and provides guarantee to the information exchange.
Owner:青海洛藏数码科技有限公司

Tibetan web page and its code identification method

The invention relates to a method for identifying Tibetan language webpage and its coded, including the steps of: giving a code of characteristic string, which is syllable node and / or selected high frequency syllable, in Tibetan language codefirstly; webpage character flow, the code of characteristic string as keyword, scanned and searched; calculating the frequency that accords with characteristic string coded character to appear by the couter; determining whether the webpage is Tibetan language webpage and the Tibetan language code is used according to the result of counter. The invention makes the best of the syllable structural feature of Tibetan language language and the statistics characteristic of Tibetan language word, and respectively applys the identification criteria for different code, accordingly Tibetan language webpage and non Tibetan language webpageu can efficiently be distinguished correctly, and Tibetan language coding used by the webpage is also able to be identified.
Owner:INST OF SOFTWARE - CHINESE ACAD OF SCI

Tibetan character recognition device and method

The invention provides a Tibetan character recognition device and method, and relates to the field of character recognition. According to the invention, by receiving a photographed Tibetan character document image sent by a high-speed photographic apparatus, and according to the size of the Tibetan character document image, the line spacing of each line region, each of the suspected Tibetan characters in the coordinates of a pre-established coordinate system and the area of the minimum rectangular frame of each suspected Tibetan character, an adaptive threshold range associated with features of the Tibetan characters is generated in real time; and if an attribute of the minimum rectangular frame outside each Tibetan character is not within the adaptive threshold range generated in real time, the Tibetan characters in the minimum rectangle frame are segmented or merged, so that the cases of erroneous recognition of the Tibetan characters are eliminated according to the special structure of the Tibetan language and the special byte shape, and the accuracy rate of Tibetan character recognition is raised.
Owner:赣州朗顺数字科技有限公司

Method for realizing tibetan language input, display and short-message reception and transmission by hand-held electronic terminal

The method creates a components including: building code table of Tibetan keywords and combined words of Tibetan keywords; building key assignments for key-presses of electronic terminal; building coincidence relation between dot matrix and code table. The method includes following steps: (1) reading key values input; (2) searching dot matrix font library and displaying character; (3) selecting and determining character needed to input. The invention realizes functions for inputting and displaying Tibetan.
Owner:TIBET UNIV +1

Semantic ontology creation and vocabulary expansion method for Tibetan language

The invention relates to a method for processing Chinese minority scripts, in particular to a semantic ontology creation and vocabulary expansion method for a Tibetan language. The method comprises the following steps: (1) establishing an upper level ontology on the basis of the Chinese dictionary of the HowNet; (2) expanding conceptual synonyms appearing in the upper level ontology by using definitions in an electronic dictionary; (3) carrying out a conceptual hyponymy mode matching algorithm on the upper level ontology in a multi-language ontology library to expand the concept of the upper level ontology; (4) searching for conceptual synonyms in the expanded ontology; (5) sequencing from higher similarities to lower similarities on the basis of an ontology conceptual lexical semantic similarity algorithm; (6) modifying the sequencing results and editing the ontology. According to the method, the upper level ontology is established on the basis of the Chinese dictionary of the HowNet, levels of different concepts are defined according to a hyponymy in the ontology, and more new semantic words can be obtained on the basis of the hyponymy, so that the vocabulary of the existing Tibetan language ontology is expanded, and the Tibetan language information processing accuracy is increased greatly.
Owner:MINZU UNIVERSITY OF CHINA

Tibetan language input method of embedded device

The invention relates to a Tibetan language input method of an embedded device. The Tibetan language input method of the embedded device achieves Tibetan language input of the embedded device through a remote controller: (1) building a mapping file of a remote controller digital code and a Tibetan language character standard Unicode code; (2) storing the mapping file in an input method installation directory, and loading the mapping file in a memory when the input method is started to form a mapping table; (3) for a user input numeric key, finding a Unicode code corresponding to the numeric key in the mapping table according to the digital code formed by the numeric key; (4) finding a corresponding Tibetan language character from a word stock based on the Unicode according to the Unicode code, and displaying on an interface through a graphic program; and (5) inputting sound insulation characters, figures, half-figures and punctuation marks in the Tibetan language through the 0 key and the # key. The Tibetan language input method of the embedded device has very important significance in development of informatization of minority areas and improvement of social informatization level.
Owner:郑州威科姆科技股份有限公司

Intelligent rule multi-language type interpretation system and creation method for same

The invention discloses an intelligent rule multi-language type interpretation system and a creation method for the same, and is applied to the field of intelligent home. By the system, a foundation is laid for solving the problem of artificial intelligence (essentially a remote control system or local intelligence) in the field of intelligent home at present. The system is an integrated language interpretation environment formed by a rule editor, rule lexical checking, rule grammar checking, intermediate code generation, executable code generation, executable code file management, rule debugging, a virtual machine execution environment and the like. The system and the method are simple and easy to learn; language grammars are similar to those of natural languages of human beings, a user can understand and use the system and the method without any professional knowledge, the whole program can be described completely by pure Chinese (Tibetan language, Mongolian language and the like) besides operators, and is high in execution efficiency, and executable codes are very close to machine languages. By fusion of such a language system and an intelligent home cloud service system, the user can speak to the system to simply require the system to automatically finish a complex process, and according to the actual condition of the current market, such a technology has broad market prospect.
Owner:谢玮琦

A Tibetan emotion analysis method and system based on mixed depth learning

The invention relates to a Tibetan emotion analysis method and system based on mixed depth learning. The method comprises the following steps: S1, acquiring Tibetan sentences, and preprocessing the Tibetan sentences to obtain preprocessed Tibetan; S2, carrying out Tibetan word segmentation on that preprocessed Tibetan language to obtain the Tibetan language after word segmentation; S3, preliminarily processing the Tibetan language after word segmentation to obtain a word vector, and establishing a word vector model according to the word vector; S4, introducing that word vector model into a three-layer convolution neural network for iterative processing to extract the feature of the first word vector; S5, introducing that first word vector feature into a two-layer long-short-term memory network for iterative processing to extract the second word vector feature; S6, the second word vector feature is compiled and classified by a classification function. By using convolution layer and two-layer long-term and short-term memory network to analyze Tibetan language features, the global measure of text can be preserved and deeper semantic relations can be mined, and better classification results can be achieved.
Owner:QINGHAI UNIVERSITY

Tibetan language input system and method

The invention relates to a Tibetan input system and a method thereof. The system comprises an input device which comprises ten input keys at least and is used for inputting an instruction of a target input key and a keyboard definition module which comprises at least ten storage units. Each storage unit is corresponding to an input key, is stored with a set of following Tibetan list separator or letter (see a lower right chart) processor and is used for taking out a Tibetan letter corresponding to the target input key in the keyboard definition module according to the input instruction of the target input key; the target Tibetan is input into a target program after target Tibetan is searched from a Tibetan word stock according to the taken-out Tibetan letter. The system also comprises the Tibetan word stock which is used for storing the Tibetan. The system and the method realize the function of inputting the Tibetan by a small keyboard.
Owner:GUANGDONG GUOBI TECH

Method and system for constructing Tibetan emotional dictionary based on Tibetan language features

InactiveCN107122465ASentiment Accurate AnalysisEmotionally accurate determinationNatural language data processingSpecial data processing applicationsPattern recognitionMicroblogging
The present invention discloses a method and a system for constructing a Tibetan emotional dictionary based on Tibetan language features. The method comprises: matching the Chinese vocabulary ontology with emotion classification with a Chinese-Tibetan dictionary to obtain a Tibetan basic emotional dictionary; carrying out corpus training on the preliminarily collected Tibetan microblogging information by using the Word2vec tool to obtain a synonym set of the corpus training vocabulary, and taking the synonym set as an extended candidate word set; calculating the weight variance of each extended candidate word; and screening the extended candidate words according to the weight variance to obtain emotional extension words. According to the method for constructing the Tibetan emotional dictionary based on Tibetan language features disclosed by the present invention, the Chinese vocabulary ontology is matched with a Chinese-Tibetan dictionary to obtain the Tibetan basic emotional dictionary, corpus training and screening is carried out on the Tibetan microblogging information by using the Word2vec tool, and extension is carried out based on the Tibetan basic emotional dictionary, so that more Tibetan emotional vocabulary is provided, and emotion of the current Tibetan microblogging information expression is accurately analyzed.
Owner:MINZU UNIVERSITY OF CHINA

Tibetan word vector representation method fusing components and character information

The invention belongs to the technical field of Tibetan language information processing, and discloses a Tibetan language word vector representation method fusing components and character information,and the Tibetan language word vector representation method fusing the components and the character information comprises the steps: directly fusing the components and the character information into amodel TCCWEI represented by the Tibetan language word vector; fusing the component information into the vector representation of the character, and then fusing the component information and the information of the character into a model TCCWEII represented by the Tibetan word vector; and fusing components and Tibetan word vectors of character position information into the TCCWEII model to represent a model TCCWEII + P. According to the Tibetan language word vector representation model TCCWE provided by the invention, compared with the current optimal word vector representation, the Tibetan language word vector representation model TCCWE is improved by 8% on the Tibetan language similarity evaluation set TWordSim215. Compared with the current optimal word vector representation, the Tibetanlanguage correlation evaluation set TWordRel 215 is improved by 7%, and the semantic and correlation of the found similarity / correlation words are very high.
Owner:QINGHAI NORMAL UNIV

Tibetan language microblog emotional tendency analysis method and system

InactiveCN107102984AAccurate analysisAnalytical methods for sentiment analysis accuratelySemantic analysisSpecial data processing applicationsSequence databaseMicroblogging
The invention discloses a Tibetan language microblog emotional tendency analysis method and system. By means of a collection module and a preprocessing module, collected Tibetan language microblog text information is preprocessed, and a Tibetan language text sequence database is obtained; emotional analysis processing is conducted on the Tibetan language text sequence database through an analysis module to obtain emotion feature parameters; a space model module is built according to the emotion feature parameters, and an emotion vector space model is constructed; an emotion classification module carries out emotion classification on current microblog text information according to the emotion vector space model, and emotion information of the current microblog text information is obtained. By adopting the Tibetan language emotion analysis method and the method for constructing the space model, emotional tendency analysis of Tibetan language microblogs is achieved.
Owner:MINZU UNIVERSITY OF CHINA

Tibetan language input method applied to mobile telephone

The invention provides a Tibetan language input method applied to a mobile telephone, relating to a character input method and concretely designing a mobile telephone input method applied to Tibetan language. In the invention, number keys and the combination keys of the number keys and direction keys are utilized to correspondingly display a group of Tibetan language letters, every Tibetan language letter group corresponds to one number key or one functional key, and then the corresponding number keys or the corresponding functional keys are selected to affirm the Tibetan language letters required to be input, and are combined into required Tibetan language characters and sentences according to the Tibetan language letters; and the application of the corresponding letters is simple and fast. The invention has the advantages of simple input and comprehensive input of the Tibetan language, is convenient for Tibetan to use a mobile telephone to send out Tibetan language short messages, enriches the daily life of the Tibetan and is convenient for the contact among the Tibetan.
Owner:王白

Method for converting gestures into Chinese-Tibetan bilingual speech

InactiveCN108665898AFacilitate daily communication activitiesTroubleshoot voice output issuesSpeech recognitionSpeech synthesisComputer-aidedBroadcasting
The present invention provides a method of converting gestures into a Chinese-Tibetan bilingual speech. The method comprises the steps that sample data is used for carrying out gesture recognition ongestures to be recognized to obtain meanings of the gestures; the meanings of the gestures are expressed in a Chinese-Tibetan bilingual form to obtain a semantic definition of the gestures, and context-related annotations of the gestures are generated according to the semantic definition of the gestures; a speaker-related Tibetan model or a Mandarin model is obtained through self-adaptive trainingof the speaker and using the training corpus of a specific speaker in Mandarin or Tibetan, and the Tibetan speech or Mandarin speech are synthesized by using a speaker-related Tibetan model or a Mandarin model and the context-sensitive annotations of the gestures. The method can convert the input static gestures and dynamic gestures into Mandarin or Tibetan language, can promote the daily communication activities of speech impaired persons and normal persons, solves the speech output problem in the communication between disabled persons and the normal persons, and can further be applied to the aspects such as computer assisted instruction for deaf and dumb, bilingual broadcasting of TV programs.
Owner:NORTHWEST NORMAL UNIVERSITY

Tibetan language encoding method and device and Tibetan language code decoding method and device

The invention discloses a Tibetan language encoding and decoding method and device, and belongs to the field of communication. The encoding method includes the steps that a Tibetan language syllable is obtained; characters at all the positions are sequentially judged from left to right; if the current position includes one character, a preset code table is searched to obtain a code corresponding to the character; if the current position includes multiple characters arrayed vertically, the code table is searched for all the characters arrayed vertically sequentially in the first consonant and later vowel sequence and the top-to-bottom consonant sequence; in the encoding process, a character space mark is added between every two adjacent codes, and the Tibetan language syllable codes are obtained after compiling of the characters at all the positions in the Tibetan language syllable is completed. The scheme of achieving code encoding and decoding based on the minority language is given, which greatly facilitates Tibetan language communication, and the Tibetan language encoding and decoding method plays a great role in emergency communication and security and secrecy communication.
Owner:MINZU UNIVERSITY OF CHINA

Service system for automatic generation of Tibetan reading questions in primary schools

The invention relates to a service system for automatic generation of Tibetan reading questions in primary schools. The system comprises a Tibetan reading corpus construction model and a Tibetan reading text question generation model. According to the Tibetan reading corpus construction model, the Tibetan reading corpus is obtained by extracting feature data of primary school Tibetan articles and designing a mixed multi-strategy text screening model; The Tibetan reading text problem generation model comprises a coding end and a decoding end, and the coding end uses a bidirectional RNN network and an attention mechanism; The decoding end uses a one-way RNN network, an attention mechanism and a replication mechanism. Through the designed mixed multi-strategy text screening model, Tibetan Chinese articles suitable for being read in primary schools can be screened out from large-scale encyclopedia Tibetan Chinese texts. And an end-to-end automatic problem generation model is designed, so that the problems that the Tibetan language reading teaching materials in primary schools are few, the updating speed is low, the manual question setting amount is small and the like are solved, and the development of Tibetan language teaching in national regions is promoted.
Owner:MINZU UNIVERSITY OF CHINA

Tibetan language paper plagiarism detection method and system

The invention provides a Tibetan language paper plagiarism detection method and system, and relates to the technical field of modern education. Aiming at three different plagiarism phenomena of continuous text plagiarism, semantic rewriting plagiarism and translation plagiarism, the invention provides a longest common subsequence algorithm and an improved twin long-short-term memory network methodrespectively. Academic paper pre-detection based on abstract document vectors and a weight distribution strategy based on chapter positions are adopted to improve retrieval efficiency.
Owner:MINZU UNIVERSITY OF CHINA +1

Tibetan language speech recognition method based on HMM and DNN

The invention relates to the field of artificial intelligence, the invention provides a Tibetan language speech recognition system based on an HMM-DNN (hidden Markov model-deep neural network). According to the Tibetan language speech recognition method based on the HMM and the DNN, a deep learning training model is combined with Tibetan language which is a low-resource corpus, a Tibetan language-based establishment model is trained, Tibetan language speech is recognized, and the human-computer interaction efficiency of Tibetan people is improved, and the Tibetan language speech recognition method based on the HMM and the DNN comprises the following steps of 1, recording Tibetan language speech data; 2, carrying out data preparation; 3, constructing a language model and a pronunciation dictionary; 4, training a single-phoneme model; 5, training a three-tone sub-model; 6, performing linear discriminant analysis and maximum likelihood linear transformation, and performing decoding and alignment; 7, carrying out speaker adaptive training; and 8, carrying out model training. The method is mainly applied to Tibetan language speech automatic recognition occasions.
Owner:TIANJIN UNIV

Method and device for constructing Tibetan language question and answer corpus

The invention provides a method and device for constructing a Tibetan language question and answer corpus, and belongs to the field of big data processing. The method comprises the steps of selectinga Tibetan language triple entity as a central word entity, and obtaining all triples related to the central word entity; and mapping all entities in all the triples into a corresponding relationship between the entities and the tags, and constructing a Tibetan language question and answer corpus according to the corresponding relationship and the head word entity. According to the scheme, the Tibetan language question and answer corpus is constructed by finding all triples related to the Tibetan language triple entity and mapping the triples into the corresponding relation between the entity and the label, and the defects of time consumption and labor consumption of manual participation are overcome.
Owner:MINZU UNIVERSITY OF CHINA

Method and system for constructing HowNet dynamic role knowledge system based on Tibetan features

The invention provides a HowNet dynamic role knowledge system construction method and system based on Tibetan features, and relates to the technical field of data processing. According to syntactic-semantic features of Tibetan and a prototype theory of Dowty, semantic features and a limit range of a dynamic role of HowNet are determined; classifying the determined dynamic roles layer by layer according to semantic features and limited ranges of the dynamic roles in combination with syntactic priorities selected by arguments to obtain dynamic role hierarchies; and associating the dynamic role hierarchy with 802 HowNet basic event classes to form a HowNet dynamic role knowledge system. According to the method, the HowNet dynamic role is re-determined on the basis of syntactic-semantic features of the Tibetan language and a prototype theory, the semantic range and the semantic feature principle of the HowNet dynamic role are determined, the consistency of the HowNet during semantic annotation is ensured, the problem of disordered division of the original HowNet dynamic role is solved, and a structured HowNet dynamic role knowledge system with clear division is established.
Owner:MINZU UNIVERSITY OF CHINA

Method and system for predicting tone of Lhasa dialect of Tibetan language

The invention discloses a method and a system for predicting tone of Lhasa dialect of Tibetan language. The method includes acquiring continuous Tibetan language texts; segmenting the texts to obtain segmented word strings; determining syllable tone prediction characteristics of various syllables in segmented words; determining tonal patterns of the syllables according to the syllable tone prediction characteristics of the syllables by the aid of syllable tonal pattern prediction models obtained by means of training in advance. Each segmented word string comprises the single corresponding segmented word or the multiple corresponding segmented words. The method and the system have the advantage that the syllable tone prediction accuracy can be improved for the Lhasa dialect in the continuous Tibetan language texts.
Owner:IFLYTEK CO LTD

Translation management and evaluation method for Chinese and Tibetan language data under domestic operating system

The invention provides a Chinese and Tibetan language data translation management and evaluation method under a domestic operating system, which can perform overall management on tens of millions of Tibetan language data of a large amount of software in the domestic operating system based on Linux, so that the management cost of Chinese and Tibetan language maintenance of the domestic operating system is effectively reduced; comprising the following steps: analyzing all software source codes under a domestic operating system based on Linux to obtain source entries, and constructing a source entry data set; in response to update of the translation data of the source entry, constructing a translation mapping relationship between the newly added translation data and the source entry; according to the translation mapping relation, cross-software to-be-processed source entry checking is carried out in the domestic operating system, and translation data updating or newly adding is carried out on the software with the detected to-be-processed source entry; and performing translation quality evaluation on the newly added translation data, and outputting an evaluation result of translation correctness.
Owner:NAT UNIV OF DEFENSE TECH

Method for realizing Tibetan language input, display and short-message reception and transmission on hand-held electronic terminal

The method creates a components including: building code table of Tibetan keywords and combined words of Tibetan keywords; building key assignments for key-presses of electronic terminal; building coincidence relation between dot matrix and code table. The method includes following steps: (1) reading key values input; (2) searching dot matrix font library and displaying character; (3) selecting and determining character needed to input. The invention realizes functions for inputting and displaying Tibetan.
Owner:TIBET UNIV +1
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products