A Uighur language agricultural technology term recognition method

An agricultural technology and Uyghur language technology, applied in the field of Uyghur agricultural technology term recognition, can solve the problems of low recognition efficiency, poor effect of automatic recognition of domain terms, and insufficient consideration of Uyghur language morphological changes and language knowledge characteristics.

Active Publication Date: 2020-03-17
张海军
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] At present, there is no automatic recognition method for Uyghur agricultural terms. Although the recognition methods for Uyghur terms other than agriculture use rule-based and statistical-based methods, or a combination of the two, but because this method is not sufficient Considering that Uyghur, as an agglutinative language, has the characteristics of language knowledge formed by rich language morphological changes, the recognition process requires the support of a large amount of labeled corpus, and the recognition effect is too dependent on the size of the labeled corpus and the labeling results, resulting in the automatic recognition of domain terms. At the same time, due to the insufficient application of domain features based on language knowledge in the existing Uyghur term recognition methods in other fields, the term extraction field is poorly targeted; there is also a lack of integration for term automatic The unified framework of the statistical features of recognition and language knowledge features, and the random use of various features lead to the problem of poor overall recognition effect. Therefore, this method is not suitable for automatic recognition of Uyghur agricultural terms.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Uighur language agricultural technology term recognition method
  • A Uighur language agricultural technology term recognition method
  • A Uighur language agricultural technology term recognition method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0063] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings. It should be understood that the specific embodiments described here are only used to explain the present invention, and are not intended to limit the present invention.

[0064] The present invention is mainly aimed at the recognition of two types of field terms, the first type is the recognition of word-type terms, and the second type is the recognition of multi-word type terms. The technical solution adopted by the present invention is to apply finite state automata, integrate language knowledge features and statistical features, construct a state transition matrix for term recognition, and gradually realize field term recognition under the action of a main program controller. In specific processing, for a specific word, a variety of features and rules are used s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a Uyghur character agricultural technology term recognition method, and relates to the technical field of computer application. According to the method, the character string frequency and the C_value value of words of linguistic data are counted in Uyghur character linguistic data; the words corresponding to the C_value value meeting the C_value value threshold are selected and are used as anchor point candidate terms; the statistics features of the anchor point candidate terms are counted; the part-of-speech tagging and the stem and suffix splitting are performed on all words in the linguistic data to obtain language features; a finite state automata is used for integrating the statistics features and the language features to construct a state transfer matrix; the automatic recognition of the agricultural technology term under the control of the finite state automata is realized. The method has the advantages that the accuracy of the Uyghur character agricultural field technology term recognition is improved by 4 percent; the recall rate is improved by about 3 percent; the Uyghur character agricultural field technology term recognition blank is filled.

Description

technical field [0001] The invention relates to the field of computer application technology, in particular to a method for recognizing Uighur agricultural technical terms. Background technique [0002] At present, there is no automatic recognition method for Uyghur agricultural terms. Although the recognition methods for Uyghur terms other than agriculture use rule-based and statistical-based methods, or a combination of the two, but because this method is not sufficient Considering that Uyghur, as an agglutinative language, has the characteristics of language knowledge formed by rich language morphological changes, a large amount of annotated corpus is needed in the recognition process, and the recognition effect is overly dependent on the scale of the annotated corpus and the annotation results, resulting in the automatic recognition of domain terms. At the same time, due to the insufficient application of domain features based on language knowledge in the existing Uyghur...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/263G06F40/284G06F40/289
CPCG06F40/263G06F40/284G06F40/289
Inventor 张海军
Owner 张海军
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products