Classification query method for global human mtDNA development tree based on ensemble learning

A query method and integrated learning technology, which is applied in the field of global human mtDNA development tree classification query, can solve the problems of low correct rate, insufficient practicability, and inability to update data, etc., and achieve the effect of improving the correct rate

Active Publication Date: 2021-07-13
YUNNAN UNIV
View PDF8 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] As far as the current global human mtDNA development tree classification query method is concerned, due to the problem of algorithm design, its correct rate is low, it is difficult to meet the actual needs, and the data cannot be updated, so the practicability is not enough

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Classification query method for global human mtDNA development tree based on ensemble learning
  • Classification query method for global human mtDNA development tree based on ensemble learning
  • Classification query method for global human mtDNA development tree based on ensemble learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0030] figure 1 It is a specific implementation flow chart of the global human mtDNA developmental tree classification query method based on integrated learning in the present invention. Such as figure 1 As shown, the global human mtDNA developmental tree classification query method of the present invention, its specific steps are as follows.

[0031] S101: Obtain training data:

[0032] First, a number of mtDNA data are manually measured, and each piece of data contains the mtDNA variation site sequence and the corresponding most likely classification. Each variation site sequence contains the specific information of several variation sites, and then obtains the global human mtDNA development Tree data, including all classifications and the sequence of mtDNA variation sites corresponding to each classification.

[0033] The mtDNA data used in this example is selected from the real mtDNA raw data of various human regions provided by the Southwest Biodiversity Laboratory of ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a global human mtDNA developmental tree classification and query method based on integrated learning, using manually measured mtDNA data to train a neural network classifier, using mtDNA developmental tree data to calculate the parameters of the Naive Bayesian classifier, and classifying The queried mtDNA variation site sequence is input into the neural network classifier to obtain the first Q possible classifications, and the mtDNA variation site sequence to be classified and the corresponding variation site sequences of the first Q possible classifications are combined to obtain the simple shell The input variable site sequence of the Yassian classifier, and calculate the weight of each variable site in the input variable site sequence, obtain the first Q possible classifications through the Naive Bayesian classifier, and then classify the two groups of Q possible classifications The probability is weighted, and the top Q possible classifications are obtained as the final classification results. The invention comprehensively utilizes the advantages of the neural network classifier and the naive Bayesian classifier, and improves the accuracy rate of the global human mtDNA development tree classification query.

Description

technical field [0001] The invention belongs to the technical field of machine learning, and more specifically relates to a global human mtDNA development tree classification query method based on integrated learning. Background technique [0002] my country is a country with a large population. Based on such a large country base, and the continuous exchanges and integration of various regions in the long river of history, as well as the isolation and migration based on geographical relations, my country's multi-ethnic population characteristics have gradually formed, providing researchers with A rich and diverse genetic treasure trove. But how to use such a treasure trove of genes to study human origin, migration, development and genetic structure is a problem we face. The research on the origin, migration and evolution of various ethnic groups is not only a major scientific issue, but also involves the self-identification of each ethnic group, which has important humanisti...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G16B20/20G16B20/30G16B30/10G16B40/00G06F16/28G06K9/62
CPCG06F16/285G16B20/00G16B30/00G06F18/24155
Inventor 周维彭旻晟贾俊燕王文智向文坤张亚平
Owner YUNNAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products