Method for predicting mutagenicity of chemicals through machine learning algorithm
A technology of mutagenicity and machine learning, applied in the field of ecological risk assessment testing strategies, can solve the problems of chemical mutagenicity bias, experiments without coverage to detect mutagenic types, and one-sided prediction results, achieving low-cost and high-efficiency predictions , Clarify the effect of the scope of application
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0038] Given a compound dinitrosocaffeine (CAS number: 145438-97-7), to predict its mutagenicity, first calculate its molecular fingerprint according to the Smiles code of dinitrosocaffeine, using the RDkit software package, and then Calculate its similarity with each molecule in the training set. It is calculated that there are 5 molecules in the training set whose similarity is greater than 0.25, so it is in the application domain. Based on its molecular fingerprints, predictions were made using the GBDT model. The result was 1, indicating that the compound is mutagenic. The predicted results are the same as the experimental results.
Embodiment 2
[0040] Given a compound p-anisidine (CAS No.: 104-94-9), to predict its mutagenicity, first calculate its molecular fingerprint using the RDkit software package according to the Smiles code of p-anisidine, and then calculate its difference with the training set. According to the calculation of the similarity of molecules, there are 267 molecules in the training set whose similarity is greater than 0.25, so they are in the application domain. Based on its molecular fingerprints, predictions were made using the GBDT model. The result was 1, indicating that the compound is mutagenic. The predicted results are the same as the experimental results.
Embodiment 3
[0042] Given a compound 10,10-dimethylundecane-1-amine (CAS number: 68955-53-3), to predict its mutagenicity, first according to 10,10-dimethylundecane-1 - The Smiles code of the amine, use the RDkit software package to calculate its molecular fingerprint, and then calculate its similarity with each molecule in the training set. Calculated, there are 91 molecules in the training set with a similarity greater than 0.25, so it is in the application domain . Based on its molecular fingerprints, predictions were made using the GBDT model. The result is 0, indicating that the compound is not mutagenic. The predicted results are the same as the experimental results.
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com