Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

792 results about "Cross-validation" patented technology

Cross-validation, sometimes called rotation estimation or out-of-sample testing, is any of various similar model validation techniques for assessing how the results of a statistical analysis will generalize to an independent data set. It is mainly used in settings where the goal is prediction, and one wants to estimate how accurately a predictive model will perform in practice. In a prediction problem, a model is usually given a dataset of known data on which training is run (training dataset), and a dataset of unknown data (or first seen data) against which the model is tested (called the validation dataset or testing set). The goal of cross-validation is to test the model's ability to predict new data that was not used in estimating it, in order to flag problems like overfitting or selection bias and to give an insight on how the model will generalize to an independent dataset (i.e., an unknown dataset, for instance from a real problem).

Surface roughness prediction method based on GA-GBRT and method for optimizing process parameters

The invention discloses a surface roughness prediction method based on GA-GBRT and a method for optimizing process parameters. The method comprises the steps of: collecting data to construct a data set, and dividing the data set to training set data and test set data, and employing the training set data to perform training of key parameters of a GBRT model; b, performing parameter coding and population initialization: randomly generating a chromosomal sequence for increasing the number of iterations, the maximum depth of the individual regression estimator and the learning rate; c, employing the k-folded cross-validation method to train the GBRT model, and employing the genetic algorithm to calculate the fit goodness fitting value of each individual; d, when the number of cycles does not reach the maximum number of iterations, allowing the population to be selected, crossed and mutated to produce a new generation of populations, and continuously performing training of the GBRT model; and e, repeatedly performing the steps c and d until the number of cycles reaches the maximum evolution algebra or exceeds the maximum number of iterations to obtain the optimal model parameters. The surface roughness prediction method based on GA-GBRT and the method for optimizing process parameters are high in test precision and superior in prediction performance and improves the surface processing precision of the workpiece.
Owner:GUIZHOU UNIV

Encrypted traffic analysis feature extraction method and system, storage medium and safety equipment

The invention belongs to the technical field of network security and communication, and discloses an encrypted traffic analysis feature extraction method and system, a storage medium and security equipment. The method comprises the steps: collecting original traffic data ; preprocessing the collected original data packet, and filtering traffic data of SSL / TLS encryption communication; performing deep packet analysis on the streaming data to generate a traffic analysis log; performing log aggregation according to the connection quadruple information and the index information in each log to forma flow feature call chain, and performing feature extraction according to the call chain to obtain an initial data set; and determining an optimal supervised learning classification algorithm in thecurrent environment, determining an optimal parameter by using a grid parameter optimization method, and evaluating the feature extraction accuracy by using a ten-fold cross validation method. According to the method, the classification effect is optimal by adopting the random forest algorithm, the obtained accuracy is as high as 99.96%, the result shows that SSL / TLS encryption features used by all malicious families are different, and the classification effect is remarkable.
Owner:XIDIAN UNIV

Data-based qualitative analysis method of influence factors of circuit breaker faults

The invention discloses a data-based qualitative analysis method of influence factors of circuit breaker faults. The method solves prior problems of insufficient utilization of circuit breaker fault data, too subjective positioning of fault causes, insufficient stability of a qualitative analysis model and the like. According to the method, the key influence factors of all types of the circuit breaker faults are found through mining and analyzing the circuit breaker fault data. The method includes the realization steps of defining an original fault data set; cleaning the fault data; transforming the fault data; reducing the fault data; constructing a qualitative analysis model of the influence factors of the faults, and carrying out ten times of ten-fold cross-validation; and obtaining association rules between the faults and attributes, and obtaining the influence factors of all the types of the circuit breaker faults by qualitative analysis. According to the method, a supervised learning algorithm of CMAR is utilized for modeling, and with data amount increasing, the accuracy of the model will be continuously improved. At the same time, the robustness of the qualitative analysis model is guaranteed by the ten times of the ten-fold cross-validation, and the influence factors of the circuit breaker faults can be effectively analyzed.
Owner:无锡启工数据科技有限公司

Treated sewage quality prediction method based on combination of support vector classification and GRU neural network

The invention discloses a treated sewage quality prediction method based on combination of support vector classification and a GRU neural network, and belongs to the technical field of sewage treatment. Missing value processing, abnormal value elimination and data standardization are carried out on the collected sewage historical data, a PCA principal component analysis method is adopted to carryout dimension reduction on the data, and the selected auxiliary variable is used as an input variable of a sewage quality prediction model; a sewage effluent key prediction model is established by adopting a GRU neural network suitable for processing time series data, a support vector machine model is firstly introduced to classify sewage quality data, and then the classified data is respectivelymodeled through the GRU neural network algorithm to predict effluent quality. When the SVM model is trained, a grid search method and a cross validation method are used for optimizing model parameters, the prediction precision of the obtained joint prediction model is more accurate, the model effect is better, the network performance can meet the actual application requirements, and accurate prediction of the effluent quality of the sewage treatment system can be realized.
Owner:HEFEI UNIV

Customs declaration data processing method and device, computer equipment and storage medium

The invention relates to the field of big data processing, in particular to a customs declaration data processing method and device, computer equipment and a storage medium. The method comprises the following steps: receiving customs declaration data sent by a declaration main body, wherein the customs declaration data carries an order identifier, a payment order identifier and a waybill identifier; obtaining encrypted order data, encrypted payment order data and encrypted waybill data from corresponding nodes of the block chain according to the order identifier, the payment order identifier and the waybill identifier; obtaining a directionally authorized decryption key corresponding to the encrypted order data, the encrypted payment bill data and the encrypted waybill data; decrypting thecorresponding data through the obtained decryption key of the directional authorization; performing cross validation on the decrypted order data, payment data and waybill data to obtain a cross validation result; and when the cross validation result shows that the order data, the payment order data and the waybill data are inconsistent, increasing the check rate of customs clearance articles corresponding to the customs clearance data. By adopting the method, the data security can be ensured.
Owner:ONE CONNECT SMART TECH CO LTD SHENZHEN

Analysis method and application of electroencephalogram (EEG) signals based on complex network

The invention discloses an analysis method and application of EEG signals based on a complex network. The analysis method of the EEG signals based on the complex network comprises the following steps: constructing multi-scale level limited penetrable visibility graph complex networks; calculating characteristic indexes of each multi-scale level limited penetrable visibility graph complex network; combining a support vector machine to classify the EEG signals, namely using a leave-one-out cross-validation and support vector machine classifier to classify all two-dimensional index vectors, and using a ten-fold cross-validation and support vector machine classifier to classify all the two-dimensional index vectors. According to the invention, multi-scale ideas and level limited penetrable visibility graph theories are combined to construct an EEG multi-scale level limited penetrable visibility graph complex network so as to extract complex network indexes, and the support vector machine classifier in machine learning is combined to realize high-accuracy classification for different EEG signals. The analysis method and application of the EEG signals based on the complex network can be applied to smart head-mounted wearable equipment, and sleep EEG signals are measured through analyzing the smart wearable equipment to monitor the brain state of a user, furthermore, necessary early warning can be provided.
Owner:钧晟(天津)科技发展有限公司

Abnormity detection method for enterprise industry classification

The invention discloses an abnormity detection method for enterprise industry classification, which comprises the following steps of: firstly, extracting to-be-mined text and non-text information in taxpayer industry information, and carrying out feature processing and coding processing; Secondly, constructing a deep network structure conforming to the industry classification abnormity detection problem, and determining the number of neurons of an input layer and an output layer of the network according to the characteristic dimension of the coded data; Thirdly, on the basis of the constructeddeep network structure, adopting different training strategies to train the industry large-class network and the industry detail network through cross validation; And finally, carrying out abnormitydetection on the industry large class by using dimension reduction characteristics of the industry large class network in combination with an SOS abnormity detection algorithm, and carrying out abnormity detection on industry details according to reconstruction characteristics of the industry detail network. According to the invention, the TADM model is utilized to carry out abnormal detection onthe original data, and macroscopic management work such as national statistics, tax collection and industrial and commercial management can be analyzed more reasonably and accurately.
Owner:XI AN JIAOTONG UNIV

Method for determination of PCTFE content in PBX explosive by near infrared spectrum

The invention discloses a method for determination of the polychlorotrifluoroethylene (PCTFE) content in a polymer-bonded explosive (PBX) by near infrared spectroscopy. The method includes the steps of: preparing and collecting 260 PBX samples, taking 180 samples of them as a calibration set for establishing a calibration model, taking the remaining 80 samples as a validation set for model validation, and acquiring the near infrared spectrum data of all the samples; using a standard method to determine the PCTFE content in the samples; subjecting the spectrum data of the validation set samples in the wave bands of 6102.0cm<-1>-5697.0cm<-1> and 4680.2cm<-1>-4242.9cm<-1> to a first order derivative treatment, correlating the treated spectrum data with the PCTFE content by a partial least squares method, and establishing the calibration model by cross validation; employing the calibration model to predetermine the PCTFE content of the validation set samples, and selecting an optimal model according to a minimum root mean squared error of prediction (RMSEP) of the validation set; and acquiring the near infrared spectrum data of the samples to be determined, and making use of the optimal model to obtain the PCTFE content directly. Being suitable for determination of the PCTFE content in a PBX explosive, the method has the characteristics of convenient operation, and rapid and accurate analysis.
Owner:XIAN MODERN CHEM RES INST

A method for carrying out transformer area user identification based on optimized supervised learning

InactiveCN109816033AReliable Identification ResultsReduce hardware costsCharacter and pattern recognitionTransformerEngineering
The invention relates to the field of data analysis, and discloses a method for carrying out transformer area user identification based on optimized supervised learning. The method comprises the following steps of determining a user with a known station user topological relation and a station area and a phase to which the user belongs, determining a corresponding tag of user data according to thestation area and the phase to which the user belongs, establishing a training set, a verification set and a test set, determining k parameters in a KNN model by adopting a cross verification mode, andcompleting model training; and identifying and classifying the voltage data to be identified by adopting the trained model and the determined k value, thereby realizing the identification of the users in the transformer area to be identified. According to the invention, conversion from unsupervised learning to supervised learning is realized; a training set, a verification set and a test set arereasonably set, and k parameters are determined by adopting a cross verification mode, so that the transformer area and the phase of a user are accurately and effectively identified, the problem of cross-transformer-area user ownership is thoroughly solved, and a foundation is laid for comprehensively guiding work in the fields of operation, maintenance, first-aid repair, technical improvement, planning and the like of a low-voltage transformer area.
Owner:SICHUAN ENERGY INTERNET RES INST TSINGHUA UNIV

Method for solving collaborative filtering recommendation data sparsity based on neural network

The invention provides a method for solving collaborative filtering recommendation data sparsity based on a neural network. The method for solving collaborative filtering recommendation data sparsity based on the neural network adopts generalized regression of neural network (GRNN) and conducts full filling on sparse data by a train network model and score prediction. The method for solving collaborative filtering recommendation data sparsity based on the neural network comprises the following steps: before conducting the GRNN training, conducting screening on input variables of the neural network by adopting mean impact value (MIV), choosing characteristic values having great impact on output as valid input variables; using the valid input variable to construct the input matrix of the GRNN; adopting Kfold cross validation circulation to find out an optimal spread value of the GRNN; using the optimal spread value and the corresponding input matrix and output matrix to conduct GRNN training; using the trained GRNN to conduct score prediction on a sparse score matrix; and replacing non-scored data of the sparse score matrix with predicted score values. The method for solving collaborative filtering recommendation data sparsity based on the neural network can conduct fully filling on sparse recommendation data, solve the data height sparsity problem most outstanding in existing collaborative technology, and enable recommendation result to be accurate.
Owner:UNIV OF ELECTRONICS SCI & TECH OF CHINA

Microgrid energy storage SOC estimation method and system based on least square support vector machine

The invention discloses a microgrid energy storage SOC estimation method and a system based on a least square support vector machine. Firstly, a microgrid experiment platform comprising an energy storage system is used, and through measurement, voltage, current, temperature and an SOC data sequence of the energy storage system during a charging and discharging process are obtained; according to the measurement data, training set and test set samples for the least square support vector machine are obtained; a cross validation method is then used for selecting the optimal training parameters for the training set of the least square support vector machine, and a training model is further obtained; and the testing set is then used for testing the training model, the result is evaluated, and finally, the training model can be used for SOC estimation determination on the corresponding type of energy storage system in the microgrid. Defects in the traditional SOC calculation method can be overcome, reference is provided for charging and discharging control on the energy storage system, deep charging and discharging of a battery can be prevented, and the service life of the battery and the operation safety of the microgrid can be improved.
Owner:NARI TECH CO LTD +5

Scanning signal feature extraction method based on independent component analysis and recognition method

The invention discloses a scanning signal feature extraction method based on independent component analysis. The method comprises the steps that six-lead scanning eye electric signals are collected and subjected to band-pass filtering treatment, an airspace filter set corresponding to different scanning task backgrounds is built through an ICA method for filtered data, linear protection is carried out, and airspace feature parameters of scanning signals are obtained. The invention further discloses a recognition method of the scanning signal feature extraction method based on independent component analysis. An ICA airspace filter set is built for each experiment sample in an eye movement database, feature parameters are extracted, cross validation is carried out through a support vector machine, and the optimal ICA filter set and SVM model parameters are determined; the optimal ICA airspace filter set is used for filtering, and then the result is fed into an SVM classifier to be recognized. The scanning signal feature extraction method based on independent component analysis and the recognition method have the advantages of being higher in recognition accuracy rate, higher in expansibility, good in application prospect and the like.
Owner:ANHUI UNIVERSITY

Efficient high-accuracy whole-genome selection method capable of performing parallel operation

The invention relates to the technical field of animal and plant breeding and human disease prediction, and provides an efficient high-accuracy whole-genome selection method capable of performing parallel operation. The method comprises the following steps: firstly, reading an original genotype file and a phenotype file, constructing a new genotype file and a new phenotype file, and calculating agenetic relationship matrix of all individuals; then, extracting all individuals in the new phenotypic file as a reference group, and extracting all individuals without phenotypic data in the originalgenotypic file as a prediction group; carrying out whole genome association analysis by utilizing the reference group data, and extracting result characteristics of the whole genome association analysis; constructing a model library with specific characters, sequentially optimizing an optimal fixed effect and an optimal random effect by adopting a cross validation strategy, and selecting an optimal prediction model from the model library; and finally, calculating genome estimated breeding values of the prediction group by utilizing the optimal prediction model. The method can quickly, accurately and stably predict individual genome breeding values, and thus the accuracy and efficiency of whole genome selection are improved.
Owner:武汉影子基因科技有限公司

Shopping mall building air conditioner cooling load prediction method based on GBDT, storage medium and equipment

PendingCN112001439AFlexible handlingSolve the problem of requiring a large amount of data trainingForecastingCharacter and pattern recognitionSimulationEngineering
The invention discloses a shopping mall building air conditioner cooling load prediction method based on GBDT, a storage medium and equipment, and the method comprises: collecting cooling load data, and carrying out the normalization processing to serve as the cooling load energy consumption prediction; establishing a load prediction model based on a gradient lifting decision tree algorithm; inputting the preprocessed data into a prediction model for training, selecting a grid search-cross validation mode, and optimizing the three hyper-parameters with the maximum influence on the performanceof the GBDT model; establishing a final cold load prediction model by completing parameter optimization of the prediction model, and obtaining a predicted cold load curve according to the parameters and the structure of the prediction model; and evaluating the prediction performance of the prediction model, adopting the prediction error for evaluation, enabling the deviation between the true valueand the prediction value to form the prediction error, and completing mall building air conditioner cooling load prediction. The method has good prediction precision, universality and applicability,and is especially suitable for large public buildings with periodically changing cold loads.
Owner:XI'AN UNIVERSITY OF ARCHITECTURE AND TECHNOLOGY

Cutter residual life prediction method based on machine learning regression algorithm

The invention relates to the field of machine tool cutter remaining life prediction, and discloses a cutter residual life prediction method based on a machine learning regression algorithm. The cutterresidual life prediction method comprises two parts including model training and online life prediction. The model training comprises the steps of collecting original data of a complete life cycle and establishing a corresponding relation with the actual life of a cutter, preprocessing signals, extracting signal features to form feature vectors, performing cross validation to obtain an optimal cutter life model, and performing hyper-parameter adjustment and optimization. The online life prediction comprises real-time data acquisition, signal preprocessing, signal feature extraction to form afeature vector, input of THE optimal cutter life model based on optimal hyper-parameters and output of the residual life of the cutter. The number of eigenvalues extracted from each channel during model training is large, so that the training precision is high, the residual life of the cutter is accurately predicted, a residual life intelligent prediction model of the cutter is established, different regression models can be intelligently selected according to different working condition environments, and the model is good in generalization performance and high in portability.
Owner:南京凯奥思数据技术有限公司

Protein lysine malonylation site prediction method based on deep learning

The invention discloses a protein lysine malonylation site prediction method based on deep learning, and relates to the technical field of biological information. The method comprises the steps: converting the character information of a protein sequence into a numerical vector by adopting an enhanced amino acid composition, a grouped enhanced amino acid composition, a dipeptide deviation expectedaverage value, a K neighbor score and a BLOSUM62 matrix feature extraction algorithm, performing fusion and obtaining a feature space, wherein the influence of each potential feature on a prediction result is fully considered; performing calculating by using a linear convolutional neural network to obtain malonyl site specificity characteristics; selecting related features and reducing feature dimensions through a maximum pooling layer, classifying malonylation sites and non-malonylation sites in combination with a multi-layer deep neural network, constructing a protein malonylation site prediction model DeepMal, and evaluating prediction performance by using 10-fold cross validation and an independent test data set. The model DeepMal is remarkably improved in evaluation indexes, and further promotion of application of deep learning in protein function prediction is facilitated.
Owner:QINGDAO UNIV OF SCI & TECH

Individual sub-health intervention method and system based on big data and artificial intelligence

InactiveCN109545328AIntervention effect is goodSub-health reversal is effectiveHealth-index calculationNutrition controlEpidemiologyPhysical exercise
The invention discloses an individual sub-health nutrition and health intervention method based on big data and a management system. The method comprises the steps of establishing a sub-health assessment initial model and a nutrition and health intervention scheme library on the basis of evidence based medicine, epidemiology, dietary nutrition survey and nutrition intervention data; revising sub-health assessment model parameters by food, exercise and sign information analytic statistics results of a large sample of people and carrying out cross validation; acquiring individual sub-health score, an interpreted report and a recommended nutrition and health intervention scheme from personal information; in a process of executing the nutrition and health intervention scheme, assessing individual sub-health at regular intervals and adjusting the individual sub-health to obtain an optimal intervention effect; and meanwhile, continuously calculating and optimizing the individual sub-health assessment model by cumulative data of the personal information. The management system which executes the method consists of four modules including a personal information acquiring module, a calculating module, a display module and a supervision module. By the method and the management system, a scientific, individualized, visual and traceable sub-health solution is provided for residents, and long-term health records are established.
Owner:极力健生物科技(广州)有限公司

Protein-protein interaction site prediction method based on deep map convolutional network

PendingCN113192559AInteraction site prediction problem solvingImprove the accuracy of interaction predictionNeural architecturesHybridisationBiologyNeutral network
The invention discloses a protein-protein interaction site prediction method based on a deep map convolutional network; the method comprises the following steps: extracting a node feature matrix and an adjacent matrix containing side information according to the sequence and structure information of protein, and forming protein map representation together; carrying out deep map convolution based on an initial residual error and identical mapping; inputting the output of the last layer of image convolution layer of the deep image convolution into a multi-layer perceptron, and completing construction of the deep image convolution neural network; extracting training data to obtain protein map representation, and training the deep map convolutional neural network by adopting a five-fold cross validation method; and extracting to-be-detected data to obtain protein map representation, and inputting the protein map representation into the trained deep map convolutional neural network to realize prediction of protein-protein interaction sites. According to the method, protein space structure information can be fully utilized, and the accuracy of protein-protein interaction site prediction is further improved.
Owner:SUN YAT SEN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products