Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

33 results about "Active learning (machine learning)" patented technology

Active learning is a special case of machine learning in which a learning algorithm is able to interactively query the user (or some other information source) to obtain the desired outputs at new data points. In statistics literature it is sometimes also called optimal experimental design.

Unbalanced data classification method based on active learning

The invention discloses an unbalanced data classification method based on active learning, and the method comprises the steps: randomly sampling and selecting a sample from original label-free data for marking, and taking the sample as initial training data; performing cost-sensitive learning training on the initial training data by adopting a universal machine learning model; predicting all samples which are not labeled in the original training data samples by utilizing a trained binary supervised classification model, and selecting N samples which are most uncertain according to uncertainty;respectively calculating the sum of Euclidean distances between the N samples and the central point of the trained data set, and selecting M samples from the N samples according to the descending order of the distances; marking the selected M samples, and adding the marked M samples into a training data set; performing cost-sensitive learning training on the initial training data set by using a universal machine learning model; and continuously repeating the process, iteratively circulating until the average uncertainty of the selected M samples is smaller than a set uncertainty threshold, and stopping training. According to the method, on the basis of keeping the performance of the unbalanced data classifier, the sample size of labeling can be effectively reduced, so that the labeling time and the labor cost are saved.
Owner:NANJING UNIV OF SCI & TECH

Medical image classification method and device, medium and electronic equipment

The invention relates to the field of machine learning, and discloses a medical image classification method and device, a medium and electronic equipment. The method comprises the following steps: selecting a target medical image sample from an unlabeled medical image sample set by utilizing an active learning framework, wherein a query strategy of the active learning framework is provided by a reinforcement learning model; inputting the target medical image sample labeled by the labeling expert into a medical image classification model, and training the medical image classification model; ifthe training does not meet the preset condition, obtaining a training result, training a reinforcement learning model based on the training result, updating a query strategy by utilizing the trained reinforcement learning model, and turning to a sample selection step until the training meets the preset condition; and inputting to-be-classified medical image data into the trained medical image classification model for classification. According to the method, a long-acting working mechanism for training the medical image classification model through man-machine cooperation is established, the labeling cost is reduced, and the labeling efficiency is improved.
Owner:PING AN TECH (SHENZHEN) CO LTD

Transfer learning algorithm based on active learning

The invention discloses a transfer learning algorithm based on active learning, and belongs to the field of machine learning. For a general unsupervised transfer learning algorithm, a large number ofresearches exist at present, but on this basis, the improvement of the algorithm performance in a target field can be obtained at a relatively low sample labeling cost. The active transfer learning algorithm accesses a batch of data based on an active sampling method to finely tune and update network parameters after an unsupervised domain self-adaption process is carried out, so that the extracted features have good migration capability and good discrimination capability. In the invention, an active sampling strategy is not only based on a traditional information entropy method, but also provides an evaluation index of one characteristic under a transfer learning background.
Owner:NANJING UNIV OF AERONAUTICS & ASTRONAUTICS

Method for optimizing gradient titanium dioxide nanotube micro-patterns under assistance of machine learning

The invention discloses a method for optimizing a gradient titanium dioxide nanotube micro-pattern under the assistance of machine learning, and relates to preparation of the gradient titanium dioxide nanotube micro-pattern. The method comprises the following steps: 1) setting related experimental conditions to prepare a TiO2 nanotube micro-pattern and characterize the TiO2 nanotube micro-pattern to obtain experimental data; 2) preprocessing the obtained experimental data and performing machine learning modeling; 3) performing, by the machine learning model, prediction and recommending an optimization experiment scheme; and (4) verifying a prediction result through an experiment, supplementing data, and iterating the steps (1)-(4). By means of the technical scheme, sample data expansion, self-learning and automatic training of a model meeting preset precision can be automatically realized, and then an active learning framework for predicting the parameter structure property of the material is automatically constructed, and intelligent generation and reverse design of the material are realized. The TiO2 nanotube micro-pattern sample with the maximum gradient range, which is prepared in one step by utilizing a bipolar oxidation method under an ammonium fluoride/water/glycerol system, and the experimental conditions of the TiO2 nanotube micro-pattern sample can be found under fewer experimental conditions. Operation is simple and convenient, and operation time is short.
Owner:XIAMEN UNIV

Method, device and equipment for determining training sample

The invention relates to the technical field of machine learning, and discloses a method for determining a training sample, and the method comprises the steps: obtaining an unlabeled sample set and a plurality of alternative models, and distributing corresponding labeled sample sets for the alternative models, wherein the plurality of alternative models are active learning models with different sample selection strategies; training corresponding alternative models by using the labeled sample set to obtain evaluation models corresponding to the alternative models; evaluating the unlabeled sample set by using an evaluation model to obtain a first evaluation result; and determining a training sample according to the first evaluation result. According to the method, the active learning models with different sample selection strategies are trained through the labeled sample set to obtain the corresponding evaluation models, and the unlabeled sample set is evaluated by using the evaluation models to determine the training samples, so that the tendency of a single active learning algorithm is avoided, and the diversity of the training samples is improved. The invention also discloses a device and equipment for determining the training sample.
Owner:SHANGHAI MININGLAMP ARTIFICIAL INTELLIGENCE GRP CO LTD

Active learning for data matching

The inventive method comprises: a) training a machine learning model using a current set of tagged data points, each data point being a plurality of data records, where the tagging of the data points indicates a classification of the data points, the training resulting in a trained machine learning model configured to classify the data points as representing the same entity or different entities. B) a subset of unmarked data points can be selected from the current unmarked data point set using the classification results of the current unmarked data point set. C) a subset of unlabeled data points may be provided to a classifier and a label of the subset of unlabeled data points may be received in response to the providing. Steps a) to c) may be repeated using the current set of tagged data points plus the subset of tagged data points as the current set of tagged data points.
Owner:IBM CORP

Method and device for identifying life cycle operation and maintenance state of optical channel

The invention discloses a method and device for identifying the life cycle operation and maintenance state of an optical channel. The method includes: collecting historical data of a current network through a network management system, and defining the life cycle operation and maintenance state of the optical channel according to the historical data, wherein the historical data comprise topological structure data, historical alarm data and historical performance data; performing sample labeling on the collected historical data through active learning to obtain a labeled sample set containing a plurality of labeled data; performing feature engineering processing on the data in the labeled sample set, and calling a machine learning algorithm to train the processed data to obtain an optical channel life cycle operation and maintenance state recognition model; calling the optical channel life cycle operation and maintenance state identification model for the to-be-detected optical channel to obtain the optical channel life cycle operation and maintenance state of the to-be-detected optical channel, and positioning the position and reason of the hidden danger according to the corresponding characteristics. According to the scheme, the life cycle operation and maintenance state of the optical channel can be quickly and accurately identified, and faults can be predicted and fault causes can be positioned in advance.
Owner:FENGHUO COMM SCI & TECH CO LTD +1

Industrialized system for rice grain recognition and method thereof

Proposed is an industrialized system (1) and method for rice grain recognition. An optical image (123) is taken by a user (5) and transmitted to a digital platform (11), wherein the system (1) segments the optical image (123) and extracts and / or measures appropriate grain features (41) from the image (123) describing different aspects of the grain (4). The image (123) is processed by the system (1), comprising a selector (1122) selecting different machine learning structures (11211,...,1121i), applying the different machine learning structures (11211,...,1121i) to the extracted features (41) for rice grain (4) recognition, and selecting the best of the applied machine learning structures (11211,...,1121i) by a random sampling process. The selected best of the applied machine learning structures (11211,...,1121i) is further optimized by varying an appropriate threshold (11231) by a threshold trigger (1123) based on a confusion matrix comprising (11221). An active learning structure (113) based on the confusion matrix comprising (11221) the values of True Positive (TP), False Negative (FN), False Positive (FP) and True Negative (TN) for the classified rice grains, providing a feedback loop to the user or human expert (5), wherein the system (1) is retrained based on the feedback parameters of the feedback loop (1132).
Owner:BUEHLER AG

Active learning classification method based on uncertainty and similarity measurement

The invention discloses an active learning classification method based on uncertainty and similarity measurement. The method comprises the following steps: S1, carrying out preprocessing and vectorization on unlabeled classification data; S2, clustering, selecting most representative samples in each class, carrying out manual labeling, recording the samples as a data set L, and recording the rest samples as a set U; S3, calculating a similarity metric value of each sample in the U; S4, enabling the L to be used for training a plurality of different machine learning models, and obtaining the accuracy rate and the output value of each model; S5, determining a weight value and an uncertainty degree of each model so as to determine an uncertainty decision value; S6, determining a diversified training sample with the maximum value, labeling the diversified training sample, updating the labeled diversified training sample to the data set L, and removing the labeled diversified training sample from the set U to obtain an updated set U; and S7, repeating the steps S3-S6 until the accuracy of each model does not change any more, and obtaining a final marked data set L. According to the method, the information redundant sample size can be reduced, and the data labeling cost is reduced on the basis of ensuring the training effect.
Owner:SOUTHWEST PETROLEUM UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products