AI – machine learning algorithms applied to transformer diagnostics
Statistical learning has a different interpretation for each of the above-indicated algorithms.
byDr. Luiz Cheim
- INTRODUCTION
1.1 Dataset
The dataset employed to train the machine learning algorithms contained 24 typical transformer parameters such as nameplate data, DGA, oil quality, insulation power factor, etc. As illustrated in Table 1 and Table 2, it provides a general statistical description of each parameter for the whole dataset.
1.2 Machine learning training with 10-fold cross-validation
The training was achieved by first random partitioning the original dataset with 1,000 transformers into two subsets, in which one dataset contained data for 800 transformers (training dataset), and the remaining 200 transformers were used as validation or test dataset. The training process was a supervised learning based on a 10-fold cross-validation procedure with 3 repeats, yielding 30 output accuracies for each machine learning algorithm [2-5], with each accuracy corresponding to each fold in a given repeat process. The supervised learning was applied with the support of human experts who have analyzed the same 1,000 cases provided to the machine learning algorithms.
See the full article in PDF
Fitting of input data to desired output data using ML algorithms is called learning or training, and once the ML model is trained, it can be used for predicting output value for arbitrary inputs
Machine learning algorithms
The following 12 ML algorithms were trained and compared in the present work:
Linear algorithms
- General linear regression (logistic regression) – GLM
- Linear discriminant analysis – LDA
Non-linear algorithms
- Classification and regression trees (CART)
- C5.0 (a type of CART algorithm)
- Naïve Bayes algorithm (NB)
- K-nearest neighbor (KNN)
- Support vector machine (SVM)
Ensemble algorithms
- Random forest (stochastic assembly of a large number of CART algorithms)
- Tree bagging (tree bagging)
- Extreme gradient boosting machine (XGB tree)
- Extreme gradient boosting machine (XGB linear)
- Artificial neural networks (ANN – not deep learning yet)
The following section describes the meaning of statistical learning for each algorithm.
- Statistical learning process
Statistical learning has a different interpretation for each of the above-indicated algorithms. In linear regression, for example, the learning process is associated with the optimal search of the linear model coefficients that best correlate inputs to outputs in a given problem.
Log in or subscribe to read the whole article.