摘要
In order to improve the accuracy of unbalanced data classification, the random forest algorithm is used for data classification, and the whale optimization algorithm is adoped to optimize the key parameters of the random forest, thus the adaptability of the random forest algorithm to unbalanced data classification is enhanced. First, the unbalanced data classification model is developed based on the random forest. The classification difficulties caused by sample imbalance are effectively solved through multiple decision tree weak classifiers of the random forest. Second, the whale swarm optimization algorithm is deployed to optimize the weight of weak classifiers, and the average classification accuracy is taken as the fitness function of the whale swarm optimization. Thus the accuracy of the weak classifier weight voting on the final classification results. Finally, the random forest model optimized by the whale population is used to classify the unbalanced data. Experiments show that by reasonably setting the parameters of the whale swarm optimization algorithm, the weight of random forest weak classifiers with higher classification accuracy can be obtained. Compared with the unbalanced data classification algorithms, this algorithm can obtain better classification performance. ? 2022 Journal of Nanjing Institute of Posts and Telecommunications.
- 单位