DATASETS PREPARATION FEATURES FOR TRAINING ARTIFICIAL NEURAL NETWORKS
DOI:
https://doi.org/10.31471/1993-9981-2024-1(52)-115-120Keywords:
object recognition, neural networks, datasets, data processing, image annotation.Abstract
The article analyzes key aspects of the formation of training datasets, including images of objects, for training artificial neural networks, in particular on the YOLO platform, which is widely used for recognition tasks. The subject of the study is the process of identifying and classifying objects associated with track scenarios, such as track robots, barriers and track surfaces. The article presents an approach to creating two-dimensional images with the appropriate annotation of the classes "robots", "track" and "barriers". Particular attention is paid to data augmentation methods that allow increasing the variability and improving the quality of training samples, which is critically important for the effectiveness of training neural networks. The study describes the process of specializing a neural network in recognizing objects by specific characteristics or in conditions with limited resources. After training, the network is used to identify and segment certain objects in images. The collected recognition results are subject to aggregation, which allows integrating information from different sources, thereby increasing the overall accuracy of the system. The proposed methods can serve as the basis for developing and improving object recognition algorithms in various applied tasks, such as mobile robotics, monitoring and environmental analysis. The relevance of the work is due to the need to increase the stability and accuracy of object recognition systems in difficult conditions, in particular in the presence of limited and low-quality training data. The obtained results and approaches can be applied to solve practical problems and support further research in this area.
Downloads
References
1. Nanthini K., Sivabalaselvamani D. and oth. A Survey on Data Augmentation. 2023 7th International Conference on Computing Methodologies and Communication (ICCMC), Erode, India, 2023. pp. 913-920. DOI: 10.1109/ICCMC56507.2023.10084010.
2. Rogers A. W. and oth. A transfer learning approach for predictive modeling of bioprocesses using small data. Biotechnology and Bioengineering. 2021. Vol. 119, № 2. P. 411–422. DOI: 10.1002/bit.27980
3. Gürsakal N., Çelik S., Birişçi E. Synthetic Data Generation with Python. Synthetic Data for Deep Learning. Berkeley, CA, 2022. С. 159–214. DOI: 10.1007/978-1-4842-8587-9_5
4. Roboflow: Computer vision tools for developers and enterprises. Roboflow: Computer vision tools for developers and enterprises. URL: https://roboflow.com/
5. Huang L. Normalization Techniques in Deep Learning. Cham : Springer International Publishing, 2022. DOI: 10.1007/978-3-031-14595-7
6. A Review on Imbalanced Data Handling Using Undersampling and Oversampling Technique. International Journal of Recent Trends in Engineering and Research. 2017. Т. 3, № 4. С. 444–449. DOI: 10.23883/ijrter.2017.3168.0uwxm