References

ellibs

Электронные библиотеки

Russian Digital Libraries Journal

1562-5419

Казанский (Приволжский) федеральный университет

10.26907/1562-5419-2024-27-5-718-729

ellibs-568

Research Article

Статьи

Автоматическая разметка обучающих выборок в компьютерном зрении с использованием методов машинного обучения

Automatic Annotation of Training Datasets in Computer Vision using Machine Learning Methods

Журавлёв

Алексей Константинович

Zhuravlev

Aleksey Konstantinovich

UnMelow@yandex.ru

Григорян

Карен Альбертович

Grigorian

Karen Albertovich

karigri@yandex.ru

Казанский (Приволжский) федеральный университетKazan (Volga region) Federal University

2024

28052025

275718729

2025

Журавлёв А.К., Григорян К.А.

Zhuravlev A.K., Grigorian K.A.

Данная работа распространяется под лицензией Creative Commons Attribution 4.0.

This work is licensed under a Creative Commons Attribution 4.0 License.

https://ellibs.elpub.ru/jour/article/view/568

Рассмотрена проблема автоматической разметки обучающих выборок в области компьютерного зрения с использованием методов машинного обучения. Разметка данных является ключевым этапом в разработке и обучении моделей глубокого обучения, однако процесс создания размеченных данных зачастую требует значительных временных и трудовых затрат. В статье предложен механизм автоматической разметки, основанный на использовании сверточных нейронных сетей и методов активного обучения. Предложенная методология включает анализ и оценку существующих подходов к автоматической разметке. Эффективность предложенных решений оценена на общедоступных наборах данных. Результаты показали, что предложенный метод в значительной мере сокращает время, необходимое для разметки данных, но в любом случае требует вмешательства оператора-разметчика. Обзор литературы включает анализ современных методов разметки и существующих автоматических систем, что позволяет лучше понять контекст и преимущества предлагаемого подхода. В заключении обсуждены достижения, ограничения и возможные направления для будущих исследований в данной области.

This paper addresses the issue of automatic annotation of training datasets in the field of computer vision using machine learning methods. Data annotation is a key stage in the development and training of deep learning models, yet the process of creating labeled data often requires significant time and labor. This paper proposes a mechanism for automatic annotation based on the use of convolutional neural networks (CNN) and active learning methods. The proposed methodology includes the analysis and evaluation of existing approaches to automatic annotation. The effectiveness of the proposed solutions is assessed on publicly available datasets. The results demonstrate that the proposed method significantly reduces the time required for data annotation, although operator intervention is still necessary. The literature review includes an analysis of modern annotation methods and existing automatic systems, providing a better understanding of the context and advantages of the proposed approach. The conclusion discusses achievements, limitations, and possible directions for future research in this field.

компьютерное зрениемашинное обучениеавтоматическая разметка данныхобучающая выборкасегментация изображений

computer visionmachine learningautomatic data annotationtraining datasetsimage segmentation

References1

Council J. Data challenges are halting AI projects, IBM executive says // The Wall Street Journal. 2019. Vol. 28.

LabelImg for Image Annotation. URL: https://viso.ai/computer-vision/labelimg-for-image-annotation/.

VGG Image Annotator. URL: https://www.robots.ox.ac.uk/~vgg/software/via/via_demo.html.

Everingham M. et al. The pascal visual object classes challenge: A retrospective // International Journal of Computer Vision. 2015. Vol. 111. P. 98–136.

Berg A. et al. Semi-automatic annotation of objects in visual-thermal video // Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2019. https://doi.org/10.1109/ICCVW.2019.00277

Sager C., Janiesch C., Zschech P. A survey of image labelling for computer vision applications // Journal of Business Analytics. 2021. Vol. 4, No. 2. P. 91–110.

Cao J., Zhao A., Zhang Z. Automatic image annotation method based on a convolutional neural network with threshold optimization // Plos one. 2020. V. 15, No. 9. e0238956. https://doi.org/10.1371/journal.pone.0238956

Vatani A., Ahvanooey M.T., Rahimi M. An effective automatic image annotation model via attention model and data equilibrium // arXiv preprint arXiv:2001.10590. 2020.

Gu Y. et al. Automatic lung nodule detection using a 3D deep convolutional neural network combined with a multi-scale prediction strategy in chest CTs // Computers in Biology and Medicine. 2018. Vol. 103. P. 220–231.

Levine S. et al. Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection // The International Journal of Robotics Research. 2018. Vpl. 37, No. 4-5. P. 421–436.

Kirillov A. et al. Segment anything // Proceedings of the IEEE/CVF International Conference on Computer Vision. 2023. P. 4015–4026.

Zou X. et al. Segment everything everywhere all at once // Advances in Neural Information Processing Systems. NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing Systems. Article No. 868. P.19769–19782. https://dl.acm.org/doi/10.5555/3666122.3666990

Ultralytics YOLOv8 Docs. URL: https://docs.ultralytics.com/ru.

COCO Dataset. URL: https://cocodataset.org/#home.

Cityscapes Dataset. URL: https://www.cityscapes-dataset.com/.

Auto-Label. URL: https://roboflow.com/auto-label.

The authors declare that there are no conflicts of interest present.