<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "JATS-journalpublishing1-3.dtd">
<article article-type="research-article" dtd-version="1.3" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xml:lang="ru"><front><journal-meta><journal-id journal-id-type="publisher-id">ellibs</journal-id><journal-title-group><journal-title xml:lang="ru">Электронные библиотеки</journal-title><trans-title-group xml:lang="en"><trans-title>Russian Digital Libraries Journal</trans-title></trans-title-group></journal-title-group><issn pub-type="epub">1562-5419</issn><publisher><publisher-name>Казанский (Приволжский) федеральный университет</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.26907/1562-5419-2024-27-5-718-729</article-id><article-id custom-type="elpub" pub-id-type="custom">ellibs-568</article-id><article-categories><subj-group subj-group-type="heading"><subject>Research Article</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="ru"><subject>Статьи</subject></subj-group></article-categories><title-group><article-title>Автоматическая разметка обучающих выборок в компьютерном зрении с использованием методов машинного обучения</article-title><trans-title-group xml:lang="en"><trans-title>Automatic Annotation of Training Datasets in Computer Vision using Machine Learning Methods</trans-title></trans-title-group></title-group><contrib-group><contrib contrib-type="author" corresp="yes"><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Журавлёв</surname><given-names>Алексей Константинович</given-names></name><name name-style="western" xml:lang="en"><surname>Zhuravlev</surname><given-names>Aleksey Konstantinovich</given-names></name></name-alternatives><email xlink:type="simple">UnMelow@yandex.ru</email><xref ref-type="aff" rid="aff-1"/></contrib><contrib contrib-type="author" corresp="yes"><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Григорян</surname><given-names>Карен Альбертович</given-names></name><name name-style="western" xml:lang="en"><surname>Grigorian</surname><given-names>Karen Albertovich</given-names></name></name-alternatives><email xlink:type="simple">karigri@yandex.ru</email><xref ref-type="aff" rid="aff-1"/></contrib></contrib-group><aff-alternatives id="aff-1"><aff xml:lang="ru"><institution>Казанский (Приволжский) федеральный университет</institution></aff><aff xml:lang="en"><institution>Kazan (Volga region) Federal University</institution></aff></aff-alternatives><pub-date pub-type="collection"><year>2024</year></pub-date><pub-date pub-type="epub"><day>28</day><month>05</month><year>2025</year></pub-date><volume>27</volume><issue>5</issue><fpage>718</fpage><lpage>729</lpage><permissions><copyright-statement>Copyright &amp;#x00A9; Журавлёв А.К., Григорян К.А., 2025</copyright-statement><copyright-year>2025</copyright-year><copyright-holder xml:lang="ru">Журавлёв А.К., Григорян К.А.</copyright-holder><copyright-holder xml:lang="en">Zhuravlev A.K., Grigorian K.A.</copyright-holder><license xml:lang="ru" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>Данная работа распространяется под лицензией Creative Commons Attribution 4.0.</license-p></license><license xml:lang="en" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>This work is licensed under a Creative Commons Attribution 4.0 License.</license-p></license></permissions><self-uri xlink:href="https://ellibs.elpub.ru/jour/article/view/568">https://ellibs.elpub.ru/jour/article/view/568</self-uri><abstract><p>Рассмотрена проблема автоматической разметки обучающих выборок в области компьютерного зрения с использованием методов машинного обучения.
&#13;

Разметка данных является ключевым этапом в разработке и обучении моделей глубокого обучения, однако процесс создания размеченных данных зачастую требует значительных временных и трудовых затрат. В статье предложен механизм автоматической разметки, основанный на использовании сверточных нейронных сетей и методов активного обучения.
&#13;

Предложенная методология включает анализ и оценку существующих подходов к автоматической разметке. Эффективность предложенных решений оценена на общедоступных наборах данных. Результаты показали, что предложенный метод в значительной мере сокращает время, необходимое для разметки данных, но в любом случае требует вмешательства оператора-разметчика.
&#13;

Обзор литературы включает анализ современных методов разметки и существующих автоматических систем, что позволяет лучше понять контекст и преимущества предлагаемого подхода. В заключении обсуждены достижения, ограничения и возможные направления для будущих исследований в данной области.
</p></abstract><trans-abstract xml:lang="en"><p>This paper addresses the issue of automatic annotation of training datasets in the field of computer vision using machine learning methods. Data annotation is a key stage in the development and training of deep learning models, yet the process of creating labeled data often requires significant time and labor. This paper proposes a mechanism for automatic annotation based on the use of convolutional neural networks (CNN) and active learning methods.
&#13;

The proposed methodology includes the analysis and evaluation of existing approaches to automatic annotation. The effectiveness of the proposed solutions is assessed on publicly available datasets. The results demonstrate that the proposed method significantly reduces the time required for data annotation, although operator intervention is still necessary.
&#13;

The literature review includes an analysis of modern annotation methods and existing automatic systems, providing a better understanding of the context and advantages of the proposed approach. The conclusion discusses achievements, limitations, and possible directions for future research in this field.
</p></trans-abstract><kwd-group xml:lang="ru"><kwd>компьютерное зрение</kwd><kwd>машинное обучение</kwd><kwd>автоматическая разметка данных</kwd><kwd>обучающая выборка</kwd><kwd>сегментация изображений</kwd></kwd-group><kwd-group xml:lang="en"><kwd>computer vision</kwd><kwd>machine learning</kwd><kwd>automatic data annotation</kwd><kwd>training datasets</kwd><kwd>image segmentation</kwd></kwd-group></article-meta></front><back><ref-list><title>References</title><ref id="cit1"><label>1</label><citation-alternatives><mixed-citation xml:lang="ru">Council J. Data challenges are halting AI projects, IBM executive says // The Wall Street Journal. 2019. Vol. 28.</mixed-citation><mixed-citation xml:lang="en">Council J. Data challenges are halting AI projects, IBM executive says // The Wall Street Journal. 2019. Vol. 28.</mixed-citation></citation-alternatives></ref><ref id="cit2"><label>2</label><citation-alternatives><mixed-citation xml:lang="ru">LabelImg for Image Annotation. URL: https://viso.ai/computer-vision/labelimg-for-image-annotation/.</mixed-citation><mixed-citation xml:lang="en">LabelImg for Image Annotation. URL: https://viso.ai/computer-vision/labelimg-for-image-annotation/.</mixed-citation></citation-alternatives></ref><ref id="cit3"><label>3</label><citation-alternatives><mixed-citation xml:lang="ru">VGG Image Annotator. URL: https://www.robots.ox.ac.uk/~vgg/software/via/via_demo.html.</mixed-citation><mixed-citation xml:lang="en">VGG Image Annotator. URL: https://www.robots.ox.ac.uk/~vgg/software/via/via_demo.html.</mixed-citation></citation-alternatives></ref><ref id="cit4"><label>4</label><citation-alternatives><mixed-citation xml:lang="ru">Everingham M. et al. The pascal visual object classes challenge: A retrospective // International Journal of Computer Vision. 2015. Vol. 111. P. 98–136.</mixed-citation><mixed-citation xml:lang="en">Everingham M. et al. The pascal visual object classes challenge: A retrospective // International Journal of Computer Vision. 2015. Vol. 111. P. 98–136.</mixed-citation></citation-alternatives></ref><ref id="cit5"><label>5</label><citation-alternatives><mixed-citation xml:lang="ru">Berg A. et al. Semi-automatic annotation of objects in visual-thermal video // Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2019. https://doi.org/10.1109/ICCVW.2019.00277</mixed-citation><mixed-citation xml:lang="en">Berg A. et al. Semi-automatic annotation of objects in visual-thermal video // Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2019. https://doi.org/10.1109/ICCVW.2019.00277</mixed-citation></citation-alternatives></ref><ref id="cit6"><label>6</label><citation-alternatives><mixed-citation xml:lang="ru">Sager C., Janiesch C., Zschech P. A survey of image labelling for computer vision applications // Journal of Business Analytics. 2021. Vol. 4, No. 2. P. 91–110.</mixed-citation><mixed-citation xml:lang="en">Sager C., Janiesch C., Zschech P. A survey of image labelling for computer vision applications // Journal of Business Analytics. 2021. Vol. 4, No. 2. P. 91–110.</mixed-citation></citation-alternatives></ref><ref id="cit7"><label>7</label><citation-alternatives><mixed-citation xml:lang="ru">Cao J., Zhao A., Zhang Z. Automatic image annotation method based on a convolutional neural network with threshold optimization // Plos one. 2020. V. 15, No. 9. e0238956. https://doi.org/10.1371/journal.pone.0238956</mixed-citation><mixed-citation xml:lang="en">Cao J., Zhao A., Zhang Z. Automatic image annotation method based on a convolutional neural network with threshold optimization // Plos one. 2020. V. 15, No. 9. e0238956. https://doi.org/10.1371/journal.pone.0238956</mixed-citation></citation-alternatives></ref><ref id="cit8"><label>8</label><citation-alternatives><mixed-citation xml:lang="ru">Vatani A., Ahvanooey M.T., Rahimi M. An effective automatic image annotation model via attention model and data equilibrium // arXiv preprint arXiv:2001.10590. 2020.</mixed-citation><mixed-citation xml:lang="en">Vatani A., Ahvanooey M.T., Rahimi M. An effective automatic image annotation model via attention model and data equilibrium // arXiv preprint arXiv:2001.10590. 2020.</mixed-citation></citation-alternatives></ref><ref id="cit9"><label>9</label><citation-alternatives><mixed-citation xml:lang="ru">Gu Y. et al. Automatic lung nodule detection using a 3D deep convolutional neural network combined with a multi-scale prediction strategy in chest CTs // Computers in Biology and Medicine. 2018. Vol. 103. P. 220–231.</mixed-citation><mixed-citation xml:lang="en">Gu Y. et al. Automatic lung nodule detection using a 3D deep convolutional neural network combined with a multi-scale prediction strategy in chest CTs // Computers in Biology and Medicine. 2018. Vol. 103. P. 220–231.</mixed-citation></citation-alternatives></ref><ref id="cit10"><label>10</label><citation-alternatives><mixed-citation xml:lang="ru">Levine S. et al. Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection // The International Journal of Robotics Research. 2018. Vpl. 37, No. 4-5. P. 421–436.</mixed-citation><mixed-citation xml:lang="en">Levine S. et al. Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection // The International Journal of Robotics Research. 2018. Vpl. 37, No. 4-5. P. 421–436.</mixed-citation></citation-alternatives></ref><ref id="cit11"><label>11</label><citation-alternatives><mixed-citation xml:lang="ru">Kirillov A. et al. Segment anything // Proceedings of the IEEE/CVF International Conference on Computer Vision. 2023. P. 4015–4026.</mixed-citation><mixed-citation xml:lang="en">Kirillov A. et al. Segment anything // Proceedings of the IEEE/CVF International Conference on Computer Vision. 2023. P. 4015–4026.</mixed-citation></citation-alternatives></ref><ref id="cit12"><label>12</label><citation-alternatives><mixed-citation xml:lang="ru">Zou X. et al. Segment everything everywhere all at once // Advances in Neural Information Processing Systems. NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing Systems. Article No. 868. P.19769–19782. https://dl.acm.org/doi/10.5555/3666122.3666990</mixed-citation><mixed-citation xml:lang="en">Zou X. et al. Segment everything everywhere all at once // Advances in Neural Information Processing Systems. NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing Systems. Article No. 868. P.19769–19782. https://dl.acm.org/doi/10.5555/3666122.3666990</mixed-citation></citation-alternatives></ref><ref id="cit13"><label>13</label><citation-alternatives><mixed-citation xml:lang="ru">Ultralytics YOLOv8 Docs. URL: https://docs.ultralytics.com/ru.</mixed-citation><mixed-citation xml:lang="en">Ultralytics YOLOv8 Docs. URL: https://docs.ultralytics.com/ru.</mixed-citation></citation-alternatives></ref><ref id="cit14"><label>14</label><citation-alternatives><mixed-citation xml:lang="ru">COCO Dataset. URL: https://cocodataset.org/#home.</mixed-citation><mixed-citation xml:lang="en">COCO Dataset. URL: https://cocodataset.org/#home.</mixed-citation></citation-alternatives></ref><ref id="cit15"><label>15</label><citation-alternatives><mixed-citation xml:lang="ru">Cityscapes Dataset. URL: https://www.cityscapes-dataset.com/.</mixed-citation><mixed-citation xml:lang="en">Cityscapes Dataset. URL: https://www.cityscapes-dataset.com/.</mixed-citation></citation-alternatives></ref><ref id="cit16"><label>16</label><citation-alternatives><mixed-citation xml:lang="ru">Auto-Label. URL: https://roboflow.com/auto-label.</mixed-citation><mixed-citation xml:lang="en">Auto-Label. URL: https://roboflow.com/auto-label.</mixed-citation></citation-alternatives></ref></ref-list><fn-group><fn fn-type="conflict"><p>The authors declare that there are no conflicts of interest present.</p></fn></fn-group></back></article>
