<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "JATS-journalpublishing1-3.dtd">
<article article-type="research-article" dtd-version="1.3" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xml:lang="ru"><front><journal-meta><journal-id journal-id-type="publisher-id">ellibs</journal-id><journal-title-group><journal-title xml:lang="ru">Электронные библиотеки</journal-title><trans-title-group xml:lang="en"><trans-title>Russian Digital Libraries Journal</trans-title></trans-title-group></journal-title-group><issn pub-type="epub">1562-5419</issn><publisher><publisher-name>Казанский (Приволжский) федеральный университет</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.26907/1562-5419-2024-27-4-533-557</article-id><article-id custom-type="elpub" pub-id-type="custom">ellibs-561</article-id><article-categories><subj-group subj-group-type="heading"><subject>Research Article</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="ru"><subject>Статьи</subject></subj-group></article-categories><title-group><article-title>Автоматизация сортировки материалов по тексту сценария для видеомонтажа</article-title><trans-title-group xml:lang="en"><trans-title>Automation of Footages Sorting by Screenplay Text for Video Editing</trans-title></trans-title-group></title-group><contrib-group><contrib contrib-type="author" corresp="yes"><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Неманов</surname><given-names>Андрей Дмитриевич</given-names></name><name name-style="western" xml:lang="en"><surname>Nemanov</surname><given-names>Andrey Dmitrievich</given-names></name></name-alternatives><email xlink:type="simple">andrewoch@yandex.ru</email><xref ref-type="aff" rid="aff-1"/></contrib><contrib contrib-type="author" corresp="yes"><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Шахова</surname><given-names>Ирина Сергеевна</given-names></name><name name-style="western" xml:lang="en"><surname>Shakhova</surname><given-names>Irina Sergeevna</given-names></name></name-alternatives><email xlink:type="simple">is@it.kfu.ru</email><xref ref-type="aff" rid="aff-1"/></contrib></contrib-group><aff-alternatives id="aff-1"><aff xml:lang="ru"><institution>Казанский (Приволжский) федеральный университет</institution></aff><aff xml:lang="en"><institution>Kazan (Volga region) Federal University</institution></aff></aff-alternatives><pub-date pub-type="collection"><year>2024</year></pub-date><pub-date pub-type="epub"><day>28</day><month>05</month><year>2025</year></pub-date><volume>27</volume><issue>4</issue><fpage>533</fpage><lpage>557</lpage><permissions><copyright-statement>Copyright &amp;#x00A9; Неманов А.Д., Шахова И.С., 2025</copyright-statement><copyright-year>2025</copyright-year><copyright-holder xml:lang="ru">Неманов А.Д., Шахова И.С.</copyright-holder><copyright-holder xml:lang="en">Nemanov A.D., Shakhova I.S.</copyright-holder><license xml:lang="ru" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>Данная работа распространяется под лицензией Creative Commons Attribution 4.0.</license-p></license><license xml:lang="en" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>This work is licensed under a Creative Commons Attribution 4.0 License.</license-p></license></permissions><self-uri xlink:href="https://ellibs.elpub.ru/jour/article/view/561">https://ellibs.elpub.ru/jour/article/view/561</self-uri><abstract><p>Процесс видеомонтажа включает множество трудоемких операций по сортировке и подготовке материалов, что требует значительных временных затрат. В статье описана разработка программного решения для автоматизации этих процессов с использованием технологии машинного обучения. Основное внимание уделено созданию системы, способной классифицировать и сортировать медиафайлы по тексту сценария, тем самым повышая эффективность подготовки материалов к монтажу. Система включает модули распознавания речи, классификации аудио и видео, а также алгоритмы определения соответствия сценарию. Тестирование показало, что предложенная система правильно классифицирует медиафайлы в большинстве случаев, что позволяет существенно сократить время на черновой монтаж.
</p></abstract><trans-abstract xml:lang="en"><p>The video editing process involves numerous labor-intensive operations for sorting and preparing footages, requiring significant time investment. This article describes the development of a software solution that uses machine learning technology to automate these processes.
&#13;

The primary focus is on creating a system capable of classifying and sorting media files according to the screenplay text, thereby increasing the efficiency of material preparation for editing. The system includes modules for speech recognition, audio and video classification, and algorithms for determining screenplay compliance.
&#13;

Testing showed that the proposed system correctly classifies media files in most cases, significantly reducing rough-cut editing time.
</p></trans-abstract><kwd-group xml:lang="ru"><kwd>видеомонтаж</kwd><kwd>автоматизация</kwd><kwd>машинное обучение</kwd><kwd>распознавание речи</kwd><kwd>классификация аудио</kwd><kwd>классификация видео</kwd><kwd>coreml</kwd><kwd>параллельные вычисления</kwd><kwd>сценарий</kwd><kwd>soundex</kwd><kwd>tf-idf</kwd><kwd>косинусное сходство</kwd><kwd>обработка естественного языка</kwd></kwd-group><kwd-group xml:lang="en"><kwd>video editing</kwd><kwd>automation</kwd><kwd>machine learning</kwd><kwd>speech recognition</kwd><kwd>audio classification</kwd><kwd>video classification</kwd><kwd>coreml</kwd><kwd>parallel computing</kwd><kwd>screenplay</kwd><kwd>soundex</kwd><kwd>tf-idf</kwd><kwd>cosine similarity</kwd><kwd>natural language processing</kwd></kwd-group></article-meta></front><back><ref-list><title>References</title><ref id="cit1"><label>1</label><citation-alternatives><mixed-citation xml:lang="ru">Монтаж | Теория кино // Студия “Кинокафе”, 2019. URL: https://www.kinocafe.ru/theory/?tid=1221.</mixed-citation><mixed-citation xml:lang="en">Монтаж | Теория кино // Студия “Кинокафе”, 2019. URL: https://www.kinocafe.ru/theory/?tid=1221.</mixed-citation></citation-alternatives></ref><ref id="cit2"><label>2</label><citation-alternatives><mixed-citation xml:lang="ru">Сколько времени длится съемка сцены? // Celebrity.fm, 2020. URL: https://celebrity.fm/ru/how-long-does-filming-a-scene-take.</mixed-citation><mixed-citation xml:lang="en">Сколько времени длится съемка сцены? // Celebrity.fm, 2020. URL: https://celebrity.fm/ru/how-long-does-filming-a-scene-take.</mixed-citation></citation-alternatives></ref><ref id="cit3"><label>3</label><citation-alternatives><mixed-citation xml:lang="ru">Кинопродюсерство в кинематографии // Allbest, 2020. URL: https://revolution.allbest.ru/culture/01202211_2.html#text.</mixed-citation><mixed-citation xml:lang="en">Кинопродюсерство в кинематографии // Allbest, 2020. URL: https://revolution.allbest.ru/culture/01202211_2.html#text.</mixed-citation></citation-alternatives></ref><ref id="cit4"><label>4</label><citation-alternatives><mixed-citation xml:lang="ru">Создание документального фильма: как работают монтажеры – победители «Эмми» // Sayhi, 2019. URL: https://say-hi.me/24-kadra/sozdanie-dokumentalnogo-filma-kak-rabotayut-montazhyory-pobediteli-emmi.html.</mixed-citation><mixed-citation xml:lang="en">Создание документального фильма: как работают монтажеры – победители «Эмми» // Sayhi, 2019. URL: https://say-hi.me/24-kadra/sozdanie-dokumentalnogo-filma-kak-rabotayut-montazhyory-pobediteli-emmi.html.</mixed-citation></citation-alternatives></ref><ref id="cit5"><label>5</label><citation-alternatives><mixed-citation xml:lang="ru">Media Composer // AVID, 2023. URL: https://ch.avid.com/media-composer.</mixed-citation><mixed-citation xml:lang="en">Media Composer // AVID, 2023. URL: https://ch.avid.com/media-composer.</mixed-citation></citation-alternatives></ref><ref id="cit6"><label>6</label><citation-alternatives><mixed-citation xml:lang="ru">Как правильно оформить сценарий – правила с примерами // BandBand, 2023. URL: https://bandband.ru/blog/kak-oformit-scenarij.</mixed-citation><mixed-citation xml:lang="en">Как правильно оформить сценарий – правила с примерами // BandBand, 2023. URL: https://bandband.ru/blog/kak-oformit-scenarij.</mixed-citation></citation-alternatives></ref><ref id="cit7"><label>7</label><citation-alternatives><mixed-citation xml:lang="ru">Whisper // OpenAI, 2022. URL: https://openai.com/research/whisper.</mixed-citation><mixed-citation xml:lang="en">Whisper // OpenAI, 2022. URL: https://openai.com/research/whisper.</mixed-citation></citation-alternatives></ref><ref id="cit8"><label>8</label><citation-alternatives><mixed-citation xml:lang="ru">OpenAI Whisper // GitHub, 2022. URL: https://github.com/openai/ whisper.</mixed-citation><mixed-citation xml:lang="en">OpenAI Whisper // GitHub, 2022. URL: https://github.com/openai/ whisper.</mixed-citation></citation-alternatives></ref><ref id="cit9"><label>9</label><citation-alternatives><mixed-citation xml:lang="ru">SwiftWhisper // GitHub, 2023. URL: https://github.com/exPHAT/ SwiftWhisper.</mixed-citation><mixed-citation xml:lang="en">SwiftWhisper // GitHub, 2023. URL: https://github.com/exPHAT/ SwiftWhisper.</mixed-citation></citation-alternatives></ref><ref id="cit10"><label>10</label><citation-alternatives><mixed-citation xml:lang="ru">WhisperKit // GitHub, 2024. URL: https://github.com/argmaxinc/ WhisperKit.</mixed-citation><mixed-citation xml:lang="en">WhisperKit // GitHub, 2024. URL: https://github.com/argmaxinc/ WhisperKit.</mixed-citation></citation-alternatives></ref><ref id="cit11"><label>11</label><citation-alternatives><mixed-citation xml:lang="ru">An Effective Review of Phonetics Algorithms // ResearchGate, 2023. URL: https://www.researchgate.net/publication/375967141_An_Effective_Review_of_Phonetics_Algorithms.</mixed-citation><mixed-citation xml:lang="en">An Effective Review of Phonetics Algorithms // ResearchGate, 2023. URL: https://www.researchgate.net/publication/375967141_An_Effective_Review_of_Phonetics_Algorithms.</mixed-citation></citation-alternatives></ref><ref id="cit12"><label>12</label><citation-alternatives><mixed-citation xml:lang="ru">Russian Soundex // GitHub, 2022. URL: https://github.com/roddar92/ russian_soundex.</mixed-citation><mixed-citation xml:lang="en">Russian Soundex // GitHub, 2022. URL: https://github.com/roddar92/ russian_soundex.</mixed-citation></citation-alternatives></ref><ref id="cit13"><label>13</label><citation-alternatives><mixed-citation xml:lang="ru">Про многопоточность 2. GCD // Habr, 2021. URL: https://habr.com/ru/articles/578752/.</mixed-citation><mixed-citation xml:lang="en">Про многопоточность 2. GCD // Habr, 2021. URL: https://habr.com/ru/articles/578752/.</mixed-citation></citation-alternatives></ref><ref id="cit14"><label>14</label><citation-alternatives><mixed-citation xml:lang="ru">Combine // Apple Developer Documentation, 2024. URL: https://developer.apple.com/documentation/combine.</mixed-citation><mixed-citation xml:lang="en">Combine // Apple Developer Documentation, 2024. URL: https://developer.apple.com/documentation/combine.</mixed-citation></citation-alternatives></ref><ref id="cit15"><label>15</label><citation-alternatives><mixed-citation xml:lang="ru">Как правильно оформить сценарий – правила с примерами | Основные сценарные разделы // BandBand, 2023. URL: https://bandband.ru/blog/kak-oformit-scenarij#dirs.</mixed-citation><mixed-citation xml:lang="en">Как правильно оформить сценарий – правила с примерами | Основные сценарные разделы // BandBand, 2023. URL: https://bandband.ru/blog/kak-oformit-scenarij#dirs.</mixed-citation></citation-alternatives></ref><ref id="cit16"><label>16</label><citation-alternatives><mixed-citation xml:lang="ru">Sound Analysis // Apple Developer, 2022. URL: https://developer.apple.com/documentation/soundanalysis.</mixed-citation><mixed-citation xml:lang="en">Sound Analysis // Apple Developer, 2022. URL: https://developer.apple.com/documentation/soundanalysis.</mixed-citation></citation-alternatives></ref><ref id="cit17"><label>17</label><citation-alternatives><mixed-citation xml:lang="ru">YOLOv8 // Ultralytics, 2023. URL: https://docs.ultralytics.com/ru/ models/yolov8.</mixed-citation><mixed-citation xml:lang="en">YOLOv8 // Ultralytics, 2023. URL: https://docs.ultralytics.com/ru/ models/yolov8.</mixed-citation></citation-alternatives></ref><ref id="cit18"><label>18</label><citation-alternatives><mixed-citation xml:lang="ru">VisualActionKit // GitHub, 2020. URL: https://github.com/lukereichold/ VisualActionKit.</mixed-citation><mixed-citation xml:lang="en">VisualActionKit // GitHub, 2020. URL: https://github.com/lukereichold/ VisualActionKit.</mixed-citation></citation-alternatives></ref><ref id="cit19"><label>19</label><citation-alternatives><mixed-citation xml:lang="ru">Creating an Image Classifier Model // Apple Developer, 2021. URL: https://developer.apple.com/documentation/createml/creating-an-image-classifier-model.</mixed-citation><mixed-citation xml:lang="en">Creating an Image Classifier Model // Apple Developer, 2021. URL: https://developer.apple.com/documentation/createml/creating-an-image-classifier-model.</mixed-citation></citation-alternatives></ref><ref id="cit20"><label>20</label><citation-alternatives><mixed-citation xml:lang="ru">keremberke/indoor-scene-classification // Hugging Face, 2022. URL: https://huggingface.co/datasets/keremberke/indoor-scene-classification.</mixed-citation><mixed-citation xml:lang="en">keremberke/indoor-scene-classification // Hugging Face, 2022. URL: https://huggingface.co/datasets/keremberke/indoor-scene-classification.</mixed-citation></citation-alternatives></ref><ref id="cit21"><label>21</label><citation-alternatives><mixed-citation xml:lang="ru">Sample usage for wordnet // NLTK, 2023. URL: https://www.nltk.org/howto/wordnet.html</mixed-citation><mixed-citation xml:lang="en">Sample usage for wordnet // NLTK, 2023. URL: https://www.nltk.org/howto/wordnet.html</mixed-citation></citation-alternatives></ref><ref id="cit22"><label>22</label><citation-alternatives><mixed-citation xml:lang="ru">Извлечение признаков из текстовых данных с использованием TF-IDF // Habr, 2023. URL: https://github.com/roddar92/russian_soundex.</mixed-citation><mixed-citation xml:lang="en">Извлечение признаков из текстовых данных с использованием TF-IDF // Habr, 2023. URL: https://github.com/roddar92/russian_soundex.</mixed-citation></citation-alternatives></ref><ref id="cit23"><label>23</label><citation-alternatives><mixed-citation xml:lang="ru">Selecting the Optimal Probability Threshold for a Classification Model，ROC Curve Analysis and KS Score // DataScienceByExample, 2023. URL: https://www.datasciencebyexample.com/2023/04/19/choose-thresold-for-classification-model-with-ROC-analysis</mixed-citation><mixed-citation xml:lang="en">Selecting the Optimal Probability Threshold for a Classification Model，ROC Curve Analysis and KS Score // DataScienceByExample, 2023. URL: https://www.datasciencebyexample.com/2023/04/19/choose-thresold-for-classification-model-with-ROC-analysis</mixed-citation></citation-alternatives></ref></ref-list><fn-group><fn fn-type="conflict"><p>The authors declare that there are no conflicts of interest present.</p></fn></fn-group></back></article>
