Content-based video retrieval from natural language ; Recuperação de vídeos baseada em conteúdo a partir de linguagem natural

Item request has been placed!

Item request cannot be made.

Processing Request

اقرأ أكثر حفظ في قائمتي

المؤلفون: Jorge, Oliver Cabral
الموضوع:
Vídeos para Internet; Processamento de linguagem natural (Computação); Redes neurais (Computação); Visão por computador; Redes sociais on-line; Recuperação da informação; Aprendizado do computador; Internet videos; Natural language processing (Computer science); Neural networks (Computer science); Computer vision; Online social networks; Information retrieval; Machine learning; CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO; Engenharia Elétrica
نوع التسجيلة:
master thesis
اللغة:
English

معلومة اضافية
- Contributors:
  Lopes, Heitor Silvério; orcid:0000-0003-3984-1432; http://lattes.cnpq.br/4045818083957064; Lazzaretti, André Eugênio; orcid:0000-0003-1861-3369; http://lattes.cnpq.br/7649611874688878; Gomes, David Menotti; orcid:0000-0003-2430-2030; http://lattes.cnpq.br/6692968437800167; Gomes Junior, Luiz Celso; orcid:0000-0002-1534-9032; http://lattes.cnpq.br/0370301102971417; Bugatti, Pedro Henrique; orcid:0000-0001-9421-9254; http://lattes.cnpq.br/2177467029991118
- بيانات النشر:
  Universidade Tecnológica Federal do Paraná
  Curitiba
  Brasil
  Programa de Pós-Graduação em Engenharia Elétrica e Informática Industrial
  UTFPR
- الموضوع:
  2022
- Collection:
  Universidade Tecnológica Federal do Paraná (UTFPR): Repositório Institucional (RIUT)
- نبذة مختصرة :
  More and more, videos are becoming the most common means of communication, leveraged by the popularization of affordable video recording devices and social networks such as TikTok, Instagram, and others. The most common ways of searching for videos on these social networks as well as on search portals are based on metadata linked to videos through keywords and previous classifications. However, keyword searches depend on exact knowledge of what you want and may not necessarily be efficient when trying to find a particular video from a description, superficial or not, of a particular scene, which may lead to frustrating results in the search. The objective of this work is to find a particular video within a list of available videos from a textual description in natural language based only on the content of its scenes, without relying on previously cataloged metadata. From a dataset containing videos with a defined number of descriptions of their scenes, a Siamese network with a triplet loss function was modeled to identify, in hyperspace, the similarities between two different modalities, one of them being the information extracted from a video, and the other information extracted from a text in natural language. The final architecture of the model, as well as the values of its parameters, was defined based on tests that followed the best results obtained. Because videos are not classified into groups or classes and considering that the triplet loss function is based on an anchor text and two video examples, one positive and one negative, a difficulty was identified in the selection of false examples needed for the model training. In this way, methods of choosing examples of negative videos for training were also tested using a random choice and a directed choice, based on the distances of the available descriptions of the videos in the training phase, being the first the most effective. At the end of the tests, a result was achieved with the exact presence of the searched video in 10.67% of the cases in the top ...
- File Description:
  application/pdf
- Relation:
  JORGE, Oliver Cabral. Content-based video retrieval from natural language. 2022. Dissertação (Mestrado em Engenharia Elétrica e Informática Industrial) - Universidade Tecnológica Federal do Paraná, Curitiba, 2022.; http://repositorio.utfpr.edu.br/jspui/handle/1/29964
- الدخول الالكتروني :
  http://repositorio.utfpr.edu.br/jspui/handle/1/29964
- Rights:
  openAccess ; http://creativecommons.org/licenses/by/4.0/
- الرقم المعرف:
  edsbas.3EF650CB

تعليقات

No Comments.

Content-based video retrieval from natural language ; Recuperação de vídeos baseada em conteúdo a partir de linguagem natural

اتصل بنا

اتبع