Дисертації з теми "YOLOX"
Оформте джерело за APA, MLA, Chicago, Harvard та іншими стилями
Ознайомтеся з топ-50 дисертацій для дослідження на тему "YOLOX".
Біля кожної праці в переліку літератури доступна кнопка «Додати до бібліографії». Скористайтеся нею – і ми автоматично оформимо бібліографічне посилання на обрану працю в потрібному вам стилі цитування: APA, MLA, «Гарвард», «Чикаго», «Ванкувер» тощо.
Також ви можете завантажити повний текст наукової публікації у форматі «.pdf» та прочитати онлайн анотацію до роботи, якщо відповідні параметри наявні в метаданих.
Переглядайте дисертації для різних дисциплін та оформлюйте правильно вашу бібліографію.
Nikue, Amassah Djahlin. "Analyse vidéo pour la détection, le suivi et la reconnaissance du comportement pour l'animal en situation d'élevage." Electronic Thesis or Diss., Orléans, 2024. http://www.theses.fr/2024ORLE1011.
Activity recognition, also known as action recognition, is a field of research in computer vision and machine learning, with a variety of applications. One of the most common applications is the identification and understanding of human activities from visual data, such as images or videos. Action recognition techniques can also be applied to livestock monitoring, where they can help improve animal welfare, productivity, and farm management practices. Thus, the work conducted in this document falls within the context of video analysis for the detection, monitoring, and recognition of animal behavior in livestock situations. This work is being achieved within ANIMOV "Animal Movements Observation from Videos", a multidisciplinary research project being implemented over the period 2019-2023 by a regional consortium in Centre-Val-de-Loire. This project concerns two main animal species: elephants and goats. In this thesis, our research focuses on activity analysis for goats. We have built an object detection and tracking system to implement our behavior analysis system. For detection, we tested and compared two popular methods from the literature: YOLOv4 and Faster R-CNN, on self-created datasets. Of the two detection methods, YOLOv4 performs better in average accuracy and is 2.5 times faster than Faster R-CNN. For goat tracking, we also tested and compared two popular methods from the literature: SORT and Deep SORT. Evaluation of both tracking methods on test videos shows a slight improvement of Deep SORT over SORT regarding data association. However, SORT is faster and better suited to a real-time system. The detection and tracking system we have set up enables us to analyze the general activity of the livestock in real-time, with indicators that are fairly close to reality. The main limitation of our system is the loss of detection on certain video images, which leads to tracking failures. So, to improve the performance, we proposed an approach that merges information from previous detections and the current image, in a new detection architecture (YOLOX), to better detect all objects without losing the old ones
Núñez-Melgar, Espinoza Erika Pamela, Oré Natali Leonor Reyes, Abad Jorge Raúl Salazar, and Vela Anderson Vásquez. "YOLO." Bachelor's thesis, Universidad Peruana de Ciencias Aplicadas (UPC), 2018. http://hdl.handle.net/10757/625370.
The following research work pretends to test out the viability of the project named YOLO. This project proposes to create a virtual setting to interrelate two segments of different interests or needs: a segment that wants to sell products such as technology, clothing or accessories and another segment that wish to get those items participating in a virtual raffle at affordable prices. In a virtual survey conducted to know the interest and value of the service in the market, the results were that 72% of the people would be willing to participate in virtual games of chance and 52% has sold some new or used product by the Internet. Afterward, the proposal was validated through the raffle of a product in which the interest and participation of the users were astounding because it surpassed the expectations. For this project, it is required an investment of approximately S/141,000.00 of which the 60% corresponds to the shareholders capital and the remaining 40% will be financed by a financial institution. Finally, we present for your revision, the assessment process of project YOLO, it contains the strategic plan, the marketing plan, the operations plan, human resources, and the economic-financial plan. For that reason, in this summary more reference has been made to the financial aspect basing us in the evaluation of cash flows that create the project and which indicate that 100% of the initial outlay it is regained during the first year, all the same, happens to the shareholders who have already free disposition of cash on the first year, which grows 23% every year. Additionally, Net Present Value (NPV), for both flows, is positive and the Internal Rate of Return (IRR), for both flows, generates higher rates than the shareholder's opportunity cost of capital It concludes that the business project generates value to the stockholders, it is viable and profitable.
Trabajo de investigación
Ferrer, Bustamante Claudia Mariela, Llanos Víctor Hugo Ibarra, and Flores Carlos Rafael Prialé. "Plataforma virtual de Rifa Yolo." Bachelor's thesis, Universidad Peruana de Ciencias Aplicadas (UPC), 2018. http://hdl.handle.net/10757/625450.
This business project has been developed with the purpose of meeting the needs of people who are users of electronic commerce by offering an innovative way to obtain products for a minimal cost. The objective of this plan is to bring products closer to consumers who have the desire to have them but for various reasons have not been able to get them. In this proposal developed to fulfill the desire of our chosen client, we have worked on identifying their main motivations when buying online such as: saving time, convenience and searching for the best price, which was contrasted with the main problems they face when making purchases in a physical space: prolonged waiting to be attended and paid, high prices of products and poor attention. In addition, it was possible to highlight their fears regarding the experience of shopping online. The business project raises the service of raffling products of brands recognized by our client, for which raffle tickets will be sold for a set time for each draw, this will be done through an agile and reliable virtual platform where the customer can buy as many options as he wants in order to win the raffle. Our value proposition proposes sending the product free of cost, live broadcasts and notifications of all the draws that will be duly validated, variety of products, guarantee, simple means of payment, bonuses.
Trabajo de investigación
Marmayohan, Nivethan, and Abdirahman Farah. "Scene analysis using Tensorflow & YOLO algorithms on Raspberry pi 4." Thesis, Högskolan i Halmstad, Akademin för informationsteknologi, 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:hh:diva-45540.
Object detection is one of the essential software components in the next generation of traffic monitoring. Real-time detection and recognition of objects are essential tasks for image processing. Therefore, deep learning algorithms for object detection such as YOLO (You Only Look Once) are increasingly used in image analysis, since they run in normal video frame rate (real-time) and are reasonably accurate. This study presents an embedded system and its results for detecting and recognizing objects in real-time. Results are based on a video stream originating from a traffic environment in the city of Halmstad (Sweden). The embedded system is implemented in Raspberry pi 4 using the software Tensorflow and different deep learning algorithms of the YOLO software package. Real-time analyses on frames per second, accuracy in mean average precision, CPU temperature, and CPU frequency are reported for experiments comprising transfer learning. A main conclusion is that Raspberry pi 4 can perform object classification and detection with high accuracy in certain scenarios for traffic monitoring with YOLO algorithms. For example, classifying objects with the speed of a pedestrian would be feasible with classifying and detecting with high accuracy. On the other hand, with high-speed objects such as cars and cyclists, it is a more challenging task for Raspberry pi 4 to detect and classify objects.
Al, Hakim Ezeddin. "3D YOLO: End-to-End 3D Object Detection Using Point Clouds." Thesis, KTH, Skolan för elektroteknik och datavetenskap (EECS), 2018. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-234242.
För att autonoma fordon ska ha en god uppfattning av sin omgivning används moderna sensorer som LiDAR och RADAR. Dessa genererar en stor mängd 3-dimensionella datapunkter som kallas point clouds. Inom utvecklingen av autonoma fordon finns det ett stort behov av att tolka LiDAR-data samt klassificera medtrafikanter. Ett stort antal studier har gjorts om 2D-objektdetektering som analyserar bilder för att upptäcka fordon, men vi är intresserade av 3D-objektdetektering med hjälp av endast LiDAR data. Därför introducerar vi modellen 3D YOLO, som bygger på YOLO (You Only Look Once), som är en av de snabbaste state-of-the-art modellerna inom 2D-objektdetektering för bilder. 3D YOLO tar in ett point cloud och producerar 3D lådor som markerar de olika objekten samt anger objektets kategori. Vi har tränat och evaluerat modellen med den publika träningsdatan KITTI. Våra resultat visar att 3D YOLO är snabbare än dagens state-of-the-art LiDAR-baserade modeller med en hög träffsäkerhet. Detta gör den till en god kandidat för kunna användas av autonoma fordon.
Yevsieiev, V., O. Tokarieva, and S. Starikova. "Research of Object Recognition in the Workspace of A Mobile Robot Based on the Yolo Method." Thesis, Кременчуцький національний університет імені Михайла Остроградського, 2022. https://openarchive.nure.ua/handle/document/20421.
Pini, Mattia. "Sviluppo di un Prototipo di Video Sorveglianza Indoor con Tecnologie YOLO ed OpenCV." Bachelor's thesis, Alma Mater Studiorum - Università di Bologna, 2019. http://amslaurea.unibo.it/18178/.
Sommer, Ted Robert. "The aquatic ecology of the Yolo Bypass floodplain : evaluation at the species and landscape scales /." For electronic version search Digital dissertations database. Restricted to UC campuses. Access is free to UC campus dissertations, 2002. http://uclibs.org/PID/11984.
Головатий, Ігор Богданович, та Ihor Holovatiy. "Комп'ютерна система на основі нейромережі для виявлення зіткнення автомобілів". Bachelor's thesis, Тернопільський національний технічний університет імені Івана Пулюя, 2021. http://elartu.tntu.edu.ua/handle/lib/35429.
The qualification work deals with the development of a system that allows you to identify serious car collisions on a video recorded by surveillance cameras. A review of existing road accident detection systems was conducted. The way of the decision of a problem by means of a neural network is offered, the description of algorithm of detection of car accidents is given, search and preparation of sampling is carried out. Object detection algorithms are analyzed. A YOLOv3 detector is selected to detect the car on video. The object detectors are compared to determine the impact on the final efficiency of the system as a whole. The algorithm of car collision detection in real time is implemented. The developed system was tested on real data to determine car collisions. The obtained practical results allow us to assert the effectiveness of the development.
Вступ. 1. Аналіз технічного завдання. 1.1 Огляд систем детектування ДТП. 1.2. Поняття ДТП. 2. Проектна частина. 2.1. Згорткові нейронні мережі. 2.2. Спосіб вирішення проблеми. за допомогою згорткових нейронних мереж. 2.3. Опис алгоритму детектування автокатастроф. 2.4. Пошук і підготовка вибірки. 2.5. Аналіз алгоритмів детектування об'єктів. 3. Практична частина. 3.1. Порівняння детекторів об'єктів для з'ясування впливу на кінцеву ефективність роботи системи в цілому. 3.2. Реалізація алгоритму детектування зіткнення автомобілів в режимі реального часу. 3.3. Отримані результати експериментів. 4. Безпека життєдіяльності, основи хорони праці. Висновки. Список використаних джерел
Güven, Jakup. "Investigating techniques for improving accuracy and limiting overfitting for YOLO and real-time object detection on iOS." Thesis, Malmö universitet, Fakulteten för teknik och samhälle (TS), 2019. http://urn.kb.se/resolve?urn=urn:nbn:se:mau:diva-19999.
This paper features the creation of a real time object detection system for mobile iOS using YOLO, a state-of-the-art one stage object detector and convoluted neural network far surpassing other real time object detectors in speed and accuracy. In this process an object detecting model is trained to detect doors. The machine learning process is outlined and practices to combat overfitting and increasing accuracy and speed are discussed. A series of experiments are conducted, the results of which suggests that data augmentation, including negative data in a dataset, hyperparameter optimisation and transfer learning are viable techniques in improving the performance of an object detection model. The author is able to increase mAP, a measurement of accuracy for object detectors, from 63.76% to 86.73% based on the results of experiments. The tendency for overfitting is also explored and results suggest that training beyond 300 epochs is likely to produce an overfitted model.
Сторожук, Б. В. "Система розпізнавання об‘єктів на ділянках автошляху за допомогою оглядових камер відеоспостережень". Thesis, Чернігів, 2020. http://ir.stu.cn.ua/123456789/23420.
Об'єктом розробки була інтелектуальна система розпізнавання об‘єктів на ділянках автошляху за допомогою оглядових камер відеоспостережень. Метою кваліфікаційної роботи є створення системи, яка зможе прийняти, обробити та розпізнати об’єкти на зображенні. Під час розробки та проектування, були розглянуті ШНМтрументи для створення системи та їх недоліки. Результати даної роботи можуть бути використанні для отримання статистичних даних по кількості та типу транспорту на ділянці автошляху. Можливе подальше вдосконалення системи шляхом покращення методів розпізнавання об’єктів на зображеннях використовуючи нейронну мережу характерно новітньої архітектури, що дозволяє у свою чергу покращити метод класифікації та підвищити показник точності розпізнавання об‘єктів на ділянках автошляху
The object of the development was an intelligent system for recognition of objects on the road sections with the help of overview cameras of video surveillance. The purpose of qualifying work is to create a system that can receive, process and recognize objects in an image. During development and design, the tools for creating the system and their disadvantages were considered. The results of this work can be used for obtaining statistics on the amount and type of transport on the road section. Possible further Improvement of the system by improving the methods for recognizing objects to be saved using the neural venter is characteristic of the latest architecture, which allows it to improve its classification method in its queue and improve the accuracy of the object recognition on the road sections.
Alsalehy, Ahmad, and Ghada Alsayed. "Scenanalys av trafikmiljön." Thesis, Högskolan i Halmstad, Akademin för informationsteknologi, 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:hh:diva-44936.
Стахив, Ю. Н., та М. Г. Заворотна. "Классификация объектов в режиме реального времени". Thesis, ХНУРЕ, 2019. http://openarchive.nure.ua/handle/document/8478.
Mehta, Rinav. "COMPARISON OF REFUSE DECOMPOSITION IN THE PRESENCE AND ABSENCE OF LEACHATE RECIRCULATION AT THE YOLO COUNTY, CALIFORNIA TEST CELLS." NCSU, 2000. http://www.lib.ncsu.edu/theses/available/etd-20001219-110441.
MEHTA, RINAV C. Comparison of Refuse Decomposition in the Presence and Absence of Leachate Recirculation at the Yolo County, California Test Cells. (Under the direction of Morton A. Barlaz.)A side by side comparison of two 8,000-metric ton test cells, one operated with (enhanced) and one without (control) leachate recirculation, was performed to evaluate the effects of leachate recirculation on refuse decomposition at Yolo County, CA. After about three years of operation, refuse was excavated in three borings of the enhanced cell (E1, E2 and E3) and two borings in the control cell (C1 and C2). The objective of this study was to present a comparison of test cell performance with respect to moisture content, settlement, methane production and solids decomposition. Refuse moisture content data show that leachate recirculation resulted in an increase in refuse moisture content, but also show that the refuse in the enhanced cell was not uniformly wet. The average moisture content in E1, E2 and E3 was 38.8, 31.7 and 34.8%, respectively, while the average moisture content in C1 and C2 was 14.6 and 19.2%, respectively. The extent of decomposition was determined by the biochemical methane potential (BMP) and the ratio of cellulose plus hemicellulose to lignin ((C+H)/L). BMP analysis showed the average methane potential in the enhanced and control cells to be 24.0 and 30.9 mL CH4/dry-gm, respectively, and the (C+H)/L of 1.09 and 1.44. These data correlates well with the measured methane production in the enhanced and control cell of 54 and 26 L CH4/wet-kg, respectively. Thus, laboratory and field data shows more decomposition in the enhanced cell relative to the control cell. While the overall averages may not appear significantly different, a closer look at the performance of E1 shows a difference in both moisture content and solids decomposition when compared to the control cell. Hence, the extent of decomposition varies within the enhanced cell. The sampling program conducted for the Yolo County test cells, in concert with data on settlement, methane production and the volume of liquid actually recycled, represents perhaps the most complete set of data available to date on a field-scale leachate recirculation landfill.
Sun, Ruiwen. "Detecting Faulty Tape-around Weatherproofing Cables by Computer Vision." Thesis, KTH, Skolan för elektroteknik och datavetenskap (EECS), 2020. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-272108.
Fler kablar kommer att installeras på grund av installation av fler radiotorn när det gäller 5G. En stor del av radioenheterna är dock konstruerade högt i det öppna utrymmet, vilket gör det svårt för mänskliga tekniker att underhålla systemen. Under dessa omständigheter är automatiska upptäckter av fel bland radioskåp avgörande. Kablar och kontakter täcks vanligtvis med väderbeständiga band, och ett av de vanligaste problemen är att banden inte är rundade på kablarna och kontakterna. Detta gör att tejpen går ur kabeln och ser ut som en viftande flagga, vilket allvarligt kan skada radiosystemen. Avhandlingen syftar till att upptäcka detta flaggband och ta itu med frågorna. Den här avhandlingen experimenterar två metoder för objektdetektering, det invändiga neurala nätverket såväl som OpenCV och bildbehandling. Den förstnämnda använder YOLO (You Only Look Once) nätverk för träning och testning, medan i den senare metoden används den anslutna komponentmetoden för detektering av stora föremål som kablarna och linjesegmentdetektorn är ansvarig för utvinning av bandbandgränsen. Flera parametrar, strukturellt och funktionellt unika, utvecklades för att hitta det mest lämpliga sättet att uppfylla kravet. Dessutom används precision och återkallande för att utvärdera prestandan för systemutgångskvaliteten, och för att förbättra kraven utfördes större experiment med olika parametrar. Resultaten visar att det bästa sättet att upptäcka felaktigt väderbeständighet är med bildbehandlingsmetoden genom vilken återkallelsen är 71% och precisionen når 60%. Denna metod visar bättre prestanda än YOLO som hanterar markering av flaggband. Metoden visar den stora potentialen för denna typ av objektdetektering, och en detaljerad diskussion om begränsningen presenteras också i avhandlingen.
Матлахов, В. І. "Інтелектуальна система розпізнавання образів у Web-контексті". Master's thesis, Сумський державний університет, 2020. https://essuir.sumdu.edu.ua/handle/123456789/82177.
Zeng, Xing. "One Stage Fine- Grained Classification." Thesis, KTH, Skolan för elektroteknik och datavetenskap (EECS), 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-301055.
Finkornad visuell klassificering (FGVC) är ett snabbt växande fält inom bildklassificering. Det är dock en utmanande uppgift på grund av underkategorier som delar subtila skillnader. Befintliga tillvägagångssätt hanterar detta problem genom att först extrahera diskriminerande regioner med dellokalisering eller objektlokalisering eller Region Proposal Networks (RPN) och sedan tillämpa Convolutional Network eller SVM- klassificering på dessa regioner. I det här arbetet, med syftet att förenkla ovanstående komplicerade rörledning samtidigt som vi håller hög noggrannhet, blir vi inspirerade av enstegs objektdetekteringsmodellen YOLO och designar en enstegs end- to- end objektdetektormodell för FGVC. Specifikt tillämpar vi YOLOv5 som basmodell och ersätter dess Path Aggregation Network (PANet) struktur med en viktad dubbelriktad funktionspyramidnätverk (BiFPN) struktur för att effektivt smälta information från olika upplösningar. Vi utför experiment på olika klassificerings och lokaliseringsviktsförhållanden för att vägleda valet av förlustvikter i olika scenarier. Vi har bevisat livskraften hos enstegsdetektormodellen YOLO på FGVC, som har 87,1 % topp1noggrannhet i FGVC- dataset CUB2002011. Dessutom har vi utformat en mer exakt enstegsmodell som uppnår 88,1 % noggrannhet, vilket är den mest exakta metoden jämfört med befintliga lokaliseringsmodeller. Slutligen har vi visat att ju högre klassificeringsförlustvikten är, desto snabbare är konvergenshastigheten, medan en ökning av lokaliseringsförlustvikten ökar något kan bidra till en mer exakt klassificering men resulterar i långsammare konvergens.
Kohmann, Erich. "Tecniche di deep learning per l'object detection." Bachelor's thesis, Alma Mater Studiorum - Università di Bologna, 2019. http://amslaurea.unibo.it/19637/.
Norén, Aron. "Enhancing Simulated Sonar Images With CycleGAN for Deep Learning in Autonomous Underwater Vehicles." Thesis, KTH, Matematisk statistik, 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-301326.
Denna rapport ämnar undersöka problemet med gles data för djupinlärning i sonardomänen. Ett dataflöde för att generera och höja kvalitén hos simulerad sonardata sätts upp i syfte att skapa en stor uppsättning data för att träna ett neuralt nätverk. Möjligheterna och begränsningarna med att använda cycleGAN för att höja kvalitén hos simulerad sonardata studeras och diskuteras. Ett neuralt nätverk för att upptäcka och klassificera objekt i sonarbilder tränas i syfte att evaluera den förbättrade simulerade sonardatan.Denna rapport bygger vidare på tidigare metoder genom att generalisera dessa och visa att metoden har potential även för komplexa uppgifter baserad på icke trivial data.Genom att träna ett nätverk för klassificering och detektion på simulerade sonarbilder som använder cycleGAN för att höja kvalitén, ökade klassificeringsresultaten markant jämfört med att träna på enbart simulerade bilder.
Тарасов, О. Є. "Інтелектуальна система автоматичного керування автомобілем у віртуальній моделі навколишнього середовища". Thesis, Чернігів, 2021. http://ir.stu.cn.ua/123456789/25133.
Кваліфікаційна робота передбачає дослідження сучасного стану розвитку галузі безпілотних автомобілів, а також основних технологій, які в ній застосовуються; дослідження можливостей використання віртуальних середовищ для перевірки ефективності роботи систем автоматичного керування автомобілем; розробку системи автоматичного керування автомобілем та оцінку ефективності її роботи у віртуальній моделі навколишнього середовища. Віртуальне середовище повинно являти собою модель реального світу зі змінними довколишніми умовами: погодою, ландшафтом, часом доби. При виборі перевага має надаватися тим варіантам, в яких дорожня інфраструктура реалізована більш детально та реалістично. Розроблювана система має складатися з частин, що повинні виконувати наступні функції: - підсистема збору даних: отримання даних з віртуального середовища, їх зберігання у сховищі даних; - підсистема обробки даних: підготовка зібраних даних для реалізації системи автоматичного керування; реалізація обраних технологій на основі зібраних даних; - підсистема керування: інтеграція реалізованої системи автоматичного керування до віртуального середовища.
Qualification work involves the study of the current state of development of the unmanned vehicle industry, as well as the main technologies used in it; research of possibilities of use of virtual environments for check of efficiency of work of systems of automatic control of the car; development of an automatic car control system and evaluation of its efficiency in a virtual model of the environment. The virtual environment should be a model of the real world with changing environmental conditions: weather, landscape, time of day. When choosing, preference should be given to those options in which the road infrastructure is implemented in more detail and realistically. The developed system should consist of parts that must perform the following functions: - data collection subsystem: obtaining data from the virtual environment, storing them in the data warehouse; - data processing subsystem: preparation of collected data for the implementation of automatic control system; implementation of selected technologies based on collected data; - control subsystem: integration of the implemented automatic control system into the virtual environment.
Venkatesh, Anirudh. "Object Tracking in Games using Convolutional Neural Networks." DigitalCommons@CalPoly, 2018. https://digitalcommons.calpoly.edu/theses/1845.
Khlif, Wafa. "Multi-lingual scene text detection based on convolutional neural networks." Thesis, La Rochelle, 2022. http://www.theses.fr/2022LAROS022.
This dissertation explores text detection approaches via deep learning techniques towards achieving the goal of mining and retrieval of weakly structured contents in scene images. First, this dissertation presents a method for detecting text in scene images based on multi-level connected component (CC) analysis and learning text component features via convolutional neural networks (CNN), followed by a graph-based grouping of overlapping text boxes. The features of the resulting raw text/non-text components of different granularity levels are learned via a CNN. The second contribution is inspired from YOLO: Real-Time Object Detection system. Both methods perform text detection and script identification simultaneously. The system presents a joint text detection and script identification approach based on casting the multi-script text detection task as an object detection problem, where the object is the script of the text. Joint text detection and script identification strategy is realized in a holistic approach using a single convolutional neural network where the input data is the full image and the outputs are the text bounding boxes and their script. Textual feature extraction and script classification are performed jointly via a CNN. The experimental evaluation of these methods are performed on the Multi-Lingual Text MLT dataset. We contributed in building this new dataset. It is constituted of natural scene images with embedded text, such as street signs and advertisement boards, passing vehicles, user photos in microblog. This kind of images represents one of the mostly encountered image types on the internet which are the images with embedded text in social media
Barbazza, Sigfrido. "Deep-learning applicato all'identificazione automatica di frutta in immagini." Bachelor's thesis, Alma Mater Studiorum - Università di Bologna, 2016. http://amslaurea.unibo.it/11526/.
Acharya, Pradip. "DETECTION AND SEGMENTATION OF DEFECTS IN X-RAY COMPUTED TOMOGRAPHY IMAGE SLICES OF ADDITIVELY MANUFACTURED COMPONENT USING DEEP LEARNING." OpenSIUC, 2021. https://opensiuc.lib.siu.edu/theses/2834.
Jurečka, Tomáš. "Detekce a klasifikace létajících objektů." Master's thesis, Vysoké učení technické v Brně. Fakulta elektrotechniky a komunikačních technologií, 2021. http://www.nusl.cz/ntk/nusl-442512.
Valentini, Alice. "Evaluation of deep learning techniques for object detection on embedded systems." Master's thesis, Alma Mater Studiorum - Università di Bologna, 2018. http://amslaurea.unibo.it/15478/.
Кушнір, Іван Ярославович. "Мультисервісна комп’ютерна мережа навчального курсового комбінату". Бакалаврська робота, Хмельницький національний університет, 2021. http://elar.khnu.km.ua/jspui/handle/123456789/10443.
Кучерук, Владислав Русланович. "Програмована схема керування світлофорами на базі мікро-ЕОМ для навчального макета, що моделює ситуації на перехресті вулиці". Бакалаврська робота, Хмельницький національний університет, 2021. http://elar.khnu.km.ua/jspui/handle/123456789/10423.
Taurone, Francesco. "3D Object Recognition from a Single Image via Patch Detection by a Deep CNN." Master's thesis, Alma Mater Studiorum - Università di Bologna, 2019. http://amslaurea.unibo.it/18669/.
Rehnholm, Jonas. "Battery Pack Part Detection and Disassembly Verification Using Computer Vision." Thesis, Mälardalens högskola, Akademin för innovation, design och teknik, 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:mdh:diva-54852.
Donini, Massimo. "Algoritmi di stitching per il rilevamento dell'occupazione di aule in un contesto smart campus." Bachelor's thesis, Alma Mater Studiorum - Università di Bologna, 2019. http://amslaurea.unibo.it/19063/.
Scalamandrè, Davide. "Sistema di visione per le gestione automatica dei posti in un parcheggio." Master's thesis, Alma Mater Studiorum - Università di Bologna, 2018.
Melcherson, Tim. "Image Augmentation to Create Lower Quality Images for Training a YOLOv4 Object Detection Model." Thesis, Uppsala universitet, Signaler och system, 2020. http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-429146.
Giambi, Nico. "Sperimentazione di tecniche di Deep Learning per l'Object Detection." Bachelor's thesis, Alma Mater Studiorum - Università di Bologna, 2020. http://amslaurea.unibo.it/21557/.
Thaung, Ludwig. "Advanced Data Augmentation : With Generative Adversarial Networks and Computer-Aided Design." Thesis, Linköpings universitet, Datorseende, 2020. http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-170886.
Gustafsson, Simon, and Andreas Persson. "Detecting small and fast objects using image processing techniques : A project study within sport analysis." Thesis, Jönköping University, JTH, Avdelningen för datateknik och informatik, 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:hj:diva-54343.
Lukáč, Jakub. "Sledování osob v záznamu z dronu." Master's thesis, Vysoké učení technické v Brně. Fakulta informačních technologií, 2020. http://www.nusl.cz/ntk/nusl-417275.
Rexhaj, Kastriot. "Machine visual feedback through CNN detectors : Mobile object detection for industrial application." Thesis, Mittuniversitetet, Institutionen för elektronikkonstruktion, 2019. http://urn.kb.se/resolve?urn=urn:nbn:se:miun:diva-36467.
Den här rapporten behandlar objekt detektering som en möjlig lösning på Valmets efterfrågan av ett visuellt återkopplingssystem som kan hjälpa operatörer och annan personal att lättare interagera med maskiner och utrustning. Nya framsteg inom djupinlärning har dem senaste åren möjliggjort framtagande av neurala nätverksarkitekturer med detekteringsförmågor. Då industrisektorn svårare tar till sig högst specialiserade algoritmer och komplexa bildbehandlingsmetoder (som tidigare varit fallet med objekt detektering) så ger djupinlärningsmetoder istället upphov till att skapa självlärande system som är återanpassningsbara och närmast intuitiva i dem fall där sådan teknologi åberopas. Den här studien har därför valt att studera ett par sådana teknologier för att hitta möjliga implementeringar som kan realiseras på något så enkelt som en mobiltelefon. Urvalet har därför bestått i att hitta detekteringsmodeller som är hårdvarumässigt resurssnåla och implementera ett sådant system för att agera prototyp och underlag till Valmets vidare diskussioner kring objekt-detekteringsslösningar. Studien valde att implementera en SSD-MobileNetsV2 modellarkitektur då den uppvisade lovande egenskaper kring hårdvarukraven. Modellen implementerades och utvärderades på Valmets mest förekommande telefon Samsung Galaxy S8 och resultatet visade på en god förmåga för modellen att detektera objekt. Den valda modellen gav 60 % precision på utvärderingsbilderna och lyckades nå 4.7 FPS på den implementerade telefonen. TensorFlow användes för programmering och som stödjande mjukvaruverktyg för träning, utvärdering samt vidare implementering. Studien påpekar optimistiska förväntningar av denna typ av teknologi; kombinerat med bättre smarttelefoner i framtiden kan det leda till revolutionerande lösningar för både industri och konsumenter.
Lamberti, Lorenzo. "A deep learning solution for industrial OCR applications." Master's thesis, Alma Mater Studiorum - Università di Bologna, 2019. http://amslaurea.unibo.it/19777/.
Hamren, Rasmus. "APPLYING UAVS TO SUPPORT THE SAFETY IN AUTONOMOUS OPERATED OPEN SURFACE MINES." Thesis, Mälardalens högskola, Akademin för innovation, design och teknik, 2021. http://urn.kb.se/resolve?urn=urn:nbn:se:mdh:diva-53376.
Lukáč, Jakub. "Sledování osob ve videu z dronu." Master's thesis, Vysoké učení technické v Brně. Fakulta informačních technologií, 2021. http://www.nusl.cz/ntk/nusl-445483.
Baroncini, Gian Marco. "analisi dei principali campi del deep learning e delle loro reti neurali." Bachelor's thesis, Alma Mater Studiorum - Università di Bologna, 2021. http://amslaurea.unibo.it/23368/.
Rispoli, Luca. "Un approccio deep learning-based per il conteggio di persone tramite videocamere low-cost in un contesto Smart Campus." Bachelor's thesis, Alma Mater Studiorum - Università di Bologna, 2019. http://amslaurea.unibo.it/19567/.
Nguyen, Van Dinh. "Exploitation de la détection de contours pour la compréhension de texte dans une scène visuelle." Electronic Thesis or Diss., Sorbonne université, 2018. http://www.theses.fr/2018SORUS473.
Scene texts have been attracting increasing interest in recent years as witnessed by a large number of applications such as car licence plate recognition systems, navigation systems, self-driving cars based on traffic sign, and so on. In this research, we tackle challenges of designing robust and reliable automatic scene text reading systems. Two major steps of the system as a scene text localization and a scene text recognition have been studied and novel algorithms have been developed to address them. Our works are based on the observation that providing primary scene text regions which have high probability of being texts is very important for localizing and recognizing texts in scenes. This factor can influence both accuracy and efficiency of detection and recognition systems. Inspired by successes of object proposal researches in general object detection and recognition, two state-of-the-art scene text proposal techniques have been proposed, namely Text-Edge-Box (TEB) and Max-Pooling Text Proposal (MPT). In the TEB, proposed bottom-up features, which are extracted from binary Canny edge maps, are used to group edge connected components into proposals and score them. In the MPT technique, a novel grouping solution is proposed as inspired by the max-pooling idea. Different from existing grouping techniques, it does not rely on any text specific heuristic rules and thresholds for providing grouping decisions. Based on our proposed scene text proposal techniques, we designed an end-to-end scene text reading system by integrating proposals with state-of-the-art scene text recognition models, where a false positive proposals suppression and a word recognition can be processed concurrently. Furthermore, we developed an assisted scene text searching system by building a web-page user interface on top of the proposed end-to-end system. The system can be accessed by any smart device at the link: dinh.ubismart.org:27790. Experiments on various public scene text datasets show that the proposed scene text proposal techniques outperform other state-of-the-art scene text proposals under different evaluation frameworks. The designed end-to-end systems also outperforms other scene-text-proposal based end-to-end systems and are competitive to other systems as presented in the robust reading competition community. It achieves the fifth position in the champion list (Dec-2017): http://rrc.cvc.uab.es/?ch=2&com =evaluation&task=4
Grahn, Fredrik, and Kristian Nilsson. "Object Detection in Domain Specific Stereo-Analysed Satellite Images." Thesis, Linköpings universitet, Datorseende, 2019. http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-159917.
Charvát, Michal. "System for People Detection and Localization Using Thermal Imaging Cameras." Master's thesis, Vysoké učení technické v Brně. Fakulta informačních technologií, 2020. http://www.nusl.cz/ntk/nusl-432478.
Oškera, Jan. "Detekce dopravních značek a semaforů." Master's thesis, Vysoké učení technické v Brně. Fakulta informačních technologií, 2020. http://www.nusl.cz/ntk/nusl-432850.
BATMUNKH, BAYARMAGNAI, and 白榮. "Real-Time Logo Detector via YOLO." Thesis, 2019. http://ndltd.ncl.edu.tw/handle/az7abw.
國立勤益科技大學
資訊工程系
107
Brand logo detecting is challenging task due to its diversity in size and shape. There are several researchers studied this field and achieved remarkable results using fast region-based convolutional networks and other proposed methods. Although, previous researches are focused on accuracy of the detection only, but not in fast detection field. In contrast, we concerned about real-time logo detection, which can lead to new application opportunities that can be explored by further exploration. In this paper, we extend the state-of-the-art real-time detection architecture YOLOv2 and YOLOv3, and used the FlickrLogos-47 dataset, which is new version of well-known logo dataset, for training to solve the problem. We trained one network for each method. Experiment results show promising results on chosen dataset in real-time detection. In addition, we report a comparison between the methods in mAP metric, show detailed result of each method’s performance on each logo class detection and highlight opportunities and improvements can be explored in future work.
林 and 林家平. "A YOLO-based Traffic Counting System." Thesis, 2018. http://ndltd.ncl.edu.tw/handle/2t8chr.
國立中央大學
資訊工程學系
106
Image recognition can be applied in many applications of Intelligent Transportation System (ITS). Through automated traffic flow counting, the traffic information can be presented effectively for a given area. After the existing image recognition model process the monitoring video, the coordinates of objects in each frame can be easily extracted. The extracted object coordinates are then filtered to obtain the required vehicle coordinates. To achieve the function of vehicle counting, it is necessary to identify the relationship of vehicles in different frames, i.e., whether or not they represent the same vehicle. Although the vehicle counting can be achieved by using the tracking algorithm, a short period of recognition failure may cause wrong tracking, which will lead to incorrect traffic counting. In this thesis, we propose a system that utilizes the YOLO framework for traffic flow counting. The system architecture consists of three blocks, including the Detector that generates the bounding box of vehicles, the Buffer which stores coordinates of vehicles, and the Counter which is responsible for vehicle counting. The proposed system requires only to utilize simple distance calculations to achieve the purpose of vehicle counting. In addition, by adding checkpoints, the system is able to alleviate the consequence of false detection. The videos from different locations and angles are used to verify and analyze the correctness and overall efficiency of the proposed system, and the results indicate that our system achieves high counting accuracy under the environment with sufficient ambient light.
Farinha, João Simões. "In-vehicle object detection with YOLO algorithm." Master's thesis, 2018. http://hdl.handle.net/1822/64273.
With the growing computational power that we have at our disposal and the ever-increasing amount of data available the field of machine learning has given rise to deep learning, a subset of machine learning algorithms that have shown extraordinary results in a variety of applications from natural language processing to computer vision. In the field of computer vision, these algorithms have greatly improved the state-of-the-art accuracy in tasks associated with object recognition such as detection. This thesis makes use of one of these algorithms, specifically the YOLO algorithm, as a basis in the development of a system capable of detecting objects laying inside a car cockpit. To this end a dataset is collected for the purpose of training the YOLO algorithm on this task. A comparative analysis of the detection performance of the YOLOv2 and YOLOv3 architectures is performed.Several experiments are performed by modifying the YOLOv3 architecture to attempt to improve its accuracy. Specifically tests are performed in regards to network size, and the multiple outputs present in this network. Explorative experiments are done in order to test the effect that parallel network might have on detection performance. Lastly tests are done to try to find an optimal learning rate and batch size for our dataset on the new architectures.
Com o crescente poder computacional que temos à nossa disposição e o aumento da quantidade dados a que temos acesso o campo de machine learning deu origem ao deep learning um subconjunto de algoritmos de machine learning que têm demonstrado resultados extraordinários numa variedade de aplicações desde processamento de linguagens naturais a visão por computador. No campo de visão por computador estes algoritmos têm levado a enormes progressos na correção de sistemas de deteção de objetos. Nesta tese usamos um destes algoritmos, especificament o YOLO, como base para desenvolver um sistema capaz de detetar objetos dentro de um carro. Dado isto um dataset é recolhido com o propósito de treinar o algoritmo YOLO nesta tarefa. Uma analise comparativa da correção dos algoritmos YOLOv2 e YOLOv3 ´e realizada. Várias técnicas relacionadas com a modificação da arquitetura YOLOv3 são exploradas para otimizar o sistema para o problema especifico de deteção a bordo de veículos. Especificamente testes são realizados no contexto de tamanho da rede e dos múltiplos outputs presentes nesta rede. Experiencias exploratórias são realizadas de forma a testar o efeito que redes parallelas podem ter na correção dos algoritmos. Por fim testes são feitos para tentar encontrar learning rates e batch sizes apropriados para o nosso dataset nas novas arquiteturas.