Deteksi Kerusakan Jalan Menggunakan YoloV7 dengan Cluster Weighted DIoU NMS

Rachman, Rudy (2024) Deteksi Kerusakan Jalan Menggunakan YoloV7 dengan Cluster Weighted DIoU NMS. Masters thesis, UIN Sunan Gunung Djati Bandung.

[thumbnail of 6025222002-Master-Thesis.pdf]

Text
6025222002-Master-Thesis.pdf - Accepted Version
Restricted to Repository staff only until 5 February 2027.
Download (4MB)

Official URL: https://join.if.uinsgd.ac.id

Abstract

Kerusakan jalan merupakan suatu keadaan yang sering dilihat di kehidupan sehari-hari. Salah satu jenis kerusakan umum terjadi pada jalan adalah lubang (pothole). Penelitian sebelumnya terdapat penggunaan metode Faster Regional Convolutional Neural Network (Faster R-CNN) untuk deteksi jalan berlubang. Faster R-CNN mampu menangani permasalahan metode pengolahan citra tersebut. Akan tetapi, Faster R-CNN tidak mampu mendeteksi dengan cepat karena Faster R-CNN merupakan metode deteksi dua tahap. Penelitian ini menggunakan metode You only look once versi tujuh (YoloV7) yang merupakan deteksi satu tahap. Metode YoloV7 mampu mendeteksi objek dengat cepat dan menghasilkan nilai mean Average Precision (mAP) paling baik dibandingkan metode sebelumnya. Metode post-processing yang digunakan pada YoloV7 adalah non-maximum suppression (NMS). NMS kurang mampu mendeteksi objek yang kecil atau saling berdekatan dengan baik, karena itu pada penelitian ini menggunakan Cluster Weighted Distance Intersection over Union NMS (DIoU-NMS) (CWD-NMS), yang mana menghasilkan nilai mAP yang lebih baik dibandingkan menggunakan metode NMS lainnya. Dataset yang digunakan untuk melatih metode YoloV7 dibagi tiga yaitu adalah dataset primer, sekunder, dan gabungan (primer dan sekunder). Dataset pengujian diambil dari dataset sekunder, agar bisa dibandingkan antara Faster R-CNN dari penelitian sebelumnya dengan YoloV7. Hasil dari YoloV7 mampu mencapai nilai mAP yang lebih baik pada saat melatih menggunakan data sekunder saja, semua metode YoloV7 dari NMS yang berbeda, metode scheduler berbeda, dan semua checkpoint mencapai nilai mAP >85,00%, rata-rata waktu inferensi NMS nya mencapai 8,00 milliseconds (ms). Metode YoloV7 juga mengalami kenaikan sebanyak 2,00% jika dilatih menggunakan dataset gabungan, tetapi untuk rata-rata waktu inferensi mengalami kenaikan 1ms. YoloV7 memiliki hasil yang lebih baik dan waktu inferensi yang lebih cepat, dibandingkan Faster R-CNN. Faster R-CNN, hanya mampu mencapai nilai mAP 79,70%, dengan waktu inferensi 52,00ms tiap citra, sedangkan untuk YoloV7 dengan CWD-NMS, checkpoint best, dan tanpa scheduler, mAP 89,2%, waktu inferensinya 22,10ms. YoloV7 dengan waktu inferensi tercepat menggunakan metode Cluster NMS, waktu inferensi 18,80ms, tetapi hasil mAP hanya 87,00%. YoloV7 yang digunakan pada penelitian ini, mendapatkan hasil FPS dengan rata-rata 20 FPS. FPS yang paling baik untuk melakukan deteksi secara langsung juga adalah C-NMS yaitu 20,65, diikuti CWD-NMS yang bernilai 20,47.
=================================================================
Road damage is a situation that is often seen in everyday life. One type of damage that commonly occurs on roads is potholes. Previous research used the Faster Regional Convolutional Neural Network (Faster R-CNN) method for pothole detection. Faster R-CNN can handle the problems of this image processing method. However, Faster R-CNN cannot detect quickly because Faster R-CNN is a two-stage detection. This research uses the You Only Look Once version seven (YoloV7) method, a one-stage detection. The YoloV7 method can detect objects quickly and produces the best mean Average Precision (mAP) value compared to the previous method. The post-processing method used in YoloV7 is non-maximum suppression (NMS). NMS is less able to detect objects that are small or close to each other, therefore in this study used Cluster Weighted Distance Intersection over Union NMS (DIoU-NMS) (CWD-NMS), which produces better mAP compared to other NMS methods. The dataset used to train the YoloV7 method is divided into primary, secondary, and combined datasets. The test dataset was taken from a secondary dataset so that it could be compared between Faster R-CNN and YoloV7. The results of YoloV7 can achieve better mAP values when training using secondary data only, all YoloV7 methods from different NMS, different scheduler methods, and all checkpoints achieve mAP values >85.00%, the average NMS inference time reaches 8.00 milliseconds (ms). The YoloV7 method also experienced an increase of 2.00% when trained using a combined dataset, but the average inference time increased by 1ms. YoloV7 has better results and faster inference time, compared to Faster R- CNN. Faster R-CNN was only able to achieve a mAP value of 79.70%, with an inference time of 52.00ms per image, while YoloV7 with CWD-NMS, checkpoint xii best, and without a scheduler, the mAP was 89.2%, the inference time was 22.10ms. YoloV7 with the fastest inference time uses the Cluster NMS method, inference time of 18.80ms, but mAP results are only 87.00%. The YoloV7 used in this research obtained FPS results with an average of 20 FPS. The best FPS for direct detection is C-NMS, namely 20.65, followed by CWD-NMS, which is 20.47

Item Type:	Thesis (Masters)
Subjects:	T Technology > T Technology (General) > T59.7 Human-machine systems.
Divisions:	Faculty of Intelligent Electrical and Informatics Technology (ELECTICS) > Informatics Engineering > 55101-(S2) Master Thesis
Depositing User:	Rudy Rachman
Date Deposited:	05 Feb 2025 12:51
Last Modified:	05 Feb 2025 13:03
URI:	http://repository.its.ac.id/id/eprint/118316

Actions (login required)

View Item