Klasifikasi Ayunan Pukulan Golf Menggunakan Vision Transformer

Rapsodi, Alif Hibatullah Rapsodi (2025) Klasifikasi Ayunan Pukulan Golf Menggunakan Vision Transformer. Other thesis, Institut Teknologi Sepuluh Nopember.

[thumbnail of 5002201154_Undergraduated_Thesis.pdf]

Text
5002201154_Undergraduated_Thesis.pdf - Accepted Version
Restricted to Repository staff only
Download (5MB) | Request a copy

Abstract

Olahraga Golf memiliki peminat yang banyak di seluruh dunia tetapi tidak semua orang bisa melakukannya dikarenakan memiliki teknik ayunan yang rumit dan sulit. Teknik ayunan sangat penting untuk mencapai hasil pukulan yang bagus dan efektif. Terdapat tujuh fase utama ayunan dalam golf yaitu: Address, Toe-up, Mid-backswing, Top, Impact, Mid-follow-through, dan finish. Analisis teknik ayunan ini diperlukan untuk mengetuhi keakuratan ayunan pukulan golf. Langkah-langkah dalam penerapan Vision Transformer pada penelitian ini dimulai dengan memecah citra menjadi patch kecil, kemudian setiap patch dikonversi menjadi vektor melalui embedding linier. Selanjutnya, informasi posisi ditambahkan menggunakan positional encoding sebelum diproses oleh lapisan encoder transformer yang mengandalkan mekanisme multi-head self-attention. Output dari token representasi citra keseluruhan kemudian digunakan untuk klasifikasi fase ayunan golf. Tujuan penelitian ini adalah mengklasifikasikan tujuh fase utama ayunan golf dengan menggunakan Vision Transformer. Penelitian ini menggunakan data citra yang berjumlah 2398 citra dan menggunakan tiga arsitektur ViT yaitu Tiny, Small, dan Base. Penelitian ini menghasilkan performa model ViT arsitektur ViT- Small/16 yang memiliki akurasi 81%, presisi 79.29%, recall 81%, dan F1-score 76%, hasil tersebut merupakan hasil terbaik dibandingkan dengan arsitektur ViT lainnya. Diharapkan dengan klasifikasi teknik ayunan golf menggunakan vision transformer dapat mengetahui keefektifan pukulan ayunan golf.
========================================================================================================================
Golf has many enthusiasts around the world but not all people can do it because it has a complicated and difficult swing technique. Swing technique is very important to achieve good and effective shots. There are seven main swing phases in golf namely: Address, Toe-up, Mid-backswing, Top, Impact, Mid-follow-through, and finish. Analysis of this swing technique is needed to determine the accuracy of the golf swing. The steps in the application of the Vision Transformer in this study begin with breaking the image into small patches, then each patch is converted into a vector through linear embedding. Next, position information is added using positional encoding before being processed by encoder transformer layer that relies on multi-head self-attention mechanism. The output of the tokenized whole image representation is then used for classification of golf swing phases. The purpose of this research is to classify the seven main phases of a golf swing using Vision Transformer. This research uses data images totaling 2398 images and uses three ViT architectures namely Tiny, Small, and Base. This research results in the performance of the ViT model architecture ViT-Small/16 which has 81% accuracy, 79.29% precision, 81% recall, and 76% F1-score, these results are the best results compared to other ViT architectures. It is expected that the classification of golf swing techniques using vision transformers can determine the effectiveness of golf swing shots

Item Type:	Thesis (Other)
Uncontrolled Keywords:	Golf, Citra , Klasifikasi, Vision Transformer Golf, Image, Classification, Vision Transformer
Subjects:	Q Science Q Science > QA Mathematics > QA278.55 Cluster analysis
Divisions:	Faculty of Mathematics, Computation, and Data Science > Mathematics > 44201-(S1) Undergraduate Thesis
Depositing User:	Alif Hibatullah Rapsodi
Date Deposited:	01 Aug 2025 09:12
Last Modified:	01 Aug 2025 09:12
URI:	http://repository.its.ac.id/id/eprint/126028

Actions (login required)

View Item