Pengukuran Degree-of-Concern Berdasarkan Hasil Klasifikasi Bahasan COVID-19 Pada Teks Twitter Menggunakan Pendekatan Deep Learning

Hasanah, Novrindah Alvi (2021) Pengukuran Degree-of-Concern Berdasarkan Hasil Klasifikasi Bahasan COVID-19 Pada Teks Twitter Menggunakan Pendekatan Deep Learning. Masters thesis, Institut Teknologi Sepuluh Nopember.

[thumbnail of 05111950010002-Master_Thesis.pdf]

Text
05111950010002-Master_Thesis.pdf - Accepted Version
Restricted to Repository staff only until 1 April 2023.
Download (2MB) | Request a copy

Abstract

Indonesia khususnya Kota Surabaya telah menjadi salah satu area terdampak COVID-19. Hal tersebut berdampak pada perubahan perilaku masyarakat secara sosial maupun individu. Oleh karena itu, dibutuhkan identifikasi perhatian publik. Penelitian ini mengidentifikasi perhatian publik menggunakan rumus Degree-Of-Concern. Sehingga informasi topik penting yang menjadi perhatian publik dapat dijadikan salah satu faktor pendukung pihak terkait untuk membuat suatu kebijakan dalam membangun strategi tanggap darurat yang tepat. Objek penelitian merupakan teks Twitter. Namun data Twitter yang bervariasi perlu diklasifikasikan. Kemudian, identifikasi perhatian publik dengan perhitungan Degree-of-Concern lebih spesifik terhadap topik bahasan COVID-19. Klasifikasi dilakukan melalui dua tahap. Klasifikasi tahap pertama adalah memisahkan data Twitter untuk COVID-19 dan non COVID-19. Tahap kedua adalah mengklasifikasikan data Twitter mengenai COVID-19 menjadi tujuh kelas: peringatan dan saran, pemberitahuan informasi, donasi, dukungan emosional, mencari bantuan, kritik, dan hoaks. Klasifikasi dilakukan dengan menerapkan variasi word embedding (Word2Vec dan fastText) dan variasi deep learning (CNN, RNN, dan LSTM) untuk mendapatkan hasil akurasi terbaik. Percobaan dilakukan dengan tiga skenario yang dibedakan oleh jumlah data train untuk mengetahui pengaruh jumlah data train pada klasifikasi. Hasil klasifikasi menunjukkan akurasi tertinggi dari semua skenario sebesar 97,86% dan 99,66% untuk klasifikasi tahap pertama dan kedua yang didapatkan kombinasi LSTM dan fastText. Semua akurasi terbaik dari setiap skenario pada klasifikasi tahap kedua digunakan untuk melakukan perhitungan Degree-of-Concern. Hasil penelitian menunjukkan bahwa klasifikasi topik COVID-19 dapat digunakan untuk mengukur Degree-of-Concern dengan baik. Hal tersebut karena pengukuran Degree-of-Concern teks Twitter berdasarkan hasil akurasi terbaik menunjukkan hasil yang sama dengan Degree-of-Concern GroundTruth. Sehingga, perhitungan yang diusulkan dapat digunakan untuk identifikasi perhatian publik dan dapat digunakan sebagai salah satu faktor pendukung bagi pihak-pihak terkait untuk membuat strategi tanggap darurat yang tepat.
=====================================================================================================
Indonesia, especially the city of Surabaya, has become one of the areas affected by COVID-19. This has an impact on changing people's behavior socially and individually. Therefore, identification of public attention is needed. This study identifies public attention using the Degree-Of-Concern formula. So that information on important topics of concern to the public can be used as a supporting factor for related parties to make a policy in building an appropriate emergency response strategy. The research object is Twitter text. But Twitter's varying data needs to be classified. Then, identify the public's attention with a more specific Degree-of-Concern calculation on the topic of COVID-19. Classification is carried out in two stages. The first stage of classification is to separate Twitter data for COVID-19 and non-COVID-19. The second stage is to classify Twitter data on COVID-19 into seven classes: warnings and suggestions, informational notifications, donations, emotional support, seeking help, criticism, and hoaxes. Classification is done by applying word embedding variations (Word2Vec and fastText) and deep learning variations (CNN, RNN, and LSTM) to get the best accuracy results. Experiments were carried out with three scenarios which differed by the number of train data to determine the effect of the number of train data on classification. The classification results show the highest accuracy of all scenarios of 97.86% and 99.66% for the first and second stage classifications obtained by the combination of LSTM and fastText. All the best accuracy of each scenario in the second stage classification is used to perform the Degree-of-Concern calculations. The results show that the classification of the COVID-19 topic can be used to measure the Degree-of-Concern well. This is because the Degree-of-Concern measurement of Twitter text based on the best accuracy results shows the same result as the Degree-of-Concern GroundTruth. Thus, the proposed calculation can be used to identify public concerns and can be used as a supporting factor for the parties involved in making an appropriate emergency response strategy.

Item Type:	Thesis (Masters)
Uncontrolled Keywords:	COVID-19, Degree-Of-Concern, Klasifikasi Teks Twitter, Word embedding, Deep Learning COVID-19, Degree-Of-Concern, Twitter Text Classification, Word embedding, Deep Learning
Subjects:	Q Science > Q Science (General) > Q325.5 Machine learning. T Technology > T Technology (General) > T57.5 Data Processing
Divisions:	Faculty of Intelligent Electrical and Informatics Technology (ELECTICS) > Informatics Engineering > 55101-(S2) Master Thesis
Depositing User:	NOVRINDAH ALVI HASANAH
Date Deposited:	28 Feb 2021 20:39
Last Modified:	28 Feb 2021 20:39
URI:	http://repository.its.ac.id/id/eprint/82961

Actions (login required)

View Item