Identifikasi Karakteristik Dataset untuk Federated SPARQL Query

Fadzilah, Lutfi Nur (2017) Identifikasi Karakteristik Dataset untuk Federated SPARQL Query. Undergraduate thesis, Institut Teknologi Sepuluh Nopember.

[img]
Preview
Text
5213100059-Undergraduate_Theses.pdf - Published Version

Download (16MB) | Preview

Abstract

Saat ini telah dikembangkan federated SPARQL query engine yang mempunyai kemampuan untuk melakukan query dari beberapa SPARQL endpoint yang terdistribusi, sehingga data yang berasal berbagai sumber memungkinkan untuk diperoleh. Ketika dijalankan untuk melakukan query, masing-masing query engine mempunyai kinerja yang berbeda-beda. Salah satu faktor yang berpengaruh terhadap kinerja dari query engine adalah karakteristik dari dataset RDF yang diakses, seperti jumlah triple, kelas, property, subjek, entity, objek, dan spreading factor dataset. Tugas Akhir ini dilakukan untuk mengidentifikasi karakteristik dataset RDF serta mengetahui karakteristik dataset yang berpengaruh terhadap kinerja dari query engine. Penelitian dilakukan dengan mengidentifikasi 10 dataset yang diambil dari jurnal penelitian lain. Sedangkan uji coba untuk mengetahui keterkaitan antara karakteristik dataset dengan kinerja dari query engine dilakukan menggunakan federated SPARQL query engine FedX. Dari hasil analisis, diketahui bahwa jumlah triple dan jumlah kelas yang terkait dengan query cenderung berpengaruh terhadap kinerja dari query engine. Sedangkan jumlah property yang terkait dengan dataset, spreading factor dataset, dan spreading factor dataset yang terkait dengan query cenderung tidak berpengaruh terhadap kinerja dari query engine. ======================================================================================================================== Federated SPARQL query engines that are able to query from multiple distributed SPARQL endpoints have been developed, so that data from multiple sources are possible to obtain. When it is used to execute a query, a query engine usually has different performance compared to the others. One of the factors that affect the performance of the query engine is the characteristic of the accessed RDF dataset, such as the number of triples, the number of classes, the number of properties, the number of subjects, the number of entities, the number of objects, and the spreading factor of dataset. This final project is done to identify the characteristic of RDF dataset and to know dataset characteristic which is able influence the performance of query engine. The study was conducted by identifying 10 datasets taken from other research journals. The test to determine the relationship between dataset characteristics and the performance of the query engine is done using federated SPARQL query engine FedX. From the analysis results, it is known that the number of triples and the number of classes associated with the query tend to affect the performance of the query engine. Meanwhile, the number of properties associated with the query, spreading factor of dataset, and spreading factor of dataset associated with the query tend not to have an effect on performance of query engine.

Item Type: Thesis (Undergraduate)
Uncontrolled Keywords: linked data, federated SPARQL query engine, dataset, RDF
Subjects: T Technology > T Technology (General)
Z Bibliography. Library Science. Information Resources > ZA Information resources > ZA4450 Databases
Divisions: Faculty of Information Technology > Information System > (S1) Undergraduate Theses
Depositing User: Fadzilah Lutfi Nur
Date Deposited: 24 Aug 2017 08:53
Last Modified: 05 Mar 2019 04:38
URI: http://repository.its.ac.id/id/eprint/42781

Actions (login required)

View Item View Item