Exploiting Linked Data in DBpedia to Reduce Prediction Error in Matrix Factorization Recommenders

DISCENTEVictor Martinez Vidal Pereira

DATA 21/06/2022
 

HORA: 9:00



TÍTULO:

Exploiting Linked Data in DBpedia to Reduce Prediction Error in Matrix Factorization Recommenders



RESUMO:

Recommender Systems provide suggestions for items that are most likely of interest to users. Providing personalized recommendations is a challenge that can be addressed by filtering algorithms among which Collaborative Filtering (CF) has demonstrated much progress in the last few years. By using Matrix Factorization (MF) techniques, CF methods reduce prediction error by using optimization algorithms. However, they usually face problems such as data sparsity and prediction error. Studies point to the use of data available in Semantic Web as a path to improve recommender systems and address the challenges related to CF techniques. Motivated by these premises, the present work developed a data pipeline along with an algorithm that processes the Ratings Matrix combining semantic similarities of Linked Open Data (LOD) and estimates missing rat- ings. The experiments take subsets of three different datasets (Movielens, LastFM and LibraryThing), two semantic similarity metrics, Linked Data Similarity Distance (LDSD) and Resource Similarity (RESIM), and three MF-based algorithms (SVD, SVD++ and NMF). Our experiments reduced sparsity by more than 75% in Movielens subset and 28% in LastFM. Prediction error is reduced in all subsets with statistical confidence using parametric test one-way ANOVA followed by Tukey’s multiple comparison test.
 

MEMBROS DA BANCA:

Membro Externo: Adriano César Machado Pereira (UFMG)

Membro Interno: Danilo Barbosa Coimbra (UFBA)

Membro Suplente Externo: Ramon Pereira Lopes (UFRB)
 

Membro Suplente Interno: Rodrigo Rocha Gomes e Souza (UFBA)

 

 

Data da Defesa: 
21/06/2022 - 09:00
Tipo de Defesa: 
Defesa de Mestrado