• Journal of Internet Computing and Services
    ISSN 2287 - 1136 (Online) / ISSN 1598 - 0170 (Print)
    https://jics.or.kr/

Semantic Jaccard Coefficient for Data Sparsity in Collaborative Filtering


Soojung Lee, Journal of Internet Computing and Services, Vol. 26, No. 3, pp. 67-74, Jun. 2025
10.7472/jksii.2025.26.3.67, Full Text:  HTML
Keywords: Recommender System, Collaborative Filtering, Similarity Measure, data sparsity

Abstract

As the amount of information available on the web increases rapidly every day, the introduction of information filtering techniques for user search efficiency is essential. Collaborative filtering is one of the most useful methods for recommending products to e-commerce users. The most important component of this method is to find similarities between users using the user-item rating matrix and recommend products preferred by similar neighboring users. However, one of the key problems is that the system performance deteriorates due to the difficulty in measuring similarity when the user rating data is sparse. In this study, we propose a base similarity measure for sparse data environments such as Jaccard coefficient. The proposed method introduces a semantic approach to rated items to overcome the limitations of Jaccard coefficient by basing it on the number of common rated genres rather than the number of common rated items of two users. The performance evaluation was performed using two types of public datasets, and the results show that the proposed method outperformed the prediction performance and coverage performance of the Jaccard coefficient by up to 7.4% and 35.9%, respectively, and showed excellent prediction performance especially in sparse data environments. In addition, despite not utilizing user evaluations, the proposed method generally outperformed the performance of existing traditional similarity measures.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from November 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article
[APA Style]
Lee, S. (2025). Semantic Jaccard Coefficient for Data Sparsity in Collaborative Filtering. Journal of Internet Computing and Services, 26(3), 67-74. DOI: 10.7472/jksii.2025.26.3.67.

[IEEE Style]
S. Lee, "Semantic Jaccard Coefficient for Data Sparsity in Collaborative Filtering," Journal of Internet Computing and Services, vol. 26, no. 3, pp. 67-74, 2025. DOI: 10.7472/jksii.2025.26.3.67.

[ACM Style]
Soojung Lee. 2025. Semantic Jaccard Coefficient for Data Sparsity in Collaborative Filtering. Journal of Internet Computing and Services, 26, 3, (2025), 67-74. DOI: 10.7472/jksii.2025.26.3.67.