User Similarity Measures in Online Social Networks
Files
Date
2025
Authors
Abedelaziz BELABAC
Khalil SAIDANI
Journal Title
Journal ISSN
Volume Title
Publisher
university of bordj bou arreridj
Abstract
With the growing use of Online Social Networks (OSNs), these platforms have become
central to engagement, participation, and community formation. As user bases expand,
understanding user similarity becomes increasingly important for applications such as friend
recommendation, community detection, and spam detection. Measuring user similarity includes
determining behavioral patterns in addition to profile similarities and behavioral differences. In
this research, we examine some classical similarity measures (Jaccard, Cosine, and Euclidean)
versus deep learning-based measures, including Node Embeddings, Graph Convolutional
Networks (GCN), Graph Attention Networks (GAT), and GraphSAGE with a synthetic Twitter
dataset. User interactions are modeled as graphs and transformed into low-dimensional
embeddings. Using the Deep Graph Infomax framework for unsupervised learning, clustering
is applied and assessed through internal evaluation metrics: Silhouette Score, Davies-Bouldin
Index, and Calinski-Harabasz Score. The objective is to identify the most effective method for
discovering meaningful user clusters in OSNs.
Description
Keywords
User Similarity, Graph Convolutional Networks (GCN), Graph Attention Networks (GAT), GraphSAGE, Graph Infomax (DGI), Cosine Similarity, Jaccard Similarity, Euclidean Distance