Comparison of Similarity Distance-Based Metrics for HODA and BANGLA Dataset for Enhanced Precision

Mgd Maaz Taha Yassin; Amirul Ramzani Radzid; Mohd Sanusi Azmi; Nur Atikah Arbain

doi:10.47772/IJRISS.2025.91200013

International Journal of Research and Innovation in Social Science (IJRISS)

Comparison of Similarity Distance-Based Metrics for HODA and BANGLA Dataset for Enhanced Precision

byMgd Maaz Taha Yassin; Amirul Ramzani Radzid; Mohd Sanusi Azmi; Nur Atikah Arbain

Published December 30, 2025 • Vol. 9, Issue 12, pp. 141–150Open Access
DOI: 10.47772/IJRISS.2025.91200013

Download PDF Browse this issue

Abstract

A similar metric is often used as a tool to measure the degree of similarity between two objects or pieces of data. It is essential in many areas of study including data analysis, machine learning and image processing, which provides a way to compare and evaluate the similarity of different entities. These metrics can be categorized into distance-based and similarity-based approaches, each with their strengths and applications. Therefore, this study is to do a comparison of various distance metrics on image classification performance using HODA and Bangla handwritten digit datasets. A comprehensive evaluation is conducted on eight different distance measures, namely Euclidean, Manhattan, Chebyshev, Canberra, Cosine, Minkowski, Jaccard, and Sorenson, within the Mean Average Precision (MAP) metric framework to evaluate their effectiveness in the context of handwritten digit recognition. Experimental results show that Chebyshev distance produces the highest classification accuracy of 71.6% on the HODA dataset, while Euclidean distance achieves the best performance on the Bangla dataset with 70.7% accuracy. In addition to quantitative analysis, a user study involving a structured questionnaire was conducted to qualitatively verify the MAP-based evaluation methodology. Results from user evaluations further reinforce the empirical findings. Therefore, the study underlines the importance of choosing an appropriate distance metric that is adapted to the specific properties of the dataset, highlighting its role in improving the performance of pattern recognition systems in computer vision applications.

Keywords: distance metric, image classification, HODA dataset, BANGLA dataset

Journal	International Journal of Research and Innovation in Social Science (IJRISS)
ISSN	2454-6186
Volume / Issue	Volume 9, Issue 12
Pages	141–150
Publication date	December 30, 2025
DOI	10.47772/IJRISS.2025.91200013
Publisher	RSIS International
License	Open Access

How to cite this article

Mgd Maaz Taha Yassin, Amirul Ramzani Radzid, Mohd Sanusi Azmi, & Nur Atikah Arbain (2025). Comparison of Similarity Distance-Based Metrics for HODA and BANGLA Dataset for Enhanced Precision. International Journal of Research and Innovation in Social Science (IJRISS), 9(12), 141-150. https://doi.org/10.47772/IJRISS.2025.91200013

BibTeX

@article{Mgd2025,
  title   = {Comparison of Similarity Distance-Based Metrics for HODA and BANGLA Dataset for Enhanced Precision},
  author  = {Mgd Maaz Taha Yassin and Amirul Ramzani Radzid and Mohd Sanusi Azmi and Nur Atikah Arbain},
  journal = {International Journal of Research and Innovation in Social Science (IJRISS)},
  volume  = {9},
  number  = {12},
  pages   = {141--150},
  year    = {2025},
  doi     = {10.47772/IJRISS.2025.91200013},
  publisher = {RSIS International}
}