Benchmarking Hallucination Mitigation Techniques in Large Language Models: A Comparative Study

Wu  Lingyi; Anwar Saif

doi:10.59088/gi.v4i2.28

Download

Published: May 4, 2026

DOI: https://doi.org/10.59088/gi.v4i2.28

Keywords:

Large Language Models (LLMs) Hallucination Mitigation Retrieval-Augmented Generation (RAG) Factual Consistency, Prompt Engineering

Wu Lingyi

Guangdong Technology College, China

Anwar Saif

Department of Information Systems, Sana’a University, Sana’a, Yemen

Abstract

Hallucinations defined as factually incorrect or fabricated outputs remain a critical limitation of Large Language Models (LLMs), significantly undermining their reliability in high-stakes applications. This paper presents a systematic and reproducible comparative evaluation of prominent hallucination mitigation strategies, including prompt engineering, retrieval-augmented generation (RAG), and self-consistency decoding. Using benchmark factual question-answering datasets, we assess these approaches across multiple evaluation dimensions, including factual accuracy, hallucination rate, and response consistency. Furthermore, we introduce a unified evaluation protocol and extend prior work by incorporating a hybrid evaluation perspective that examines trade-offs between grounding effectiveness and computational overhead. Experimental results indicate that retrieval-based methods substantially improve factual grounding at the cost of increased latency, whereas prompt-based techniques provide lightweight yet less robust improvements. We complement quantitative findings with qualitative error analysis and discuss practical implications for real-world deployment. This study contributes a standardized benchmarking framework and provides actionable insights into optimizing reliability–efficiency trade-offs in LLM-based systems..

Downloads

Download data is not yet available.

How to Cite

Lingyi, W., & Saif, A. (2026). Benchmarking Hallucination Mitigation Techniques in Large Language Models: A Comparative Study. Global Social Science and Humanities Journal, 4(2), 1–14. https://doi.org/10.59088/gi.v4i2.28

Issue

Vol. 4 No. 2 (2026)

Section

Articles

All articles published in Global Social Science and Humanities Journal (GSSHJ) are licensed under the Creative Commons Attribution 4.0 International (CC BY 4.0) license. This permits use, sharing, adaptation, distribution, and reproduction in any medium or format, including for commercial purposes, provided the original work is properly cited, a link to the license is given, and any changes are indicated.

Benchmarking Hallucination Mitigation Techniques in Large Language Models: A Comparative Study

Abstract

Downloads

Most read articles by the same author(s)

Information

Guide Line

Contact

Article Sidebar

Main Article Content

Abstract

Downloads

Article Details

Most read articles by the same author(s)