Decoding cultural tapestries: A deep dive into Indian social stigma patterns in large language models

Sridhar  Jonnala; Rushikesh  Tade; Nisha Mary  Thomas

doi:10.55493/5003.v15i2.5483

Vol. 15 No. 2 (2025), Articles

Vol. 15 No. 2 (2025)

Decoding cultural tapestries: A deep dive into Indian social stigma patterns in large language models

https://doi.org/10.55493/5003.v15i2.5483

Sridhar Jonnala⁺⁻
Rushikesh Tade⁺⁻
Nisha Mary Thomas⁺⁻

Sridhar Jonnala

IBM India Pvt Ltd, and International School of Management Excellence- Bangalore, India.

https://orcid.org/0000-0003-2122-1972

Rushikesh Tade

IBM India Pvt Ltd, Kolkata, India.

https://orcid.org/0009-0001-7003-3639

Nisha Mary Thomas

International School of Management Excellence- Bangalore, and Symbiosis Institute of Business Management, Bengaluru, India.

https://orcid.org/0000-0002-2427-6146

View Abstract View PDF Download PDF

Keywords

Bias detection, Cultural bias, Ethical AI, Indian social stigmas, Large language models, Responsible AI, AI Governance.

How to Cite

Jonnala, S. ., Tade, R. ., & Thomas, N. M. . (2025). Decoding cultural tapestries: A deep dive into Indian social stigma patterns in large language models. Journal of Asian Scientific Research, 15(2), 226–244. https://doi.org/10.55493/5003.v15i2.5483

Abstract

The widespread adoption of Large Language Models (LLMs) raises critical concerns about the amplification of societal biases, especially in non-Western contexts where cultural and social nuances are often underrepresented. This study introduces a multi-agent bias detection framework to systematically evaluate GPT-4o, Claude 3.5 Sonnet, and Llama 3.3 across Indian social stigma categories, including caste, religion, gender, mental health, socio-economic status, appearance, language/region, and family dynamics. We present SocialStigmaQA, a benchmark dataset of 320 prompts, validated through expert review and pilot testing, and use the Overall Bias Detection Factor (OBDF) to measure model performance. Findings reveal that Claude 3.5 Sonnet achieved the highest OBDF (98.75%), demonstrating superior bias detection across all categories, while GPT-4o showed moderate performance (72.8%) with noticeable gaps in gender and socio-economic domains. Llama 3.3 scored the lowest (71%). The multi-agent framework enhanced detection accuracy by 25–30% over single-agent models, particularly in subtle bias areas. These results underscore the need for culturally contextualized evaluation frameworks and suggest that OBDF-like metrics should be integrated into India's AI auditing processes to ensure fairness, inclusivity, and ethical deployment of AI systems in sensitive sectors such as hiring, education, and governance.

https://doi.org/10.55493/5003.v15i2.5483

View Abstract View PDF Download PDF

Downloads

Download data is not yet available.

Decoding cultural tapestries: A deep dive into Indian social stigma patterns in large language models

Keywords

How to Cite

Abstract

Downloads

Similar Articles

Information

Policies

Submissions

Decoding cultural tapestries: A deep dive into Indian social stigma patterns in large language models

Keywords

How to Cite

Download Citation

Abstract

Downloads

Similar Articles