Enterprise AI Analysis

Urdu Toxicity Detection: A Multi-Stage and Multi-Label Classification Approach

Authors: Ayesha Rashid, Sajid Mahmood, Usman Inayat, Muhammad Fahad Zia

DOI: 10.3390/ai6080194

Social media platforms, while empowering freedom of expression, are frequently misused for abuse and hate. This paper addresses the critical need for toxicity detection in under-resourced languages like Urdu, a challenge exacerbated by its complex linguistic structure and limited NLP resources. We present the comprehensive Urdu Toxicity Corpus (UTC), a novel multi-label dataset, and introduce the Urdu Toxicity Detection Model (UTDM) framework. Our approach combines robust preprocessing, advanced feature engineering, and state-of-the-art machine learning and deep learning algorithms to effectively identify toxic content in Urdu Nastaliq script.

Schedule Your Strategy Session

Executive Impact & AI Readiness

Integrating AI for Urdu toxicity detection offers significant benefits, safeguarding online communities and enhancing digital platform integrity for millions of users.

0.98 RF F1-Score (New Benchmark)

18,000 comments Urdu Toxicity Corpus Size

15% Performance Improvement

5 classes Multi-Label Toxicity Categories

Discuss Your Implementation

Deep Analysis & Enterprise Applications

Select a topic to dive deeper, then explore the specific findings from the research, rebuilt as interactive, enterprise-focused modules.

Comprehensive Multi-Label Urdu Toxicity Corpus (UTC)

The research introduces the Urdu Toxicity Corpus (UTC), a novel multi-label dataset of 18,000 user-generated comments in Urdu Nastaliq font, collected from diverse social media platforms like Facebook, Twitter (X), YouTube, and Urdu Newsgroups. This dataset is meticulously annotated with a hierarchical taxonomy, classifying comments into five distinct toxicity classes: 'positive', 'negative', 'negative and rude', 'negative, offensive, and abusive', and 'negative, offensive, and hate speech'. Manual annotation by three native Urdu-speaking psychology students, guided by a predefined toxicity hierarchy and assessed using Fleiss Kappa, ensures high label accuracy. This dataset addresses a critical gap in NLP resources for Urdu, providing a foundational resource for future toxicity detection research.

UTC Dataset vs. Previous Benchmarks

Feature	Urdu Toxicity Corpus (UTC)	Prior Work ([2])
Dataset Size	18,000 comments	12,428 tweets
Sources	Facebook, Twitter (X), YouTube, Newsgroup	Twitter (X)
Classification Type	Multilabel Multiclass	Multiclass
Labels (Key Examples)	Positive, Negative, Rude, Offensive, Abusive, Hate Speech	Insult, Offensive, Name-calling, Profane, Threat, Curse, None
Annotation Method	Manual, Hierarchical (3 stages)	Manual

Robust Preprocessing and Feature Engineering Pipeline

The proposed Urdu Toxicity Detection Model (UTDM) framework employs a rigorous methodology starting with comprehensive data preprocessing. This involves removing other language data (English, Arabic, numerical digits), punctuation marks, and repetitive characters, followed by tokenization, stemming, and data normalization. A custom Urdu stop words list further refines the text. Feature engineering is then applied using Term Frequency-Inverse Document Frequency (TF-IDF), Bag-of-Words (BoW), and N-gram techniques to capture both term presence/frequency and contextual patterns, crucial for Urdu's complex morphology. To address class imbalance, the Synthetic Minority Over-sampling Technique (SMOTE) is utilized, creating synthetic examples for minority classes and ensuring balanced data distribution for training. This multi-stage approach ensures data quality and prepares the text for effective classification by both machine learning and deep learning models.

UTDM Framework Overview

Data Collection (Social Media)

→

Data Preprocessing (Normalization, Tokenization, Stemming, Stopword Removal)

→

Feature Extraction (TF-IDF, BoW, N-grams)

→

Data Resampling (SMOTE)

→

Machine Learning Classification (LR, SVM, RF, GB)

→

Deep Learning Classification (LSTM, BiLSTM, GRU)

→

Toxicity Detection (Toxic/Non-Toxic)

Random Forest Achieves Benchmark Performance

Experimental results demonstrate that the Random Forest (RF) classifier significantly outperforms all other evaluated Machine Learning (ML) and Deep Learning (DL) models in detecting Urdu toxicity. RF achieved an exceptional precision of 0.97, recall of 0.99, F1-score of 0.98, and overall accuracy of 0.99 (99%). This superior performance is attributed to RF's robustness on smaller-to-medium sized datasets and its effectiveness in exploiting features generated by TF-IDF, BoW, and N-grams. While DL models (LSTM, BiLSTM, GRU) showed potential, especially GRU with better generalization for minority classes, they did not match RF's performance or computational efficiency. The 5-fold cross-validation further validated RF's stable and reliable generalization capacity across all five toxicity classes.

99% Achieved by Random Forest model, setting a new benchmark for Urdu toxicity detection.

ML Model F1-Scores (5-fold cross-validation)

Model	Class 1 (Positive)	Class 2 (Negative Only)	Class 3 (Neg+Rude)	Class 4 (Neg+Off+Abusive)	Class 5 (Neg+Off+Hate)	Average F1-Score
Random Forest	0.9841	1.0000	0.9660	0.9814	0.9937	0.9850
Gradient Boosting	0.6598	0.9744	0.6681	0.6630	0.7817	0.7494
Support Vector Machine	0.6012	0.5537	0.4421	0.6142	0.6907	0.5804
Logistic Regression	0.5670	0.5537	0.4265	0.5679	0.6025	0.5435

DL Model F1-Scores (5-fold cross-validation)

3

Model	Class 1 (Positive)	Class 2 (Negative Only)	Class 3 (Neg+Rude)	Class 4 (Neg+Off+Abusive)	Class 5 (Neg+Off+Hate)	Average F1-Score
GRU	0.6210	0.7237	0.5755	0.7596	0.5463	0.6452
LSTM	0.6300	0.7255	0.5592	0.7654	0.5391	0.6438
BiLSTM	0.6212	0.6942	0.5545	0.7533	0.5420	0.6330

Advancing NLP for Low-Resource Languages

This study provides a robust foundation for advancing Urdu natural language processing by creating the first comprehensive multi-label, multi-class Urdu toxicity dataset and proposing the highly effective UTDM framework. While Random Forest demonstrated superior performance in this work due to dataset characteristics and feature engineering, the study acknowledges limitations such as potential annotation bias and the current scope of the dataset. Future research is vital, focusing on expanding the dataset with more diverse Urdu texts, exploring semi-supervised or unsupervised learning to reduce reliance on manual labeling, and developing advanced pre-trained transformer models specifically fine-tuned for Urdu. Furthermore, integrating human-in-the-loop moderation and explainable AI techniques (e.g., SHAP, attention maps) will be crucial to mitigate risks of censorship and bias, ensuring a safer and more inclusive digital environment for Urdu-speaking communities.

Future Research Directions for Urdu Toxicity Detection

Dataset Expansion & Diversity

Expand the Urdu Toxicity Corpus to include a larger and more diverse collection of Urdu texts from various domains, moving beyond social media comments to cover a broader spectrum of online discourse. This will improve model generalizability.

Advanced Learning Techniques

Investigate semi-supervised or unsupervised training methods to reduce the heavy dependence on human labeling, which can be time-consuming and prone to subjective biases. Also, develop deeper pre-trained transformer models, such as BERT or GPT, specifically fine-tuned for the unique linguistic characteristics of Urdu.

Ethical AI & Explainability

Integrate human-in-the-loop moderation and explainable AI techniques (e.g., SHAP, attention maps) to counter risks of censorship, prejudice, and bias. This will help detect satirical, political dissent, or culturally rich expressions, ensuring transparency and fairness in moderation decisions.

Hybrid & Lightweight Architectures

Explore lightweight or distilled architectures and hybrid pipelines where rapid content classifiers can precede more detailed analysis by deeper models. This addresses computational efficiency challenges, especially for real-time deployment in resource-constrained environments.

Calculate Your Potential AI ROI

Estimate the financial and operational benefits of implementing advanced AI solutions for text analysis within your organization.

Your Industry

Number of Employees (Impacted by Text Analysis)

Average Weekly Hours Spent on Manual Text Review (per employee)

Average Hourly Wage of Employee ($)

Estimated Annual Savings $0

Annual Hours Reclaimed 0

Unlock Your Specific ROI

Your AI Implementation Roadmap

A typical phased approach to integrating advanced AI for toxicity detection, tailored for enterprise success.

Phase 01: Discovery & Strategy

Initial assessment of existing systems, data infrastructure, and specific toxicity detection needs. Define project scope, key performance indicators (KPIs), and a tailored AI strategy. This phase includes deep dives into linguistic nuances like those found in Urdu Nastaliq.

Phase 02: Data Engineering & Model Training

Collection, preprocessing, and annotation of relevant datasets (e.g., building a custom Urdu Toxicity Corpus). Feature engineering, model selection (ML/DL), and initial training with robust validation (e.g., 5-fold cross-validation) and imbalance handling techniques like SMOTE.

Phase 03: Integration & Deployment

Seamless integration of the trained AI model into existing social media platforms, content moderation tools, or enterprise systems. Deployment in a production environment, ensuring scalability, low latency, and efficient real-time inference.

Phase 04: Monitoring & Refinement

Continuous monitoring of model performance, identifying drifts, and retraining as needed. Incorporate human-in-the-loop feedback for fine-tuning and addressing new patterns of toxic language, ensuring long-term effectiveness and ethical AI operation.

Plan Your AI Journey

Ready to Transform Your Content Moderation?

Leverage cutting-edge AI to enhance safety and integrity on your platforms. Our experts are ready to guide you.

Book Your AI Strategy Session

Enterprise AI Analysis

Urdu Toxicity Detection: A Multi-Stage and Multi-Label Classification Approach

Executive Impact & AI Readiness

Deep Analysis & Enterprise Applications

Comprehensive Multi-Label Urdu Toxicity Corpus (UTC)

UTC Dataset vs. Previous Benchmarks

Robust Preprocessing and Feature Engineering Pipeline

UTDM Framework Overview

Random Forest Achieves Benchmark Performance

ML Model F1-Scores (5-fold cross-validation)

DL Model F1-Scores (5-fold cross-validation)

Advancing NLP for Low-Resource Languages

Future Research Directions for Urdu Toxicity Detection

Dataset Expansion & Diversity

Advanced Learning Techniques

Ethical AI & Explainability

Hybrid & Lightweight Architectures

Calculate Your Potential AI ROI

Your AI Implementation Roadmap

Phase 01: Discovery & Strategy

Phase 02: Data Engineering & Model Training

Phase 03: Integration & Deployment

Phase 04: Monitoring & Refinement

Ready to Transform Your Content Moderation?

Ready to Get Started?

Book Your Free Consultation.

Let's Discuss Your AI Strategy!

Lets Discuss Your Needs

Select Time Zone

Big Competitive Advantage With Ai

Learn More

Our Demos

Research Center

Contact Us

1 888 985 3025

Solutions@OwnYourAi.com

Get Your Ai