AI Research & Development

Benchmarking Psychological Lexicons and Large Language Models for Emotion Detection in Brazilian Portuguese

This study benchmarks the performance of a large language model (Mistral 24B), a language-specific transformer model (BERTimbau), and the lexicon-based EmoAtlas for emotion detection in Brazilian Portuguese text. Focusing on eight Plutchik emotions, the models were evaluated across four diverse corpora, including human-annotated and LLM-generated texts. BERTimbau showed the highest average scores (accuracy 0.876, precision 0.529, recall 0.423), closely followed by Mistral (accuracy 0.831, precision 0.522, recall 0.539). Both transformer models outperformed EmoAtlas (accuracy 0.797), but at significantly higher computational cost (up to 40 times more). The introduction of a novel “emotional fingerprinting” methodology using synthetic data revealed nuanced differences in emotional representations. The work provides a crucial quantitative benchmark and open resources for multilingual NLP research.

Schedule Your Strategy Session

Executive Impact & ROI

Emotion detection in Brazilian Portuguese is critical for enterprises operating in Latin American markets, enabling nuanced customer sentiment analysis, targeted marketing, and enhanced customer service interactions. Understanding the strengths and weaknesses of different AI models in this context can directly inform technology investment and deployment strategies, leading to more accurate insights and optimized operational efficiencies.

0.0 BERTimbau Avg. Accuracy

0.0 Mistral Avg. Accuracy

0 Computational Cost (EmoAtlas Baseline)

Discuss Your Implementation

Deep Analysis & Enterprise Applications

Select a topic to dive deeper, then explore the specific findings from the research, rebuilt as interactive, enterprise-focused modules.

Model Performance Overview

Computational Efficiency

Interpretability & Explainability

Emotional Fingerprinting

This section provides a holistic view of how different AI models perform across various datasets. BERTimbau generally achieves the highest accuracy, precision, and recall, especially when fine-tuned on relevant datasets. Mistral, while also strong, shows more variability, particularly on human-annotated data, indicating potential sensitivity to data nuances. EmoAtlas, though computationally efficient, often performs below the transformer models, especially in recall, highlighting its limitations when dealing with complex or context-dependent emotional expressions.

A critical consideration for enterprise deployment is the computational cost. EmoAtlas stands out for its significantly lower resource requirements, being up to 40 times faster on a CPU compared to LLMs. This makes EmoAtlas a highly cost-effective solution for large-scale, real-time emotion detection where resources are constrained, or quick, interpretable results are prioritized. Transformer models, while powerful, demand substantial GPU resources and longer processing times, increasing operational expenses.

EmoAtlas offers superior interpretability. As a lexicon-based, rule-driven model, its decision-making process is transparent and verifiable, crucial for applications requiring auditability or explanation of model predictions. In contrast, LLMs like Mistral and BERTimbau are 'black-box' models. Their high performance comes with limited transparency regarding their internal decision logic, which can be a significant drawback in regulated industries or contexts where understanding 'why' an emotion was detected is as important as 'what' emotion was detected.

The novel “emotional fingerprinting” methodology involved using LLM-generated datasets to probe the internal representations of these models. This revealed that high performance on synthetically generated data may not reflect a superior understanding of genuine human emotion, but rather an alignment with the internal feature space of the generative model. This insight is vital for enterprises considering synthetic data augmentation, as it warns against potential biases or over-optimization towards model-specific 'languages' rather than real-world emotional nuances.

40x faster on CPU Computational Cost Reduction with EmoAtlas

Enterprise Process Flow

Input Brazilian Portuguese Text

→

Syntactic Parsing & Lexical Analysis (EmoAtlas)

→

Emotion Detection (LLMs / BERTimbau)

→

Emotional Profile & Framing Output

→

Enterprise Application (e.g., Customer Insights)

Feature	EmoAtlas (Lexicon-based)	Transformer Models (BERTimbau, Mistral)
Interpretability	Transparent, rule-driven decisions Explicit emotional lexicon	Black-box predictions Limited explainability
Computational Cost	Low CPU usage Fast processing (40x faster)	High GPU/CPU demand Slower processing
Data Requirements	No large annotated corpora for training Adaptable to new languages	Requires large annotated corpora for fine-tuning Language-specific pre-training needed
Performance (Avg. Accuracy)	0.797	BERTimbau: 0.876 Mistral: 0.831

Emotion Analysis of Political Discourse on X (formerly Twitter)

The study utilized EmoAtlas to analyze tweets about COVID-19 vaccines from Brazilian political elites. This revealed how specific terms like 'vacina' (vaccine) were emotionally framed by different political sentiments (positive vs. negative). For instance, positive tweets displayed less fear and anger, and more joy, demonstrating EmoAtlas's capability to discern nuanced emotional framing within specific semantic contexts. This capability is crucial for understanding public opinion and political sentiment shifts in real-time.

Advanced ROI Calculator

Estimate the potential savings and reclaimed hours your enterprise could achieve by implementing optimized AI solutions for text analysis.

Your Industry

Employees Involved in Text Analysis (e.g., customer support, marketing, legal)

Average Hours per Week per Employee on Manual Text Analysis

Average Hourly Cost per Employee (USD)

Estimated Annual Savings $0

Annual Hours Reclaimed 0

Implementation Roadmap

A strategic phased approach for integrating advanced emotion detection capabilities into your enterprise operations.

Phase 1: Needs Assessment & Data Collection

Identify specific business objectives for emotion detection in Brazilian Portuguese. Collect relevant text corpora (e.g., customer reviews, social media feeds, news articles) and define target emotions aligned with enterprise goals. Evaluate current data annotation capabilities.

Phase 2: Model Selection & Initial Integration

Based on a balance of performance, interpretability, and computational cost, select the optimal model(s) (e.g., EmoAtlas for efficiency, BERTimbau/Mistral for higher accuracy). Begin integrating the chosen model(s) into existing data pipelines and run initial pilot tests with a small dataset.

Phase 3: Customization & Refinement

For lexicon-based models like EmoAtlas, refine lexicons for domain-specific jargon. For transformer models, consider fine-tuning with additional domain-specific annotated data (if budget and resources allow) to enhance accuracy. Employ 'emotional fingerprinting' to validate model representations on synthetic data.

Phase 4: Scaled Deployment & Monitoring

Deploy the refined emotion detection system across enterprise-wide applications (e.g., customer support, marketing analytics, risk management). Establish continuous monitoring of model performance, data drift, and computational resource usage. Implement feedback loops for ongoing improvement.

Ready to Transform Your Enterprise with AI?

Unlock deeper insights from your Brazilian Portuguese text data. Schedule a free consultation with our AI strategists to explore how these advanced emotion detection models can be tailored to your specific business needs and drive measurable impact.

Book a Free Consultation

AI Research & Development

Benchmarking Psychological Lexicons and Large Language Models for Emotion Detection in Brazilian Portuguese

Executive Impact & ROI

Deep Analysis & Enterprise Applications

Enterprise Process Flow

Emotion Analysis of Political Discourse on X (formerly Twitter)

Advanced ROI Calculator

Implementation Roadmap

Phase 1: Needs Assessment & Data Collection

Phase 2: Model Selection & Initial Integration

Phase 3: Customization & Refinement

Phase 4: Scaled Deployment & Monitoring

Ready to Transform Your Enterprise with AI?

Ready to Get Started?

Book Your Free Consultation.

Let's Discuss Your AI Strategy!

Lets Discuss Your Needs

Select Time Zone

Big Competitive Advantage With Ai

Learn More

Our Demos

Research Center

Contact Us

1 888 985 3025

Solutions@OwnYourAi.com

Get Your Ai