Semantic Error Correction and Decoding for Short Block Codes

Revolutionizing URLLC: Semantic-Enhanced Error Control for 5G and Beyond

This paper presents a semantic-enhanced receiver framework for transmitting natural language sentences over noisy wireless channels using multiple short block codes. After ASCII encoding, the sentence is divided into segments, each independently encoded with a short block code and transmitted over an AWGN channel. At the receiver, segments are decoded in parallel, followed by a semantic error correction (SEC) model, which reconstructs corrupted segments using language model context. We further propose the semantic list decoding (SLD), which generates multiple candidate reconstructions and selects the best one via weighted Hamming distance. Moreover, a semantic confidence-guided HARQ (SHARQ) mechanism is designed to replaces Cyclic Redundancy Check (CRC) with a semantic confidence score, enabling selective segment retransmission. We analyze the block error rate (BLER) for the proposed framework and discuss the tradeoff between the semantic gain from segmentation and the finite-blocklength penalty of shorter codes. Simulation results demonstrate that SEC provides approximately 0.4 dB BLER gain over plain short-code transmission, while SLD extends this to 0.8 dB. Compared to transmitting the entire sentence as a single long 5G LDPC codeword, our approach significantly improves semantic fidelity and reduces decoding latency by up to 90%. SHARQ further provides an additional 1.5 dB gain over conventional HARQ.

Schedule Your Strategy Session

Author: Jiafu Hao, Chentao Yue, Wanchun Liu, Branka Vucetic, Yonghui Li
Publication: arXiv:2604.22269v2 [cs.IT] 29 Apr 2026

Short block codessemantic error controllarge language modelshybrid automatic repeat request

Executive Impact & Core Metrics

This research redefines reliable, low-latency communication by integrating advanced semantic processing with short block codes. Key outcomes include substantial improvements in error correction, semantic fidelity, and decoding efficiency, making URLLC more attainable for critical enterprise applications.

0 Decoding Latency Reduction

0 BLER Gain (SLD)

0 SHARQ Performance Gain

0 Semantic Fidelity (BLEU)

Discuss Your Implementation

Deep Analysis & Enterprise Applications

Select a topic to dive deeper, then explore the specific findings from the research, rebuilt as interactive, enterprise-focused modules.

Short Block Codes

Semantic Error Control (SEC)

Semantic List Decoding (SLD)

Hybrid Automatic Repeat Request (HARQ)

Large Language Models (LLMs)

Ordered Statistics Decoding (OSD)

Block Error Rate (BLER)

Semantic Fidelity (BLEU/ROUGE-L)

Short Block Codes

Description: Channel codes with small block lengths, optimized for low-latency communication in URLLC scenarios, often facing finite-blocklength performance penalties.

Relevance: Core to the proposed MSC framework, addressing URLLC latency constraints.

Semantic Error Control (SEC)

Description: Utilizing linguistic context (e.g., from Large Language Models) to reconstruct or correct data segments that failed channel decoding, improving semantic fidelity.

Relevance: Introduced as a key module to leverage contextual information for error correction beyond bit-level accuracy.

Semantic List Decoding (SLD)

Description: An advanced decoding technique generating multiple semantically plausible candidate reconstructions for unreliable segments and selecting the best one based on weighted Hamming distance and channel reliability.

Relevance: Refines SEC by integrating channel information to distinguish between semantically plausible but incorrect reconstructions.

Hybrid Automatic Repeat Request (HARQ)

Description: A combination of error-correcting codes and automatic retransmission requests, where retransmissions are triggered for segments identified as unreliable, potentially guided by semantic confidence.

Relevance: Enhanced with semantic confidence (SHARQ) to replace CRC, enabling segment-selective and more efficient retransmissions.

Large Language Models (LLMs)

Description: Pre-trained neural networks, like BART, capable of understanding and generating human-like text, used here for contextual inference and semantic reconstruction.

Relevance: The core technology enabling SEC and SLD by providing contextual reasoning to correct errors.

Ordered Statistics Decoding (OSD)

Description: A near-maximum likelihood decoding algorithm for linear block codes, suitable for short block codes, involving systematic form conversion and testing error patterns.

Relevance: The chosen channel decoding method for individual segments in the MSC framework.

Block Error Rate (BLER)

Description: The probability that a transmitted block (or in this paper, a sentence) contains at least one uncorrected error after decoding. A key performance metric.

Relevance: Evaluated to quantify the reliability gains of SEC/SLD/SHARQ over conventional methods.

Semantic Fidelity (BLEU/ROUGE-L)

Description: Metrics used to evaluate the quality and accuracy of reconstructed text based on n-gram precision and longest common subsequence similarity, assessing meaning preservation.

Relevance: Crucial for evaluating the effectiveness of semantic enhancements beyond traditional bit-level accuracy.

90% Reduction in Decoding Latency

The proposed MSC-SLD framework significantly reduces per-sentence decoding latency by up to 90% compared to a single long LDPC codeword, making it suitable for URLLC applications.

Semantic-Enhanced Receiver Framework

ASCII Encoding & Segmentation

→

Parallel Short Code Encoding

→

AWGN Channel Transmission

→

Parallel OSD Decoding

→

Semantic Error Correction (SEC)

→

Semantic List Decoding (SLD)

→

Semantic Confidence-Guided HARQ (SHARQ)

→

Final Decoded Sentence

Performance Comparison: MSC vs. LC
Feature	Multi-Short Code (MSC) Framework	Long Code (LC) Baseline
Channel Coding	Multiple short block codes per segment	Single long LDPC codeword for entire sentence
Error Localization	Errors confined to individual segments, preserving context	Decoding failure corrupts entire sentence, no context for recovery
Latency	Significantly reduced (up to 90% faster) due to parallel decoding	High latency due to long blocklength and sequential processing
Semantic Recovery	Leverages intact segments for LLM context, enabling effective SEC/SLD	Limited semantic recovery due to total loss of context on failure
HARQ	Semantic confidence-guided (SHARQ) for segment-selective retransmission	CRC-based, retransmits entire message or additional parity bits
BLER Gain	SEC provides ~0.4 dB, SLD ~0.8 dB over plain MSC	Negligible BLER improvement with SEC due to context loss
Semantic Fidelity	Superior BLEU/ROUGE scores, especially at low SNR	Near-zero BLEU/ROUGE on failure due to complete corruption

1.5 dB Additional Gain from SHARQ

The Semantic Confidence-Guided HARQ (SHARQ) mechanism provides an additional 1.5 dB BLER gain over conventional HARQ at the same target BLER, eliminating CRC overhead.

LLM Integration for Enhanced Reliability

The paper demonstrates how a fine-tuned BART model, acting as a denoising autoencoder, significantly improves error correction beyond traditional channel decoding. By exploiting cross-segment context through bidirectional self-attention, the LLM can reconstruct corrupted segments with high semantic fidelity, even when channel decoding fails. This approach maintains the conventional source-channel separation architecture, making it practical for real-world deployment without replacing entire communication modules. The key is localizing channel decoding failures to provide reliable context for the LLM.

BART Model: Fine-tuned for channel-specific error patterns, outperforming pretrained models.
Contextual Reconstruction: Bidirectional self-attention allows robust reconstruction from surrounding, correctly decoded segments.
Practicality: Integrates with existing infrastructure by preserving source-channel separation.
Performance Uplift: Contributes significantly to BLER and semantic fidelity gains.

Learn More About AI in Telecom

Calculate Your Potential ROI

Estimate the efficiency gains and cost savings for your enterprise by implementing semantic-enhanced communication technologies.

Your Industry Sector

Number of Employees Impacted

Avg. Hours/Week on Communication Tasks

Avg. Hourly Cost per Employee ($)

Estimated Annual Savings $0

Annual Hours Reclaimed 0

Your Implementation Roadmap

A typical journey to integrate semantic communication into your enterprise infrastructure, tailored for efficiency and impact.

Phase 1: Discovery & Strategy

Initial assessment of existing communication systems, identification of key use cases, and development of a tailored semantic communication strategy. This phase includes a detailed ROI projection.

Phase 2: Pilot Program & Customization

Deployment of a pilot semantic-enhanced receiver framework on a small scale. Customization of LLM models (e.g., BART) to align with specific enterprise data and communication patterns.

Phase 3: Integration & Scaling

Seamless integration with existing physical layer and application-level systems. Expansion of semantic error control and decoding across enterprise-wide communication channels for maximum impact.

Phase 4: Optimization & Future-Proofing

Continuous monitoring, performance optimization, and adaptation to evolving communication standards (e.g., 6G URLLC). Training and support for your internal teams.

Start Your AI Transformation

Ready to Enhance Your Enterprise Communication?

Leverage cutting-edge semantic error correction and decoding to achieve ultra-reliable, low-latency communication. Book a free consultation to see how these innovations can benefit your organization.

Book a Free Consultation

Semantic Error Correction and Decoding for Short Block Codes

Revolutionizing URLLC: Semantic-Enhanced Error Control for 5G and Beyond

Executive Impact & Core Metrics

Deep Analysis & Enterprise Applications

Short Block Codes

Semantic Error Control (SEC)

Semantic List Decoding (SLD)

Hybrid Automatic Repeat Request (HARQ)

Large Language Models (LLMs)

Ordered Statistics Decoding (OSD)

Block Error Rate (BLER)

Semantic Fidelity (BLEU/ROUGE-L)

Semantic-Enhanced Receiver Framework

Performance Comparison: MSC vs. LC

LLM Integration for Enhanced Reliability

Calculate Your Potential ROI

Your Implementation Roadmap

Phase 1: Discovery & Strategy

Phase 2: Pilot Program & Customization

Phase 3: Integration & Scaling

Phase 4: Optimization & Future-Proofing

Ready to Enhance Your Enterprise Communication?

Ready to Get Started?

Book Your Free Consultation.

Let's Discuss Your AI Strategy!

Lets Discuss Your Needs

Select Time Zone

Big Competitive Advantage With Ai

Learn More

Our Demos

Research Center

Jobs

Contact Us

1 888 985 3025

Solutions@OwnYourAi.com

Get Your Ai