Accurate sentiment analysis in Bangla remains a significant research challenge due to limited annotated corpora, complex morphology, insufficient linguistic resources, and the absence of interpretable concept-level knowledge bases. Existing approaches often struggle to capture context-dependent sentiment, idiomatic expressions, and domain adaptability, further constrained by the low-resource nature of the language. To address these limitations, this study proposes the Bangla Concept-Level Sentiment Analysis (BCLSA) framework, introducing two dedicated algorithms: a Bangla-specific concept extraction method and the Concept-Level Sentiment Analysis for Bangla (CLSA-Bn) weighted scoring algorithm. The first extracts sentiment-bearing concepts through syntactic pattern recognition, multiword expression detection, and affective lexicon mapping, while the second refines polarity estimation via negation handling, modifier scaling, and weighted aggregation for interpretable classification. To mitigate data scarcity and morphological variation, BCLSA applies language-specific preprocessing, including Unicode normalization, phonetic correction, and lemmatization. Evaluations on 10,243 formal news articles and 12,084 informal social media comments show that CLSA-Bn outperforms the Bi-LSTM and SVM baselines, achieving 90.2 % Accuracy, 90 % Macro-F1, 85 % Matthews Correlation Coefficient (MCC), and 94 % Area Under the Curve (AUC) for formal text, and 86.8 % Accuracy, 86 % Macro-F1, and 91 % AUC for informal text. The proposed Concept-Level Polarity Accuracy (CLPA) metric confirmed semantic fidelity above 88 %. Efficiency analysis revealed that CLSA-Bn requires only 30 s initialization, 5 ms inference, and a 50 MB model. Error rate analysis further confirmed robustness with the lowest misclassification ratios (9.8 % formal, 13.2 % informal), demonstrating balanced improvement in performance and error minimization.
扫码关注我们
求助内容:
应助结果提醒方式:
