-
Bangladesh University of Engineering and Technology
- Bangladesh
- csebuetnlp.github.io
-
BanglaSocialBias Public
This is the official repository containing all codes used to generate the results reported in the paper titled "Social Bias in Large Language Models For Bangla: An Empirical Study on Gender and Rel…
-
banglanmt Public
This repository contains the code and data of the paper titled "Not Low-Resource Anymore: Aligner Ensembling, Batch Filtering, and New Datasets for Bengali-English Machine Translation" published in…
-
IllusionVQA Public
This repository contains the data and code of the paper titled "IllusionVQA: A Challenging Optical Illusion Dataset for Vision Language Models"
-
-
BanglaEmotionBias Public
This is the official repository containing all codes used to generate the results reported in the paper titled "An Empirical Study of Gendered Stereotypes in Emotional Attributes for Bangla in Mult…
-
BanglaContextualBias Public
This is the official repository containing all codes used to generate the results reported in the paper titled "An Empirical Study on the Characteristics of Bias upon Context Length Variation for B…
-
normalizer Public
This python module is an easy-to-use port of the text normalization used in the paper "Not low-resource anymore: Aligner ensembling, batch filtering, and new datasets for Bengali-English machine tr…
-
CrossSum Public
This repository contains the code, data, and models of the paper titled "CrossSum: Beyond English-Centric Cross-Lingual Summarization for 1,500+ Language Pairs" published in Proceedings of the 61st…
-
xl-sum Public
This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 44 Languages" published in Findings of the Association for Co…
-
BanglaNLG Public
This repository contains the official release of the model "BanglaT5" and associated downstream finetuning code and datasets introduced in the paper titled "BanglaNLG: Benchmarks and Resources for …
-
banglabert Public
This repository contains the official release of the model "BanglaBERT" and associated downstream finetuning code and datasets introduced in the paper titled "BanglaBERT: Language Model Pretraining…
-
banglaparaphrase Public
This repository contains the code, data, and associated models of the paper titled "BanglaParaphrase: A High-Quality Bangla Paraphrase Dataset", accepted in Proceedings of the Asia-Pacific Chapter …
-
CoDesc Public
Forked from code-desc/CoDescA large dataset of 4.2m Java source code and parallel data of their description from code search, and code summarization studies.
-
TransCoder Public
Forked from code-desc/TransCoderPublic release of the TransCoder research project https://arxiv.org/pdf/2006.03511.pdf