cv
For my full CV or more details, please contact me directly.
Basics
Name | Md Mubtasim Ahasan |
mubtasimahasan@gmail.com | |
Url | https://mubtasimahasan.github.io/ |
Work
-
2024.10 - Present Dhaka, Bangladesh
-
2024.03 - 2024.10 Dhaka, Bangladesh
Research Assistant (Part-Time)
Center for Computational & Data Sciences (CCDS), Independent University
-
2021.01 - 2022.01 Dhaka, Bangladesh
Undergraduate Research Student
Computing for Sustainability and Social Good (C2SG) Lab, Brac University
Education
-
2018.01 - 2022.05 Dhaka, Bangladesh
Publications
-
2024.10.19 DM-Codec: Distilling Multimodal Representations for Speech Tokenization
Preprint
We propose two novel distillation approaches: (1) a language model (LM)-guided distillation method that incorporates contextual information, and (2) a combined LM and self-supervised speech model (SM)-guided distillation technique that effectively distills multimodal representations (acoustic, semantic, and contextual) into a comprehensive speech tokenizer, termed DM-Codec. The DM-Codec architecture adopts a streamlined encoder-decoder framework with a Residual Vector Quantizer (RVQ) and incorporates the LM and SM during the training process. Experiments show DM-Codec significantly outperforms state-of-the-art speech tokenization models, reducing WER by up to 13.46%, WIL by 9.82%, and improving speech quality by 5.84% and intelligibility by 1.85% on the LibriSpeech benchmark dataset.
-
2022.12.30 Classification of Respiratory Diseases and COVID-19 from Respiratory and Cough Sounds
IEEE
We present a comparative analysis of deep learning techniques for identifying respiratory diseases, including COVID-19, from respiratory and cough sound recordings. The study proposes methods to extract image representations of audio features, such as Mel-Spectrograms and Mel-Frequency Cepstral Coefficients (MFCC), from each recording. Following feature extraction, we compare the classification performance of ten distinct convolutional neural network (CNN) models. Additionally, we examine various model training techniques, including the 1cycle policy, transfer learning, and balanced mini-batch training, to identify the most effective training approach for disease detection.
Languages
Bengali | |
Native speaker |
English | |
Fluent |