Publications
Publications
Objective: COLT aims to reach high sparsity levels in neural networks while preserving accuracy, challenging the existing benchmarks set by the Lottery Ticket Hypothesis (LTH).
Method: In the COLT method, a dataset is partitioned into N segments, and distinct models are trained on each segment, maintaining identical initial weights. For subsequent pruning iterations, only the intersecting pruned weights from these N models are retained.
Outcome: Unlike LTH, COLT achieves optimal 'winning lottery tickets' (efficiently pruned networks) in fewer iterations, thereby cutting down on computational expenses. Through successful trials on Cifar-10, Cifar-100, tinyand TinyImageNet, COLT not only demonstrates superior performance over current leading methods but also shows remarkable dataset transferability, enhancing both the convergence and compression aspects of neural network pruning.
Precision-Driven Low-Resource Speech Synthesis For Bangla Text-To-Speech System
Authors : Tabassum Sadia Shahjahan, Md. Ismail Hossain, Kazi Rafat, Md Ruhul Amin, Fuad Rahman, Nabeel Mohammed
PML4LRS, ICLR24
Role: Researcher
paper / code
BORO-LILA-BOTI: Unlocking Bias Mitigation in Bangla Handwritten Word Recognition Through Inter-Linguistic Character-Level Teacher Model Insights
Authors : Tabassum Sadia Shahjahan, Mahima Ahsan, Mohammed Farhan Islam, Md. Ismail Hossain, Fuad Rahman, and Nabeel Mohammed.
Under Review IEEE Access
Role: Supervisor
paper / code
Objective: Mitigate bias in handwritten OCR models for morphologically rich languages.
Method: Trained a selected set of cross-lingual balanced printed graphemes (generated) using cross-entropy loss and extracted the knowledge from it.
Outcome: Applied this knowledge to an OCR model trained on word-level handwritten data with CTC loss to enhance the performance of minor classes.
Mitigating Carbon Footprint of Hyper-parameter Selection During Knowledge Distillation
Authors : Kazi Rafat,Sadia Islam, Abdullah Al Mahfug,Md. Ismail Hossain, Fuad Rahman, Sifat Momen, Shafin Rahman, and Nabeel Mohammed
PLOS ONE 2023
Role: Co-Supervisor
paper / code
Objective: To address and reduce the high carbon footprint associated with knowledge distillation in deep learning models.
Method: Implementing a stochastic approach for selecting hyperparameters, aimed at minimizing energy consumption and CO2 emissions.
Outcome: Introducing and utilizing new metrics for assessing environmental costs (like GFLOPS, energy usage, and CO2 equivalents) in deep learning and promoting eco-friendly AI practices.
Authors : Saddam Al Amin,Md. Saddam Hossain Mukta,Md Sezan Mahmud Saikat,Md. Ismail Hossain, Md. Adnanul Islam,Md Mohiuddin Ahmed and Sami Azam
IEEE Acess, 2023
Role: Research Assistant
Objective: To distinguish intentional from unintentional opioid use through the application of machine learning and deep learning methods.
Method: Utilized structured and unstructured data from the MIMIC-III database, analyzing 455 patient cases, and employed knowledge distillation techniques to transfer insights from structured to unstructured datasets.
Outcome: Achieved 95% accuracy in identifying intentional users and 64% for accidental users in tests, with a further refined knowledge-based test accuracy of 76.44% after integrating both models. The study also offers new analytical insights into opioid patient profiles.
Bangla-Wave: Improving Bangla Automatic Speech Recognition Utilizing N-gram Language Models
Authors : Mohammed Rakib, Md. Ismail Hossain, Nabeel Mohammed and Fuad Rahman
ICSCA, 2023
Role: Project Co-Lead
Objective: To surpass current state-of-the-art (SOTA) pretrained Bengali Automatic Speech Recognition (ASR) models.
Method: Finetuning a pretrained wav2vec2 model on the Common Voice dataset, coupled with the integration of an n-gram language model as a post-processor.
Outcome: Conducted various experiments and hyperparameter tuning to develop a more robust Bangla ASR model, achieving superior performance compared to existing models.
Authors : Md. Ismail Hossain, Mohammed Rakib, Sabbir Mollah, Fuad Rahman, and Nabeel Mohammed.
ICPR , 2022
Role: Project Lead
Objective: Mitigate bias in handwritten OCR models for morphologically rich languages.
Method: Trained a selected set of balanced printed graphemes (generated) using cross-entropy loss and extracted the knowledge from it.
Application: Applied this knowledge to an OCR model trained on word-level handwritten data with CTC loss to enhance the performance of minor classes.
FACT: Drawing inspiration from the rich heritage of Bengali literature and the legendary mathematician Bhaskar II, who poignantly named his mathematical treatise "Lilavati" after his daughter, combined with the emotional depth in Bangladeshi author Humayun Ahmed's novel and personal story, the name "Lilaboti" was chosen for our paper.
IoT Based Air PollutionMonitoring & Prediction System
Authors : Mohammed Rakib, Sanaulla Haq,Md. Ismail Hossain, and Tanzilur Rahman
ICISET, 2022
Role: Project Co-Lead
Objective: To monitor current air pollution levels and accurately predict future environmental pollution levels.
Method: Utilization of the ARIMA model, implemented via the Statsmodels package, for forecasting the levels of various pollutants.
Outcome: This system serves as a tool for both real-time pollution detection and future pollution level prediction, enhancing environmental monitoring capabilities.
Ongoing Project
1. Dataset Distillation:
Objective: Develop an innovative distillation algorithm leveraging model uncertainty or perception.
2. Channel Pruning:
Objective: Create an efficient channel pruning algorithm to identify and remove redundant channels.
3. 3D Model Compression:
Objective: Implement a compression technique for 3D models trained on point cloud data.
Projects
OCR (optical character Recognition ) Development
Associated with: Apurba-NSU R&D Lab, Apurba Technology, EBLICT- ICT, the Government of Bangladesh.
Funded by: EBLICT- ICT, the Government of Bangladesh
Processing the raw-data
Developing an end-to-end model
Introduced two new knowledge distillation technique
Improve the performance of minor classes
Quantized the final model
Presents and write research paper
Opioid Misuse investigation
Associated with: United International University, REI Systems
Funded by: United International University
Classified misused opioid user from structured and unstructured data
Distill knowledge from structured data to unstructured data
Trustworthy modeling
ASR Development
Associated with: Government of Bangladesh
Funded by: Government of Bangladesh
Developed a state-of-the-art ASR model on Bangla.