Publication Details

Total Publications (Journals, Conferences, Book chapters): 22

Citation score: 252 H-index: 9 i10-index: 9

Journals

S. Dhar, N. D. Jana and S. Das,"GLGAN-VC: A Guided Loss based Generative Adversarial Network for Many-To-Many Voice Conversion", Accepted in IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023, doi: 10.1109/TNNLS.2023.3335119 (Index: SCI, Q1 journal, IF: 11.83). (Journal link: https://ieeexplore.ieee.org/document/10339641) [Related to my PhD thesis]
S. Dhar, N. D. Jana, S. Das, "An Adaptive-Learning-Based Generative Adversarial Network for One-to-One Voice Conversion," in IEEE Transactions on Artificial Intelligence (TAI), vol. 4, no. 1, pp. 92-106, Feb. 2023, doi: 10.1109/TAI.2022.3149858 (Index: SCOPUS, Q1 journal, IF: 5.94). (Journal link: https://ieeexplore.ieee.org/abstract/document/9709124), arXiv version: https://arxiv.org/abs/2104.12159). [Related to my PhD thesis]

Conferences

S. Dhar, N. Shah, A. Gudmalwar and P. Wasnik, "Adaptive Oscillatory Inductive Bias for Modeling Sharp Prosodic Dynamics in Diffusion-Based TTS", accepted in Interspeech 2026.
S. Dhar, S. R. Chetupalli and P. Rao, "Speaker Anonymization for Children's Oral Reading Assessment", Accepted in the 16th Symposium on Educational Advances in Artificial Intelligence (EAAI-26, Special Track of AAAI-26, Singapore), 2026.
S. Dhar, M. Gupta and P. Rao, "LAPS-Diff: A Diffusion-Based Framework for Hindi Singing Voice Synthesis With Language Aware Prosody-Style Guided Learning", Accepted in 17th Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC, Singapore) 2025.
S. Dhar, M. T. Akhter, N. D. Jana, S. Das, M. Swain, S. Chowdhury, "Collective Learning Mechanism based Optimal Transport GAN with Multi-Level Fine-Grained and Global Discriminators for Voice Conversion", Accepted in 17th Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC, Singapore) 2025. [Related to my PhD thesis]
A.Nandi, S.Ghosh, M.T.Akhter, S.Dhar and N.D.Jana, "Correlations of Evaluation Metrics for Voice Conversion: An Experimental Analysis", in 15th IEEE International Conference on Computing, Communication and Networking Technologies (ICCCNT-2024, IIT Mandi, India), pp. 1-7, doi: 10.1109/ICCCNT61001.2024.10725801, link: https://ieeexplore.ieee.org/abstract/document/10725801 .
A.Sen, A.Mazumder, D.Dutta, U.Sen, P.Syam, and S.Dhar (2023). Comparative Evaluation of Metaheuristic Algorithms for Hyperparameter Selection in Short-Term Weather Forecasting. In Proceedings of the 15th International Joint Conference on Computational Intelligence - ECTA (Rome, Italy); ISBN 978-989-758-674-3; ISSN 2184-3236, SciTePress, pages 238-245. DOI: 10.5220/0012187300003595, link: https://www.scitepress.org/PublicationsDetail.aspx?ID=vMqLI8/hLbg=&t=1 .
S.Dhar, A.Sen, A.Bandyopadhyay, N.D. Jana, A.Ghosh, and Z. Sarayloo, (2023). Differential Evolution Algorithm Based Hyper-Parameters Selection of Convolutional Neural Network for Speech Command Recognition. In Proceedings of the 15th International Joint Conference on Computational Intelligence - ECTA (Rome, Italy); ISBN 978-989-758-674-3; ISSN 2184-3236, SciTePress, pages 315-322. DOI: 10.5220/0012251500003595, link: https://www.scitepress.org/PublicationsDetail.aspx?ID=zfkC9QqRY34=&t=1.
S.Ghosh, S. Dhar, R. Yoddha, S. Kumar, A. K. Thakurb, N. D. Jana, "Melanoma Skin Cancer Detection Using Ensemble of Machine Learning Models Considering Deep Feature Embeddings", 2nd International Conference on Machine Learning and Data Engineering (ICMLDE 2023, UPES, India), doi: 10.1016/j.procs.2024.04.284, link: https://authors.elsevier.com/sd/article/S1877050924009633.
M. T. Akhter, P. Banerjee, S.Dhar, S.Ghosh, N. D. Jana, "Region Normalized Capsule Network Based Generative Adversarial Network for Non-Parallel Voice Conversion", 25th International Conference on Speech and Computer Lecture Notes in Computer Science(), vol 14338. Springer, Cham. https://doi.org/10.1007/978-3-031-48309-7_20. (SPECOM 2023, Dharwad, India), link: https://link.springer.com/chapter/10.1007/978-3-031-48309-7_20.
S.Dhar, M. T. Akhter, P. Banerjee, N. D. Jana and S. Das, ”FID-RPRGAN-VC: Fréchet Inception Distance Loss based Region-wise Position Normalized Relativistic GAN for Non-Parallel Voice Conversion,” 2023 15th Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), Taipei, Taiwan, 2023, pp. 350-356, doi: 10.1109/APSIPAASC58517.2023.10317438, link: https://ieeexplore.ieee.org/document/10317438. [Related to my PhD thesis]
S. Das, S. Dhar and N. D. Jana, "Convolutional Feature based Vision Transformer Model for Speech Command Recognition," 2023 IEEE 20th India Council International Conference (INDICON), Hyderabad, India, 2023, pp. 228-232, doi: 10.1109/INDICON59947.2023.10440809, link: https://ieeexplore.ieee.org/document/10440809.
S. Dhar, P. Banerjee, N. D. Jana and S. Das, "Voice Conversion Using Feature Specific Loss Function Based Self-Attentive Generative Adversarial Network," ICASSP 2023 - 2023 IEEE 48th International Conference on Acoustics, Speech and Signal Processing (ICASSP), Rhodes Island, Greece, 2023, pp. 1-5, doi: 10.1109/ICASSP49357.2023.10095069, link: https://ieeexplore.ieee.org/abstract/document/10095069. [Related to my PhD thesis]
M. T. Akhter, P. Banerjee, S. Dhar and N. D. Jana, "An Analysis of Performance Evaluation Metrics for Voice Conversion Models," 2022 IEEE 19th India Council International Conference (INDICON), Kochi, India, 2022, pp. 1-6, doi: 10.1109/INDICON56171.2022.10040000, link: https://ieeexplore.ieee.org/document/10040000.
S. Dhar, A. Vishwakarma, D. Ghanti and N. D. Jana, "Ensemble Learning based Plant Leaf Disease Classification Considering Deep Convolutional Features from Pre-trained CNN," 2022 IEEE 6th Conference on Information and Communication Technology (CICT), Gwalior, India, 2022, pp. 1-6, doi: 10.1109/CICT56698.2022.9997819, link: https://ieeexplore.ieee.org/document/9997819.

Book Chapters

Ghosh, S., Dhar, S., Jana, N.D. (2024). A Comprehensive Analysis on Features and Performance Evaluation Metrics in Audio-Visual Voice Conversion. In: Verma, A., Verma, P., Pattanaik, K.K., Dhurandher, S.K., Woungang, I. (eds) Advanced Network Technologies and Intelligent Computing. (ANTIC, BHU, Varanasi, India). Communications in Computer and Information Science, vol 2092. Springer, Cham. https://doi.org/10.1007/978-3-031-64070-4_19, link: https://link.springer.com/chapter/10.1007/978-3-031-64070-4_19
Jana, N.D., Dhar, S., Ghosh, S., Phukan, S., Gogoi, R., Singh, J. (2024). An Ensemble of Machine Learning Models Utilizing Deep Convolutional Features for Medical Image Classification. In: Verma, A., Verma, P., Pattanaik, K.K., Dhurandher, S.K., Woungang, I. (eds) Advanced Network Technologies and Intelligent Computing. (ANTIC, BHU, Varanasi, India) 2023. Communications in Computer and Information Science, vol 2092. Springer, Cham. https://doi.org/10.1007/978-3-031-64070-4_24, link: https://link.springer.com/chapter/10.1007/978-3-031-64070-4_24#citeas
Dhar, S., Ghosh, A., Roy, S., Mazumder, A., Jana, N.D. (2023). Hyperparameter Optimization of CNN Using Genetic Algorithm for Speech Command Recognition. In: Das, S., Saha, S., Coello Coello, C.A., Bansal, J.C. (eds) Advances in Data-driven Computing and Intelligent Systems. Lecture Notes in Networks and Systems, vol 653. Springer, Singapore. https://doi.org/10.1007/978-981-99-0981-0_10.
Dhar, S., Phukan, S., Gogoi, R., Jana, N.D. (2023). Speaker Identification Using Ensemble Learning With Deep Convolutional Features. In: Das, S., Saha, S., Coello Coello, C.A., Bansal, J.C. (eds) Advances in Data-driven Computing and Intelligent Systems. Lecture Notes in Networks and Systems, vol 653. Springer, Singapore. https://doi.org/10.1007/978-981-99-0981-0_9.
Phukan, S., Singh, J., Gogoi, R., Dhar, S., Jana, N.D. (2022). COVID-19 Chest X-ray Image Generation Using ResNet-DCGAN Model. In: Mohanty, M.N., Das, S. (eds) Advances in Intelligent Computing and Communication. Lecture Notes in Networks and Systems, vol 430. Springer, Singapore. https://doi.org/10.1007/978-981-19-0825-5_24.
Mazumder, A., Ghosh, S., Roy, S., Dhar, S., Jana, N.D. (2022). Rectified Adam Optimizer-Based CNN Model for Speaker Identification . In: Mohanty, M.N., Das, S. (eds) Advances in Intelligent Computing and Communication. Lecture Notes in Networks and Systems, vol 430. Springer, Singapore. https://doi.org/10.1007/978-981-19-0825-5_16.

ArXiv Preprints

S. Dhar, M. Gupta and P. Rao, "LAPS-Diff: A Diffusion-Based Framework for Singing Voice Synthesis With Language Aware Prosody-Style Guided Learning", in ArXiv, 2025 (link: https://arxiv.org/abs/2507.04966).
S. Dhar, M. T. Akhter, N. D. Jana and S. Das."Collective Learning Mechanism based Optimal Transport Generative Adversarial Network for Non-parallel Voice Conversion", in ArXiv, 2025 (link: https://arxiv.org/abs/2504.13791). [Related to my PhD thesis]
S. Dhar, N. D. Jana and S. Das."Generative Adversarial Network based Voice Conversion: Techniques, Challenges, and Recent Advancements", in ArXiv, 2025 (link: https://arxiv.org/abs/2504.19197). [Related to my PhD thesis]

Home Page

Page updated

Google Sites

Report abuse