Saleh Soltan
Principal Applied Scientist @Amazon
Amazon AGI Team
Email: firstname@ee.columbia.edu
Research Interests: Machine Learning, Natural Language Processing, Algorithms, Network Science
Soltan, Saleh, Shankar Ananthakrishnan, Jack FitzGerald, Rahul Gupta, Wael Hamza, Haidar Khan, Charith Peris, et al. “AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq Model.” arXiv, August 3, 2022. https://doi.org/10.48550/arXiv.2208.01448.
FitzGerald, Jack, Soltan, Saleh, Shankar Ananthakrishnan, Konstantine Arkoudas, Davide Bernardi, Abhishek Bhagia, Claudio Delli Bovi, Jin Cao, et al. “Alexa Teacher Model: Pretraining and Distilling Multi-Billion-Parameter Encoders for Natural Language Understanding Systems.” In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2893–2902, 2022. https://doi.org/10.1145/3534678.3539173.
Zhu, Qile, Haidar Khan, Saleh Soltan, Stephen Rawls, and Wael Hamza. “Don’t Parse, Insert: Multilingual Semantic Parsing with Insertion Based Decoding.” In Proceedings of the 24th Conference on Computational Natural Language Learning, edited by Raquel Fernández and Tal Linzen, 496–506. Online: Association for Computational Linguistics, 2020. https://doi.org/10.18653/v1/2020.conll-1.40.
Soltan, Saleh, Andy Rosenbaum, Tobias Falke, Qin Lu, Anna Rumshisky, and Wael Hamza. “Recipes for Sequential Pre-Training of Multilingual Encoder and Seq2Seq Models.” In Findings of the Association for Computational Linguistics: ACL 2023, 9380–94. Toronto, Canada: Association for Computational Linguistics, 2023. https://doi.org/10.18653/v1/2023.findings-acl.598.
Soltan, Saleh, Victor Soto, Ke Tran, and Wael Hamza. “A Hybrid Approach to Cross-Lingual Product Review Summarization.” In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing: Industry Track, 18–28. Abu Dhabi, UAE: Association for Computational Linguistics, 2022. https://doi.org/10.18653/v1/2022.emnlp-industry.3.
Rosenbaum, Andy, Saleh Soltan, Wael Hamza, Yannick Versley, and Markus Boese. “LINGUIST: Language Model Instruction Tuning to Generate Annotated Utterances for Intent Classification and Slot Tagging.” In Proceedings of the 29th International Conference on Computational Linguistics, edited by Nicoletta Calzolari, Chu-Ren Huang, Hansaem Kim, James Pustejovsky, Leo Wanner, Key-Sun Choi, Pum-Mo Ryu, et al., 218–41. Gyeongju, Republic of Korea: International Committee on Computational Linguistics, 2022. https://aclanthology.org/2022.coling-1.18.
Rosenbaum, Andy, Saleh Soltan, Wael Hamza, Marco Damonte, Isabel Groves, and Amir Saffari. “CLASP: Few-Shot Cross-Lingual Data Augmentation for Semantic Parsing.” In Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing (Volume 2: Short Papers), edited by Yulan He, Heng Ji, Sujian Li, Yang Liu, and Chua-Hui Chang, 444–62. Online only: Association for Computational Linguistics, 2022. https://aclanthology.org/2022.aacl-short.56.
Saleh Soltan, Mihalis Yannakakis, and Gil Zussman, "Doubly Balanced Connected Graph Partitioning," in Proc. ACM-SIAM SODA'17, Jan. 2017. [Download] [Extended Version (arXiv)]
Jury Award from the Department of Electrical Engineering at Columbia University (highest recognition by the department awarded annually to two recent Ph.D. graduates for outstanding achievements in the areas of systems, communications, signal processing, or circuits), 2018
Second place prize at Siemens Future Makers Challenge/Hackathon at Princeton University, 2018
Armstrong Memorial Award from the Department of Electrical Engineering at Columbia University (awarded annually to one outstanding candidate for the M.S.), 2013
Exempted from Iran's National Qualification Exam for undergraduate program as an exceptional talent, 2006
Gold Medalist of the 23rd National Mathematics Olympiad (among top 12 students in the nation chosen for Iran International Mathematics Olympiad team), 2005
Bronze Medalist of the 22nd National Mathematics Olympiad (among top 42 students in Iran), 2004