M Saiful Bari (Maruf) is the Training Lead
and one of the Core Maintainers
of ALLaM, a sovereign foundational model for English and Arabic language technologies. His research focuses on understanding and advancing large language models, particularly in the areas of scaling
, training dynamics
, and systematic evaluation of frontier
(-> superintelligence) models. His lab investigates these aspects through three primary areas: (1) scaling behaviors and training dynamics of large language models in terms of data
and truthfulness
, (2) developing robust evaluation methodologies for frontier class (-> superintelligence) models with scalable-oversight
, and (3) exploring efficient learning paradigms through transfer learning at scale
. This research combines theoretical frameworks with large-scale empirical studies, leading to both methodological innovations and practical applications.
Maruf received his Ph.D. from Nanyang Technological University
under the supervision of Prof. Shafiq Joty
, where his thesis explored transfer learning for large language model adaptation, addressing the fundamental question: How can you learn so much from so little?
. During his doctoral studies, he attended three internships at Amazon Web Services
(2021-2023) and made substantial contributions to the BLOOM LLM development
, particularly in its architecture design
, pretraining
, and prompt engineering efforts
.
17 October | One paper has been accepted at EMNLP 2024 on evaluation. See you in Miami. |
---|---|
13 October | Our paper on scaling evaluation (`scalable oversight`) for frontier-class models has just dropped. Feel free to reach out to me for details. |
10 October | We announced ALLaM during the keynote of the Global AI Summit as one of the main priorities for sovereign AI in the Kingdom of Saudi Arabia. We also released the model and evaluation details for the 34B pretraining from scratch model. Keynote [YouTube Link]. |
22 July | We released our technical paper on ALLaM. Feel free to reach out to me if you have any queries. |
22 May | My lab's alignment work was recently released. ALLaM was revealed at IBM Think Keynote [Youtube Link]. ALLaM is a nationwide LLM effort of Saudi Arabia. Paper coming soon ... |
15 May | Our paper on xCodeEval has been accepted at ACL 2024. Unfortunately, I won't be traveling to Thailand :(. |
16 May | Two paper accepted at ACL'24. See you at Bangkok. |
6 Aug | Joined SDAIA as a Senior Resarch Scientist |
---|---|
7 May | Three paper accepted at ACL'23. See you at Toronto. |
15 March | Check out our recent pre-print xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval paper. |
20 Dec | Check out our recent pre-print SPT: Semi-Parametric Prompt Tuning for Multitask Prompted Learning paper. |
---|---|
18 Dec | Check out our recent pre-print BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting paper. |
26 Nov | Returned from Amazon d2l summer internship. |
3 Nov | Check out our recent pre-print Crosslingual Generalization through Multitask Finetuning paper. |
1-Oct | Our paper (What Language Model to Train if You Have One Million GPU Hours?) got accepted at EMNLP'22 findings. |
05-Jul | Joined Amazon d2l team as a summer intern. |
11-Apr | Check out our new ACL'22 workshop paper What Language Model to Train if You Have One Million GPU Hours?. |
7-Feb | Check out our pre-print PromptSource paper. |
2-Feb | Check out my recent talk on T0++ paper. |
19-Nov | Our paper T0++ got accepted as a spotlight paper at ICLR'22 |
---|---|
15-Oct | Our paper Multitask Prompted Training Enables Zero-Shot Task Generalization is online. Model (T0++) , Dataset (P3) |
22-Sept | Check out my Recent talk on Finetuned Language Models Are Zero-Shot Learners at NTU-NLP lab. |
25-Aug | My internship work from Amazon got accepted as a short paper in EMNLP. |
3-May | Two paper UXLA (in main conference) and AugVic (in findings) got accepted in ACL-IJCNLP-2021 . |
6-Nov | Get back to the PhD study after completing my internship with Amazon Lex Team. |
---|---|
15-Oct | Our paper LNMap is accepted in EMNLP-2020. |
03-Aug | I will be starting my internship at Amazon, with Lex Team. |
28-Aug | Passed my PhD Qualifying Examination (QE). |
28-Apr | Preprint of our new paper LNMap released in Arxiv. |
24-Apr | Preprint of our new paper MultiMix released in Arxiv. |
25-Mar | Gave a talk on mBART (paper ) in NTU-NLP. |
13-Feb | Presenting our Cross-lingual-NER paper on AAAI-2020. |
4-Jan | Got AAAI-2020 travel scholarship. See you in New York. |
9-Nov | I will be a Teaching Assistant, for Graduate Deep Learning Course of NLP , NTU |
---|---|
11-Nov | A paper accepted on AAAI 2020. |
15-Jun | A paper accepted on ACL 2019. |
25-Jan | Shows MT system to industry and government stakeholders. |
15-Jan | Joins NTU-NLP lab as a PhD student. |