News

2024

17 October One paper has been accepted at EMNLP 2024 on evaluation. See you in Miami.
13 October Our paper on scaling evaluation (`scalable oversight`) for frontier-class models has just dropped. Feel free to reach out to me for details.
10 October We announced ALLaM during the keynote of the Global AI Summit as one of the main priorities for sovereign AI in the Kingdom of Saudi Arabia. We also released the model and evaluation details for the 34B pretraining from scratch model. Keynote [YouTube Link].
22 July We released our technical paper on ALLaM. Feel free to reach out to me if you have any queries.
22 May My lab's alignment work was recently released. ALLaM was revealed at IBM Think Keynote [Youtube Link]. ALLaM is a nationwide LLM effort of Saudi Arabia. Paper coming soon ...
15 May Our paper on xCodeEval has been accepted at ACL 2024. Unfortunately, I won't be traveling to Thailand :(.
16 May Two paper accepted at ACL'24. See you at Bangkok.

2023

6 Aug Joined SDAIA as a Senior Resarch Scientist
7 May Three paper accepted at ACL'23. See you at Toronto.
15 March Check out our recent pre-print xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval paper.

2022

20 Dec Check out our recent pre-print SPT: Semi-Parametric Prompt Tuning for Multitask Prompted Learning paper.
18 Dec Check out our recent pre-print BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting paper.
26 Nov Returned from Amazon d2l summer internship.
3 Nov Check out our recent pre-print Crosslingual Generalization through Multitask Finetuning paper.
1-Oct Our paper (What Language Model to Train if You Have One Million GPU Hours?) got accepted at EMNLP'22 findings.
05-Jul Joined Amazon d2l team as a summer intern.
11-Apr Check out our new ACL'22 workshop paper What Language Model to Train if You Have One Million GPU Hours?.
7-Feb Check out our pre-print PromptSource paper.
2-Feb Check out my recent talk on T0++ paper.

2021

19-Nov Our paper T0++ got accepted as a spotlight paper at ICLR'22
15-Oct Our paper Multitask Prompted Training Enables Zero-Shot Task Generalization is online. Model (T0++) , Dataset (P3)
22-Sept Check out my Recent talk on Finetuned Language Models Are Zero-Shot Learners at NTU-NLP lab.
25-Aug My internship work from Amazon got accepted as a short paper in EMNLP.
3-May Two paper UXLA (in main conference) and AugVic (in findings) got accepted in ACL-IJCNLP-2021 .

2020

6-Nov Get back to the PhD study after completing my internship with Amazon Lex Team.
15-Oct Our paper LNMap is accepted in EMNLP-2020.
03-Aug I will be starting my internship at Amazon, with Lex Team.
28-Aug Passed my PhD Qualifying Examination (QE).
28-Apr Preprint of our new paper LNMap released in Arxiv.
24-Apr Preprint of our new paper MultiMix released in Arxiv.
25-Mar Gave a talk on mBART (paper ) in NTU-NLP.
13-Feb Presenting our Cross-lingual-NER paper on AAAI-2020.
4-Jan Got AAAI-2020 travel scholarship. See you in New York.

2019

9-Nov I will be a Teaching Assistant, for Graduate Deep Learning Course of NLP , NTU
11-Nov A paper accepted on AAAI 2020.
15-Jun A paper accepted on ACL 2019.
25-Jan Shows MT system to industry and government stakeholders.
15-Jan Joins NTU-NLP lab as a PhD student.

Experience

 
 
 
 
 
August 6, 2023 – Present
Saudi Arabia

Senior Research Scientist

National Center for Artificial Intelligence (NCAI), SDAIA

Currently:
- I’m the Core maintainer of ALLaM
- I lead the Training Team of ALLaM

My current reasearch interest is:
- Anatomy of Pretraining
- Alignment of LLMs. (see: T0, BLOOMZ, SPT)
- Robust evaluation of frontier models. (see: xCodeEval , ChatGPTEval)
 
 
 
 
 
July 5, 2022 – October 10, 2022
Santa Clara, CA

Applied Scientist Intern

Amazon Development Center U.S., Inc.

SPT: Semi-Parametric Prompt Tuning for Multitask Prompted Learning.
 
 
 
 
 
July 1, 2021 – January 28, 2022
East Palo Alto, CA (WFH)

Applied Scientist Intern (Part Time)

Lex Team, Amazon Web Services, Inc.

Cross-lingual Transfer learning under low resource and low parameter scenario.
 
 
 
 
 
August 3, 2020 – November 6, 2020
East Palo Alto, CA (WFH)

Applied Scientist Intern

Lex Team, Amazon Web Services, Inc.

Doing research on Cross-lingual Lex Bot.
 
 
 
 
 
September 15, 2018 – January 6, 2019
Bangladesh

Software Engineering Intern

Aubichol IT Limited

Worked on an early startup where the key responsibility was to design the architecture of sports analytic and Translation System.
 
 
 
 
 
September 15, 2017 – August 27, 2018
Singapore

Research Assistant

Nanyang Technological University

Doing Research on Natural Language Processing.
Responsibilities include:
* Research on MT, NER and Adversarial Training.
* Maintain internal GPU servers
* Maintain group web-site

Recent & Upcoming Talks

The talk summarizes how ChatGPT’s release marked a turning point in Generative AI, driven by refined integration of traditional …

This talk discusses the evolving field of transfer learning, from LSTMs to large language models, and shows new direction on the …

A talk on T0++ paper.

Publications

Quickly discover relevant content by filtering publications

The paper comprehensively evaluates ChatGPT’s performance on various academic tasks, covering 140 tasks across diverse fields, …