mBART: pretraining seq2seq architecture

PDF

Date

Apr 25, 2020

11:30 AM — 12:30 PM

Event

Location

MICL Lab, Singapore

This talk goes though the pretraining objectives of seq2seq architecture. It also discusses, how mBART is different from pretraining of XLM and it’s derivatives?

M Saiful Bari

Applied Scientist

@NTU, Singapore, Intern’20,21,22 Amazon Web Inc. (@awscloud), T0, BLOOMZ, UXLA, xCodeEval, I train LLM at SDAIA! - Scaling Maximalist, Training lead and Core maintainer of ALLaM.