architecture

Multitask Prompted Training Enables Zero-Shot Task Generalization

A talk on T0++ paper.

Transforme-XL

This talk summarizes the paper [`Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context`](https://arxiv.org/abs/1901.02860). It assumes that audience are already familier with [`Attention Is All You Need`](https://arxiv.org/abs/1706.03762) paper and also discuss some high level concepts of it.

Structure of RNN Cells

A talk on the structures of RNN cells.