Text-Only Transformers (NLP)

Image-Only Transformers (Computer Vision)

Multimodal Transformers (Text + Image + Other Modalities)

Transformer architecture variants

LLaMA Papers

Sparse and Efficient Models

Diffusion Models

Fine-Tuning & Adaptation Techniques

Memory/Compute Optimization