Sharding Large Language Models: Achieving Efficient Distributed Inference

Techniques to load LLMs on smaller GPUs and enable parallel inference using Hugging Face Accelerate

Large Language Models and their Applications

Understanding the Large Language Models and their applications

2023-09-15 12090 words 57 min

Text Sampling Techniques in Natural Language Processing

Understanding various text sampling techniques in NLP, their applications, pros-cons and how they can help control various aspects in LLMs

2023-09-01 1223 words 6 min

Foundation Models from Ground Up

Understanding the core of Large Language Models, a journey through evolution of various transformer models and their applications.

2023-08-01 9100 words 43 min