Direct Preference Optimization (DPO) and SimPO paper explanationIn this article, we will discuss the Direct Preference Optimization paper and the Simple Preference Optimization paper. Both of these are…4d ago4d ago
PPO to GRPO in Large Language Models AlignmentIn this article, I will discuss the theory behind Proximal Policy Optimization and Group Relative Policy Optimization (which was used in…Mar 5Mar 5
DoRA paper deep diveIn this article, we will be going through the paper DoRA, which came after LoRA and QLoRA. I have discussed AdaLORA and Representation…Feb 3Feb 3
ReFT: Representation Finetuning Paper deep diveThis article will dive deeper into the paper ReFT (Representation fine-tuning). It is a parameter-efficient finetuning (PEFT) method that…Jan 29Jan 29
JSON vs YAML function calling Finetuning comparisonWANDB TRAINING RUNS AND CHECKPOINTSNov 20, 2024Nov 20, 2024
Character.ai optimized inference blog post explainedRecently, character.ai, a role-playing based LLM startup, released a blog post on their inference pipeline. The blog posts mentioned three…Jun 30, 2024Jun 30, 2024
Adaptive LoRA (AdaLORA) paper explanationIn this article, we will dive deeper into the paper AdaLORA, which is based on Singular value decomposition (SVD) to dynamically choose low…May 6, 20241May 6, 20241
ColBERT: Contextualized Late Interaction BERT explained with a tutorialIn this article, we will go over the Colbert architecture, both v1 and v2. It is a neural Information Retrieval technique that can help us…Mar 9, 20241Mar 9, 20241
Neo4j: Analyzing the supplier's list of Apple and SamsungIn this article, we will review Neo4j basics by getting data about Apple and Samsung supplier lists. We are analyzing the supplier list of…Feb 3, 2024Feb 3, 2024
MAMBA and State Space Models ExplainedThis article will go through a new class of deep learning models called Structured State Spaces and Mamba.Feb 1, 20241Feb 1, 20241
RoFormer paper explained and implemented in JAXIn this article, we will go through the RoFormer paper, which introduced rotary positional embedding for transformer architecture and…Nov 13, 2023Nov 13, 2023
vLLM: A faster inference pipeline for LLMs paper explainedIn this article, we will be going over the paper vLLM titled Efficient Memory Management for Large Language Model Serving with…Oct 26, 2023Oct 26, 2023
SEC FILINGS QUESTION ANSWERING AND SUMMARIZATION: CHROMA DB, LANGCHAIN AND LLAMA INDEXThis article talks about my project on SEC filings question answering and summarization. Every publicly listed company has to file a yearly…Aug 27, 2023Aug 27, 2023
QLoRA: Quantized Low-Rank Adaptation paper explainedContinuing my fine-tuning journey, you can find the first article on LoRA, let’s get into the QLoRA paper which was released on May 2023.Aug 7, 2023Aug 7, 2023
LoRA (Low-Rank Adaptation) paper in-depth explanationThis article series will explain the two papers on finetuning neural networks and large language models. First, we will start with LoRA and…Aug 2, 2023Aug 2, 2023
SENTIMENT ANALYSIS OF SEC FILINGS USING Loughran-McDonald Master DictionaryThis article covers the sentiment analysis of SEC filings like 10-K (filed annually) and 10-Q (filed quarterly) using the Loughran-McDonald…May 1, 20232May 1, 20232
SVB COLLAPSE AND THE RECENT BANKING TURMOIL EXPLAINEDIn the book, Sapiens: A Brief History of Humankind by Yuval Noah Harari, the author discussed how humans, even if they are fragile beings…Mar 20, 2023Mar 20, 2023
Black Litterman Model: Investment Management with Python and Machine learning Specialization…This week we will cover week 3 of course 2 in the Specialization. For earlier parts, please refer here. This week, we will be covering the…Jan 29, 2023Jan 29, 2023
Investment Management with Python and Machine learning Specialization lecture Notes Part-VIIThis week we will cover week 4 of course 2 in the Specialization. For earlier parts, please refer here. Week 3 covers Black Litterman…Jan 7, 2023Jan 7, 2023
Investment Management with Python and Machine learning Specialization lecture Notes Part-VIThis week we will cover week 2 of course 2 in the Specialization. For earlier parts, please refer here.Jan 6, 2023Jan 6, 2023