List: Paper Explanations | Curated by Astarag Mohapatra

Feb 3, 2025
17 stories
Paper ExplanationsPaper summaries 
Astarag Mohapatra
DoRA paper deep diveIn this article, we will be going through the paper DoRA, which came after LoRA and QLoRA. I have discussed AdaLORA and Representation…
Feb 3
Feb 3
Astarag Mohapatra
ReFT: Representation Finetuning Paper deep diveThis article will dive deeper into the paper ReFT (Representation fine-tuning). It is a parameter-efficient finetuning (PEFT) method that…
Jan 29
Jan 29
Astarag Mohapatra
ColBERT: Contextualized Late Interaction BERT explained with a tutorialIn this article, we will go over the Colbert architecture, both v1 and v2. It is a neural Information Retrieval technique that can help us…
Mar 9, 2024
1
Mar 9, 2024
1
Astarag Mohapatra
Adaptive LoRA (AdaLORA) paper explanationIn this article, we will dive deeper into the paper AdaLORA, which is based on Singular value decomposition (SVD) to dynamically choose low…
May 6, 2024
1
May 6, 2024
1
Astarag Mohapatra
MAMBA and State Space Models ExplainedThis article will go through a new class of deep learning models called Structured State Spaces and Mamba.
Feb 1, 2024
1
Feb 1, 2024
1
Astarag Mohapatra
RoFormer paper explained and implemented in JAXIn this article, we will go through the RoFormer paper, which introduced rotary positional embedding for transformer architecture and…
Nov 13, 2023
Nov 13, 2023
Astarag Mohapatra
vLLM: A faster inference pipeline for LLMs paper explainedIn this article, we will be going over the paper vLLM titled Efficient Memory Management for Large Language Model Serving with…
Oct 26, 2023
Oct 26, 2023
Astarag Mohapatra
QLoRA: Quantized Low-Rank Adaptation paper explainedContinuing my fine-tuning journey, you can find the first article on LoRA, let’s get into the QLoRA paper which was released on May 2023.
Aug 7, 2023
Aug 7, 2023
Astarag Mohapatra
LoRA (Low-Rank Adaptation) paper in-depth explanationThis article series will explain the two papers on finetuning neural networks and large language models. First, we will start with LoRA and…
Aug 2, 2023
Aug 2, 2023
Astarag Mohapatra
THE REAL THREATS OF AI (PART — II)Glad you are with me and I hope that you went through the first part. In the first part, we introduced AI naively and went through some…
Jan 30, 2021
Jan 30, 2021
In
Analytics Vidhya
by
Astarag Mohapatra
DEEP REINFORCEMENT LEARNING IN ALGORITHMIC TRADING (Part- II)So now we will discuss the paper, An Application of Deep Reinforcement Learning to Algorithmic Trading.
Dec 20, 2020
Dec 20, 2020
In
Analytics Vidhya
by
Astarag Mohapatra
POLICY GRADIENTS IN DEEP REINFORCEMENT LEARNINGIn 2016, a deep learning Reinforcement agent AlphaGobeat Lee Sedol, who is a professional Go player of 9 dan rank (the highest honor in the…
Jun 13, 2021
Jun 13, 2021
In
Analytics Vidhya
by
Astarag Mohapatra
DEEP DETERMINISTIC POLICY GRADIENT FOR CONTINUOUS ACTION SPACEIn the previous article about Policy gradient methods, we discussed the shortcomings of PG-based methods. They are not sample-efficient as…
Jun 23, 2021
Jun 23, 2021
In
Analytics Vidhya
by
Astarag Mohapatra
TRUST REGION METHODS FOR DEEP REINFORCEMENT LEARNINGTRUST REGION METHODS
Jul 4, 2021
1
Jul 4, 2021
1
In
Analytics Vidhya
by
Astarag Mohapatra
SOFT ACTOR-CRITIC ALGORITHMS IN DEEP REINFORCEMENT LEARNINGIn the previous series of articles, we talked about Policy Gradient methods, DDPG, and Trust region methods. Here we also discussed the…
Jul 19, 2021
Jul 19, 2021
Astarag Mohapatra
Population-Based Algorithms for Hyperparameter Optimization in Reinforcement learningDeep learning promises to provide accurate and robust automated systems to perform human-level intelligent tasks. But in this race…
Apr 7, 2022
1
Apr 7, 2022
1
Astarag Mohapatra
Black Litterman Model: Investment Management with Python and Machine learning Specialization…This week we will cover week 3 of course 2 in the Specialization. For earlier parts, please refer here. This week, we will be covering the…
Jan 29, 2023
Jan 29, 2023