Astarag MohapatraDoRA paper deep diveIn this article, we will be going through the paper DoRA, which came after LoRA and QLoRA. I have discussed AdaLORA and Representation…Feb 3Feb 3
Astarag MohapatraReFT: Representation Finetuning Paper deep diveThis article will dive deeper into the paper ReFT (Representation fine-tuning). It is a parameter-efficient finetuning (PEFT) method that…Jan 29Jan 29
Astarag MohapatraColBERT: Contextualized Late Interaction BERT explained with a tutorialIn this article, we will go over the Colbert architecture, both v1 and v2. It is a neural Information Retrieval technique that can help us…Mar 9, 20241Mar 9, 20241
Astarag MohapatraAdaptive LoRA (AdaLORA) paper explanationIn this article, we will dive deeper into the paper AdaLORA, which is based on Singular value decomposition (SVD) to dynamically choose low…May 6, 20241May 6, 20241
Astarag MohapatraMAMBA and State Space Models ExplainedThis article will go through a new class of deep learning models called Structured State Spaces and Mamba.Feb 1, 20241Feb 1, 20241
Astarag MohapatraRoFormer paper explained and implemented in JAXIn this article, we will go through the RoFormer paper, which introduced rotary positional embedding for transformer architecture and…Nov 13, 2023Nov 13, 2023
Astarag MohapatravLLM: A faster inference pipeline for LLMs paper explainedIn this article, we will be going over the paper vLLM titled Efficient Memory Management for Large Language Model Serving with…Oct 26, 2023Oct 26, 2023
Astarag MohapatraQLoRA: Quantized Low-Rank Adaptation paper explainedContinuing my fine-tuning journey, you can find the first article on LoRA, let’s get into the QLoRA paper which was released on May 2023.Aug 7, 2023Aug 7, 2023
Astarag MohapatraLoRA (Low-Rank Adaptation) paper in-depth explanationThis article series will explain the two papers on finetuning neural networks and large language models. First, we will start with LoRA and…Aug 2, 2023Aug 2, 2023
Astarag MohapatraTHE REAL THREATS OF AI (PART — II)Glad you are with me and I hope that you went through the first part. In the first part, we introduced AI naively and went through some…Jan 30, 2021Jan 30, 2021
InAnalytics VidhyabyAstarag MohapatraDEEP REINFORCEMENT LEARNING IN ALGORITHMIC TRADING (Part- II)So now we will discuss the paper, An Application of Deep Reinforcement Learning to Algorithmic Trading.Dec 20, 2020Dec 20, 2020
InAnalytics VidhyabyAstarag MohapatraPOLICY GRADIENTS IN DEEP REINFORCEMENT LEARNINGIn 2016, a deep learning Reinforcement agent AlphaGobeat Lee Sedol, who is a professional Go player of 9 dan rank (the highest honor in the…Jun 13, 2021Jun 13, 2021
InAnalytics VidhyabyAstarag MohapatraDEEP DETERMINISTIC POLICY GRADIENT FOR CONTINUOUS ACTION SPACEIn the previous article about Policy gradient methods, we discussed the shortcomings of PG-based methods. They are not sample-efficient as…Jun 23, 2021Jun 23, 2021
InAnalytics VidhyabyAstarag MohapatraTRUST REGION METHODS FOR DEEP REINFORCEMENT LEARNINGTRUST REGION METHODSJul 4, 20211Jul 4, 20211
InAnalytics VidhyabyAstarag MohapatraSOFT ACTOR-CRITIC ALGORITHMS IN DEEP REINFORCEMENT LEARNINGIn the previous series of articles, we talked about Policy Gradient methods, DDPG, and Trust region methods. Here we also discussed the…Jul 19, 2021Jul 19, 2021
Astarag MohapatraPopulation-Based Algorithms for Hyperparameter Optimization in Reinforcement learningDeep learning promises to provide accurate and robust automated systems to perform human-level intelligent tasks. But in this race…Apr 7, 20221Apr 7, 20221
Astarag MohapatraBlack Litterman Model: Investment Management with Python and Machine learning Specialization…This week we will cover week 3 of course 2 in the Specialization. For earlier parts, please refer here. This week, we will be covering the…Jan 29, 2023Jan 29, 2023