Astarag Mohapatra – Medium

Astarag Mohapatra

Direct Preference Optimization (DPO) and SimPO paper explanation

In this article, we will discuss the Direct Preference Optimization paper and the Simple Preference Optimization paper. Both of these are…

Mar 9

Direct Preference Optimization (DPO) and SimPO paper explanation

Mar 9

PPO to GRPO in Large Language Models Alignment

In this article, I will discuss the theory behind Proximal Policy Optimization and Group Relative Policy Optimization (which was used in…

Mar 5

PPO to GRPO in Large Language Models Alignment

Mar 5

DoRA paper deep dive

In this article, we will be going through the paper DoRA, which came after LoRA and QLoRA. I have discussed AdaLORA and Representation…

Feb 3

DoRA paper deep dive

Feb 3

ReFT: Representation Finetuning Paper deep dive

This article will dive deeper into the paper ReFT (Representation fine-tuning). It is a parameter-efficient finetuning (PEFT) method that…

Jan 29

ReFT: Representation Finetuning Paper deep dive

Jan 29

JSON vs YAML function calling Finetuning comparison

WANDB TRAINING RUNS AND CHECKPOINTS

Nov 20, 2024

JSON vs YAML function calling Finetuning comparison

Nov 20, 2024

Character.ai optimized inference blog post explained

Recently, character.ai, a role-playing based LLM startup, released a blog post on their inference pipeline. The blog posts mentioned three…

Jun 30, 2024

Character.ai optimized inference blog post explained

Jun 30, 2024

Adaptive LoRA (AdaLORA) paper explanation

In this article, we will dive deeper into the paper AdaLORA, which is based on Singular value decomposition (SVD) to dynamically choose low…

May 6, 2024

Adaptive LoRA (AdaLORA) paper explanation

May 6, 2024

ColBERT: Contextualized Late Interaction BERT explained with a tutorial

In this article, we will go over the Colbert architecture, both v1 and v2. It is a neural Information Retrieval technique that can help us…

Mar 9, 2024

ColBERT: Contextualized Late Interaction BERT explained with a tutorial

Mar 9, 2024

Neo4j: Analyzing the supplier's list of Apple and Samsung

In this article, we will review Neo4j basics by getting data about Apple and Samsung supplier lists. We are analyzing the supplier list of…

Feb 3, 2024

Neo4j: Analyzing the supplier's list of Apple and Samsung

Feb 3, 2024

MAMBA and State Space Models Explained

This article will go through a new class of deep learning models called Structured State Spaces and Mamba.

Feb 1, 2024

MAMBA and State Space Models Explained

Feb 1, 2024

RoFormer paper explained and implemented in JAX

In this article, we will go through the RoFormer paper, which introduced rotary positional embedding for transformer architecture and…

Nov 13, 2023

RoFormer paper explained and implemented in JAX

Nov 13, 2023

vLLM: A faster inference pipeline for LLMs paper explained

In this article, we will be going over the paper vLLM titled Efficient Memory Management for Large Language Model Serving with…

Oct 26, 2023

vLLM: A faster inference pipeline for LLMs paper explained

Oct 26, 2023

SEC FILINGS QUESTION ANSWERING AND SUMMARIZATION: CHROMA DB, LANGCHAIN AND LLAMA INDEX

This article talks about my project on SEC filings question answering and summarization. Every publicly listed company has to file a yearly…

Aug 27, 2023

SEC FILINGS QUESTION ANSWERING AND SUMMARIZATION: CHROMA DB, LANGCHAIN AND LLAMA INDEX

Aug 27, 2023

QLoRA: Quantized Low-Rank Adaptation paper explained

Continuing my fine-tuning journey, you can find the first article on LoRA, let’s get into the QLoRA paper which was released on May 2023.

Aug 7, 2023

QLoRA: Quantized Low-Rank Adaptation paper explained

Aug 7, 2023

LoRA (Low-Rank Adaptation) paper in-depth explanation

This article series will explain the two papers on finetuning neural networks and large language models. First, we will start with LoRA and…

Aug 2, 2023

LoRA (Low-Rank Adaptation) paper in-depth explanation

Aug 2, 2023

SENTIMENT ANALYSIS OF SEC FILINGS USING Loughran-McDonald Master Dictionary

This article covers the sentiment analysis of SEC filings like 10-K (filed annually) and 10-Q (filed quarterly) using the Loughran-McDonald…

May 1, 2023

SENTIMENT ANALYSIS OF SEC FILINGS USING Loughran-McDonald Master Dictionary

May 1, 2023

SVB COLLAPSE AND THE RECENT BANKING TURMOIL EXPLAINED

In the book, Sapiens: A Brief History of Humankind by Yuval Noah Harari, the author discussed how humans, even if they are fragile beings…

Mar 20, 2023

SVB COLLAPSE AND THE RECENT BANKING TURMOIL EXPLAINED

Mar 20, 2023

Black Litterman Model: Investment Management with Python and Machine learning Specialization…

This week we will cover week 3 of course 2 in the Specialization. For earlier parts, please refer here. This week, we will be covering the…

Jan 29, 2023

Black Litterman Model: Investment Management with Python and Machine learning Specialization…

Jan 29, 2023

Investment Management with Python and Machine learning Specialization lecture Notes Part-VII

This week we will cover week 4 of course 2 in the Specialization. For earlier parts, please refer here. Week 3 covers Black Litterman…

Jan 7, 2023

Investment Management with Python and Machine learning Specialization lecture Notes Part-VII

Jan 7, 2023

Investment Management with Python and Machine learning Specialization lecture Notes Part-VI

This week we will cover week 2 of course 2 in the Specialization. For earlier parts, please refer here.

Jan 6, 2023

Investment Management with Python and Machine learning Specialization lecture Notes Part-VI

Jan 6, 2023

Astarag Mohapatra

Astarag Mohapatra

Hi Astarag here, I am interested in topics about Deep learning and other topics. If you have any queries I am one comment away

Following

Lists

Finetuning LLMs

Paper Explanations

Investment Management Specialization

9 stories7 saves

Help
Status
About
Careers
Press
Blog
Privacy
Rules
Terms
Text to speech