NickyNicky (Nicky)

upvoted 4 articles 10 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

NormalUhr

•

Feb 7, 2025

• 292

Article

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

+7

wenhuach, Haihao, weiweiz1, n1ck-guo, isaacmac, kding1, IlyasMoutawwakil, marcsun13, medmekk

•

Apr 29, 2025

• 44

Article

Gemma 3n fully available in the open-source ecosystem!

+6

ariG23498, pcuenq, sergiopaniego, reach-vb, FL33TW00D-HF, Xenova, Steveeeeeeen, kashif

•

Jun 26, 2025

• 121

Article

Training and Finetuning Sparse Embedding Models with Sentence Transformers

tomaarsen, arthurbresnu

•

Jul 1, 2025

• 138

upvoted an article about 1 year ago

Article

Introducing EuroBERT: A High-Performance Multilingual Encoder Model

EuroBERT

•

Mar 10, 2025

• 147

upvoted a paper about 1 year ago

Magma: A Foundation Model for Multimodal AI Agents

Paper • 2502.13130 • Published Feb 18, 2025 • 58

upvoted 2 articles over 1 year ago

Article

Open R1: Update #2

open-r1

•

Feb 10, 2025

• 218

Article

wHy DoNt YoU jUsT uSe ThE lLaMa ToKeNiZeR??

catherinearnett

•

Sep 27, 2024

• 54

upvoted a paper over 1 year ago

RuSentNE-2023: Evaluating Entity-Oriented Sentiment Analysis on Russian News Texts

Paper • 2305.17679 • Published May 28, 2023 • 2

upvoted 7 articles over 1 year ago

Article

Open-source DeepResearch – Freeing our search agents

+3

m-ric, albertvillanova, merve, thomwolf, clefourrier

•

Feb 4, 2025

• 1.32k

Article

Welcome to Inference Providers on the Hub 🔥

+5

burkaygur, zeke, aton2006, hassanelmghari, sbrandeis, kramp, julien-c

•

Jan 28, 2025

• 495

Article

The AI tools for Art Newsletter - Issue 1

linoyts, multimodalart

•

Jan 31, 2025

• 84

Article

The N Implementation Details of RLHF with PPO

+1

vwxyzjn, tianlinliu0121, lvwerra

•

Oct 24, 2023

• 72

Article

PEFT: Parameter-Efficient Fine-Tuning Methods for LLMs

samuellimabraz

•

Jan 24, 2025

• 58

Article

Distributed SFT with trl and DeepSpeed Part 1: Starting Locally

jlzhou

•

Jan 23, 2025

• 4

Article

We now support VLMs in smolagents!

+1

m-ric, merve, albertvillanova

•

Jan 24, 2025

• 113

upvoted a collection over 1 year ago

ProLIP

Collection

Official ProLIP weights, Probabilistic Language-Image Pre-Training (ICLR 2025) • 7 items • Updated Apr 18, 2025 • 10

upvoted 3 articles over 1 year ago

Article

How to Expand Your AI Music Generations of 30 Seconds to Several Minutes

theeseus-ai

•

Dec 13, 2024

• 3

Article

Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO)

ariG23498

•

Jan 19, 2025

• 50

Article

Fine-tune ModernBERT for RAG with Synthetic Data

sdiazlor

•

Jan 20, 2025

• 42

Nicky

AI & ML interests

Organizations

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

Gemma 3n fully available in the open-source ecosystem!

Training and Finetuning Sparse Embedding Models with Sentence Transformers

Introducing EuroBERT: A High-Performance Multilingual Encoder Model

Magma: A Foundation Model for Multimodal AI Agents

Open R1: Update #2

wHy DoNt YoU jUsT uSe ThE lLaMa ToKeNiZeR??

RuSentNE-2023: Evaluating Entity-Oriented Sentiment Analysis on Russian News Texts

Open-source DeepResearch – Freeing our search agents

Welcome to Inference Providers on the Hub 🔥

The AI tools for Art Newsletter - Issue 1

The N Implementation Details of RLHF with PPO

PEFT: Parameter-Efficient Fine-Tuning Methods for LLMs

Distributed SFT with trl and DeepSpeed Part 1: Starting Locally

We now support VLMs in smolagents!

ProLIP

How to Expand Your AI Music Generations of 30 Seconds to Several Minutes

Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO)

Fine-tune ModernBERT for RAG with Synthetic Data

Nicky

AI & ML interests

Organizations

NickyNicky's activity

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

Gemma 3n fully available in the open-source ecosystem!

Training and Finetuning Sparse Embedding Models with Sentence Transformers

Introducing EuroBERT: A High-Performance Multilingual Encoder Model

Open R1: Update #2

wHy DoNt YoU jUsT uSe ThE lLaMa ToKeNiZeR??

Open-source DeepResearch – Freeing our search agents

Welcome to Inference Providers on the Hub 🔥

The AI tools for Art Newsletter - Issue 1

The N Implementation Details of RLHF with PPO

PEFT: Parameter-Efficient Fine-Tuning Methods for LLMs

Distributed SFT with trl and DeepSpeed Part 1: Starting Locally

We now support VLMs in smolagents!

How to Expand Your AI Music Generations of 30 Seconds to Several Minutes

Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO)

Fine-tune ModernBERT for RAG with Synthetic Data