1 21 25

Emmanuel Sugutt

Sugutt

AI & ML interests

Reinforcement learning Transformer models

Recent Activity

updated a model about 1 hour ago

Sugutt/whisper-kalenjin-small-revised

published a model 1 day ago

Sugutt/whisper-kalenjin-small-revised

updated a model 4 months ago

Sugutt/whisper-kalenjin-large

View all activity

Organizations

Collections 3

View 3 collections

spaces 1

Mistralai Mixtral 8x7B Instruct V0.1

🌖

models 9

datasets 0

None public yet

Emmanuel Sugutt

AI & ML interests

Recent Activity

Organizations

Collections 3

DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization

MEML-GRPO: Heterogeneous Multi-Expert Mutual Learning for RLVR Advancement

URPO: A Unified Reward & Policy Optimization Framework for Large Language Models

Grove MoE: Towards Efficient and Superior MoE LLMs with Adjugate Experts

MoBE: Mixture-of-Basis-Experts for Compressing MoE-based LLMs

SmallThinker: A Family of Efficient Large Language Models Natively Trained for Local Deployment

MiniCPM4: Ultra-Efficient LLMs on End Devices

DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization

MEML-GRPO: Heterogeneous Multi-Expert Mutual Learning for RLVR Advancement

URPO: A Unified Reward & Policy Optimization Framework for Large Language Models

Grove MoE: Towards Efficient and Superior MoE LLMs with Adjugate Experts

MoBE: Mixture-of-Basis-Experts for Compressing MoE-based LLMs

SmallThinker: A Family of Efficient Large Language Models Natively Trained for Local Deployment

MiniCPM4: Ultra-Efficient LLMs on End Devices

spaces 1

Mistralai Mixtral 8x7B Instruct V0.1

models 9

Sugutt/whisper-kalenjin-small-revised

Sugutt/whisper-kalenjin-large

Sugutt/whisper-small-hi

Sugutt/finmap-expense-cat-model

Sugutt/finbert-expense-categorization

Sugutt/Taxi-V3

Sugutt/q-FrozenLake-v1-4x4-noSlippery

Sugutt/ppo-Huggy

Sugutt/lunarlander

datasets 0

Emmanuel Sugutt

AI & ML interests

Recent Activity

Organizations

Collections 3

spaces 1

Mistralai Mixtral 8x7B Instruct V0.1

models 9 Sort: Recently updated

datasets 0

models 9