🤗Transformers

Topic	Replies	Views	Activity
What happened to DeepSite 2.0 🤗Transformers	1	12	April 8, 2026
Deprecation of assistant_only_loss 🤗Transformers	3	14	April 8, 2026
Semantic matching in graph space without matrix computation and hallucinations and no GPU 🤗Transformers	0	13	April 6, 2026
How to decode CSM tokens into audio tensors for streaming 🤗Transformers	2	78	April 5, 2026
How to get list of downloaded models names? 🤗Transformers	7	5768	April 5, 2026
Peft 0.18.1 crashing when fine-tuning 🤗Transformers	3	55	April 4, 2026
Webhook usecase 🤗Transformers	1	20	April 2, 2026
Spaces not working with zerogpu on the paid planm 🤗Transformers	2	13	April 1, 2026
Pipeline tutorial, summarization doesn't work 🤗Transformers	3	69	March 31, 2026
Transformer for asynchronous multi-stream image time-series with online prediction? 🤗Transformers	1	17	March 30, 2026
Found the fix for memory not being freed when switching models on Linux (it's not Python or PyTorch) 🤗Transformers	2	69	March 29, 2026
Wave Field LLM — O(n log n) attention via wave equation dynamics, within 5% of standard transformer 🤗Transformers	4	5768	March 29, 2026
Mes Spaces restent bloqués sur “Starting” malgré abonnement Pro et hébergement GPU 🤗Transformers	5	107	March 28, 2026
How do I get started with Hugging Face Transformers as a beginner? 🤗Transformers	0	43	March 27, 2026
Numerical instability when finetuning deberta-v3-small 🤗Transformers	2	40	March 23, 2026
Title: Could Tagalog’s Focus System Inspire a Higher-Level Attention Mechanism in Transformers? 🤗Transformers	1	13	March 19, 2026
ImportError for function find_pruneable_heads_and_indices 🤗Transformers	2	285	March 16, 2026
Transformers.js: Retrieving the size of models in MB/GB before running 🤗Transformers	1	18	March 16, 2026
Purpose of commit_hash in PreTrainedModel.from_pretrained 🤗Transformers	1	28	March 16, 2026
How DEoT Makes LLMs Think: A New Framework for Open-Ended Reasoning 🤗Transformers	2	15	March 15, 2026
AutoModel with ClinicalBERT gives UNEXPECTED warning 🤗Transformers	3	45	March 13, 2026
Are biofoundation models actually used in practice and how helpful they are? 🤗Transformers	0	8	March 10, 2026
Overfitting in BERT IMDB50k 🤗Transformers	2	1146	March 6, 2026
LLM Course code errors 🤗Transformers	7	149	March 6, 2026
Different output when we inference through packing with flash attention in bf16 🤗Transformers	1	14	March 6, 2026
Why are gradient_checkpointing and training bound? 🤗Transformers	2	29	March 2, 2026
Attentions not returned from transformers ViT model when using output_attentions=True 🤗Transformers	5	1233	March 2, 2026
Using hyperparameter-search in Trainer 🤗Transformers	102	38943	March 2, 2026
Issue with summarization and translation pipeline 🤗Transformers	3	76	March 2, 2026
Is LLaMA rotary embedding implementation correct? 🤗Transformers	8	9621	February 26, 2026