New pipeline for zero-shot text classification

joeddav · December 15, 2020, 2:37pm

Hey @charly, here’s a previous thread about that. The main tricks are going to be:

Use one of these distilled models which are smaller and faster but with similar results
Run with the ONNX Runtime. One way you can do this is with this project created by @valhalla before he joined Hugging Face
If you have long sequences you’re classifying, you can try truncating to just part of the sequence. That’ll give you a speedup but you’ll have to evaluate how it impacts your performance.
If you have a large # of candidate labels, try to come up with a heuristic or use a super lightweight classifier to identify the most likely candidates, and then just feed in those more likely candidates rather than all of them.

Btw if it’s public would you mind linking to your streamlit app? It’s always fun to see the ways that people are using it

Topic		Replies	Views
Alternative approaches for text classification task 🤗Transformers	0	449	October 25, 2022
Zero shot classification with manual pytorch Beginners	0	755	August 27, 2021
How to scale Zero Shot Pipeline in large datasets? 🤗Transformers	0	244	August 27, 2021
Model for Text Classification similar to bart-large-mnli, for TensorFlow Beginners	0	520	May 6, 2022
Project: Create a new zero-shot model with NLI data 🤗 Course Projects	9	3760	April 11, 2023