Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Patronus AI

Team
company
Verified
https://patronus.ai
patronusai
Activity Feed Request to join this org

AI & ML interests

LLM Evaluation

Papers

MEMTRACK: Evaluating Long-Term Memory and State Tracking in Multi-Platform Dynamic Agent Environments

View all Papers

Darshan Deshpande's profile picture Varun Gangal's profile picture Anand Kannappan's profile picture Rebecca Qian's profile picture Bartosz Mielczarek's profile picture Bartosz Mielczarek's profile picture Varun Joshi's profile picture Arek's profile picture Sky Wang's profile picture Maciej Gełdon's profile picture Shivani Jain's profile picture Hersh Mehta's profile picture Edgar Colque's profile picture Jedrzej's profile picture Chinmayee Kulkarni's profile picture Devanshu Bansal's profile picture Bartlomiej Olechno's profile picture

PatronusAI 's datasets 37

PatronusAI/lynx-70b-instruct-pubmedqa-generations

Viewer • Updated Jul 8, 2024 • 1k • 14

PatronusAI/lynx-70b-instruct-halueval-generations

Viewer • Updated Jul 8, 2024 • 10k • 15

PatronusAI/lynx-70b-instruct-financebench-generations

Viewer • Updated Jul 8, 2024 • 1k • 12

PatronusAI/lynx-70b-instruct-drop-generations

Viewer • Updated Jul 8, 2024 • 1k • 18

PatronusAI/lynx-70b-instruct-covidqa-generations

Viewer • Updated Jul 8, 2024 • 1k • 13

PatronusAI/drop-test

Viewer • Updated Jun 17, 2024 • 1k • 16

PatronusAI/financebench-test

Viewer • Updated Jun 17, 2024 • 1k • 22
  • Previous
  • 1
  • 2
  • Next
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs