ChartMuseum ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models Paper • 2505.13444 • Published May 19 • 17 lytang/ChartMuseum Viewer • Updated Sep 19 • 1.16k • 1.65k • 5
ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models Paper • 2505.13444 • Published May 19 • 17
MiniCheck & LLM-AggreFact MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents Paper • 2404.10774 • Published Apr 16, 2024 • 6 lytang/LLM-AggreFact Viewer • Updated Dec 20, 2024 • 59.7k • 526 • 29 bespokelabs/Bespoke-MiniCheck-7B Text Classification • 8B • Updated Dec 20, 2024 • 9.62k • 77 lytang/MiniCheck-Flan-T5-Large Text Classification • Updated Dec 20, 2024 • 525 • 13
MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents Paper • 2404.10774 • Published Apr 16, 2024 • 6
ChartMuseum ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models Paper • 2505.13444 • Published May 19 • 17 lytang/ChartMuseum Viewer • Updated Sep 19 • 1.16k • 1.65k • 5
ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models Paper • 2505.13444 • Published May 19 • 17
MiniCheck & LLM-AggreFact MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents Paper • 2404.10774 • Published Apr 16, 2024 • 6 lytang/LLM-AggreFact Viewer • Updated Dec 20, 2024 • 59.7k • 526 • 29 bespokelabs/Bespoke-MiniCheck-7B Text Classification • 8B • Updated Dec 20, 2024 • 9.62k • 77 lytang/MiniCheck-Flan-T5-Large Text Classification • Updated Dec 20, 2024 • 525 • 13
MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents Paper • 2404.10774 • Published Apr 16, 2024 • 6