OmniGAIA Towards Native Omni-Modal AI Agents Running 4 OmniGAIA Leaderboard 🏆 4 Benchmarking Native Omni-Modal AI Agents RUC-NLPIR/OmniGAIA Viewer • Updated 11 days ago • 360 • 2.29k • 6 RUC-NLPIR/Omnimodal-Agent-SFT-2K Viewer • Updated 11 days ago • 2.16k • 5.16k • 7 RUC-NLPIR/OmniAtlas-Qwen3-30B-A3B Image-Text-to-Text • 32B • Updated 11 days ago • 80 • 3
DeepImageSearch Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories DeepImageSearch: Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories Paper • 2602.10809 • Published Feb 11 • 57 RUC-NLPIR/DISBench Updated 15 days ago • 352 • 2 Running 2 DISBench Leaderboard 🏆 2 Submit and compare multimodal agents on DISBench
DeepImageSearch: Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories Paper • 2602.10809 • Published Feb 11 • 57
GISA Running 2 GISA Leaderboard 🏆 2 Submit model predictions and view GISA leaderboard scores GISA: A Benchmark for General Information-Seeking Assistant Paper • 2602.08543 • Published Feb 9 • 26 RUC-NLPIR/GISA Preview • Updated Feb 13 • 818 • 3
GISA: A Benchmark for General Information-Seeking Assistant Paper • 2602.08543 • Published Feb 9 • 26
OmniEval An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain Running 7 OmniEval 🥇 7 Official Leaderboard for OmniEval OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain Paper • 2412.13018 • Published Dec 17, 2024 • 41 RUC-NLPIR/OmniEval-KnowledgeCorpus Updated Dec 19, 2024 • 3.32k • 5 RUC-NLPIR/OmniEval-AutoGen-Dataset Updated Dec 19, 2024 • 47 • 6
OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain Paper • 2412.13018 • Published Dec 17, 2024 • 41
OmniGAIA Towards Native Omni-Modal AI Agents Running 4 OmniGAIA Leaderboard 🏆 4 Benchmarking Native Omni-Modal AI Agents RUC-NLPIR/OmniGAIA Viewer • Updated 11 days ago • 360 • 2.29k • 6 RUC-NLPIR/Omnimodal-Agent-SFT-2K Viewer • Updated 11 days ago • 2.16k • 5.16k • 7 RUC-NLPIR/OmniAtlas-Qwen3-30B-A3B Image-Text-to-Text • 32B • Updated 11 days ago • 80 • 3
GISA Running 2 GISA Leaderboard 🏆 2 Submit model predictions and view GISA leaderboard scores GISA: A Benchmark for General Information-Seeking Assistant Paper • 2602.08543 • Published Feb 9 • 26 RUC-NLPIR/GISA Preview • Updated Feb 13 • 818 • 3
GISA: A Benchmark for General Information-Seeking Assistant Paper • 2602.08543 • Published Feb 9 • 26
DeepImageSearch Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories DeepImageSearch: Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories Paper • 2602.10809 • Published Feb 11 • 57 RUC-NLPIR/DISBench Updated 15 days ago • 352 • 2 Running 2 DISBench Leaderboard 🏆 2 Submit and compare multimodal agents on DISBench
DeepImageSearch: Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories Paper • 2602.10809 • Published Feb 11 • 57
OmniEval An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain Running 7 OmniEval 🥇 7 Official Leaderboard for OmniEval OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain Paper • 2412.13018 • Published Dec 17, 2024 • 41 RUC-NLPIR/OmniEval-KnowledgeCorpus Updated Dec 19, 2024 • 3.32k • 5 RUC-NLPIR/OmniEval-AutoGen-Dataset Updated Dec 19, 2024 • 47 • 6
OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain Paper • 2412.13018 • Published Dec 17, 2024 • 41