Evaluate and compare AI model performance on ChinaTravel benchmark tasks
Engage in multimedia chat with LLMs and ML models