Xu Zhihao
naiweizi
AI & ML interests
Trustworthy AI
Recent Activity
submitted
a paper
2 days ago
Unlocking Implicit Experience: Synthesizing Tool-Use Trajectories from Text
authored
a paper
4 days ago
Uncovering Safety Risks of Large Language Models through Concept
Activation Vector
authored
a paper
4 days ago
A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training
and Deployment
Organizations
None yet