UI Agent Collection a collection of algorithmic agents for user interfaces/interactions, program synthesis, and robotics β’ 448 items β’ Updated 3 days ago β’ 66
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times Paper β’ 2512.16093 β’ Published Dec 18, 2025 β’ 93
view post Post 6144 Introducing Anim Lab AIβ‘ My submission for the MCP 1st Birthday HackathonTurn any math concept or logic into a clear video explanation instantly using AI.π Try it now: MCP-1st-Birthday/anim-lab-aiDemo outputs are attached π See translation π₯ 10 10 β€οΈ 2 2 π 2 2 π 1 1 π 1 1 + Reply
Running 311 Robot Learning: A Tutorial π 311 Learn about modern robot learning techniques and applications
view article Article LeRobot v0.4.0οΌε ¨ι’ζεεΌζΊζΊε¨δΊΊηε¦δΉ θ½ε +7 Oct 24, 2025 β’ 13
GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents Paper β’ 2506.03143 β’ Published Jun 3, 2025 β’ 53
GUI-AIMA: Aligning Intrinsic Multimodal Attention with a Context Anchor for GUI Grounding Paper β’ 2511.00810 β’ Published Nov 2, 2025 β’ 3
Running 215 FineVision: Open Data is All You Need π 215 A new open-source dataset for training VLMs
view article Article A failed experiment: Infini-Attention, and why we should keep trying? +1 Aug 14, 2024 β’ 74
Mini-Omni: Language Models Can Hear, Talk While Thinking in Streaming Paper β’ 2408.16725 β’ Published Aug 29, 2024 β’ 53