view article Article ScreenSuite - The most comprehensive evaluation suite for GUI Agents! 3 days ago β’ 24
view article Article How to Build an MCP Server with Gradio By abidlabs and 1 other β’ Apr 30 β’ 166
Common Pile v0.1 Collection All resources related to Common Pile v0.1, an 8TB dataset of public domain and openly licensed text β’ 4 items β’ Updated 2 days ago β’ 18
Reward Bench 2 Collection Datasets, spaces, and models for Reward Bench 2 benchmark and paper! β’ 11 items β’ Updated 6 days ago β’ 9
view article Article *Context Is Gold to Find the Gold Passage*: Evaluating and Training Contextual Document Embeddings By manu and 1 other β’ 6 days ago β’ 23
view article Article AI Policy @π€: Response to the 2025 National AI R&D Strategic Plan By evijit and 2 others β’ 6 days ago β’ 12
view article Article CodeAgents + Structure: A Better Way to Execute Actions By akseljoonas and 1 other β’ 12 days ago β’ 46
view article Article Bigger isn't always better: how to choose the most efficient model for context-specific tasks π±π§πΌβπ» By sasha β’ 11 days ago β’ 19
view article Article Interactive Tools for machine learning, deep learning, and math By Suzana β’ 13 days ago β’ 40
view changelog Changelog Xet is now the default storage option for new users and organizations 16 days ago β’ 58
view article Article Tiny Agents in Python: a MCP-powered agent in ~70 lines of code By celinah and 3 others β’ 17 days ago β’ 122
Falcon Edge series Collection A series of powerful, universal and fine-tunable small Language Models β’ 7 items β’ Updated 19 days ago β’ 22
BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset Paper β’ 2505.09568 β’ Published 25 days ago β’ 92