A pipeline gathering real developer MCP examples to generate vast synthetic tool-calling data, judged by an LLM rubric and refined via reinforcement learning to optimize agentic tool use.