Microsoft proposes a "generalist agentic system designed to solve such tasks. Magentic-One employs a multi-agent architecture where a lead agent, the Orchestrator, directs four other agents to solve.... complete complex, multi-step tasks across a wide range of scenarios people encounter in their daily lives." Examples of such tasks: find and edit missing citations in a paper; oprder a shawarma sandwich; describe trends in the S&P 500; and count the number of members of MSR-HAX. What's interesting is the use of WebArena to test the tool; this is a service that emulates interactive websites like GitHub, Reddit, and others. And here you see the current real implementations of such systems: automating reviews and comparing prices on vendor websites and autoposting to Reddit forums. Or, you know, maximizing prices.
Today: 5 Total: 196 [Share]
] [