Tech »  Topic »  Google finds that AI agents learn to cooperate when trained against unpredictable opponents

Google finds that AI agents learn to cooperate when trained against unpredictable opponents


Training standard AI models against a diverse pool of opponents — rather than building complex hardcoded coordination rules — is enough to produce cooperative multi-agent systems that adapt to each other on the fly. That's the finding from Google's Paradigms of Intelligence team, which argues the approach offers a scalable and computationally efficient blueprint for enterprise multi-agent deployments without requiring specialized scaffolding.

The technique works by training an LLM agent via decentralized reinforcement learning against a mixed pool of opponents — some actively learning, some static and rule-based. Instead of hardcoded rules, the agent uses in-context learning to read each interaction and adapt its behavior in real time.

Why multi-agent systems keep fighting each other

The AI landscape is rapidly shifting away from isolated systems toward a fleet of agents that must negotiate, collaborate, and operate in shared spaces simultaneously. In multi-agent systems, the success of a task depends on the ...


Copyright of this story solely belongs to venturebeat . To see the full text click HERE