Sakana AI's new algorithm lets large language models work together to solve complex problems

7 hours ago 1
ARTICLE AD BOX

The Japanese AI startup Sakana AI has developed a new method that lets multiple large language models, such as ChatGPT and Gemini, work together on the same problem. Early tests suggest this collaborative approach outperforms individual models working alone.

The technique, called AB-MCTS (Adaptive Branching Monte Carlo Tree Search), is an algorithm that enables several AI models to tackle a problem at the same time. The models exchange and refine their suggestions, working together much like a human team.

AB-MCTS combines two different search strategies: it can either refine an existing solution (depth search) or try out entirely new approaches (breadth search). A probability model continuously decides which direction to pursue next. In the multi-LLM version (Multi LLM AB-MCTS), the system dynamically picks which model - such as ChatGPT, Gemini, or DeepSeek - is best suited for the current task. This selection adapts on the fly depending on which model delivers the strongest results for a particular problem.

AB-MCTS gets results in ARC-AGI-2

During tests on the challenging ARC-AGI-2 benchmark, Multi-LLM AB-MCTS solved more problems than any single model working alone (Single-LLM AB-MCTS). In several cases, only the combination of different models led to the right answer.

Ad

THE DECODER Newsletter

The most important AI news straight to your inbox.

✓ Weekly

✓ Free

✓ Cancel at any time

There are still some limitations. When allowed unlimited guesses, the system finds a correct answer about 30 percent of the time. But in the official ARC-AGI-2 benchmark, where submissions are usually limited to one or two answers, the success rate drops significantly. To address this, Sakana AI plans to develop new methods for automatically identifying and selecting the best suggestions. One idea is to use an additional AI model to evaluate the options. The approach could also be combined with systems where AI models discuss solutions with each other.

Sakana AI has released the algorithm as open source software under the name TreeQuest, so other developers can apply the method to their own problems.

Read Entire Article
LEFT SIDEBAR AD

Hidden in mobile, Best for skyscrapers.