Skip to the content.

Tool use in Google Search AI Mode

AI Mode project

Overview

My current work at Google Search focuses on direct tool use of Gemini. In Google Search AI Mode, Gemini calls various search tools to enhance their capabilities.

My recent work is building infrastructure to stress test tools for their reliability, such that Gemini can be tuned to perform best on tool use.

→ Learn more about tool use in Gemini

Challenges

Language models use tools to find data, execute code, and interact with other systems. Gemini, Google’s LLM, can call functions of Google-internal tools (Search, Maps, Flights, …) or third-party tools.

In post-training, LLMs can be tuned to call function and follow function interfaces. This helps the LLM learn to decide when to call functions and which function to pick.

At inference time, LLMs can be given function declarations as context. This allows to expose new functions to LLMs without having to do any additional training.

Misdeclerations and ambiguities in the function I/O give LLMs a hard time.

Thus, defining clear interfaces and reliable responses are key for building strong tools for function calling.

Technical Details