{"id":195831,"date":"2025-03-16T04:00:27","date_gmt":"2025-03-16T09:00:27","guid":{"rendered":"https:\/\/narcolepticnerd.com\/2025\/03\/16\/beyond-the-chatbot-agentic-ai-with-gemma\/"},"modified":"2025-03-16T04:00:27","modified_gmt":"2025-03-16T09:00:27","slug":"beyond-the-chatbot-agentic-ai-with-gemma","status":"publish","type":"post","link":"https:\/\/narcolepticnerd.com\/2025\/03\/16\/beyond-the-chatbot-agentic-ai-with-gemma\/","title":{"rendered":"Beyond the Chatbot: Agentic AI with Gemma"},"content":{"rendered":"<p><\/p>\n<div>\n<p><img decoding=\"async\" class=\"banner-image\" src=\"https:\/\/storage.googleapis.com\/gweb-developer-goog-blog-assets\/images\/Gemma_Logo_General.original.png\" alt=\"Gemma Logo General\"\/>  <\/p>\n<div class=\"inner-block-content rich-content\">\n<p data-block-key=\"v5smj\"><a href=\"https:\/\/goo.gle\/gemma\">Gemma<\/a> is a family of lightweight, generative artificial intelligence (AI) open models, built from the same research and technology used to create the <a href=\"http:\/\/goo.gle\/gemini\">Gemini<\/a> models. In a <a href=\"https:\/\/developers.googleblog.com\/build-a-text-based-adventure-game-with-gemma-2\/\">blog post last year<\/a>, we showcased a text-based adventure game creation using Gemma. In this blog post, you will learn how to use Gemma with a form of AI called Agentic AI, which offers a different way to use Large Language Models (LLMs).<\/p>\n<p data-block-key=\"3e8l3\">Most common AIs today are <b>reactive<\/b>. They respond to specific commands, like a smart speaker playing music when asked. They\u2019re useful, but can only do what they\u2019re told.<\/p>\n<p data-block-key=\"69q87\">In contrast, Agentic AI is <b>proactive and autonomous<\/b>. It makes its own decisions to reach goals. A key feature is using external tools like search engines, specialized software, and other programs to get information beyond their inherent knowledge base. This lets Agentic AI work and solve problems very independently and effectively.<\/p>\n<p data-block-key=\"f82on\">Here, we\u2019ll provide a practical guide to constructing a <a href=\"https:\/\/ai.google.dev\/gemma\/docs\/model_card_2\">Gemma 2<\/a> based Agentic AI system, covering key technical concepts like <b>&#8220;Function Calling&#8221;<\/b>, <b>&#8220;ReAct&#8221;<\/b> and <b>&#8220;Few-shot prompting&#8221;<\/b>. This AI system will serve as a dynamic lore generator for a fictional game, actively expanding its history and providing a distinct, perpetually evolving narrative landscape for players.<\/p>\n<h2 data-block-key=\"9g809\">Bridging the Gap<\/h2>\n<p data-block-key=\"83m9p\">Before we dive into the coding, let&#8217;s understand Gemma&#8217;s agentic AI capabilities. You can experiment directly with it through <a href=\"https:\/\/aistudio.google.com\/prompts\/new_chat?model=gemma-2-2b-it\">Google AI Studio<\/a>. Google AI Studio offers several Gemma 2 models. The 27B model is recommended for the best performance, but the smaller model like 2B can also be used as you can see below. In this example, we tell Gemma that there\u2019s a <code>get_current_time()<\/code> function and ask Gemma to tell us the time in Tokyo and Paris.<\/p>\n<\/div>\n<div class=\"inner-block-content\">\n<div class=\"image-wrapper\">\n<p>            <img decoding=\"async\" class=\"regular-image\" src=\"https:\/\/storage.googleapis.com\/gweb-developer-goog-blog-assets\/images\/image2_R5cJg57.original.png\" alt=\"Time Request Denied in Google AI Studio\"\/><\/p><\/div>\n<\/div>\n<div class=\"inner-block-content rich-content\">\n<p data-block-key=\"v5smj\">This result shows that Gemma 2 does not suggest calling the <code>get_current_time()<\/code> function. This model capability is called <b>&#8220;Function Calling&#8221;<\/b>, which is a key feature for enabling AI to interact with external systems and APIs to retrieve data.<\/p>\n<p data-block-key=\"el46a\">Gemma\u2019s built-in function calling capabilities are limited, which limits its ability to act as an agent. However, its strong instruction-following capabilities can be used to compensate for this missing functionality. Let\u2019s see how we can harness these capabilities to expand Gemma\u2019s functionality.<\/p>\n<p data-block-key=\"b9tau\">We will implement a prompt based on the <a href=\"https:\/\/arxiv.org\/abs\/2210.03629\">ReAct (Reasoning and Acting)<\/a> prompting style. ReAct defines available <b>tools<\/b> and a specific <b>format<\/b> for interaction. This structure enables Gemma to engage in cycles of <b>Thought<\/b> (reasoning), <b>Action<\/b> (utilizing tools), and <b>Observation<\/b> (analyzing the output).<\/p>\n<\/div>\n<div class=\"inner-block-content\">\n<div class=\"image-wrapper\">\n<p>            <img decoding=\"async\" class=\"regular-image\" src=\"https:\/\/storage.googleapis.com\/gweb-developer-goog-blog-assets\/images\/image5_RwX8Tdk.original.png\" alt=\"AI Assistant : Getting Time in Google AI Studio\"\/><\/p><\/div>\n<\/div>\n<div class=\"inner-block-content rich-content\">\n<p data-block-key=\"e06zr\">As you can see, Gemma is attempting to use the <code>get_current_time()<\/code> function for both Tokyo and Paris. A Gemma model cannot simply execute on its own. To make this operational, you\u2019ll need to run the generated code yourself or as part of your system. Without it, you can still proceed and observe Gemma\u2019s response, similar to the one provided below.<\/p>\n<\/div>\n<div class=\"inner-block-content\">\n<div class=\"image-wrapper\">\n<p>            <img decoding=\"async\" class=\"regular-image\" src=\"https:\/\/storage.googleapis.com\/gweb-developer-goog-blog-assets\/images\/image3_1K68Jdy.original.png\" alt=\"Gemma attempting to use `get_current_time` function for both Tokyo and Paris in Google AI Studio\"\/><\/p><\/div>\n<\/div>\n<div class=\"inner-block-content rich-content\">\n<p data-block-key=\"e06zr\">Awesome! Now you\u2019ve witnessed Gemma\u2019s function calling in action. This function calling ability allows it to execute operations autonomously in the background, executing tasks without requiring direct user interaction.<\/p>\n<p data-block-key=\"2p3ft\">Let\u2019s get our hands dirty with the actual demo, building a History AI Agent!<\/p>\n<h2 data-block-key=\"8jpeq\"><b>Demo Setup<\/b><\/h2>\n<p data-block-key=\"7bk2m\">All the prompts below are in the &#8220;<a href=\"https:\/\/goo.gle\/colab-gemma2-agentic-ai\">Agentic AI with Gemma 2<\/a>&#8221; notebook in <a href=\"http:\/\/goo.gle\/gemma-cookbook\">Gemma&#8217;s Cookbook<\/a>. One difference when using Gemma in Google AI Studio versus directly with Python on Colab is that you must use a specific format like <code><start_of_turn\/><\/code> to give instructions to Gemma. You can learn more about this from <a href=\"https:\/\/ai.google.dev\/gemma\/docs\/formatting\">the official docs<\/a>.<\/p>\n<p data-block-key=\"ct753\">Let\u2019s imagine a fictional game world where AI agents craft dynamic content.<\/p>\n<p data-block-key=\"9m4eq\">These agents, designed with specific objectives, can generate in-game content like books, poems, and songs, in response to a player choice or significant events within the game\u2019s narrative.<\/p>\n<p data-block-key=\"eetvt\">A key feature of these AI agents is their ability to break down complex goals into smaller actionable steps. They can analyze different approaches, evaluate potential outcomes, and adapt their plans based on new information.<\/p>\n<p data-block-key=\"akndh\">Where Agentic AI truly shines is that they\u2019re not just passively spitting out information. They can interact with digital (and potentially physical) environments, execute tasks, and make decisions autonomously to achieve their programmed objectives.<\/p>\n<h2 data-block-key=\"dfec4\">So, how does it work?<\/h2>\n<p data-block-key=\"2bk80\">Here\u2019s an example ReAct style prompt designed for an AI agent that generates in-game content, with the capability to use function calls to retrieve historical information.<\/p>\n<\/div>\n<div class=\"inner-block-content code-block\">\n<div class=\"highlight\">\n<pre class=\"markdown\"><span\/><start_of_turn>user\nYou are an AI Historian in a game. Your goal is to create books, poems, and songs found in the game world so that the player's choices meaningfully impact the unfolding of events.\n\nYou have access to the following tools:\n\n<span class=\"k\">*<\/span><span class=\"w\"> <\/span><span class=\"sb\">`get_historical_events(year, location=None, keyword=None)`<\/span>: Retrieves a list of historical events within a specific year.\n<span class=\"k\">*<\/span><span class=\"w\"> <\/span><span class=\"sb\">`get_person_info(name)`<\/span>: Retrieves information about a historical figure.\n<span class=\"k\">*<\/span><span class=\"w\"> <\/span><span class=\"sb\">`get_location_info(location_name)`<\/span>: Retrieves information about a location.\n\nUse the following multi-step conversation:\n\nThought: I need to do something...\nAction: I should use the tool <span class=\"sb\">`tool_name`<\/span> with input <span class=\"sb\">`tool_input`<\/span>\n\nWait user to get the result of the tool is <span class=\"sb\">`tool_output`<\/span>\n\nAnd finally answer the Content of books, poems, or songs.\n<\/start_of_turn><\/pre>\n<\/div>\n<\/div>\n<div class=\"inner-block-content rich-content\">\n<p data-block-key=\"e06zr\">Let\u2019s try to write a book. See the example outputs below:<\/p>\n<h3 data-block-key=\"9rv6q\"><b><br \/>Zero-shot prompting<\/b><\/h3>\n<\/div>\n<div class=\"inner-block-content\">\n<div class=\"image-wrapper\">\n<p>            <img decoding=\"async\" class=\"regular-image\" src=\"https:\/\/storage.googleapis.com\/gweb-developer-goog-blog-assets\/images\/Agentic-AI-with-Gemma-zero-shot-prompting-examp.original.png\" alt=\"Agentic-AI-with-Gemma-zero-shot-prompting-example\"\/><\/p><\/div>\n<\/div>\n<div class=\"inner-block-content rich-content\">\n<p data-block-key=\"e06zr\">As you can see, Gemma may struggle with function calling due to a lack of training in that area.<\/p>\n<p data-block-key=\"d8btn\">To address this limitation, we can employ &#8220;<b>One-shot prompting<\/b>&#8220;, a form of in-context learning, where demonstrations are embedded within the prompt. This example will serve as a guide for Gemma, allowing it to understand the intended task and improve its performance through contextual learning.<\/p>\n<h3 data-block-key=\"ei7ia\"><b>One-Shot Prompting<\/b><\/h3>\n<p data-block-key=\"6cjjm\"><i>(Note: the green section is a provided example, the actual prompt comes after it)<\/i><\/p>\n<\/div>\n<div class=\"inner-block-content\">\n<div class=\"image-wrapper\">\n<p>            <img decoding=\"async\" class=\"regular-image\" src=\"https:\/\/storage.googleapis.com\/gweb-developer-goog-blog-assets\/images\/Agentic-AI-with-Gemma-One-Shot-prompting-exampl.original.png\" alt=\"Agentic-AI-with-Gemma-One-Shot-prompting-example\"\/><\/p><\/div>\n<\/div>\n<div class=\"inner-block-content rich-content\">\n<p data-block-key=\"e06zr\">Notably, the model performs better since <code>Action<\/code> contains the correct input.<\/p>\n<h3 data-block-key=\"f407g\"><b><br \/>Few-shot prompting<\/b><\/h3>\n<p data-block-key=\"7d7h9\">For more complex tasks, use <b>&#8220;Few-shot prompting&#8221;<\/b>. It works by providing a small set of examples (usually 2-5, but sometimes more) that demonstrate the desired input-output relationship, allowing the model to grasp the underlying pattern.<\/p>\n<p data-block-key=\"b4e9k\">Now, we received a function name <code>get_person_info<\/code> and parameter values <code>\"name: Anya, the Rebel Leader\"<\/code>, the game must connect to an API and call the function. We will use a synthetic response payload for this API interaction.<\/p>\n<\/div>\n<div class=\"inner-block-content\">\n<div class=\"image-wrapper\">\n<p>            <img decoding=\"async\" class=\"regular-image\" src=\"https:\/\/storage.googleapis.com\/gweb-developer-goog-blog-assets\/images\/Agentic-AI-with-Gemma-few-shot-prompting-exampl.original.png\" alt=\"Agentic-AI-with-Gemma-few-shot-prompting-example\"\/><\/p><\/div>\n<\/div>\n<div class=\"inner-block-content rich-content\">\n<p data-block-key=\"e06zr\">Note that the agent used the provided information to create a book about Eldoria&#8217;s Rebel Leader.<\/p>\n<h2 data-block-key=\"5166e\">The Future is Agentic<\/h2>\n<p data-block-key=\"1osq1\">We\u2019re still in the early stages of Agentic AI development, but the progress is rapid. As these systems become more sophisticated, we can expect them to play an increasingly significant role in our lives.<\/p>\n<p data-block-key=\"denqn\">Here are some potential applications, focused primarily on gaming:<\/p>\n<ul>\n<li data-block-key=\"4ckj\"><b>Lifelike NPCs<\/b>: NPCs will become more believable, exhibiting unique personalities and adapting to player interactions.<\/li>\n<li data-block-key=\"vshl\"><b>Dynamic Stories<\/b>: Games will offer dynamically generated stories and quests, ensuring lasting replayability.<\/li>\n<li data-block-key=\"5kqrc\"><b>Efficient Development<\/b>: AI can streamline game testing, leading to higher quality and faster development cycles.<\/li>\n<\/ul>\n<p data-block-key=\"ah77p\">But with implications beyond:<\/p>\n<ul>\n<li data-block-key=\"dcupk\"><b>GUI Automation<\/b>: Models can be used to interact with graphical user interfaces directly within a web browser.<\/li>\n<li data-block-key=\"db17r\"><b>Mathematical Tool Integration<\/b>: AI can utilize tools like calculators to overcome limitations in performing complex calculations.<\/li>\n<li data-block-key=\"804hl\"><b>Contextual Knowledge Retrieval<\/b>: AI can decide when it needs to query external knowledge sources (as in RAG systems).<\/li>\n<\/ul>\n<h2 data-block-key=\"68brq\">Next steps<\/h2>\n<p data-block-key=\"10fdd\">The era of passive, reactive AI is gradually giving way to a future where AI is proactive, goal-oriented, and capable of independent action. This is the dawn of Agentic AI, and it&#8217;s a future worth getting excited about.<\/p>\n<p data-block-key=\"al5tq\">The <a href=\"https:\/\/github.com\/google-gemini\/gemma-cookbook\">Gemma Cookbook repository<\/a> is a place where various ideas like this come together. Contributions are always welcome. If you have a notebook that implements a new idea, please send us a Pull Request.<\/p>\n<p data-block-key=\"d1fue\">Thanks for reading and catch you in the next one.<\/p>\n<\/div><\/div>\n<p><a href=\"https:\/\/developers.googleblog.com\/en\/beyond-the-chatbot-agentic-ai-with-gemma\/\">Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Gemma is a family of lightweight, generative artificial intelligence (AI) open models, built from the same research and technology used to create the Gemini models. In a blog post last [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":195832,"comment_status":"open","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"_seopress_robots_primary_cat":"","_seopress_titles_title":"","_seopress_titles_desc":"","_seopress_robots_index":"","footnotes":""},"categories":[4317],"tags":[4676,4675,4576],"class_list":["post-195831","post","type-post","status-publish","format-standard","has-post-thumbnail","category-software","tag-agentic","tag-chatbot","tag-gemma"],"acf":[],"_links":{"self":[{"href":"https:\/\/narcolepticnerd.com\/wp-json\/wp\/v2\/posts\/195831","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/narcolepticnerd.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/narcolepticnerd.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/narcolepticnerd.com\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/narcolepticnerd.com\/wp-json\/wp\/v2\/comments?post=195831"}],"version-history":[{"count":0,"href":"https:\/\/narcolepticnerd.com\/wp-json\/wp\/v2\/posts\/195831\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/narcolepticnerd.com\/wp-json\/wp\/v2\/media\/195832"}],"wp:attachment":[{"href":"https:\/\/narcolepticnerd.com\/wp-json\/wp\/v2\/media?parent=195831"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/narcolepticnerd.com\/wp-json\/wp\/v2\/categories?post=195831"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/narcolepticnerd.com\/wp-json\/wp\/v2\/tags?post=195831"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}