Developers today need more than just basic speech-to-text—they need to unlock the full potential of long-form video. With vlm-1, you can transcribe hours of video and extract deep, structured insights in a single API call: from scene changes and chapter segmentation to temporal grounding and visual context. This empowers you to build smarter search, content discovery, and analytics tools for podcasts, lectures, interviews, and more—at scale, with minimal effort.

Analyzing Video Content

Let’s look at a video analysis example to see how vlm-1 can be used to extract structured insights from video content. In this example, we’ll use vlm-1 to transcribe and analyze the full 1-hour and 40 minute Google Cloud Next 25 Opening Keynote, generating segmented chapters with start and end timestamps, visual scene descriptions, and corresponding full transcript.

Let’s look at the few lines of code to transcribe the video with vlm-1:

from pathlib import Path
from vlmrun.client import VLMRun
from vlmrun.client.types import PredictionResponse, GenerationConfig

# Initialize the client
client = VLMRun(api_key="<VLMRUN_API_KEY>")

# Submit the video file for transcription
prediction: PredictionResponse = client.video.generate(
    file=Path("path/to/video.mp4"),
    domain="video.transcription",
    batch=True,
)

# Wait for the prediction to complete (with a timeout of 600 seconds)
prediction: PredictionResponse = client.predictions.wait(id=prediction.id, timeout=600)
print(prediction.response.model_dump())

Understanding the Output

Here’s an example of the output in JSON format, for the entire 8 minute video:

Example Video Transcription
{
  "metadata": {
    "content": null,
    "topics": null,
    "duration": 6004.006893
  },
  "segments": [
    {
      "start_time": 0,
      "end_time": 22,
      "audio": {
        "content": ""
      },
      "video": {
        "content": "The video begins with a white screen, which then transitions to a vibrant and colorful scene featuring the text \"Vtex.\" The letters are rendered in a playful, 3D style with a gradient effect, transitioning from blue to red. Surrounding the text are various abstract shapes and forms, including spheres, rings, and other geometric elements in bright colors like red, blue, green, and yellow. These shapes are floating and moving around the text, creating a dynamic and lively atmosphere."
      }
    },
    {
      "start_time": 22,
      "end_time": 75.33,
      "audio": {
        "content": " you know and you know and you're I'm I'm The Thank you. Why not?"
      },
      "video": {
        "content": "A person is toasting bread in a yellow toaster. The camera focuses on the bread as it cooks, showing the golden-brown color forming on the surface. The scene then transitions to a close-up of a slot machine, where the number 11 appears on the screen. The next scene shows a person wakeboarding, performing a trick over a wave. The wakeboarder is wearing a black wetsuit and a helmet. The final scene depicts a bright explosion in space, with colorful particles and light beams radiating outward."
      }
    },
    {
      "start_time": 76.33,
      "end_time": 96.33,
      "audio": {
        "content": " Just two words, but those two words challenge everything and can change anything. Why not help find a cure? Bring it here and even there. Wait, really? Yes, really."
      },
      "video": {
        "content": "The video begins with two individuals in a sterile environment, likely a laboratory or medical facility. They are wearing protective gear, including hairnets, masks, and lab coats, indicating a controlled and clean workspace. The setting is well-lit with fluorescent lighting, and the background shows various pieces of equipment and machinery typical of such environments."
      }
    },
    {
      "start_time": 96.33,
      "end_time": 120.33,
      "audio": {
        "content": " We're building the most helpful AI. So you can turn an idea into an enterprise. Get the right crops into this box and breakfast on the table. Inspect 800. and breakfast on the table. Inspect 800,000 packages a day and help protect our power grids. Because once we turn this into that, we ask, what else can we do?"
      },
      "video": {
        "content": "The video begins with a view from space, showing a vast expanse of Earth below. A large, cylindrical object is seen floating near the surface of the planet. The scene then transitions to an indoor setting where two individuals wearing hairnets and gloves are pushing a cart loaded with yellow containers through a sterile environment. The next scene shifts to a highway where a red truck is driving on the road. The truck suddenly collides with a large, colorful object, causing it to burst into flames. The final scene features a close-up of a glowing, spinning object, possibly a particle accelerator or a similar scientific device."
      }
    },
    {
      "start_time": 120.33,
      "end_time": 142,
      "audio": {
        "content": " Find out where the wild things are? Uh, wilder. Spot patterns and crime data? Catch fishing attacks. Take a thousand customer service calls an hour. Help coders, well, code. Let's make it happen. I'm"
      },
      "video": {
        "content": "The video begins with a vibrant and surreal scene featuring a watermelon that appears to be floating in space. The watermelon is depicted with a shiny, reflective surface, giving it an almost otherworldly appearance. A rainbow arcs across the sky, adding to the dreamlike quality of the scene. The watermelon is shown in various stages of being sliced, with pieces falling away, revealing its juicy interior. The background features a gradient of colors, transitioning from blue to pink, enhancing the fantastical atmosphere."
      }
    },
    {
      "start_time": 142,
      "end_time": 171.67,
      "audio": {
        "content": ""
      },
      "video": {
        "content": "A baby is seen smiling and laughing while being held up by an adult. The baby's hands are raised in the air, and the adult's hands are visible holding the baby securely. The baby appears to be enjoying the moment, with a joyful expression on its face."
      }
    },
    {
      "start_time": 142,
      "end_time": 198.33,
      "audio": {
        "content": " Ha ha ha ha ha. Please welcome CEO of Google Cloud, Thomas Currian. Thank you. Wow. Hello, everyone. Welcome to Google Cloud Next. Hello everyone. Welcome to Google Cloud Next. Just one year ago, we stood here and talked about the future of AI for organizations. Today, that future is being built by all of us. In 2024, we shipped more than 3,000 product advances across Google Cloud and Workspace."
      },
      "video": {
        "content": "A baby is seen smiling and laughing while being held up by an adult. The baby's hands are raised in the air, and the adult's hands are visible holding the baby securely. The baby appears to be enjoying the moment, with its mouth open and eyes wide with joy."
      }
    },
    {
      "start_time": 198.33,
      "end_time": 224,
      "audio": {
        "content": " We expanded Google Cloud to 42 regions, including Sweden, Mexico, and South Africa, and are rapidly expanding to countries like Malaysia, Thailand, and Kuwait. We expanded our 2 million mile, terrestrial and subsea fiber network by announcing new subsea cables like Umoja, Bosen, and Proa."
      },
      "video": {
        "content": "A man in a dark blue suit stands on a stage, gesturing with his hands as he speaks. The background is a simple, light-colored curtain. The scene then transitions to a large screen displaying a world map with various locations marked by blue dots. The text on the screen reads \"Expanded infrastructure footprint to 42 regions.\" The map highlights several specific regions such as Mexico, Sweden, South Africa, Kuwait, Thailand, Malaysia, and others. The presentation continues with another slide showing a sunset over the ocean, with the text \"2 million miles of terrestrial and subsea cables.\" The man continues to speak, emphasizing the information displayed on the screen."
      }
    },
    {
      "start_time": 224,
      "end_time": 250.67,
      "audio": {
        "content": " Google's AI momentum is exciting. We're seeing more than 4 million developers using Gemini, a 20 times increase in Vertex AI usage last year, driven by the strong adoption of Gemini Flash, Gemini 2.0, Imagine 3.0, and most recently, VO, our advanced video generation model,"
      },
      "video": {
        "content": "A man in a dark blue suit stands on a stage, gesturing with his hands as he speaks. The background is a gradient of light blue to white, with vertical lines creating a modern and professional atmosphere. The scene transitions to a large screen displaying the text \"4 million+ developers use Gemini.\" The man continues to speak, and the camera shifts to show him from different angles, emphasizing his gestures and expressions. The final shot shows the man standing confidently on the stage, with the screen behind him displaying the words \"Gemini Imagen Veo.\""
      }
    },
    {
      "start_time": 250.67,
      "end_time": 275.33,
      "audio": {
        "content": " and over 2 billion AI assist monthly to business users right within Google Workspace. But even more exciting is the momentum with you are customers. Here next, we'll be sharing over 500 customer stories showcasing real business innovation impact from AI adoption."
      },
      "video": {
        "content": "A man in a dark suit stands on a stage with a blue curtain backdrop. He is speaking to an audience, gesturing with his hands as he presents information. The scene transitions to a large screen displaying text about Google Workspace's AI assists, followed by a slide showing customer stories from various companies. The man continues to speak, emphasizing the benefits of Google Workspace's AI capabilities."
      }
    },
    {
      "start_time": 277.51,
      "end_time": 320.33,
      "audio": {
        "content": " Google is building for a unique moment. We're investing in the technology and the ecosystem to power your growth and transformation. Let's hear more from a special guest, a warm welcome for the CEO of Google and Alphabet, Sundaphi. Thank you. Thank you, Thomas. Good to be with you all here in Vegas. Last year, I joked in my remarks about how I was auditioning for the sphere."
      },
      "video": {
        "content": "A man in a dark blue suit stands on a stage, speaking to an audience. He is wearing a white dress shirt and a pocket square. The background features vertical light panels that change colors from white to blue. The man gestures with his hands as he speaks, occasionally adjusting his posture. The lighting highlights his presence, creating a professional and formal atmosphere."
      }
    },
    {
      "start_time": 320.33,
      "end_time": 342,
      "audio": {
        "content": " Well, it turns out I got the gig. Last night, I was on stage at the sphere to share a new collaboration. We are introducing the visit of us to a new generation using Google AI, transforming one of the greatest films of all time for one of the largest screens in the world."
      },
      "video": {
        "content": "A man in a blue blazer and white shirt stands on a stage, gesturing with his hands as he speaks. The background is a simple, light-colored curtain. The scene then transitions to a large screen displaying various images. The first image shows an empty stadium with rows of seats and a stage at the front. The next image features a couple in formal attire, possibly at a wedding, with the woman in a white dress and the man in a suit. The third image depicts a boat on a river surrounded by lush greenery. The final image shows a close-up of a hand holding a small object."
      }
    },
    {
      "start_time": 342,
      "end_time": 365.82,
      "audio": {
        "content": " It's a huge ongoing effort and not something we could have attempted even 18 months ago. Shows how rapidly technology is evolving and how it can enable us to rethink what's possible. I think that's a fitting theme for Cloud Next. The chance to improve lives and reimagine things is why Google has been investing in AI for"
      },
      "video": {
        "content": "A man in a blue blazer stands on a stage, addressing an audience. The stage is well-lit with a modern design featuring vertical light panels. Behind him, a large screen displays various images: first, two men standing in front of a whiteboard filled with diagrams and notes; then, a close-up of a green, furry creature with a surprised expression; followed by a scene from a movie with characters walking through a tunnel; finally, a group of people in futuristic attire walking down a corridor."
      }
    },
    {
      "start_time": 365.82,
      "end_time": 388.04,
      "audio": {
        "content": " more than a decade. We see it as the most important way we can advance our mission, to organize the world's information and make it universally accessible and useful. With Google Cloud, we see AI as the most important way we can help advance your mission. The opportunity with AI is as big as it gets."
      },
      "video": {
        "content": "A speaker stands on a stage at an event, addressing an audience. The stage is modern and sleek, with a large screen displaying various images and videos. The speaker gestures with his hands as he speaks, emphasizing points. The audience is seated in darkness, focused on the speaker. The screen behind the speaker shows different scenes, including close-ups of hands working, a person in a space suit, and two individuals engaged in an activity, possibly related to agriculture."
      }
    },
    {
      "start_time": 388.66,
      "end_time": 408.66,
      "audio": {
        "content": " That's why we are investing in the full stack of AI innovation. Starting with the infrastructure that powers it all. We are making big investments now and for the future. In 2025, and for the future. In 2025, we plan to invest around $75 billion in total CAPEX. This investment will be in total CAPEX."
      },
      "video": {
        "content": "A speaker stands on a stage, presenting to an audience. The stage is illuminated with blue lighting, creating a modern and professional atmosphere. The speaker is dressed in a dark suit and white shirt, gesturing with his hands as he speaks. Behind him, a large screen displays various images and text related to AI products and platforms. The screen transitions through different slides, highlighting different aspects of AI technology, such as products and platforms, models and tooling, world-class research, and AI infrastructure."
      }
    },
    {
      "start_time": 408.66,
      "end_time": 430.66,
      "audio": {
        "content": " This investment will be directed towards our servers and data centers, which includes powering our AI compute and cloud business. So this will greatly benefit our customers like all of you. We need our infrastructure to move at Google speed with near-zero latency, supporting services like search, Gmail and photos"
      },
      "video": {
        "content": "A man in a blue blazer and white shirt stands on a stage, gesturing with his hands as he speaks. The background is a gradient of blue shades, and there is a large screen behind him displaying a graph titled \"Alphabet Capital Expenditure\". The graph shows a line indicating an increase in expenditure from 2020 to 2025, with the approximate value of $75 billion marked at the end. The scene then transitions to a large screen showing an aerial view of a city with various buildings and infrastructure."
      }
    },
    {
      "start_time": 430.66,
      "end_time": 452.66,
      "audio": {
        "content": " for billions of users worldwide. And we use it for training our most capable model, Gemina. Google's backbone network is unparalleled, as Thomas just mentioned, spanning more than 200 countries and territories powered by over 2 million miles of fiber. Today, I'm pleased to announce that we are making"
      },
      "video": {
        "content": "A man stands on a stage, dressed in a blue blazer over a white shirt and black pants, with a belt. He gestures with his hands as he speaks, indicating he is delivering a presentation. The background is a gradient of blue shades, creating a calm and professional atmosphere. The lighting focuses on him, highlighting his presence against the darker backdrop."
      }
    },
    {
      "start_time": 452.66,
      "end_time": 480.05,
      "audio": {
        "content": " Google's global private network available to enterprises around the world. We call it cloud wide area network or van. Cloud van leverages Google's planet scale network. It's optimized for application performance and delivers over 40% faster performance while reducing total"
      },
      "video": {
        "content": "A man in a blue blazer and white shirt stands on a stage, gesturing as he speaks. The background features a large screen displaying an image of a network of lights and lines, symbolizing a global network. The text \"New Cloud Wide Area Network\" appears on the screen, indicating the introduction of a new service. The man continues to speak, emphasizing the importance of this new network for businesses."
      }
    },
    {
      "start_time": 480.05,
      "end_time": 507,
      "audio": {
        "content": " cost of ownership by up to 40%. Companies like Citadel Securities and Nestle are already using this network for faster, more reliable solutions, and it will be available to all Google Cloud customers later this month. This builds on our legacy of opening up our technical infrastructure for others to use. We do this with our custom AI chips called tensor processing units or TPUs."
      },
      "video": {
        "content": "A man in a blue blazer and white shirt stands on a stage, addressing an audience. He gestures with his hands as he speaks, emphasizing points about the Cloud Wide Area Network. The background is a simple, light-colored curtain, and the lighting focuses on him, highlighting his presence. The scene transitions to a large screen displaying text about the Cloud Wide Area Network's performance compared to the public internet, showing a 40% improvement. The video then cuts back to the speaker, who continues his presentation."
      }
    },
    {
      "start_time": 507,
      "end_time": 533.48,
      "audio": {
        "content": " Since 2013, we've invested heavily in this specialized hardware and we continue to make massive improvements in performance and efficiency at scale. Today I'm proud to announce our seven-generation TPU,"
      },
      "video": {
        "content": "A man in a blue blazer and white shirt stands on a stage, speaking to an audience. He gestures with his hands as he talks. The scene then cuts to a close-up of a TPU v5p chip on a circuit board. The video returns to the man on stage, who continues to speak. The final shot shows a new Ironwood device displayed on a screen, with the text \"New Ironwood\" and \"Coming Soon\" visible."
      }
    },
    {
      "start_time": 534.2,
      "end_time": 558.33,
      "audio": {
        "content": " Ironwood achieves 3,600 times better performance, an incredible increase. It's the most powerful chip we have ever built and will enable the next frontier of AI models. In the same period, we've also become 29x more energy efficient, and Amin will share more later today."
      },
      "video": {
        "content": "A man in a blue blazer stands on a stage, presenting information about Google TPU progress. The background features a large screen displaying a bar graph titled \"Google TPU Progress: Performance by Generation.\" The graph shows performance metrics (in exaFLOPS) for different years: 2018 (0.01), 2020 (0.13), 2022 (1.13), and 2023 (4.11). The presenter gestures towards the screen as he explains the data, emphasizing the significant increase in performance from 2023 to 2025. The final frame highlights the projected performance for 2025, showing a dramatic rise to 42.53 exaFLOPS."
      }
    },
    {
      "start_time": 558.33,
      "end_time": 580.33,
      "audio": {
        "content": " This progress is laying the foundation for breakthroughs across multiple fields. Quantum computing is a great example. Our newest quantum chip willow cracked a key challenge in quantum error correction that has eluded researchers for three decades. It can reduce errors exponentially as we scale up using more cubits."
      },
      "video": {
        "content": "A speaker stands on a stage, presenting information about advanced technology. The background features large screens displaying various images and text related to quantum computing and AI. The first screen shows a chip labeled \"Ironwood\" with a power efficiency rating of 29x. The second screen displays \"Quantum AI\" with an image of a futuristic, interconnected network. The third screen highlights a \"State-of-the-art quantum chip\" with 105 qubits and mentions it as a benchmark for quantum error correction and random CI. The speaker gestures towards the screens, emphasizing the details being presented."
      }
    },
    {
      "start_time": 580.33,
      "end_time": 606,
      "audio": {
        "content": " The willow chip really paves the way for a useful large-scale quantum computer down the row. Our infrastructure enables the next layer of the stack, research and model. Over the last decade, our research teams have pushed the boundaries of AI forward. And today, they are accelerating science and discovery. From our alpha alpha fold breakthrough with protein folding to weather next our"
      },
      "video": {
        "content": "A man in a blue suit stands on a stage, presenting information about a quantum chip called Willow. The screen behind him displays details about the chip, including its state-of-the-art status, 105 qubits, and performance metrics. The presenter gestures with his hands as he speaks, emphasizing the advancements and capabilities of the chip."
      }
    },
    {
      "start_time": 606,
      "end_time": 630.66,
      "audio": {
        "content": " state-of-the-art weather forecasting models. World-class research is what enables us to push the frontier with our Gemini models. In December, we introduced Gemini 2.0 with new advances in multimodality, like native image and audio output as well as native tool use. This new generation has also pushed the frontiers of another capability called thinking."
      },
      "video": {
        "content": "A man in a blue blazer and white shirt stands on a stage, gesturing as he speaks. The background features a large screen displaying vibrant, colorful visuals, possibly related to technology or innovation. The scene transitions to a close-up of the man, emphasizing his gestures and expressions. The video then shifts to a wide shot of the stage, highlighting the \"Gemini 2.0\" logo on the screen behind him. The audience is visible in the foreground, attentively watching the presentation."
      }
    },
    {
      "start_time": 630.66,
      "end_time": 654.66,
      "audio": {
        "content": " A couple weeks ago we released a new model, Gemini 2.5, a thinking model that can reason through its thoughts before responding. It's our most intelligent AI model ever and it's the best model in the world according to the chatbot arena leadable."
      },
      "video": {
        "content": "A man stands on a stage, delivering a presentation. The background is dark with blue lighting accents, and the word \"Thinking\" is prominently displayed on a large screen behind him. The man is dressed in a blue blazer over a white shirt and black pants, with a belt. He gestures with his hands as he speaks, occasionally clapping them together."
      }
    },
    {
      "start_time": 659.33,
      "end_time": 675.67,
      "audio": {
        "content": " It's state of the art across a range of benchmarks requiring advanced reasoning that included the highest score ever on humanity's last exam, one of the hardest industry benchmarks that's designed to capture the human frontier of knowledge and reasoning. There's a lot of impressive words, but let me show you what it can do."
      },
      "video": {
        "content": "A man in a blue blazer and white shirt stands on a stage, gesturing as he speaks. The background is a gradient of blue hues, and the stage features a large screen displaying the text \"Gemini 2.5 Pro\" along with various statistics and comparisons. The man appears to be presenting or explaining something related to the content displayed on the screen."
      }
    },
    {
      "start_time": 675.67,
      "end_time": 697.33,
      "audio": {
        "content": " Take a look at this Rubik's Cube, quoted by developer Matt Berman. You might think of it as a toy, but it's actually a really complex reasoning challenge. Adjustable dimensions, scrambling the squares, keyboard controls, and Gemini 2.5 Pro can simulate it all. It's a significant leap and shows the ability to produce robust interactive core."
      },
      "video": {
        "content": "A man in a blue blazer and white shirt stands on a stage, gesturing with his hands as he speaks. The background is a gradient of blue shades. The scene transitions to a large screen displaying a Rubik's Cube. The cube rotates slowly, showing different colored faces. The man continues to speak, occasionally gesturing towards the screen."
      }
    },
    {
      "start_time": 697.33,
      "end_time": 720.33,
      "audio": {
        "content": " That's a fun one. Let's look at one other example. With a series of prompts, developer John Modern used 2.5 Pro, to create a series of physics simulations, like the Earth's magnetic field and general relativity. You can see how the model turns really complex concepts into stunning and interactive visuals."
      },
      "video": {
        "content": "A man stands on a stage, gesturing with his hands as he speaks. The background features a large screen displaying a 3D cube made up of smaller cubes, alternating between blue and red. The scene transitions to a mesmerizing display of a cosmic event, showing a bright, glowing sphere surrounded by a swirling pattern of light and particles. The man continues to speak, emphasizing his points with hand movements."
      }
    },
    {
      "start_time": 720.33,
      "end_time": 750.66,
      "audio": {
        "content": " These are just a few brief examples, but we are excited about the possibilities, and we can't wait to see what you'll build with it. Gemini 2.4 is now available for everyone in AI studio, Vertex AI, and in the Gemini Act. I'm also excited to announce Gemini 2.5 Flash are low latency and most cost-e efficient model with thinking built in."
      },
      "video": {
        "content": "A man in a blue blazer and white shirt stands on a stage, addressing an audience. The background is a large screen displaying a futuristic, grid-like landscape with a yellow sphere at its center. The man gestures with his hands as he speaks, maintaining a professional demeanor."
      }
    },
    {
      "start_time": 754.66,
      "end_time": 772.66,
      "audio": {
        "content": " With 2.5 Flash, you can control how much the model reasons and balance performance with your budget. 2.5 Flash is coming soon in AI Studio, Vertex AI and in the Gemini app. We'll be sharing more details on the model"
      },
      "video": {
        "content": "A speaker stands on a stage, addressing an audience at what appears to be a tech conference. The stage is illuminated with blue lighting, creating a modern and professional atmosphere. The speaker, dressed in a dark suit and white shirt, gestures with his hands as he speaks, emphasizing points about the new Gemini 2.5 Flash technology. The screen behind him displays the text \"New Gemini 2.5 Flash in AI Studio, Vertex AI, and the Gemini app Coming Soon.\" The audience is seated in darkness, focusing their attention on the speaker."
      }
    },
    {
      "start_time": 772.66,
      "end_time": 794.94,
      "audio": {
        "content": " and its performance soon. I'm pretty excited by it and can't wait for you to see it for yourselves. Our goal is to always bring our latest AI advances into the fourth layer of our stack, products and platforms. Today, all 15 of our half a billion user products, including seven with 2 billion users,"
      },
      "video": {
        "content": "A man stands on a stage, dressed in a blue blazer over a white shirt with a black bow tie, and dark pants. He is speaking to an audience, gesturing with his hands as he talks. The background is a gradient of blue shades, creating a calm and professional atmosphere."
      }
    },
    {
      "start_time": 797.33,
      "end_time": 817.53,
      "audio": {
        "content": " are powered by our Gemini models. AI deployed at this scale requires world-class inference, which enterprises can benefit requires world-class inference, which enterprises can benefit to build your own AI-powered applications. Gemini is also helping us create net new products and experiences. Notebook LM is one example, now used by over 100,000 businesses."
      },
      "video": {
        "content": "A man in a blue blazer and white shirt stands on a stage, gesturing with his hands as he speaks. The background is a gradient of blue shades, and there is a large screen behind him displaying various app icons, including Google, Gmail, Android, Chrome, Play Store, YouTube, and Maps. The man appears to be giving a presentation or speech."
      }
    },
    {
      "start_time": 817.53,
      "end_time": 839.45,
      "audio": {
        "content": " It uses long context, multimodality, and our latest thinking models to show information in powerful ways. Gemini is not our only industry leading model. VO2 is the leading video generation model. Major film studios, entertainment companies, as well as the top advertising agencies in the world, are"
      },
      "video": {
        "content": "A man in a blue suit stands on a stage with a blue background, clapping his hands together. The scene transitions to a large screen displaying a digital interface named \"NotebookLM.\" The screen shows various topics such as \"Edison's Spark: The Lightbulb Legacy,\" \"Engineering Illumination,\" \"Invention of the Lightbulb,\" and \"CS History.\" The man continues to speak, gesturing with his hands. The screen then displays a search bar with the text \"olive green muscle car approach.\""
      }
    },
    {
      "start_time": 839.45,
      "end_time": 861.33,
      "audio": {
        "content": " using it to bring their stories to life. Getting advances into the hands of both consumers and enterprises is something we are really focused on. This is why we are able to innovate at the cutting edge and push the boundaries of what's possible for us and for you. The result, better, faster, and more innovation for everyone."
      },
      "video": {
        "content": "A speaker stands on a stage, addressing an audience. The stage is dimly lit with blue lighting, creating a modern and professional atmosphere. The speaker gestures with his hands as he speaks, emphasizing points in his presentation. A large screen behind him displays various images and text, including a car drifting and a stack of three blue blocks with different symbols on them. The speaker appears to be explaining the significance of these images and symbols, likely in the context of technology or innovation."
      }
    },
    {
      "start_time": 861.33,
      "end_time": 894.66,
      "audio": {
        "content": " It's exciting to see how that's helping companies of all sizes do more with AI and translate those benefits to customers. I'm delighted to introduce Chris Kempchinski, CEO of McDonald's, to tell you more. But first, thank you for having me and enjoy a week together in Las Vegas. Over to you, Chris. McDonald's is undergoing a once-in-a-generation transformation."
      },
      "video": {
        "content": "A man stands on a stage, dressed in a blue blazer over a white shirt and dark pants. He is wearing glasses and has a microphone attached to his blazer. The background is a gradient of blue shades, giving a professional and modern appearance. The man appears to be speaking or presenting, as he gestures with his hands while looking directly at the audience."
      }
    },
    {
      "start_time": 894.66,
      "end_time": 915.83,
      "audio": {
        "content": " We have about 65 million people that come to our restaurants every single day and it's how do we make their experience even better? Google's a big part of that, particularly as more and more of those customer interactions are happening in a digital world. That's why we're transforming our restaurant experience with the help of Google Cloud. Behind the counter, our restaurant team's jobs are becoming increasingly complex."
      },
      "video": {
        "content": "A man in a dark suit and light blue shirt stands in an office-like setting with large windows and modern decor. He appears to be speaking, with his hands clasped together in front of him. The background includes a McDonald's sign and some balloons, suggesting a celebratory or promotional context. The scene transitions to a close-up of a smartphone screen displaying a message from McDonald's, indicating that an order has been received."
      }
    },
    {
      "start_time": 915.83,
      "end_time": 937.33,
      "audio": {
        "content": " With edge computing from Google distributed cloud, capabilities will readily improve stability, security, and performance in our restaurants, all while giving us the space and power to test several new concepts we weren't able to do previously. For example, shift leaders will be able to leverage an AI powered assistant to help spot issues in the restaurant quickly. Our restaurant managers will be able to receive alerts"
      },
      "video": {
        "content": "A man in a dark suit and light blue shirt stands in an office-like environment with large windows showing a cityscape outside. He gestures with his hands as he speaks, emphasizing his points. The word \"security\" appears on the screen, highlighting the topic of discussion. The scene then transitions to a close-up of a person's hands holding a McDonald's bag, with a smartphone displaying a queue analysis app showing customer and traffic data."
      }
    },
    {
      "start_time": 937.33,
      "end_time": 964.55,
      "audio": {
        "content": " on their devices based on real-time data, say from their freezer or friars, along with guidance for from their freezer or friars, along with guidance for predictive maintenance. And with Gemini on Vertex AI, we can centralize all this information from restaurants in real time, making it easier for the right people to get answers with a simple question or prompt, improving the work environment in our restaurants across the globe for our more than 2 million team members. That's the magic. That's the power of AI and what Google Cloud has brought to McDonald's."
      },
      "video": {
        "content": "A person wearing blue gloves uses a tool to crack eggs into small metal rings on a griddle. The scene then transitions to a McDonald's kitchen where a worker is seen handling food items. A warning message appears on the screen indicating that the fryer oil requires changing. The video then cuts to a man in a suit standing in front of a McDonald's restaurant, followed by a map of the world with McDonald's logos scattered across it."
      }
    },
    {
      "start_time": 964.55,
      "end_time": 985.99,
      "audio": {
        "content": " Oh, oh, uh, uh Uh Uh Thank you, Chris. McDonald's is a great example of a company integrating AI into the very core of its operations."
      },
      "video": {
        "content": "A man in a dark suit and light blue shirt stands confidently in an office-like environment with large windows and modern decor. He has short, neatly styled hair and is looking directly at the camera with a serious expression. The background features a McDonald's sign and some balloons, suggesting a casual yet professional setting."
      }
    },
    {
      "start_time": 985.99,
      "end_time": 1007.99,
      "audio": {
        "content": " Customers around the world are choosing to work with Google for three important reasons. First, Google Cloud offers an AI-optimized platform with leading price, performance, precision, and quality. And new today, everything you need to build and manage multi-agent systems."
      },
      "video": {
        "content": "A man in a dark blue suit stands on a stage, gesturing with his hands as he speaks. He appears to be giving a presentation or speech. The background is a simple, light-colored curtain with vertical stripes. The man's gestures change slightly throughout the video, indicating he is emphasizing different points in his speech."
      }
    },
    {
      "start_time": 1007.99,
      "end_time": 1031.66,
      "audio": {
        "content": " Our AI platform offers advanced infrastructure and databases, world-class research leading models, and grounding for model responses with Google quality search. Vertex AI, a robust developer platform, including the broadest range of enterprise ready tools with which you can build AI agents"
      },
      "video": {
        "content": "A speaker is presenting on a stage at an event, likely a conference or seminar. The stage is well-lit with blue lighting, and the audience is seated in rows, attentively watching the presentation. The speaker, dressed in a suit, stands confidently as he addresses the crowd. The background features large screens displaying Google Cloud branding and text that reads \"AI optimized platform built for multi-agent systems.\" The presentation includes diagrams and text slides that highlight key points about AI optimization and multi-agent systems."
      }
    },
    {
      "start_time": 1031.66,
      "end_time": 1053.66,
      "audio": {
        "content": " and enable a multi-agent ecosystem. And the most comprehensive portfolio of purpose-built agents. Second, Google Cloud offers an open multi-cloud platform that allows you to adopt AI agents while connecting them with your existing IT landscape,"
      },
      "video": {
        "content": "A man in a dark suit stands on a stage, gesturing as he speaks to an audience. The stage is illuminated with blue lighting, and a large screen behind him displays various logos and text related to Google Cloud services. The man appears to be presenting or giving a speech, moving his hands expressively as he talks. The audience is visible in the foreground, attentively listening to the speaker."
      }
    },
    {
      "start_time": 1053.66,
      "end_time": 1077.97,
      "audio": {
        "content": " including your databases, your document stores, enterprise applications, and interoperating with models and agents from other providers. You get value faster from your AI investments. And third, Google Cloud offers an enterprise-ready AI platform built for interoperability."
      },
      "video": {
        "content": "A man stands on a stage, dressed in a dark blue suit with a white shirt and a patterned tie. He gestures with his hands as he speaks, occasionally bringing them together in front of him. The background is a simple, light-colored curtain with vertical stripes, and the lighting is bright, highlighting the speaker."
      }
    },
    {
      "start_time": 1077.97,
      "end_time": 1099.49,
      "audio": {
        "content": " It enables you to adopt AI deeply while addressing the evolving concerns around sovereignty, security, privacy, and regulatory requirements. You can adopt AI while we protect your data and your intellectual property and enable you to maintain compliance. Powering"
      },
      "video": {
        "content": "A man stands on a stage, dressed in a dark blue suit with a white dress shirt and a black bow tie. He is gesturing with his hands as he speaks, indicating that he is likely delivering a presentation or speech. The background consists of vertical light panels, creating a modern and professional atmosphere."
      }
    },
    {
      "start_time": 1099.49,
      "end_time": 1125.63,
      "audio": {
        "content": " this offering is our advanced infrastructure core for AI. To share the latest, please join me in welcoming Amin Vadat. You know, Thank you, Thomas."
      },
      "video": {
        "content": "A man in a dark blue suit stands on a stage, gesturing with his hands as he speaks. The background is a simple, light-colored backdrop with vertical lines. The man appears to be addressing an audience, possibly at a conference or event. The lighting is bright, highlighting the speaker and the stage."
      }
    },
    {
      "start_time": 1125.63,
      "end_time": 1147.43,
      "audio": {
        "content": " Demand for AI compute for training and inference is growing at an unprecedented rate. For over eight years, it has increased by over 10 times year over year, a factor of a hundred million in just eight years. We're continuing to offer leading power efficiency, performance, and networking for training and inference workloads, starting with the hardware."
      },
      "video": {
        "content": "A man stands on a stage, dressed in a black suit jacket over a vibrant pink shirt. He appears to be giving a presentation or speech. His body language is animated as he gestures with his right hand, emphasizing points in his dialogue. The background is simple, featuring vertical light strips that create a subtle gradient effect from white to blue. As he speaks, the camera occasionally shifts focus to a large screen behind him, which displays a close-up image of a mechanical device, possibly related to the topic of his talk."
      }
    },
    {
      "start_time": 1147.43,
      "end_time": 1171.66,
      "audio": {
        "content": " Today, we introduced our seventh generation of TPUs, Ironwood TPUs are our largest and most powerful TPU pods to date, more than a 10x improvement from our most recent high-performance TPU with over 9,000 chips per pod."
      },
      "video": {
        "content": "A man stands on a stage, dressed in a black suit jacket over a pink shirt. He gestures with his right hand as he speaks, likely addressing an audience. The background is a gradient of blue shades, giving a modern and professional ambiance to the scene."
      }
    },
    {
      "start_time": 1171.66,
      "end_time": 1193.66,
      "audio": {
        "content": " To meet the exponentially growing demands of the most demanding thinking models like Gemini 2.5. This delivers a staggering 42.5 exoflops of compute per pod. To give you a sense of the scale, the world's number one supercomputer supports 1.7 X-flops. Ironwood pods offer more than 24 times that compute power."
      },
      "video": {
        "content": "A man in a dark suit stands on a stage, addressing an audience. The stage is illuminated with blue lighting, creating a professional and modern atmosphere. A large screen behind him displays various images and text related to computing technology. The man gestures with his hands as he speaks, emphasizing points about the technical aspects being discussed. The audience is seated in darkness, focusing their attention on the speaker and the visuals on the screen."
      }
    },
    {
      "start_time": 1193.66,
      "end_time": 1217.33,
      "audio": {
        "content": " Ironwood T-s are truly built for the next generation of AI workloads. TPUs are an incredible achievement, but they are just one piece of our overall infrastructure. After all, a chip is only as powerful as a system that surrounds it. Our AI hypercomputer is a super-competing system designed to simplify AI deployment,"
      },
      "video": {
        "content": "A man in a dark suit and red shirt stands on a stage, gesturing as he speaks. The background is a deep blue with vertical light strips. A large screen behind him displays an image of a computer chip labeled \"Ironwood\" and text that reads \"24x more compute power than the world's best supercomputer.\" The scene then transitions to a different part of the stage where the same man continues his presentation, now with a backdrop featuring various icons and text such as \"Agents,\" \"Vertex AI,\" \"Research and Models,\" and \"AI Hypercomputer.\" The lighting highlights these elements, emphasizing the technological theme."
      }
    },
    {
      "start_time": 1217.33,
      "end_time": 1239.43,
      "audio": {
        "content": " improve performance, and optimize costs. It supports the best hardware platforms and brings together a single unified software stack and consumption model that enables you to use the hardware that best meets your needs and easily transition from one hardware generation to the next. Vital as we innovate so rapidly."
      },
      "video": {
        "content": "A man stands on a stage with a blue background, gesturing as he speaks. He is dressed in a dark suit over a red shirt. The camera then pans to show the entire stage, which is dimly lit with blue lighting. A large screen behind him displays text about an AI Hypercomputer, highlighting features such as flexible consumption, open software, and performance-optimized hardware. The man continues to speak, occasionally pointing towards the screen."
      }
    },
    {
      "start_time": 1239.43,
      "end_time": 1266.99,
      "audio": {
        "content": " We have enhanced our GPU portfolio with the availability of A4X and A4 VMs, powered by NVIDIA's GB200 and B200 Blackwell GPUs. We were the first cloud provider to offer both options. We're also pleased that Google Cloud will be among the first offer Nvidia's next generation, Vera Rubin GPUs, which offer up to 15 X-flops of FP4 inference performance per rack."
      },
      "video": {
        "content": "A man in a dark suit and pink shirt stands on a stage, presenting information to an audience. The stage is illuminated with blue and white lights, creating a modern and professional atmosphere. The presenter gestures with his hands as he speaks, emphasizing key points. Behind him, large screens display text related to AI hypercomputers, performance-optimized hardware, and new VMs. The presentation includes slides about NVIDIA Vera Rubin GPUs being offered on Google Cloud."
      }
    },
    {
      "start_time": 1266.99,
      "end_time": 1287.99,
      "audio": {
        "content": " We also introduced cluster-directors. per rack. We also introduce Cluster Director, which enables you to deploy and manage a large number of accelerators as a single unit to compute to improve performance, efficiency, and resilience. Storage is also vital to reduce the bottlenecks for training and inference. We are introducing new storage innovations."
      },
      "video": {
        "content": "A man in a dark suit and red shirt stands on a stage, gesturing animatedly as he speaks. The background is a large screen displaying the text \"New Cluster Director\" and \"Easily deploy and manage large compute clusters.\" The audience is seated in front of the stage, attentively watching the presentation."
      }
    },
    {
      "start_time": 1287.99,
      "end_time": 1311.9,
      "audio": {
        "content": " Hyperdisc X-Pers. storage innovations. Hyperdisc exopools offer the highest aggregate performance and capacity per AI cluster of any hyperscaler. Anywhere Cash keeps data close to your accelerators with up to 70% improvement in storage latency to reduce training time. And rapid storage, our first zonal storage solution offers five times lower latency for random reads and rights compared to the fastest"
      },
      "video": {
        "content": "A speaker is presenting on a stage at an event, likely a tech conference. The stage features a large screen displaying slides about AI hypercomputing and cloud storage solutions. The speaker, dressed in a dark suit with a red shirt and black bow tie, gestures towards the screen as he explains the features of the new Hyperdisk Exapools and Cloud Storage Anywhere Cache. The audience is visible in the foreground, attentively watching the presentation."
      }
    },
    {
      "start_time": 1311.9,
      "end_time": 1334.66,
      "audio": {
        "content": " comparable cloud alternative. Software is how we orchestrate and simplify access to this powerful hardware. And today we're introducing three enhancements for AI inference. First, we're introducing new inference capabilities in Google Kubernetes Engine, including Gen AI Aware-aware, scaling and load balancing features, which help reduce serving costs by up to 30%."
      },
      "video": {
        "content": "A man stands on a stage, presenting to an audience. He gestures towards a large screen behind him, which displays various slides related to AI hypercomputing and storage solutions. The slides highlight features such as flexible consumption, open software options like JAX, Keras, PyTorch, vLLM, Pathways on Google Cloud, XLA, and Google Kubernetes Engine and Compute Engine. The presentation emphasizes the performance-optimized hardware, including networking (Jupiter, OCS). The speaker appears engaged and informative, moving slightly between slides while explaining the benefits and capabilities of the new storage solution."
      }
    },
    {
      "start_time": 1334.66,
      "end_time": 1357.32,
      "audio": {
        "content": " Tail latency by up to 60% and increased throughput by up to 40%. Second, we're announcing that Pathways, Google's own distributed ML runtime, Powering Gemini, is now available for the first time for cloud customers. Developed by Google DeepMind, Pathways enable state-of-the-art multi-host inferencing for dynamic scaling with high performance at optimal costs."
      },
      "video": {
        "content": "A speaker is presenting on a stage at an event, likely a conference or tech summit. The stage is well-lit with a large screen displaying text about new features or products. The speaker, dressed in a dark suit and a vibrant pink shirt, gestures animatedly as he speaks. The audience is seated in front of the stage, attentively watching the presentation. The background includes a mix of blue and white lighting, creating a modern and professional atmosphere."
      }
    },
    {
      "start_time": 1357.32,
      "end_time": 1377.56,
      "audio": {
        "content": " Now you can scale out model serving to hundreds of accelerators for the best combination of batch efficiency and low latency. Third, we're bringing VLLM to TPUs. This allows customers who optimize PITORC with VLM for GPUs to easily and cost-efficiently run their workloads on TPUs."
      },
      "video": {
        "content": "A man stands on a stage, dressed in a dark suit over a red shirt and black pants. He gestures with his right hand as he speaks, indicating an engaging presentation. The background is a simple, modern design with vertical light panels, creating a professional and focused atmosphere. As the video progresses, the camera shifts to show a wider view of the stage, revealing a large screen displaying the text \"New vLLM on TPU\" along with additional information about the technology. The audience is seated in darkness, attentively watching the presentation."
      }
    },
    {
      "start_time": 1377.56,
      "end_time": 1398,
      "audio": {
        "content": " All of these AI hypercomputer hardware and software enhancements together enable us to deliver more intelligence or useful AI output at a consistently low price. This is one reason why Gemini 2.0 Flash, powered by AI hypercomputer, achieves 24 times higher intelligence per dollar compared to GPT-40,"
      },
      "video": {
        "content": "A man in a dark suit and pink shirt stands on a stage, gesturing as he speaks. Behind him is a large screen displaying text about new AI hypercomputer enhancements, including \"vLLM on TPU\" and \"GKE Inference Gateway.\" The stage is well-lit with blue lights, and the audience is visible in the background."
      }
    },
    {
      "start_time": 1398,
      "end_time": 1420.32,
      "audio": {
        "content": " and five times higher than DeepSeek R1. We're truly seeing tremendous momentum across our AI infrastructure portfolio, tripling the number of TPU and GPUR is consumed by our cloud customers just over the past year. And we're seeing tremendous customer momentum with AI unicorns like Anthropic, any scale, arise, and contextual AI."
      },
      "video": {
        "content": "A man in a suit stands on a stage, addressing an audience. The background features large screens displaying text about Google Cloud's products and services. The first screen shows information about Gemini 2.0 Flash, highlighting its superior intelligence per dollar compared to competitors. The second screen indicates a 3x increase in TPU and GPU hours consumed by cloud customers. The third screen emphasizes that leading AI unicorns trust Google Cloud's infrastructure."
      }
    },
    {
      "start_time": 1420.32,
      "end_time": 1442.99,
      "audio": {
        "content": " And enterprises. Toyota deployed ML models for factory workers. Schrodinger uses cloud GPUs for advanced drug discovery. TSMC protects its critical data for mission critical workloads. An Airbus deployed an AI platform to advance aircraft performance, safety, and reliability."
      },
      "video": {
        "content": "A man in a suit stands on a stage, presenting to an audience. The background features a large screen displaying various slides related to Google Cloud's infrastructure and its benefits for leading AI unicorns. The slides include text such as \"Leading AI Unicorns trust Google Cloud's infrastructure\" and logos of companies like Anthropic, Anyscale, Arize, Contextual AI, Toyota, Airbus, and TSMC. The presentation highlights the use of GPUs for advanced drug discovery and data protection for mission-critical workloads."
      }
    },
    {
      "start_time": 1442.99,
      "end_time": 1465.99,
      "audio": {
        "content": " Beyond optimizing training and inference in the cloud, we know that many AI workloads need to be run on-premises. As you heard from Chris Kipchensky at McDonald's, Google's distributed cloud brings our hardware and software to your environments. So you can bring AI capabilities closer to where data is generated for low latency and highly sensitive data in particular."
      },
      "video": {
        "content": "A man in a black suit and pink shirt stands on a stage, gesturing as he speaks. The background is a gradient of light blue to white, with vertical light strips on either side. He moves slightly from side to side, emphasizing his points. The scene transitions to a large screen displaying information about Google's AI Hypercomputer, including open software options like JAX, Keras, PyTorch, vL.M., Pathways on Google Cloud, XLA, and performance-optimized hardware. The screen also features the Google Distributed Cloud logo."
      }
    },
    {
      "start_time": 1465.99,
      "end_time": 1489.66,
      "audio": {
        "content": " Today we are announcing that Gemini can run on Google distributed cloud locally in air-gapped environments, as well as connected environments. This all comes with the support for NVIDIA's confidential computing and Blackwell systems, DGXB200 and HGXB200 platforms with Dell as a key partner."
      },
      "video": {
        "content": "A speaker stands on a stage at a conference, addressing an audience. The stage is well-lit with a large screen behind him displaying text about Google Distributed Cloud and Gemini. The speaker gestures with his hands as he speaks, emphasizing points about running Gemini locally in air-gapped and connected environments. The audience is seated, attentively listening to the presentation."
      }
    },
    {
      "start_time": 1489.66,
      "end_time": 1520.4,
      "audio": {
        "content": " This complements our Google Distributed Cloud AirGap product, which is now authorized for U.S. government's secret and top secret missions, and on which Gemini is now available, providing the highest levels of security and compliance. Nvidia is an important partner for Google and our customers. Let's hear directly from CEO Jensen Wang. Building Advanced AI Infrastructure is deep computer science. No company is better at every single layer of"
      },
      "video": {
        "content": "A man stands on a stage, dressed in a black suit jacket over a pink shirt. He gestures with his right hand as he speaks, occasionally adjusting his jacket. The background is a gradient of light blue to white, with vertical lines on the left side. The scene transitions to a wider shot of the stage, where the man continues to speak. The screen behind him displays the logos of Google Cloud and NVIDIA, followed by the text \"Advancing AI together.\""
      }
    },
    {
      "start_time": 1520.4,
      "end_time": 1542.99,
      "audio": {
        "content": " computing than Google and Google Cloud. Between Nvidia and Google Cloud, this super partnership includes capabilities that covers literally every single layer and every single aspect of computing. Every industry, every company, every country wants to get their hands on AI. However, everything has to be fundamentally confidential and secure."
      },
      "video": {
        "content": "A man stands in a modern, futuristic interior space with geometric glass walls and a high ceiling. He is dressed in a black leather jacket over a black shirt and dark pants, complemented by black shoes. His hair is gray, and he wears glasses. He gestures with his hands as if explaining something, moving them from his chest outward to emphasize his points. The lighting is soft and ambient, creating a calm and focused atmosphere."
      }
    },
    {
      "start_time": 1542.99,
      "end_time": 1574.66,
      "audio": {
        "content": " And so we're announcing something utterly gigantic today. Google distributed cloud with Gemini and Nvidia are going to bring state-of-the-art AI to the world's regulated industries and countries. Now, if you can't come to the cloud, Google Cloud will bring AI to you. Thank you. Thank you, Jensen. You know, we really value our deep engineering relationship with"
      },
      "video": {
        "content": "The video begins with a close-up shot of a server rack, showcasing various components such as power supplies, fans, and network interfaces. The camera pans across the rack, highlighting the intricate details and the organized layout of the hardware. The scene then transitions to a black screen with the text \"Accelerated by NVIDIA Blackwell\" displayed prominently in white letters against a black background. Following this, the video cuts to a man standing on a stage, gesturing with his hands as he speaks. He is dressed in a black leather jacket over a black shirt. The background features a futuristic, geometric design with green and black colors. The final scene shows the same man on stage, now wearing a suit, continuing his presentation. The stage is well-lit with blue lighting, and a large screen behind him displays the logos of Google Cloud and NVIDIA."
      }
    },
    {
      "start_time": 1574.66,
      "end_time": 1595.76,
      "audio": {
        "content": " Nvidia. Building on the ground-bracing research of Google DeepMind, we're delivering rapid innovation across many AI models, starting with Gemini, our most capable family of AI models. In the last year alone, we released Gemini a first native multi-modal model. We delivered the native multimodal model."
      },
      "video": {
        "content": "A man in a dark suit stands on a stage with a blue background. He gestures with his hands as he speaks, occasionally clapping them together. The scene transitions to a large screen displaying a diagram of three stacked blocks labeled \"Research and Models.\" The man continues to speak, and the camera shifts to show him from different angles, highlighting the blue lighting and the audience in the foreground."
      }
    },
    {
      "start_time": 1597.56,
      "end_time": 1617.66,
      "audio": {
        "content": " We delivered the first 2 million token context window. We built the live API for live bidirectional voice and video interaction, led in price performance with our flash models, and we recently launched Gemini 2.5 Pro, which is state-of-the-art on a wide range of benchmarks,"
      },
      "video": {
        "content": "A man in a suit stands on a stage, gesturing towards a large screen behind him. The screen displays a timeline with various stages labeled as 'Gemini', 'Live API', 'Gemini Flash models', and 'Gemini 2.5 Pro'. The man appears to be explaining these stages, likely in the context of a presentation or speech about technological advancements."
      }
    },
    {
      "start_time": 1617.66,
      "end_time": 1640.32,
      "audio": {
        "content": " and I'm pleased to say number one on chat bat arena. Batarina. Gemini is providing best in class AI for many companies around the world, including our close partners, Box and Palo Alto networks who are using Gemini 2.5 to deliver new applications."
      },
      "video": {
        "content": "A man stands on a stage with a blue curtain backdrop. He is dressed in a dark suit jacket, white dress shirt, and black tie. His hands are clasped together in front of him as he speaks. The lighting highlights his upper body and face. The scene then transitions to a large screen displaying a list of company names and logos under the heading \"Gemini customers around the world.\" The screen also includes the text \"and many more...\""
      }
    },
    {
      "start_time": 1640.32,
      "end_time": 1660.36,
      "audio": {
        "content": " It's also integrated across our own products, including Google Workspace, where Gemini powers features in Gmail, docks, drive, and meat, and is now included in all subscriptions. Gemini and Workspace is helping customers,"
      },
      "video": {
        "content": "A man in a dark suit stands on a stage, gesturing with his hands as he speaks. The background is a simple, light-colored curtain. The scene then transitions to a large screen displaying the text \"Google Workspace with Gemini.\" Various Google Workspace icons, including Drive, Docs, Sheets, Calendar, and Gmail, appear on the screen. The man continues to speak, and the audience is visible in the foreground, attentively watching him."
      }
    },
    {
      "start_time": 1660.36,
      "end_time": 1681.32,
      "audio": {
        "content": " like EV manufacture Rivian, fresh fields to enhance legal work and expedite tasks like document drafting and research, and companies of the Schwartz Group, Europe's largest retailer. Today I'm pleased to announce three new innovations with Gemina and workspace."
      },
      "video": {
        "content": "A man in a suit stands on a stage, addressing an audience. The stage is illuminated with blue lights, creating a modern and professional atmosphere. Behind him, a large screen displays various slides with text and images. The first slide shows a blue arrow symbol, followed by another slide that reads \"Setting a new standard for legal due diligence with Gemini.\" The next slide features a woman working on a laptop in a grocery store, with the text \"Migrating to Google Workspace provides their employees a secure way to work.\" The man gestures towards the screen as he speaks, emphasizing the points being made."
      }
    },
    {
      "start_time": 1681.32,
      "end_time": 1706.63,
      "audio": {
        "content": " Help me analyze Gemina and Workspace. Help me analyze in Google Sheets, which guides you through your data to complete expert-level analysis. Audio overviews in Google Docs, where you can interact with docs in an entirely new way by creating high quality audio versions of your content. And Google Workspace Flow of your content and Google workspace flows to help you automate time-consuming"
      },
      "video": {
        "content": "A man in a dark blue suit stands on a stage, addressing an audience. He gestures with his hands as he speaks, emphasizing his points. The background is a large screen displaying a presentation slide about Google Workspace. The slide features text that reads \"New Help me analyze Uncover insights from your data\" and \"Coming Soon.\" Another slide below it reads \"New Audio overviews in Docs Create high-quality audio versions of your docs\" and also indicates \"Coming Soon.\" The stage has a modern design with vertical light panels and a dark backdrop."
      }
    },
    {
      "start_time": 1706.63,
      "end_time": 1727.99,
      "audio": {
        "content": " repetitive tasks and to make decisions with more context. Let's see how Google Workspace is helping businesses around the world. We We are scattered all over the globe. It's just bonkers to try to get everyone to collaborate."
      },
      "video": {
        "content": "A large screen displays an announcement about Google Workspace Flows, highlighting its ability to automate work with agents in the loop. The scene transitions to a man standing on a stage, dressed in a dark suit and white shirt, addressing an audience. He gestures with his hands as he speaks, emphasizing key points. The camera then shifts focus to a hand interacting with a futuristic device, which appears to be part of a demonstration or presentation."
      }
    },
    {
      "start_time": 1727.99,
      "end_time": 1756.32,
      "audio": {
        "content": " Keeping up with my email everyone to collaborate. Keeping up with my email has always been really, really difficult. Is there anything more intimidating than a blank white page? But where do you begin? It's more important than ever that you have tools to work more quickly and efficiently together. Using Help Me Write, we were able to save 35 hours a month for product descriptions for our website."
      },
      "video": {
        "content": "A group of people are gathered around a table in what appears to be an office or a meeting room. They are engaged in a discussion, with one person speaking while others listen attentively. The setting is modern and well-lit, with large windows allowing natural light to flood the space. The atmosphere is collaborative and focused."
      }
    },
    {
      "start_time": 1759.32,
      "end_time": 1777.12,
      "audio": {
        "content": " We are leveraging meat to take notes, summarize them, and generate action items after the meeting so that we can be really present and focus on the content. When you're developing a DAC or a pitch for a client and you're coming up with ideas"
      },
      "video": {
        "content": "Focus on the meeting, not the notes.\n\nGemini is taking notes\nIt can take a few minutes for notes to appear\nStop taking notes\n\nYael Burla Vimeo\n\nDeliver on time, every time with visuals in seconds."
      }
    },
    {
      "start_time": 1777.12,
      "end_time": 1797.32,
      "audio": {
        "content": " and you need to visually manifest those ideas that used to take days. And now with the right prompts, we can do that in hours. We have to worry less about security because we know Google has our back. Security played a big part of the decision to move from the legacy on-premise tools set to Google Workspace."
      },
      "video": {
        "content": "The video begins with three women sitting around a table, intently looking at a laptop screen. They appear to be engaged in a discussion or collaborative work session. The scene then transitions to a woman sitting at a desk, working on a computer. The focus shifts to a close-up of various makeup products, including lipsticks and blushes, arranged on a table. The video then cuts to a black screen with the text \"Keep everything secure.\" followed by a shot of a large satellite dish inside a warehouse. The final frame shows a person standing in front of the satellite dish."
      }
    },
    {
      "start_time": 1797.32,
      "end_time": 1838.32,
      "audio": {
        "content": ""
      },
      "video": {
        "content": "The video begins with a serene cityscape at dusk, featuring tall buildings with illuminated windows against a pinkish-orange sky. The scene transitions to an indoor climbing gym where a group of six people, dressed in athletic gear, pose together. They stand in front of a colorful climbing wall, smiling and looking directly at the camera. The video then shifts to a conference or presentation setting, where a man in a suit stands on a stage, addressing an audience. The background features a large screen displaying the text \"Google Workspace with Gemini\" and \"Imagen 3,\" along with various images and graphics related to the topic."
      }
    },
    {
      "start_time": 1797.32,
      "end_time": 1862.25,
      "audio": {
        "content": " The impact of Gemini for workspace in our business has been really transformative. Thank you. Beyond Gemini, over the last year, we've made huge improvements to imagine three, our highest quality texture image model, which generates images with better detail, richer lighting, and fewer distracting artifacts than previous models. Imagine delivers accurate prompt adherence, bringing your creative vision to life with incredible precision. We also introduce CHRP 3 to help you create custom voices with just 10 seconds of input and to weave AI-powered narration into your existing recordings."
      },
      "video": {
        "content": "The video begins with a panoramic view of a city skyline at dusk, featuring numerous tall buildings with illuminated windows. The sky is painted in hues of orange and pink, indicating either sunrise or sunset. The scene then transitions to an indoor setting where a man in a suit stands on a stage, addressing an audience. The background is dark, with blue lighting accentuating the stage area. The man gestures as he speaks, emphasizing his points. The video captures the essence of a professional presentation or conference."
      }
    },
    {
      "start_time": 1863.67,
      "end_time": 1888.65,
      "audio": {
        "content": " Today, we're making Luria available on Google Cloud to transform text prompts into 30 second music clips and with the first hyperscaler to offer this capability. Let's hear a clip from Lyria. VO2 V-O-2"
      },
      "video": {
        "content": "A man in a suit stands on a stage, addressing an audience. The stage is modern and sleek, with a large screen displaying the word \"Lyria.\" The man gestures with his hands as he speaks, emphasizing his points. The audience is seated in rows, attentively listening to the presentation."
      }
    },
    {
      "start_time": 1890.65,
      "end_time": 1912.79,
      "audio": {
        "content": " is our industry-leading video generation model. It generates video generation model. It generates many minutes of 4K video, watermarked with synth ID, to ensure they can be identified as AI generators. It gives creators unprecedented creative control with new editing tools including camera"
      },
      "video": {
        "content": "A man in a suit stands on a stage, gesturing as he speaks to an audience. The stage is illuminated with a modern design featuring large, vertical light panels. A large screen behind him displays various images, including a wedding cake, a golden guitar, and a cowboy in a Western town. The man appears to be presenting or demonstrating something related to these images."
      }
    },
    {
      "start_time": 1912.79,
      "end_time": 1932.89,
      "audio": {
        "content": " presets to direct shot composition and camera angles without complex prompting, first and last shot control to define the beginning and the end of a video sequence with VO seamlessly bridging the gap and dynamic in-painting and outpainting for video"
      },
      "video": {
        "content": "A man in a cowboy hat and vest is walking down a dirt road in front of a wooden building. The camera pans to the right, revealing more of the desert landscape with cacti and mountains in the background. The scene then transitions to an animated character with blue hair standing on a city street at night. The character looks surprised as he turns around and walks away from the camera."
      }
    },
    {
      "start_time": 1932.89,
      "end_time": 1968.07,
      "audio": {
        "content": " editing and scaling. With Gemini, imagine, chirp, Lyria, and Vio. Google is the only company that offers generative media models across all modality. And all of them are available to you today on Vertex AI. Thank you. We've seen great examples from our customers. Craft Hines is speeding up campaign creation. A go-down creates unique"
      },
      "video": {
        "content": "A man in a suit stands on a stage, gesturing with his hands as he speaks. The background is a large screen displaying various text and images. The scene transitions to a close-up of the man, emphasizing his gestures and expressions. The lighting is bright, highlighting the speaker and the screen behind him."
      }
    },
    {
      "start_time": 1968.07,
      "end_time": 2002.99,
      "audio": {
        "content": " visuals of travel destinations. Bending spoon makes 60 million photos every day more fun and L'Oreal Group generates diverse cinematic shots using our models. Please join me now in welcoming Nenshad Bodily Walla for a demo of our models in action using vertex AI. Hello, everyone."
      },
      "video": {
        "content": "A professional speaker stands on a stage, presenting to an audience. The stage is well-lit with a large screen displaying various images and text related to technology and innovation. The speaker gestures with his hands as he explains the features and benefits of a new product or service. The background includes a large screen showing images of a man holding a box of cereal and another image processing 60 million photos daily using Imagen 3. The speaker's attire is formal, consisting of a dark suit and white shirt."
      }
    },
    {
      "start_time": 2002.99,
      "end_time": 2024.79,
      "audio": {
        "content": " How many of you have heard about our Cloud Next concert already? already. That's not as many I was hoping to hear about and I think that's because we've been missing a teaser video. Now let me tell you, it wasn't easy to pick the artist this year because it turns out that even though he looks very demure and very mindful,"
      },
      "video": {
        "content": "A man stands behind a podium on a stage, delivering a presentation. The background is dark with blue lighting accents. A large screen behind him displays his name, \"Nenshad Bardoliwalla,\" and his title, \"Director, Product Management, Vertex AI Platform.\" He gestures with his hands as he speaks, occasionally looking at a laptop on the podium. The setting appears to be a formal conference or event."
      }
    },
    {
      "start_time": 2024.79,
      "end_time": 2050.65,
      "audio": {
        "content": " Thomas Corian is a massive chapel ran fanboy. Yes, I have seen the video that he sent chapel directly of him going, H-O-T-O-G-O, Thomas wants you in the show. Yeah, but we waited for weeks to get her response from chapel,"
      },
      "video": {
        "content": "A man stands behind a podium on a stage, addressing an audience. The background features a large screen displaying his name, Nenshad Bardoliwalla, along with his title as Director, Product Management, for Vertex AI Platform. He is dressed in a dark blue blazer over a white shirt, with a microphone clipped to his blazer. The stage is well-lit, with blue lighting accents and a modern design."
      }
    },
    {
      "start_time": 2050.65,
      "end_time": 2072.29,
      "audio": {
        "content": " and then she broke Thomas's heart with just three words. Good luck, babe. So, we're going to use Vertex Media to pump Thomas up and create a teaser video that's going to get you as amped up as I am. Now we've already created our final video. I'm just going to show you how we got there. Given where we are, we're going"
      },
      "video": {
        "content": "A man stands behind a podium on a stage, addressing an audience. He is dressed in a dark blue blazer over a white shirt with a patterned tie. The background features vertical blinds, and there is a laptop on the podium displaying a presentation slide titled \"Vertex AI Media Studio.\" The slide lists various generative media models: Imagen (generate images), Veo (generate videos), Chirp (generate voices), and Lyria (generate music). The man gestures with his hands as he speaks, emphasizing points about the technology being discussed."
      }
    },
    {
      "start_time": 2072.29,
      "end_time": 2101.2,
      "audio": {
        "content": " to use Las Vegas Skyline as a perfect backdrop for what we're going to do with Vertex AI Media Studio. So let's go ahead. We're going to start by bringing in the Las Vegas skyline image. Really high quality, beautiful image. We're going to generate video, but here's the new hotness. Check it out. Camera presets built right into Vio, panning left, panning right, time lapse, tracking shots,"
      },
      "video": {
        "content": "A man is standing at a podium, speaking into a microphone. He is dressed in a dark suit and white shirt. The background is a plain, dark-colored wall. The man appears to be addressing an audience, gesturing with his hands as he speaks."
      }
    },
    {
      "start_time": 2101.2,
      "end_time": 2123.99,
      "audio": {
        "content": " and even drone shots. So let's go ahead and submit a drone shot in drone shot of the city skyline. There we go. We'll go and submit this. Now normally this would take a few seconds. I ran this earlier today so it's cached, so it's going to be a little quicker than normal."
      },
      "video": {
        "content": "A man is sitting at a desk in a studio, speaking into a microphone. He gestures with his hands as he talks. The scene then transitions to a computer screen displaying a media studio interface. The interface shows an image of a city skyline at sunset, featuring prominent buildings and a fountain. The camera preset is set to 'Drone shot,' and the frame rate is set to 24 fps. The video length is set to 8 seconds. The man continues to speak, and the screen displays the text 'drone shot of the city skyline' in the prompt box."
      }
    },
    {
      "start_time": 2123.99,
      "end_time": 2144.99,
      "audio": {
        "content": " All right, let's look at video number one. Absolutely spectacular. We have the ability to see the fountains, the Eiffel Tower. Now let's go ahead and take a look at video number two. A different angle that Vio creates for us."
      },
      "video": {
        "content": "A man in a suit is standing at a podium, speaking into a microphone. The background shows a cityscape with illuminated buildings and a prominent tower. The man gestures with his hands as he speaks, emphasizing his points. The video then transitions to a screen displaying a drone shot of the city skyline, showing the same illuminated buildings and tower. The video continues to play, showing the drone footage in detail."
      }
    },
    {
      "start_time": 2144.99,
      "end_time": 2167.99,
      "audio": {
        "content": " Again, stunning imagery. You can see the clouds in the background and look at the cars driving up and down Las Vegas Boulevard. Absolutely incredible. Now, one video is not going to do it for the concert promo we want to do, so I want to show you some of the other videos that I created. I have one here of the stage being set up all through the power of VO."
      },
      "video": {
        "content": "A man in a suit is standing at a podium, speaking into a microphone. The background is a plain, dark curtain. The camera remains stationary throughout the video."
      }
    },
    {
      "start_time": 2167.99,
      "end_time": 2189.65,
      "audio": {
        "content": " I have one of the band. I even have one of the audience actually clapping for what they're about to see. This will be a good reminder for all of you. Now, something very interesting happened. It turns out that Vio can do something that my 12 year old can do,"
      },
      "video": {
        "content": "A man in a suit is standing at a podium, speaking into a microphone. The camera then cuts to a stage with bright lights and a band performing. The audience is visible, clapping and enjoying the performance. The scene shifts to a close-up of a hand holding a microphone, followed by a shot of a woman clapping her hands. The video then shows a panoramic view of a cityscape at dusk, with the Las Vegas Strip in the background. Finally, the video ends with a shot of a guitar case on stage."
      }
    },
    {
      "start_time": 2189.65,
      "end_time": 2216.32,
      "audio": {
        "content": " and that is be an expert in photo bombing. It turns out that this great video we just saw has a crew member, and we love our crew members. However, in this case, I'd like to feature the guitar because the guitar is the most important part of the band. So let's go ahead and use VO's new in-painting capability. And I'm sorry, sir, I apologize."
      },
      "video": {
        "content": "A man is standing at a podium, speaking into a microphone. He gestures with his hands as he talks. The background is a dark stage with a spotlight shining on him. The camera remains stationary throughout the video."
      }
    },
    {
      "start_time": 2216.32,
      "end_time": 2243.99,
      "audio": {
        "content": " I know you're very good at your job, but I am going to have to remove you from this image. We will send flowers to you and your family though, sir. Let's use the new in-painting capability, wait a couple of seconds, and let's see what we see. Now if this does what I think it does, it should preserve every single aspect of what we saw before just without our stage hand."
      },
      "video": {
        "content": "A man in a suit is sitting at a desk, looking at a laptop screen. The scene then transitions to a stage with a guitar and a suitcase. The man in the suit appears to be interacting with the guitar and the suitcase."
      }
    },
    {
      "start_time": 2243.99,
      "end_time": 2266.99,
      "audio": {
        "content": " Look at that. Okay, so we got some video clips. Now we need some music. Let's try the first clip I created with Lyrion and see how we like it. You know, that's not quite my tempo."
      },
      "video": {
        "content": "A man is standing at a podium, speaking into a microphone. The background is a dark stage with a large screen displaying an image of a guitar and a suitcase. The man appears to be giving a presentation or speech. The video then transitions to a screen showing the Google Cloud Next 25 logo and sound waveforms."
      }
    },
    {
      "start_time": 2266.99,
      "end_time": 2292.25,
      "audio": {
        "content": " I need music that's going to make all of you feel like I'm never going to give you up. I'm never going to let you down. I'm never going to run around and desert you. So let's try clip number two and see how that works. All right, we have the recipe."
      },
      "video": {
        "content": "A man is standing at a podium, speaking into a microphone. He is dressed in a dark suit jacket over a white shirt. The background is a plain, light-colored curtain. The man appears to be addressing an audience, gesturing with his hands as he speaks. The video is likely from a conference or event, possibly related to technology or business."
      }
    },
    {
      "start_time": 2292.25,
      "end_time": 2324.99,
      "audio": {
        "content": " I like that tune better. We've got the videos. we've got the music, let's pull it all together and see what it looks like. Here we go. Play it, Sam. Hey, and you know I'm What do you think? Absolutely amazing."
      },
      "video": {
        "content": "A man in a suit is standing at a podium, gesturing with his right hand as he speaks. The background is dark, and there is a large screen behind him displaying the Google Cloud Next 25 logo. The screen also shows two sound waveforms: one blue and one green. The man appears to be giving a presentation or speech."
      }
    },
    {
      "start_time": 2324.99,
      "end_time": 2353.65,
      "audio": {
        "content": " We've seen the amazing capabilities of Vio, the ability to create incredible shots with very little prompting, the ability to have editing capabilities that are easy to use and the cinematic quality. We're gonna see you tomorrow night when Thomas does a stage dive into the Mosh pit at Allegiance Stadium for the Killers. THE KILLER THANES!"
      },
      "video": {
        "content": "A man stands behind a podium, gesturing with his hands as he speaks. He is dressed in a dark blue blazer over a white shirt with a black bow tie. The background consists of vertical blinds, and there is a laptop on the podium with a cloud logo sticker. The man appears to be giving a presentation or speech."
      }
    },
    {
      "start_time": 2353.65,
      "end_time": 2379.12,
      "audio": {
        "content": ""
      },
      "video": {
        "content": "A man in a dark blue suit and white shirt is standing on a stage, gesturing with his right hand as he speaks. He appears to be addressing an audience, possibly at a conference or seminar. The background features vertical blinds, and there are two computer monitors on either side of him. The man occasionally turns his head to look at the audience, maintaining a confident and engaging demeanor."
      }
    },
    {
      "start_time": 2353.65,
      "end_time": 2399.32,
      "audio": {
        "content": " Welcome back to the stage, my friend and spiritual advisor, Thomas Corian. Thank you, Nanshad. I am also very excited for the concert tomorrow night. We're also bringing AI models to the physical world. Our partners like Samsung are using Gemini models for their exciting new AI companion robot, Bolly. And Google DeepMind recently introduced two new AI models for a new generation of helpful robots."
      },
      "video": {
        "content": "A man in a dark blue suit and white shirt is standing on a stage, gesturing with his right hand as he speaks. He appears to be addressing an audience, possibly at a conference or event. The background features a large screen displaying the Samsung logo and some text related to Google Cloud. The man's expression and body language suggest he is engaged in delivering a presentation or speech."
      }
    },
    {
      "start_time": 2399.32,
      "end_time": 2420.32,
      "audio": {
        "content": " Now let's talk about Vertex AI, a comprehensive AI platform. Vertex helps you discover enterprise-ready foundation models to customize, evaluate, and deploy applications built with the best foundation models and to build and manage AI agents at scale."
      },
      "video": {
        "content": "A speaker is presenting at a tech conference, standing on a stage with a large screen behind him displaying various slides. The first slide shows a robot in a lab environment with the Google DeepMind logo. The next slide introduces Vertex AI, followed by a slide listing features like Agent Builder, Model Garden, and Open Software. The speaker gestures with his hands as he explains the features."
      }
    },
    {
      "start_time": 2420.32,
      "end_time": 2443.98,
      "audio": {
        "content": " Let's hear how Intuit is making tax preparation even easier with Document AI, which is part of Vertex CI. Last year, Intuit TurboTax processed 44 million returns and $107 billion in refunds with the help of AI. Yet some customers with complex 1099 forms spaced hours of manual data entry."
      },
      "video": {
        "content": "A man in a dark suit stands on a stage, gesturing with his hands as he speaks. The background is a simple, light-colored curtain. The scene then transitions to a dimly lit room where a person is sitting at a desk, wearing glasses and a yellow cap. The final scene shows a man sitting outside a house, holding a smartphone, with text overlaying the image that reads \"$107 billion in refunds.\""
      }
    },
    {
      "start_time": 2443.98,
      "end_time": 2470.97,
      "audio": {
        "content": " This year, Intuit unlocked higher quality data comprehension and auto fill with Google Cloud Document AI. This done-for-you experience simplifies tax filing for millions, freeing up time for living life. Intuit built a new way to make taxes easier. Tens of thousands of companies are building with Vertex AI and Gemini."
      },
      "video": {
        "content": "A woman is sitting at a desk, looking at her phone with a smile on her face. She is wearing a white shirt and has long brown hair. The scene then cuts to a man sitting at a desk with a laptop, smiling as he looks at something on the screen. A woman stands behind him, holding a piece of paper and smiling. The scene then cuts to a white screen with the Google Cloud logo. Finally, a man in a suit stands on a stage, speaking to an audience."
      }
    },
    {
      "start_time": 2471.55,
      "end_time": 2491.65,
      "audio": {
        "content": " Nokia built a tool to speed up application coding and development. Wayfair updates product attributes five times faster. AES, an energy company, reduces audit costs by 99% and audit time from 14 days to just one hour."
      },
      "video": {
        "content": "A man in a dark blue suit stands on a stage, gesturing with his hands as he speaks. The background is a gradient of light blue and white, with vertical lines creating a subtle pattern. The man appears to be addressing an audience, possibly at a conference or presentation."
      }
    },
    {
      "start_time": 2493.65,
      "end_time": 2512.65,
      "audio": {
        "content": " Combeance Bank is creating AI-assisted summaries of investment advisory calls. Seattle Children's Hospital makes thousands of pages of clinical guidelines instantly searchable by their pediatricians. United Wholesale Mortgage is transforming"
      },
      "video": {
        "content": "A man in a dark suit stands on a stage, speaking to an audience. The background is a simple, light-colored curtain. The scene then transitions to a large screen displaying a presentation slide. The slide features a woman in a white shirt interacting with a child, with text that reads \"1,000+ pages of clinical guidelines instantly searchable with Gemini.\" The man continues to speak, gesturing with his hands as he presents."
      }
    },
    {
      "start_time": 2512.65,
      "end_time": 2534.95,
      "audio": {
        "content": " the mortgage experience, more than doubling underwriter productivity and Honeywell has incorporated Gemini into their product development. Honeywell and Google Cloud designed a new way to manage product life cycles. It will revolutionize how they handle millions of products."
      },
      "video": {
        "content": "A man in a suit stands on a stage, addressing an audience. The background features a large screen displaying a presentation slide with the text \"UWM 2x loan underwriter productivity with Vertex AI.\" The man gestures with his hands as he speaks, emphasizing points about the benefits of using Vertex AI for loan underwriting. The scene then transitions to a wide shot of the stage, showing the audience seated in darkness, with the screen displaying an image of an airplane flying at sunset."
      }
    },
    {
      "start_time": 2535.32,
      "end_time": 2573.65,
      "audio": {
        "content": " Built with Vertex AI, Big Query, and Gemini, this agentic framework accelerates spec and model creation, connects with their global install accelerate spec and model creation. Connects with their global install-based to uncover performance improvement insights and extends life cycles by re-engineering products. Estimated to help their engineering by re-engineering products, estimated to help their engineers deliver results up to 70% faster. With AI agents, Honeywell is introducing a new way to optimize millions of products. In just the last year, we've seen over 40 times growth in Gemini use and Vertex AI, now with billions of API calls each month."
      },
      "video": {
        "content": "The video begins with a black screen displaying the text \"millions of products\" in white and red letters. The scene transitions to a close-up of a blue square button with a white icon of a camera, surrounded by red squares. The word \"Connects\" appears below the button. The video then cuts to a white screen with the logos of Google Cloud and Honeywell side by side. Finally, a man in a suit is shown standing on a stage, gesturing with his hands as he speaks."
      }
    },
    {
      "start_time": 2573.65,
      "end_time": 2596.32,
      "audio": {
        "content": " Vertex AI gives you easy access to over 200 curated foundation models through a model gardens. We offer all of Google's models, Gemini, Vio, Imagine, and our latest research models, curated popular third party models, and open source models, all now on Vertex AI."
      },
      "video": {
        "content": "A man in a dark suit stands on a stage, gesturing with his hands as he speaks. The background is a large screen displaying text about curated foundational models. The scene transitions to a wide shot of the stage, showing the audience seated in rows. The camera then focuses on a screen displaying various AI models and their features."
      }
    },
    {
      "start_time": 2596.32,
      "end_time": 2618.32,
      "audio": {
        "content": " New vertex dashboards help you monitor usage, throughput, latency, after troubleshoot errors. New tuning methods help you optimize the model's performance for your applications. We are excited to announce the general availability of MetaSlamma 4 on Vertex AI."
      },
      "video": {
        "content": "A user is navigating through a software interface, likely related to machine learning or AI development tools. The interface displays various features and services available within the platform, such as Vertex AI Studio, which includes options like API availability, open source, notebook support, pipeline support, one-click deployment, deploy on GKE, and demo availability. The user interacts with different sections, such as 'Prepare Data,' 'Model Development,' and 'Deploy & Use,' showcasing functionalities like creating datasets, training models, and deploying predictions. The user also explores recent endpoints and service usage metrics, indicating ongoing operations and costs."
      }
    },
    {
      "start_time": 2618.32,
      "end_time": 2641.98,
      "audio": {
        "content": " And last week, we announced that AI's full portfolio of open models are also accessible on the Vertex AI model card. With vertex AI, you can be sure your model has access to the right information at the right time. You can connect to any data source or any vector database on any cloud."
      },
      "video": {
        "content": "A speaker is presenting on a stage at a tech conference. The background features a large screen displaying the text \"New Llama 4 on Vertex AI\" and \"Google Cloud Ai2.\" The speaker, dressed in a suit, gestures towards the screen as he explains the topic. The audience is seated in front of the stage, attentively watching the presentation."
      }
    },
    {
      "start_time": 2641.98,
      "end_time": 2664.98,
      "audio": {
        "content": " And announcing today, you can build agents directly on our existing net app storage without requiring any data duplication. You can connect to a broad range of applications, including Oracle, SAP, Service Now, and workday. And for model factuality,"
      },
      "video": {
        "content": "A man in a dark blue suit stands on a stage, gesturing with his hands as he speaks. The background is a simple, light-colored curtain. The scene then cuts to a large screen displaying the logos of Google Cloud and NetApp. The man continues to speak, now standing with his hands clasped together at his waist. The video then returns to the initial scene, where the man resumes his gestures while speaking."
      }
    },
    {
      "start_time": 2664.98,
      "end_time": 2703.98,
      "audio": {
        "content": " we offer the most comprehensive approach to grounding on the market today. Combining grounding with Google Search, grounding with your own enterprise data, Google Maps, and third-party sources. Let's hear from Deutsche Bank CEO Christian Seweig. For over 150 years, our clients have looked to Deutsche Bank to support their lasting success and financial security."
      },
      "video": {
        "content": "A man in a dark suit stands on a stage, addressing an audience. He is positioned centrally, with his hands clasped together in front of him. The background is a simple, light-colored curtain. The scene then transitions to a large screen displaying a presentation slide about Google Maps. The text on the slide reads: \"Google Cloud Grounding with Google Maps brings fresh, factual information with 100 million updates every day.\" The screen also features a Google Maps icon. The video then cuts back to the man on stage, who continues to speak. The final shot shows a 3D animation of tall buildings against a blue sky."
      }
    },
    {
      "start_time": 2703.98,
      "end_time": 2738.32,
      "audio": {
        "content": " And they need us now more than ever as advisor and risk manager in a world marked by uncertainty and shifting geopolitics. Technology plays a key role. Our partnership with Google Cloud place a key role. Our partnership with Google Cloud enables us to take advantage of the latest tools. DBLumina is our AI-powered research agent, built on Gemini and Vertex AI. It maintains data privacy and improves our productivity, while operating in one of the most regulated industries, where trust is built in years and lost in seconds."
      },
      "video": {
        "content": "A man in a blue suit and glasses is speaking passionately in an outdoor setting with modern glass buildings in the background. The scene then transitions to an office where three individuals are gathered around a desk, engaged in a discussion. The video concludes with a graphic of a laptop displaying the word \"Brainstorming\"."
      }
    },
    {
      "start_time": 2738.32,
      "end_time": 2762.98,
      "audio": {
        "content": " A tool like DB Lumina allows us to be ahead of our competitors and provide faster, more accurate analysis of data. Recently, there was a big report in the markets which was 400 pages. We put it into DB Lumina and gave it some prompts and within seconds it gave us a three-page summary. We were able to give that to traders and our clients to help them process that information."
      },
      "video": {
        "content": "A man named Jim is sitting at a desk in a modern office setting. He is wearing a dark blue suit jacket over a white shirt and is using a laptop computer. The room has a contemporary design with light-colored walls, a large window with curtains, and some framed artwork on the wall. Jim appears to be engaged in work or study, occasionally looking up and speaking, possibly explaining something or reacting to information on the screen."
      }
    },
    {
      "start_time": 2762.98,
      "end_time": 2793.15,
      "audio": {
        "content": " Through our partnership with Google Cloud, we have seen a real breakthrough. And this is just the beginning. We see a future where Generative AI is integrated into basically every process we run, making our employees life easier while meeting the changing expectations of our clients. Thanks so much, Christian."
      },
      "video": {
        "content": "The video begins with a scene of two women working at their desks in an office environment. One woman is wearing headphones and appears to be engaged in a conversation or listening to something on her computer. The other woman is typing on her keyboard, focused on her work. The camera then shifts to show another woman sitting at a desk, also working on her computer. The scene transitions to a man walking through a modern, glass-walled corridor. He is dressed in a blue suit and white shirt, gesturing as if he is explaining something. The final frame shows a futuristic, illuminated room with the words \"The new way to cloud\" displayed prominently."
      }
    },
    {
      "start_time": 2793.15,
      "end_time": 2814.53,
      "audio": {
        "content": " We're thrilled to see how quickly you at Deutsche Bank have moved AI from pilot to production. Now let's talk about agents. Agents are intelligent systems that show reasoning, planning, memory, and the ability to use tools. They're able to think multiple steps ahead,"
      },
      "video": {
        "content": "A man in a dark blue suit stands on a stage, gesturing with his hands as he speaks. The background is a gradient of light blue to white, with vertical lines creating a modern and professional atmosphere. The man appears to be giving a presentation or speech, using hand movements to emphasize his points."
      }
    },
    {
      "start_time": 2819.82,
      "end_time": 2839.32,
      "audio": {
        "content": " use tools including working with software and systems to get something done on your behalf and under your supervision. Agents work alongside employees to drive efficiencies, to help with decision-making, and increase innovation. A great example of a company working with Google Cloud to develop agents is Salesforce."
      },
      "video": {
        "content": "A man in a dark suit stands on a stage, gesturing as he speaks to an audience. The stage is illuminated with blue lighting, creating a modern and professional atmosphere. Behind him, a large screen displays various graphics and text related to the topic of discussion. The man appears to be explaining or presenting information, using hand movements to emphasize his points."
      }
    },
    {
      "start_time": 2839.32,
      "end_time": 2861.06,
      "audio": {
        "content": " Let's hear from CEO Mark Benny Off. Salesforce and Google, two of the world's most innovative companies. We've been on an incredible journey together. And today, well, that partnership has never been stronger."
      },
      "video": {
        "content": "A man in a dark suit stands confidently on a stage, hands clasped in front of him. The background is a simple, light curtain. The scene transitions to a dark screen with the words \"Customer Innovation\" displayed prominently. The next frame shows the word \"Agentic AI\" in bold, colorful letters against a black background."
      }
    },
    {
      "start_time": 2861.06,
      "end_time": 2881.32,
      "audio": {
        "content": " Right now, we are really at the start of the biggest shift any of us have ever seen in our careers. I'll tell you, that's why we are so excited about Agent Force and our expanded partnership now with Google. I just love Gemini. I use it every single day. Whether it's Gemini inside Agent Force,"
      },
      "video": {
        "content": "A man stands in front of a large window with a panoramic view of a cityscape, including tall buildings and a body of water. He is wearing a dark blue zip-up jacket over a black shirt and blue jeans. The man gestures with his hands as he speaks, appearing to explain something. The scene then cuts to a close-up of a computer screen displaying a help page from Salesforce, with the text \"How can Agentforce help?\" visible. The video returns to the man, who continues to speak and gesture."
      }
    },
    {
      "start_time": 2881.32,
      "end_time": 2913.65,
      "audio": {
        "content": " whether it's Gemini inside agent force, whether it's all the integrations between Google and Salesforce. Together we're leading the digital labor revolution. That's the future that's gonna drive massive gains in human augmentation and productivity, efficiency, the fundamental KPIs of our business and ultimately our customer success. And we're looking forward to doing even more between Salesforce and Google. Thank you very much, Mark."
      },
      "video": {
        "content": "The video begins with a black screen displaying the word \"Agentforce\" in light blue text. The scene then transitions to a man standing in front of a large window with a panoramic view of a cityscape. The man is wearing a dark blue zip-up jacket over a black shirt and blue jeans. He has a beard and is looking directly at the camera. He appears to be speaking and gesturing with his hands as he talks."
      }
    },
    {
      "start_time": 2913.65,
      "end_time": 2937.31,
      "audio": {
        "content": " We're excited to build together and continue Thank you very much, Mark. We're excited to build together and continue this journey with you at Salesforce. You know, with Google Cloud, starting today today you can build and manage multi-agent systems with Vertex AI and our new agent development kit. You can scale the adoption of agents across your enterprise"
      },
      "video": {
        "content": "A man stands on a stage, dressed in a dark blue suit jacket over a white dress shirt, with a black belt and a pocket square. He gestures with his hands as he speaks, indicating an engaging presentation. The background is a gradient of light blue to gray, with vertical lines adding depth. The scene transitions to a large screen displaying a Google Cloud logo and text about building and managing multi-agent systems, emphasizing open and comprehensive agent platforms. The man continues to speak, maintaining a professional demeanor."
      }
    },
    {
      "start_time": 2937.31,
      "end_time": 2957.47,
      "audio": {
        "content": " with our newly released Google agent space. And you can accelerate deployment with packaged AI agents that are ready for use today. You know, following the ready for use today. Following the introduction of Vertex AI agent builder last year, we're now saying today a new agent development kit."
      },
      "video": {
        "content": "A man in a dark blue suit stands on a stage, gesturing with his hands as he speaks. The background is a simple, light-colored curtain. The scene then cuts to a large screen displaying the Google Cloud logo and text about the platform's features: \"The most open and comprehensive agent platform\". It mentions building and managing multi-agent systems, scaling adoption, and accelerating deployment. The man continues to speak, emphasizing the points on the screen."
      }
    },
    {
      "start_time": 2958.11,
      "end_time": 2978.31,
      "audio": {
        "content": " It is a new open source framework that simplifies the process of building sophisticated multi-agent systems. Now you can build sophisticated Gemini powered agents, help them use tools, do complex multi-step tasks, including reasoning or thinking."
      },
      "video": {
        "content": "A speaker is presenting on a stage at an event, likely a conference or seminar. The stage is well-lit with a large screen displaying the text \"New Agent Development Kit\" along with the subtitle \"Open Source framework to build multi-agent systems with simplicity.\" The speaker, dressed in a dark suit and white shirt, gestures with his hands as he speaks, emphasizing points about the new development kit. The audience is seated in front of the stage, attentively watching the presentation."
      }
    },
    {
      "start_time": 2978.31,
      "end_time": 3011.65,
      "audio": {
        "content": " You can also discover other agents, learn their skills and enable agents to work together while maintaining precise control. Agent Development Kit supports the Model Context Protocol, which provides a unified way for AI models to access and interact with various data sources and tools rather than requiring custom integrations for each and every one. We're also introducing a new agent-to-agent protocol"
      },
      "video": {
        "content": "A man in a dark blue suit stands on a stage, gesturing with his hands as he speaks. He appears to be presenting at an event, possibly a conference or seminar. The background is a simple, light-colored curtain with vertical stripes. The man's gestures suggest he is explaining something important, likely related to the topic of the presentation."
      }
    },
    {
      "start_time": 3011.65,
      "end_time": 3031.81,
      "audio": {
        "content": " that allows agents to communicate with each other, regardless of the underlying model and framework they were developed with. This protocol is supported by many leading partners who share our vision to allow agents to work across the multi-agent ecosystem"
      },
      "video": {
        "content": "A man in a suit stands on a stage, gesturing with his hands as he speaks. The background is a large screen displaying the text \"Agent2Agent Protocol\" and \"A collaborative way to help agents communicate with each other.\" The scene then transitions to a list of partners contributing to the Agent2Agent protocol, displayed on the screen. The man continues to speak, and the camera zooms out to show the entire stage and audience."
      }
    },
    {
      "start_time": 3031.81,
      "end_time": 3052.65,
      "audio": {
        "content": " and with agents built on other agent frameworks, including Lange graph and crew AI. Today, we're putting AI agents in the hands of every employee with Google agent space. Employees using Google agents. in space. Employees using Google Agent Space can now find"
      },
      "video": {
        "content": "A man stands on a stage, dressed in a dark suit jacket over a white dress shirt, with a black belt and a pocket square. He gestures with his hands as he speaks, moving them from an open position to a more closed gesture. The background is a simple, light-colored curtain with vertical stripes."
      }
    },
    {
      "start_time": 3052.65,
      "end_time": 3074.65,
      "audio": {
        "content": " and synthesize information from within their organization, converse with AI agents, and have these agents take action on their behalf for their enterprise applications. Google Agent Space combines Google Quality Enterprise Search, conversational AI or chat, and Gemini and third-party agents."
      },
      "video": {
        "content": "A man stands on a stage, dressed in a dark suit with a white shirt and a pocket square. He is gesturing with his hands as he speaks, indicating that he is likely delivering a speech or presentation. The background features vertical light panels, creating a modern and professional atmosphere."
      }
    },
    {
      "start_time": 3074.65,
      "end_time": 3096.11,
      "audio": {
        "content": " It also includes a broad set of tools, including purpose-built connectors to search and transacts with documents and databases as well as SaaS applications with advanced security and compliance to protect your data and your intellectual property. Let's take a look at Agents' Face in Action."
      },
      "video": {
        "content": "A man in a dark blue suit stands on a stage, gesturing with his hands as he speaks. The background is a simple, light-colored backdrop with vertical lines. The scene then transitions to a large screen displaying the Google Agentspace interface, which includes options for Google-quality search, conversational AI, Gemini and 3p agents, 100+ document repositories and database systems, and security, infrastructure, and compliance controls. The man continues to speak, using hand gestures to emphasize his points."
      }
    },
    {
      "start_time": 3096.11,
      "end_time": 3117.87,
      "audio": {
        "content": " Please welcome Gabe Weiss. Thanks, Thomas. So for the next few minutes, I'm going to be a relationship manager at a bank. Starting with a quick tour, this is my homepage, authenticated and personalized just for me."
      },
      "video": {
        "content": "A man in a dark blue suit with a white shirt and a black bow tie is standing on a stage. He is gesturing with his right hand as he speaks. The background is a plain, light-colored curtain."
      }
    },
    {
      "start_time": 3117.87,
      "end_time": 3139.23,
      "audio": {
        "content": " The agent gallery lets me see my company's approved selection of purpose-built agents, including ones powered by third-party models like Lama and Claude. You see we've got some Google made agents. We have agents that my bank has made available to me, either ones we've created or ones built by partners. And then the best part, my own personal agents, which I can build directly"
      },
      "video": {
        "content": "A person is standing behind a podium on a stage, gesturing with their hands as they speak. The background features a large screen displaying a Google Agentspace interface. The screen shows various options such as 'Deep Research,' 'Idea Generation,' 'Looker,' 'Cash flow,' 'Client analytics,' 'Credit memo,' and 'Policy checker.' The person appears to be explaining or demonstrating these features."
      }
    },
    {
      "start_time": 3139.23,
      "end_time": 3160.31,
      "audio": {
        "content": " inside agent space with this button over here, or even easier just from having a little conversation. Let's see how easy it is to create an agent to automate a daily task. Now it's critical for me to stay on top of what's going on with my clients. So I start every morning with a portfolio analysis. I'm going to use a clipboard because no one wants to watch me typing."
      },
      "video": {
        "content": "A man with long hair and a green shirt is standing in front of a laptop, gesturing with his hands as he speaks. The screen then transitions to a Google Agentspace interface, displaying an 'Agent gallery' section with various customizable agents. The man appears to be explaining or demonstrating something related to these agents. The interface shows options like 'Cash flow,' 'Client analytics,' 'Credit memo,' and 'Policy checker.' The man continues to speak and gesture, likely providing instructions or highlighting features of the agents."
      }
    },
    {
      "start_time": 3160.31,
      "end_time": 3181.31,
      "audio": {
        "content": " Run an analysis of my client portfolio and identify potential risks and opportunities. This only uses information that I have permission to access. It knows which clients are mine and summarizes top points from my data sources like OneDrive, Salesforce, are done in Bradstreet's. If I have questions, I have a direct link to my sources here,"
      },
      "video": {
        "content": "A man with long hair and a green shirt is standing in front of a laptop screen. He appears to be giving a presentation or lecture. The screen displays a chat interface labeled \"Google Agentspace.\" The chat interface shows a conversation with the text \"Hello, Gabe. What can I help you with?\" followed by several questions about AI models, fraud detection, blockchain technology, and client portfolio analysis. The man gestures with his hands as he speaks, emphasizing his points."
      }
    },
    {
      "start_time": 3181.31,
      "end_time": 3205.8,
      "audio": {
        "content": " and if I need even more I have a direct link to my sources here. And if I need even more control, I can refine that list of sources. But agent space doesn't just summarize information. It's interpreting my question and surfacing what matters most. For example, in this chart, I can see Agent Spaces flag that Acme General Contracting might have some cash flow problems in the future. Already, it's given me a massive report which is going to save me a ton of"
      },
      "video": {
        "content": "A man with long hair and a beard is standing in front of a screen, gesturing with his hands as he speaks. The screen displays a document titled \"Client Risk & Opportunity Analysis.\" The document includes an executive summary that outlines the purpose of the analysis, which is to identify key trends, potential risks, and opportunities for relationship deepening and revenue growth. The analysis focuses on identifying unmet needs and proactively offering solutions tailored to each client's specific financial situation and business goals. The portfolio demonstrates a healthy mix of industries, but proactive engagement is crucial to maximizing client lifetime value and mitigating potential risks."
      }
    },
    {
      "start_time": 3205.8,
      "end_time": 3225.98,
      "audio": {
        "content": " manual research and I can go ahead and read this later, but for now let's set up an agent so I can keep an eye on acne. Agent space automatically generates an agent plan based on our conversation so far. And this is good, but I think I want more. I'm going to have it generate an audio summary and send it to my inbox"
      },
      "video": {
        "content": "A man with long hair and a beard is standing in front of a laptop screen, gesturing with his hands as he speaks. The screen displays a document titled \"Client Risk & Opportunity Analysis\" from Google Agentspace. The document outlines key trends and observations, relationship deepening opportunities, and risk mitigation recommendations. The man appears to be explaining these points, possibly during a presentation or tutorial."
      }
    },
    {
      "start_time": 3225.98,
      "end_time": 3246.1,
      "audio": {
        "content": " so I can listen to it on my morning commute. And just like that, to it on my morning commute. And just like that, I have built my own custom agent to use whenever I want without writing a single line of code. Now, Agent Space has identified a cash flow problem with Acme General contracting. I need to dig into that. Maybe this is a problem with construction in general and not specific to Acme."
      },
      "video": {
        "content": "A man with long hair and a green shirt is speaking into a microphone while standing in front of a laptop. The screen displays a Google Agentspace interface with a task titled \"Client Risk & Opportunity Analysis.\" The man appears to be explaining or presenting something related to this task. The interface shows a progress bar indicating that an answer is being generated. After a few moments, the interface updates to show a new task titled \"Client Risk & Opportunity Agent\" with a description and suggested follow-ups. The man continues to speak and gesture as he explains the task."
      }
    },
    {
      "start_time": 3246.92,
      "end_time": 3269.65,
      "audio": {
        "content": " Agent Space has already identified that possibility as a suggested follow-up. So now let's go ahead and deep dive into general contracting industry trends. This activates Google's enterprise deep research agent, which starts by telling me what it plans to research and in what order. At this point, I could edit this plan if I wanted to, but it looks pretty good, so I'll start the research."
      },
      "video": {
        "content": "A man with long hair and a green shirt is standing in front of a laptop screen, gesturing with his hands as he speaks. The screen displays a Google Agentspace interface titled \"Client Risk & Opportunity Analysis.\" The interface shows an updated \"@Client Opportunity Agent\" created based on a plan above. Below this, there are suggested follow-ups: \"Deep dive into general contracting industry trends,\" \"Review Treasury Management offerings,\" and \"Deep dive into electrical/plumbing growth.\" The interface also includes a \"Research Plan\" section with questions about the current state of the general contracting industry, key drivers and trends, challenges and opportunities, technological advancements and adoption, regulatory and legal landscape, and competitive landscape."
      }
    },
    {
      "start_time": 3269.65,
      "end_time": 3293.65,
      "audio": {
        "content": " Now I do want to call out. We've cashed the plan and the results here. Normally this would take a little bit longer. This agent is pulling in real-time information from Google Search to build its report. But even cooler, it's also searching my internal enterprise data and adjusting this plan in real time, adding additional questions based on what it's going to find along the way. And again, an incredibly insightful analysis,"
      },
      "video": {
        "content": "A man with long hair and a beard is standing in front of a laptop screen, gesturing with his hands as he speaks. The screen displays a Google Agentspace interface titled \"Client Risk & Opportunity Analysis.\" The interface includes a search bar at the bottom where questions can be typed. The man appears to be explaining or presenting information related to the US general contracting industry, discussing various aspects such as size, growth rate, health, major factors driving growth, challenges, regulations, adoption rates of BIM, AI, and drones, competitive strategies, and forecasts for future growth."
      }
    },
    {
      "start_time": 3293.65,
      "end_time": 3315.71,
      "audio": {
        "content": " including some source links, but thankfully here at the bottom, it also is going to give me a great succinct executive summary. Let's take a quick look at this. succinct executive summary. Let's take a quick look at this. Yep, I can see Acme General Contracting is likely being affected by rising material costs, supply chain disruptions, and regulatory complexities that pose significant hurdles."
      },
      "video": {
        "content": "A man with long hair and a beard is standing at a podium, speaking into a microphone. He gestures with his hands as he talks, emphasizing his points. The background is a plain, light-colored wall. On the right side of the screen, there is a Google Agentspace interface displaying information about the US general contracting industry, including key drivers, trends, and an executive summary. The text on the screen provides details about the industry's growth, challenges, and future projections."
      }
    },
    {
      "start_time": 3315.91,
      "end_time": 3336.31,
      "audio": {
        "content": " That's really great. I mean, okay, maybe not for acne. But the analysis is really great. I don't want Acme to be surprised by this at all. So I'm going to have our bank's cash flow agent do some forecasting across the next three quarters for me. This agent uses Google's new time series forecasting model, which is specifically trained for scenarios just like this."
      },
      "video": {
        "content": "A man with long hair and a green shirt is standing at a podium, gesturing with his hands as he speaks. He appears to be giving a presentation or lecture. The background is dark, and there is a screen displaying an executive summary about the US general contracting industry. The summary mentions significant growth but also challenges such as labor shortages, rising material costs, supply chain disruptions, and regulatory complexities. It also discusses technological advancements beyond BIM, AI, and drones, which are gaining traction but face adoption challenges. Regional variations in growth exist, with the South showing particularly strong performance. Future projections indicate continued growth across all sectors, although precise rates vary depending on the source and sector. Major players employ diverse competitive strategies to succeed in this competitive market. More precise market size data by sector is needed for a more complete picture."
      }
    },
    {
      "start_time": 3336.31,
      "end_time": 3366.98,
      "audio": {
        "content": " And again, I'm going to get a super clear, very clear summary with at the bottom, some great recommended steps for Acme. And I need them to see it right away so I can ask agent space, draft me an email to Acme General Contracting CEO requesting a meeting for next week. And, just like that, I've got the draft ready to go, and even better, I can send it off directly from within agent space, so I don't even have to switch to Outlook or Gmail."
      },
      "video": {
        "content": "A man with long hair and a beard is standing at a podium, speaking into a microphone. He is wearing a green polo shirt and appears to be giving a presentation. The background is a plain, dark-colored wall. On the right side of the screen, there is a chat interface labeled 'Google Agentspace' with a conversation about client risk and opportunity analysis. The text in the chat mentions calculating Acme General Contracting's cash flow and liquidity needs for the next three quarters. The conversation also includes recommendations for Acme, such as delayed client payments, negotiating with suppliers, securing a line of credit, and improving project management. The chat interface also shows a draft email being composed to Acme General Contracting's CEO, Sophia, requesting a meeting for the next week."
      }
    },
    {
      "start_time": 3366.98,
      "end_time": 3387.98,
      "audio": {
        "content": " I'm all set. An agent space has saved my session. I'm all set. An agent space has saved my session so I can prep for that meeting right where I left off whenever I'm ready. Let's go ahead and recap. While I don't actually work for the Let's go ahead and recap. While I don't actually work for a bank, the value that Agent Space adds is very real. It's so easy to interact with all of your enterprise data and tools in one place and build and use agents directly from that conversational workflow."
      },
      "video": {
        "content": "A man with long hair and a green shirt is standing in front of a screen, gesturing with his hands as he speaks. The screen displays a Google Agentspace interface with various options and prompts. The man appears to be explaining something, possibly related to business or technology, as he interacts with the interface."
      }
    },
    {
      "start_time": 3387.98,
      "end_time": 3413.65,
      "audio": {
        "content": " Powered by Gemini 2. that conversational workflow. Powered by Gemini 2.5 and Google search technology. Agent Space is the only hyper-scaler platform on the market that can connect to third-party data and tools and offers interoperability with third-party agents and models. For companies with strict regulatory needs, like a bank, agent space provides stringent access controls at the employee level and can operate within your own VPC,"
      },
      "video": {
        "content": "A man with long curly hair and a beard is standing behind a podium, speaking into a microphone. He is wearing a green polo shirt over a white t-shirt. The background consists of vertical blinds. The man gestures with his hands as he speaks, occasionally clapping them together or pointing with one finger. He appears to be engaged in delivering a speech or presentation."
      }
    },
    {
      "start_time": 3413.65,
      "end_time": 3437.31,
      "audio": {
        "content": " ensuring that your data stays yours while meeting all of your requirements. Agent Space is a game changer, and we can't wait to see how you all put it to work. Thanks. Back to the time. and we can't wait to see how you all put it to work. Thanks. Back to you, Gabe. Today we're excited to announce that Agent Space is integrated with your Chrome browser"
      },
      "video": {
        "content": "A man with long hair and a beard is standing on a stage, wearing a green polo shirt. He is speaking and gesturing with his hands, indicating that he is giving a presentation or speech. The background is a plain curtain, and there is a laptop on a stand in front of him."
      }
    },
    {
      "start_time": 3437.31,
      "end_time": 3461.31,
      "audio": {
        "content": " to allow users to search and access your enterprise data directly from the search box in Chrome. Employees can use agent space to access Google built expert AI agents, including Notebook LN, an AI-powered notetaking and research agent that allows users to upload up to 50 documents with 25 million words,"
      },
      "video": {
        "content": "A man in a dark blue suit stands on a stage, addressing an audience. He is positioned centrally, with his hands clasped together in front of him. The background features a large screen displaying text about Google Agentspace integrated into Chrome. The stage is well-lit, with a modern design featuring vertical light panels and a sleek, curved floor. The audience is seated in rows, attentively watching the presentation."
      }
    },
    {
      "start_time": 3461.31,
      "end_time": 3484.98,
      "audio": {
        "content": " and then query them using AI effectively turning notes and sources into a virtual research assistant. You can also use our idea generation agent, which accelerates innovation, brainstorming, and problem solving. It uses a tournament style framework to rank ideas based on employee defined criteria, refine them, and generate new ones."
      },
      "video": {
        "content": "A man in a suit stands on a stage, gesturing as he speaks to an audience. The stage is well-lit, with a large screen behind him displaying various applications and tools. The screen transitions between different interfaces, including a notebook application named \"NotebookLM\" and a tool called \"Idea Generation Agent.\" The man appears to be explaining these features, likely during a presentation or conference."
      }
    },
    {
      "start_time": 3484.98,
      "end_time": 3517.31,
      "audio": {
        "content": " And our enterprise deep research agent, which Kate just showed you, researches complex topics on your behalf and provides you with findings in a comprehensive, easy-to-read report. Customers and partners around the world are already using agent space. KPMG is building Google AI into their newly formed KPMG law firm and implementing agent space to enhance their own workplace operations."
      },
      "video": {
        "content": "A man in a dark suit stands on a stage, gesturing with his hands as he speaks. The background is a simple, modern design with vertical light panels. The scene transitions to a screen displaying text about blockchain technology and its implications in the fintech industry. The text includes mentions of regulatory bodies like the Basel Committee on Banking Supervision (BCBS) and the Financial Stability Board (FSB). The video then returns to the man on stage, who continues to speak and gesture."
      }
    },
    {
      "start_time": 3517.31,
      "end_time": 3539.31,
      "audio": {
        "content": " Cohesity is integrating with agent space to provide employees with greater data discovery for better decision-making while also increasing security and threat protection. Gordon Food Services is simplifying insight discovery and recommending next steps. Rubrik is leveraging agents to develop deeper"
      },
      "video": {
        "content": "A man in a dark suit stands confidently on a stage, addressing an audience. The background is a simple, modern design with vertical light panels. As he speaks, the scene transitions to a large screen behind him displaying a presentation slide. The slide features the text \"COHESITY\" and \"Enabling better decision making with Agentspace.\" It also includes images of a man working at a desk with two laptops and some documents, suggesting a professional setting. The presenter continues to speak, and the slide changes to another presentation slide titled \"Gordon FOOD SERVICE\" with the subtitle \"Transforming decision-making with Agentspace.\" This slide shows a woman in a yellow shirt and black apron using a tablet in a food service environment, surrounded by fresh produce. The presenter remains on stage throughout, engaging with the audience."
      }
    },
    {
      "start_time": 3539.31,
      "end_time": 3561.13,
      "audio": {
        "content": " customer insights and prepare for impactful sales interaction. An agent space will provide Wells Fargo Bank the unique opportunity to modernize and simplify banking. We're now going to dive deep into five categories of agents where we're already seeing tremendous business impact."
      },
      "video": {
        "content": "A professional speaker is presenting on a large stage at a conference or event. The stage features a large screen displaying text and images related to technology and business solutions. The speaker, dressed in a suit, gestures with his hands as he addresses an audience. The background includes a large screen with the Rubrik logo and the text \"Agentspace enables faster, smarter sales support.\" The scene transitions to another screen showing the Wells Fargo logo and the text \"Modernizing and simplifying banking.\" The speaker continues to present, moving around the stage and engaging with the audience."
      }
    },
    {
      "start_time": 3561.13,
      "end_time": 3585.98,
      "audio": {
        "content": ""
      },
      "video": {
        "content": "A man in a dark blue suit stands on a stage, hands clasped in front of him. The background is a simple, light-colored curtain. The scene then transitions to a wide shot of a large stage with a green and blue abstract design on the screen behind the speaker. The speaker, a woman in a green jumpsuit, walks across the stage, gesturing as she speaks. The camera focuses on her as she continues her presentation."
      }
    },
    {
      "start_time": 3561.13,
      "end_time": 3606.22,
      "audio": {
        "content": " Please welcome Lisa O'Malley. Thank you. Thanks Thomas. Let's start with customer agents. They can synthesize and reason across all types of multimodal information, including text, audio, images, and video. Communicate and engage naturally with human-like speech and dialogue. Connect to cross the enterprise applications and take actions on behalf of the user. And be used in the contact center and on the web, on devices, in stores, in cars, and more."
      },
      "video": {
        "content": "A man in a dark blue suit stands on a stage, hands clasped in front of him. The background is a simple, light-colored backdrop with vertical lines. The scene then transitions to a woman walking across the stage. She is wearing a green jumpsuit and glasses. The text \"Lisa O'Malley\" appears on the screen."
      }
    },
    {
      "start_time": 3607.48,
      "end_time": 3638.31,
      "audio": {
        "content": " Customer agents built with Vertex AI search are helping customers to quickly find answers and the right products using both text and images in search queries. Let's hear from Reddit chief product officer, Polly Bot. A dread At Reddit, our mission is to empower communities"
      },
      "video": {
        "content": "A woman stands on a stage, addressing an audience. She is dressed in a green outfit and wears glasses. The background features a large screen displaying the text \"Customer Agents.\" The scene transitions to a darker setting where the words \"finding answers\" appear prominently."
      }
    },
    {
      "start_time": 3638.31,
      "end_time": 3660.31,
      "audio": {
        "content": " and make their knowledge accessible to all. We've been working on this mission for nearly 20 years, which in turn has made Reddit one of the Internet's largest sources of authentic conversations. With that vast amount of conversations and perspectives, we wanted to build a unique search product that's powered with AI, but still grounded in all of the real conversations and perspectives that are available on Reddit."
      },
      "video": {
        "content": "A man named Pali Bhat, identified as the Chief Product Officer of Reddit, stands in an office environment. He is dressed in a dark blue shirt and black pants, with his hands clasped together in front of him. The office has a modern design with large glass walls, colorful chairs, and a cozy seating area. The lighting is bright, creating a professional atmosphere. The video transitions to a live chat interface where a user is asking about AI-powered search features on Reddit. The video then cuts to a black screen with the text \"AI powered search\" followed by the Reddit and Cloudinary logos."
      }
    },
    {
      "start_time": 3660.31,
      "end_time": 3681.17,
      "audio": {
        "content": " This is why we introduced Reddit Answers, a new AI-powered way to get information, recommendations, and discussions on virtually any topic. It provides powerful AI that's grounded in Redator's existing posts and conversations. So it shows you more of what real humans think versus creating unverifiable perspectives on its own."
      },
      "video": {
        "content": "A person is holding a smartphone and typing on it. The screen displays a Reddit page with various posts and comments. The user searches for information about how to avoid jet lag when traveling. The search query appears prominently on the screen, and the user scrolls through the results, revealing different tips and suggestions from other users."
      }
    },
    {
      "start_time": 3681.17,
      "end_time": 3702.98,
      "audio": {
        "content": " Red answers is different from any other generative AI product on the market. It leverages Vertex AI search to make finding the answers and perspectives people seek faster and more relevant. We've seen awesome results so far because the users who have been able to access this product and tested out really love the experience. This gets them to the heart of the conversations that they were looking for right away."
      },
      "video": {
        "content": "The video begins with a black screen featuring a colorful speech bubble icon. The icon consists of three vertical bars: a teal bar on the left, a red bar in the middle, and a yellow bar on the right. The icon rotates slightly, revealing its three-dimensional appearance against the dark background."
      }
    },
    {
      "start_time": 3702.98,
      "end_time": 3726.98,
      "audio": {
        "content": " That's the magic of Reddit answers. It combines AI with the power of Reddit. Thank you, Polly. We've also introduced Vertix AI search for healthcare and retail,"
      },
      "video": {
        "content": "A man in a green polo shirt is seen pointing at something off-screen. The scene then transitions to an office environment where a man is looking at a neon sign on the wall. The video then cuts to a logo that reads \"The new way to cloud.\" followed by a woman standing on a stage, smiling and speaking."
      }
    },
    {
      "start_time": 3726.98,
      "end_time": 3747.06,
      "audio": {
        "content": " making it super easy for doctors, nurses, and providers to rapidly search and analyze patient data, including x-rays, scans, images, and medical histories. Retailers can add product discovery to their websites, powered by Google Search. This helps them deliver hyper-relevant results"
      },
      "video": {
        "content": "A woman stands on a stage, presenting about Vertex AI Search. The background features a large screen displaying the Vertex AI Search logo and images of medical professionals using the technology. The presenter gestures towards the screen as she speaks, emphasizing the benefits and applications of the AI search system. The audience is visible in the foreground, attentively listening to her presentation."
      }
    },
    {
      "start_time": 3747.06,
      "end_time": 3769.94,
      "audio": {
        "content": " and personalized recommendations for each customer, boosting conversion rates and maximizing revenue per shopper. We're seeing huge momentum for Vertex AI search with billions of daily queries executed by our customers. For example, by our customers. For example, Lowe's is revolutionizing product discovery with Vertix AI Search to generate dynamic product recommendations"
      },
      "video": {
        "content": "A woman stands on a stage, dressed in a green jumpsuit with a belt and white sneakers. She is speaking and gesturing with her hands, occasionally clasping them together in front of her chest. The background features vertical light panels that create a modern and minimalist atmosphere. As she continues to speak, the camera zooms out to reveal a large screen behind her displaying a blue and white logo."
      }
    },
    {
      "start_time": 3769.94,
      "end_time": 3792.64,
      "audio": {
        "content": " and address customers complex queries. Globo created a recommendations experience inside its streaming platform that more than doubled click-through play. And let's hear how Mercado Libre is transforming how customers discover products that they love. Mercado Libre. Latin America's e-commerce"
      },
      "video": {
        "content": "A woman stands on a stage, presenting to an audience. The background features a large screen displaying images of smartphone interfaces. The first image shows a smartphone screen with a search bar and product listings from Lowe's, highlighting the company's use of Vertex AI Search for product discovery. The text on the screen reads \"Revolutionizing product discovery, powered by Vertex AI Search.\" The second image displays a smartphone screen from Globoplay, showing a list of video titles with the text \"2x more video plays.\" The woman gestures towards the screen as she speaks, emphasizing the benefits of the technology being showcased."
      }
    },
    {
      "start_time": 3792.64,
      "end_time": 3813.3,
      "audio": {
        "content": " leader has deployed a verdicts AI search across 150 million items in three pilot countries. This multimodal search technology understands deep meaning across text and images, not just keywords. It is helping their 100 million customers find the products they love faster. Already delivering millions of dollars in incremental revenue."
      },
      "video": {
        "content": "The video begins with a view of Earth from space, with the word \"e-commerce\" prominently displayed in the center. The scene then transitions to a collage of images featuring various items, including a security camera, a person wearing a tie, and a smartphone. The text \"150M items\" is displayed in large yellow letters, emphasizing the vast number of products available online. The next scene shows a close-up of a chocolate chip cookie on a yellow background, with the text \"What ingredients do I need for choc-chip cookies?\" appearing above it. The final scene captures a person opening a box, revealing a product inside."
      }
    },
    {
      "start_time": 3813.68,
      "end_time": 3833.64,
      "audio": {
        "content": " Mercado Libre is delivering a new way to shop. Google Cloud's own purpose-built customer engagement suite is transforming customer service. Grounded in your company's data, it provides out-of-the-box functionality"
      },
      "video": {
        "content": "A woman in a green jumpsuit stands on a stage, gesturing as she speaks. The background is a large screen displaying the Google Cloud logo and the Mercado Libre logo. The scene transitions to a wide shot of the stage, showing the audience and the large screen behind the speaker. The video then cuts back to the woman, who continues her presentation."
      }
    },
    {
      "start_time": 3833.64,
      "end_time": 3858.31,
      "audio": {
        "content": " to build agents and works across web, mobile, call center, in store, and with third-party telephony and CRM systems. These unique capabilities have led to rapid growth with an increase in conversational AI agent usage. DBS, a leading Asian financial services group Group is reducing customer call handling times by 20%."
      },
      "video": {
        "content": "A woman stands on a stage, dressed in a green jumpsuit with a black belt and white sneakers. She is speaking to an audience, gesturing with her hands as she explains something. The background is a simple, light-colored curtain with vertical stripes. The scene then transitions to a large screen displaying the text \"Customer Engagement Suite with Google AI.\" The woman continues to speak, now standing in front of this screen, which is part of a larger stage setup with a dark floor and illuminated panels."
      }
    },
    {
      "start_time": 3860.31,
      "end_time": 3879.31,
      "audio": {
        "content": " Love Holidays saved 20% of their customer service cost per year. And our very own YouTube achieved a 75% reduction in calls abandoned while waiting to speak to a representative. Now, let's hear how Verizon is improving their customer experience using AI agents."
      },
      "video": {
        "content": "A woman stands on a stage, dressed in a green button-up shirt and matching pants. She is wearing glasses and has short hair. The background is a simple, light-colored backdrop with vertical lines. The woman appears to be speaking, using hand gestures as she moves slightly from side to side. At one point, a large screen behind her displays a red graphic with text that reads \"YouTube 75% reduction of calls abandoned in the queue.\""
      }
    },
    {
      "start_time": 3881.31,
      "end_time": 3904.98,
      "audio": {
        "content": " Verizon is transforming how they serve over 115 million connections with Google Cloud's Customer Engagement Suite. with Google Cloud's customer engagement suite. Their personal research assistant uses AI to provide 28,000 care representatives with instant personalized information about a customer's unique needs leading to faster and more satisfying resolutions for even the most complex inquiries."
      },
      "video": {
        "content": "A large screen displays an image of the United States map, with red vertical lines on either side. A person walks across a stage in front of this screen. The scene transitions to a woman talking on her phone in a cozy living room. The text \"115 million connections\" appears on the screen. The next scene shows a group of people wearing headsets, with the text \"27,999 care representatives.\" The final frame shows a black screen with the text \"Does T.\""
      }
    },
    {
      "start_time": 3904.98,
      "end_time": 3929.64,
      "audio": {
        "content": " With Customer Engagement Suite, Verizon is elevating its service experience, reducing wait times, and delivering exceptional support at massive scale. Verizon developed a new way to personalize customer service. The business impact that Verizon experienced is nothing short of extraordinary."
      },
      "video": {
        "content": "A woman is sitting at a desk in a modern kitchen, smiling as she talks on her phone while working on her laptop. The scene transitions to a different setting where the same woman is standing in a living room, talking on her phone. The video then cuts back to the kitchen scene, showing the woman continuing her conversation. The final scene shows a woman standing on a stage, speaking to an audience."
      }
    },
    {
      "start_time": 3929.64,
      "end_time": 3953.64,
      "audio": {
        "content": " Today, we're announcing our next generation of customer engagement suite, which will include human-like voices, comprehension, and the ability to understand emotions. so agents can adapt better during the conversation. Streaming video support, so virtual agents can respond, can interpret and respond to what they see"
      },
      "video": {
        "content": "A woman stands on a stage, dressed in a green jumpsuit with a belt, addressing an audience. The background is a simple, dark setting with vertical light panels. The scene transitions to a large screen displaying the text \"New Customer Engagement Suite Human-like voices\" with a \"Coming Soon\" button. The screen then changes to \"New Customer Engagement Suite Understanding emotions\" with another \"Coming Soon\" button. Finally, the screen shows \"New Customer Engagement Suite Streaming video support\" with yet another \"Coming Soon\" button."
      }
    },
    {
      "start_time": 3953.64,
      "end_time": 3978.31,
      "audio": {
        "content": " in real time through customer devices. AI assistance to build custom agents in a no-code interface and the ability to use a variety of tools through API calls to interact and perform specific tasks for your application like look up products, add to cart, or checkout. An integration with data sources, CRM systems,"
      },
      "video": {
        "content": "A woman stands on a stage, dressed in a green jumpsuit with a black belt and white shoes. She gestures with her hands as she speaks, likely addressing an audience. The background is simple, featuring vertical light panels that create a modern and professional atmosphere. As she continues to speak, the camera shifts to reveal a large screen behind her displaying text about a new customer engagement suite. The text highlights features such as AI assistance to build agents and the ability to interact with other applications. The audience is visible in the foreground, attentively listening to the presentation."
      }
    },
    {
      "start_time": 3978.31,
      "end_time": 4002.47,
      "audio": {
        "content": " and popular business messaging platforms. Now, let's see a demo of all this cool stuff in action. Welcome my teammate, Patrick Marlow, to the stage. All right, thanks, Lisa. Hey everyone, I'm Patrick Marlow, a product manager here at Google Cloud,"
      },
      "video": {
        "content": "A woman stands on a stage, presenting information about new customer engagement suite integrations. The scene transitions to a man standing at a podium, preparing to speak. The stage is well-lit with spotlights and a large screen displaying the name \"Patrick Marlow\" and his title as a Product Manager for Applied AI."
      }
    },
    {
      "start_time": 4002.47,
      "end_time": 4022.97,
      "audio": {
        "content": " and I am stoked to be here today showcasing our next generation customer engagement suite in action. To be honest, I'm even more excited to screws up our keynote stage. I was thinking some greenery and flowers might be nice. You know, I've already made a couple of trips to the hardware store this morning, and I still forgot to pick up potting soil. Classic. forgot to pick up potting soil. Classic."
      },
      "video": {
        "content": "A man with a beard and tattoos on his arms is standing on a stage, speaking to an audience. He is wearing a dark blue t-shirt and black pants. The background is a plain, light-colored wall with vertical blinds. To his left, there is a podium with a microphone and a laptop. In front of him, there is a small wooden crate filled with colorful flowers. The man gestures with his hands as he speaks, occasionally looking at the audience."
      }
    },
    {
      "start_time": 4022.97,
      "end_time": 4043.97,
      "audio": {
        "content": " So let's see how a next-gen agent can hopefully help me get this last order, correct. We're going to start a brand new voice interaction with our agent here. Hi there, welcome to Simple Home and Garden. Is this Patrick? Hey, yeah, this is Patrick. Good morning. How are you? Good morning. How are you? Good morning to you too, Patrick."
      },
      "video": {
        "content": "A man with a beard and tattoos on his arms is standing on a stage, holding a wooden crate filled with flowers. He is wearing a dark blue t-shirt and has a microphone attached to his shirt. The background is a plain, light-colored wall with vertical blinds. The man appears to be speaking or presenting something."
      }
    },
    {
      "start_time": 4043.97,
      "end_time": 4065.97,
      "audio": {
        "content": " Doing great. Thanks for asking. It looks like you've been shopping with us recently. I see you have a few things in your card already. What can I help you with today? Yeah, so I just bought some plants and I'm trying to pick out potting soil and fertilizer, and I'm not sure if I'm getting the right stuff. Could you help me with that? I can definitely help with that, Patrick. To give you the best recommendations, could you show me the plants you bought?"
      },
      "video": {
        "content": "A customer is shopping on a gardening website during springtime, taking advantage of a 20% discount offer. The screen shows various sections such as 'Tools and accessories,' 'Plants and supplies,' 'Plant care and maintenance,' and 'Landscaping services.' The customer adds two items to their cart: Standard Potting Soil and Standard Fertilizer. The cart summary indicates a subtotal of $42.97, tax of $3.60, and a total of $46.57. The customer then proceeds to checkout."
      }
    },
    {
      "start_time": 4065.97,
      "end_time": 4087.97,
      "audio": {
        "content": " I can take a look and help you choose the best soil and fertilizer. I'm sending a video request to your browser now. Great. I can see you. Just hold the plants up to the camera so I can help identify them. Yeah, so I have about 40 of these in various different colors. Those are beautiful petunias."
      },
      "video": {
        "content": "A customer service agent is assisting a customer over a video call on a shopping app. The customer's screen shows a cart with two items: Standard Potting Soil priced at $12.99 each and Standard Fertilizer priced at $16.99 each. The total amount is $46.57. The customer is considering whether to proceed with the purchase by checking out. The customer service agent is asking if they can access the camera for better communication."
      }
    },
    {
      "start_time": 4087.97,
      "end_time": 4108.64,
      "audio": {
        "content": " They're a great choice for vibrant color. Let me take a closer look at the items in your cart. Okay, I see you have standard potting soil and general purpose fertilizer. That could work, but petunias really thrive in a rich well-draining soil, and a fertilizer specifically for flowering plants. Okay, I've found some recommendations for your petunias. Yeah, I'm taking a look at these now."
      },
      "video": {
        "content": "A man with a beard is standing behind a podium, speaking into a microphone. He appears to be giving a presentation or demonstration. The screen then transitions to a shopping cart interface where he is shown selecting items. The items in the cart include Standard Potting Soil and Standard Fertilizer. The man then navigates to a product page for Bloom Booster Potting Mix, adding it to his cart."
      }
    },
    {
      "start_time": 4108.64,
      "end_time": 4131.64,
      "audio": {
        "content": " They look fantastic, thanks. Great! Would you like me to remove the standard potting soil and general purpose fertilizer and add the Bloom Booster potting mix and flower power fertilizer to your cart? Yeah, let's go ahead and do that. Okay, I've updated your cart. Is there anything else I can help you with today? I mean, not unless you can come over Anything else I can help you with today? I mean, not unless you can come over and plant these for me."
      },
      "video": {
        "content": "A customer service agent is interacting with a customer on a mobile app. The customer service agent is asking if they should update the items in the customer's cart. The customer service agent shows two products: Bloom Booster Potting Mix and Flower Power Fertilizer. The customer service agent then asks if they should update the items in the cart."
      }
    },
    {
      "start_time": 4131.64,
      "end_time": 4153.44,
      "audio": {
        "content": " Funny, you should mention that. We actually do offer professional landscaping services. Would you be interested in a quote? Yeah, I mean how much would something like that cost. For planting 40 petunias, I would suggest a four-hour appointment which would come out to $200. Okay, that's interesting. I'm curious, do you also price match?"
      },
      "video": {
        "content": "A customer is shopping on a gardening website. The screen shows a summary of their cart, which includes two items: Bloom Booster Potting Mix and Flower Power Fertilizer. The customer has added these items to their cart twice each. The subtotal is $77.94, with a tax of $6.53, making the total $84.47. The customer is considering adding a landscaping service offering worth $200.00 to their cart."
      }
    },
    {
      "start_time": 4154.84,
      "end_time": 4182.97,
      "audio": {
        "content": " Yes, we do price match on occasion, but it's usually on a case-by-case basis. Oh, in that case, I most definitely saw an ad from one of your competitors, Greenscapes, where they were offering 50% off first-time landscaping services for new customers, you should totally give me that 50% discount. Ologies, but it looks like that amount is beyond what I'm authorized to approve. Let me check with my supervisor to see what we can do. Just a moment."
      },
      "video": {
        "content": "A customer is shopping on an online gardening store. The screen shows a shopping cart with two items: Bloom Booster Potting Mix priced at $15.99 each and Flower Power Fertilizer priced at $22.98 each. The subtotal is $77.94, with a tax of $6.53, making the total $84.47. The customer has the option to add a promo code and proceed to checkout. On the right side, there is a customer service agent offering a landscaping service for $200.00."
      }
    },
    {
      "start_time": 4182.97,
      "end_time": 4204.03,
      "audio": {
        "content": " So I'm going to step away from being the human user in this conversation for a moment, and we're going to go behind the scenes over here to our call center. And so what you're going to see here in our CRM sales force is that our virtual assistant, our virtual agent, has escalated up to a human call center manager, also played by me, and what I can do here is guide the agent through this portion of the conversation without fully taking over the"
      },
      "video": {
        "content": "A man with a beard and tattoos is standing behind a podium, speaking into a microphone. He gestures with his hands as he talks. The scene then transitions to a computer screen displaying a customer service chat interface. The chat interface shows a conversation with a customer named Patrick Marlow, who has started a chat. The chat component includes options to message, call, or view history. The background features a large screen displaying the same man on stage, with a modern, illuminated desk in front of him."
      }
    },
    {
      "start_time": 4204.03,
      "end_time": 4225.97,
      "audio": {
        "content": " conversation. So you can see here, Patrick is requesting a price match for 50% offer. And I'll say something like, you know, we can't do that, but we can do 20, oh, say we can do 20% instead. And I'm going to send this message, and then we're going to pick up the conversation"
      },
      "video": {
        "content": "A man with a beard and a bald head is sitting at a desk, looking at a computer screen. He is wearing a dark blue shirt and has tattoos on his arms. The screen shows a chat interface with messages from a virtual agent. The man types a message to the virtual agent, which reads: 'we can't do that but we can d...'. The chat interface also displays other messages and options such as 'Share', 'Call Logs', 'Text Posts', and 'Status Changes'. On the right side of the screen, there is a sidebar with related information like 'Attachments', 'Activity History', and 'Open Activities'. The background is a simple indoor setting with a curtain."
      }
    },
    {
      "start_time": 4225.97,
      "end_time": 4251,
      "audio": {
        "content": " on the other side. Good news, Patrick. My supervisor has approved a discount. We can offer you 20% off our landscaping services. I've applied a 20% discount to the landscaping services. Well, that's awesome. I'll take it. Fantastic. Let's get that scheduled. I can have someone out tomorrow from 8 a.m. to 12 p.m. or between 1 p.m. and 5 p.m."
      },
      "video": {
        "content": "A man with a beard is standing behind a podium, speaking into a microphone. He is wearing a dark blue shirt and has tattoos on his arms. The background is a dark stage with red and orange lighting. On the screen behind him, there is a chat component showing a conversation between a customer and an agent. The customer is requesting a price match with a 50% off offer for planting services. The agent responds that they can only offer a 20% discount instead. The chat component also shows the agent's name, Patrick Marlow, and the task assigned to them. The video then cuts to a screenshot of a shopping cart on a website, displaying two items: Bloom Booster Potting Mix and Flower Power Fertilizer. The total cost is $84.47. The video ends with a screenshot of a customer service agent's chat window, showing appointment times available this Thursday."
      }
    },
    {
      "start_time": 4251.76,
      "end_time": 4272.64,
      "audio": {
        "content": " Which would you prefer? Let's definitely do the afternoon time because I have a pickleball tournament in the morning. Great! You're all set. You'll receive a confirmation email in just a moment with all the details. Is there anything else I can assist you with today, Patrick? Yeah, do I actually have to pay for the landscaping services right now?"
      },
      "video": {
        "content": "A customer is shopping on an online gardening store. The screen shows a shopping cart with two items: Bloom Booster Potting Mix and Flower Power Fertilizer. The customer adds these items to their cart and proceeds to checkout. A customer service agent appears on the right side of the screen, confirming the appointment time for pickup. The customer selects the confirmation email option."
      }
    },
    {
      "start_time": 4272.64,
      "end_time": 4300.31,
      "audio": {
        "content": " No, we'll just add this to your account. And you can pay once the service is complete. Anything else? No, I'm just going to finish checking out here. You've been really wonderful today. Thanks for your help. You're very welcome. Thanks for choosing Symbol, Home, and Garden. Have a great day and good luck at your pickleball tournament. Now that was pretty amazing, right?"
      },
      "video": {
        "content": "A man with a beard and tattoos is standing behind a podium, speaking into a microphone. He is wearing a dark blue shirt and has his hands clasped together. The background is a plain, light-colored wall with some vertical lines. The man appears to be giving a presentation or speech."
      }
    },
    {
      "start_time": 4300.31,
      "end_time": 4325.97,
      "audio": {
        "content": " That entire thing was 100% real and live. All of the tools needed to build experiences just like that are available for you to start using today. Thanks, everyone, and back to you, Lisa. Pretty amazing. We're also helping to improve conversational customer experiences beyond the call center"
      },
      "video": {
        "content": "A man with a beard and tattoos on his arms is standing behind a podium, speaking to an audience. He is wearing a blue shirt and has a microphone attached to his shirt. He gestures with his right hand, pointing upwards and then moving it down to his side. The background is a plain, light-colored curtain. The scene then cuts to a woman walking across the stage. She is wearing a green outfit and has short hair. She smiles as she walks."
      }
    },
    {
      "start_time": 4325.97,
      "end_time": 4349.38,
      "audio": {
        "content": " by offering purpose-built agents that address specific industry use cases, including food ordering, automotive, and retail. For example, Wendy's AI drive-through ordering system handles 60,000 orders daily. Mercedes-Benz provides conversational search and navigation in the new CLLA series."
      },
      "video": {
        "content": "A speaker stands on a stage at an event, addressing an audience. The stage is well-lit with a large screen behind the speaker displaying text and images. The text on the screen reads \"Purpose built industry agents.\" The speaker gestures with their hands as they speak, emphasizing points about purpose-built industry agents. The audience is seated in darkness, focusing their attention on the speaker and the screen."
      }
    },
    {
      "start_time": 4357.31,
      "end_time": 4384.97,
      "audio": {
        "content": " And the Home Depot has built Magic Apron, an agent that offers expert home improvement guidance 24-7. And we have tremendous partnerships. For example, Service Now CRM works with customer engagement suite, helping to automate and personalize customer interactions across systems. Now, let's talk about creative agents that are being used to superpower creative teams, including those in media production, marketing, advertising, design, and more."
      },
      "video": {
        "content": "A woman stands on a stage, dressed in a green jumpsuit with a black belt and white sneakers. She is speaking to an audience, gesturing with her hands as she addresses them. The background features vertical light panels that change color from blue to white. The scene transitions to a large screen displaying the logos of Google Cloud and ServiceNow, followed by another screen showing the text \"Creative Agents.\" The woman continues to speak, occasionally turning her body to face different directions."
      }
    },
    {
      "start_time": 4384.97,
      "end_time": 4409.64,
      "audio": {
        "content": " In some cases, agents are augmenting creative teams to enable content production at massive scale. In others, they're helping reimagine how stories can be told for a new generation of audiences. One of the most amazing examples is the enriching of the Wizard of Oz at the Las Vegas sphere and how VEO2 helped to bring it to life."
      },
      "video": {
        "content": "A woman stands on a stage, dressed in a green shirt and pants, with her hands clasped together in front of her. She appears to be speaking or presenting, as indicated by the microphone attached to her shirt. The background is a plain, light-colored backdrop, suggesting a formal or professional setting."
      }
    },
    {
      "start_time": 4409.64,
      "end_time": 4438.31,
      "audio": {
        "content": " Let's hear from CEO of Sphere, Jim Dolan, and the visionaries who made it happen. The sphere is an experiential medium. We looked for content that would accentuate all of the different capabilities inside of the venue."
      },
      "video": {
        "content": "A woman stands on a stage, dressed in a green outfit with a belt, addressing an audience. The scene transitions to a wide shot of the stage, illuminated by spotlights and featuring large screens displaying the word \"Sphere.\" The camera then focuses on a person walking through an empty auditorium, looking around at the rows of seats."
      }
    },
    {
      "start_time": 4438.31,
      "end_time": 4459.93,
      "audio": {
        "content": " That was our criteria for choosing the Wizard of Ours. We knew that it was really hard to tackle using traditional means, but it is possible to do it with AI. We ultimately came to the conclusion that Google was the only company that was actually capable of doing this. Alongside Google's deep mind researchers, we trained AI models"
      },
      "video": {
        "content": "The video begins with a wide shot of a large stadium at night, illuminated by numerous bright lights arranged in a curved pattern along the top of the stands. The atmosphere is dark, with a thick layer of fog or mist enveloping the scene, creating a dramatic and somewhat mysterious ambiance. The lights cast a warm glow on the fog, highlighting the structure of the stadium and the rows of seats. The camera remains stationary throughout this initial shot, capturing the grandeur and scale of the venue."
      }
    },
    {
      "start_time": 4459.93,
      "end_time": 4481.31,
      "audio": {
        "content": " so that when you see Dorothy dancing, you see her full body dancing, down to her ruby slippers. For Google to bring VO2, we were really excited because that infrastructure that exists and that compute power was needed to do everything we're doing. Sphere Studios has partners with Google Cloud, and we've successfully deployed"
      },
      "video": {
        "content": "The video begins with a close-up shot of a person working on a computer. The screen displays a video editing software interface, showing a scene of a forest with a dog walking through it. The person is focused on the screen, likely adjusting settings or trimming footage. The camera then transitions to a wide shot of a stage set up in a forest-like environment. The stage features a large, animated character in a pink dress, surrounded by other characters and vibrant decorations. The scene is lively, with people operating control panels and monitors, indicating a live performance or event being managed from behind the scenes. The video then cuts to a woman speaking in an interview setting. She is gesturing with her hands as she talks, suggesting she is explaining something important. Finally, the video transitions to an animated sequence. Three characters are walking down a dark, futuristic hallway with glowing blue walls. The hallway has a metallic, industrial design, and the characters appear to be exploring or moving towards a destination."
      }
    },
    {
      "start_time": 4481.31,
      "end_time": 4502.03,
      "audio": {
        "content": " finishing technology in Google's cloud infrastructure, turnaround times, transmission times, all of the things that sometimes can slow a traditional studio down. Now we're able to do them a lot faster and with a lot more impact using Google Cloud. My hope for it is that we keep exploring different ways to create this kind of content"
      },
      "video": {
        "content": "The video begins with a view of a control room filled with multiple screens displaying various scenes. The screens are arranged in a grid pattern, showing different angles and perspectives of an event or performance. The lighting is dim, with blue and yellow lights illuminating the screens, creating a professional and high-tech atmosphere.\n\nThe scene then transitions to a woman standing in front of a backdrop with the word \"transmission\" prominently displayed. She appears to be speaking or presenting, possibly discussing the technical aspects of the event being shown on the screens behind her. Her expression and body language suggest she is engaged and informative.\n\nNext, the video shifts to a woman working in a server room. She is wearing a white shirt and is focused on handling equipment, likely managing or troubleshooting network infrastructure. The server racks are visible in the background, indicating a high-tech environment.\n\nFinally, the video shows a man sitting in a modern office space. He is dressed in a dark suit and is gesturing with his hands as he speaks. The office has large windows that let in natural light, and there are plants and decorative elements in the background, creating a professional and well-lit environment."
      }
    },
    {
      "start_time": 4502.03,
      "end_time": 4526.97,
      "audio": {
        "content": " and to take great performances from the past and bring them to life today, I think the world is going to be amazed. Beyond entertainment, AI is helping creative agencies revolutionize marketing for their clients."
      },
      "video": null
    },
    {
      "start_time": 4526.97,
      "end_time": 4550.14,
      "audio": {
        "content": " WPP built open as a platform powered by Google models that all of its employees worldwide can use to concept, produce, and measure campaigns. Monks.Flo is using Google AI to help localize creative for campaigns. And the BrandTech Group built Pencil, a generative AI platform for brands to create ads,"
      },
      "video": {
        "content": "A woman stands on a stage in front of a large screen displaying various slides. The screen shows text prompts and images related to AI and technology. The woman gestures as she speaks, likely explaining the content displayed on the screen. The audience is seated in front of her, attentively watching the presentation."
      }
    },
    {
      "start_time": 4550.14,
      "end_time": 4570.76,
      "audio": {
        "content": " like this recent mock-up for Japan Airlines. Customers are increasing marketing performance and reducing production time with creative agents. Mondalise quickly generates visuals for global brands like Oreo and Cadbury. Bloomberg Connects is making museums more accessible. And we're absolutely thrilled"
      },
      "video": {
        "content": "A woman stands on a stage, dressed in a green jumpsuit and white shoes, addressing an audience. She gestures with her hands as she speaks, occasionally looking at a large screen behind her. The screen displays text about creating ads, predicting performance, and optimizing campaigns. The stage is well-lit, with vertical light panels on either side, and the audience is visible in the background."
      }
    },
    {
      "start_time": 4570.76,
      "end_time": 4598.3,
      "audio": {
        "content": " to partner with Adobe, the leader in creativity, to bring our advanced Imagine 3 and V-O-2 models to applications like Adobe Express. Now, please welcome Brad Calder to the stage to talk about data agents. I'm Thanks, Lisa."
      },
      "video": {
        "content": "A woman stands on a stage, addressing an audience. The background features a large screen displaying various slides related to Bloomberg Connects and Adobe products. The first slide shows two people looking at paintings, with the text \"Adapting arts & culture content to reach wider audiences.\" The second slide highlights Adobe's \"Adobe Express\" feature, emphasizing its ability to give creators and enterprises more ways to ideate. The woman continues her presentation, occasionally gesturing with her hands. Another person walks across the stage, adding movement to the scene."
      }
    },
    {
      "start_time": 4598.3,
      "end_time": 4620.3,
      "audio": {
        "content": " Data agents know what data to utilize and what questions to ask. They enable data teams to effectively manage data and business teams to activate it. Mattel is an iconic brand, making toys from Barbies to Hot Wheels. Let's hear from Mattel's CEO, Enon Crys,"
      },
      "video": {
        "content": "A man stands on a stage, addressing an audience. He is dressed in a blue long-sleeve shirt and dark pants, with a microphone clipped to his shirt. The background features a large screen displaying the name \"Brad Calder\" and the title \"VP & GM, Google Cloud.\" The screen also shows the Google Cloud logo and the Mattel logo. The stage is well-lit, and the audience is seated in front of him."
      }
    },
    {
      "start_time": 4620.3,
      "end_time": 4645.97,
      "audio": {
        "content": ""
      },
      "video": {
        "content": "A man stands on a stage, gesturing with his hands as he speaks. The background is a simple, light-colored backdrop with vertical lines. The scene then transitions to a dark room with red lights forming a circular pattern on the wall. The camera pans around the room, revealing a futuristic, illuminated stage with a sleek design. The lighting shifts to a vibrant pink hue, and various colorful objects float in the air, creating a dynamic and visually engaging scene. The video concludes with a shot of a sign that reads \"Mattel\" in bold letters."
      }
    },
    {
      "start_time": 4620.3,
      "end_time": 4670.24,
      "audio": {
        "content": " on how they harness their data with Gemini. Matt Mattel. Our mission is to create innovative products and experiences that inspire fans and entertain audiences and develop children through play. This year, we're celebrating 80 years, and while we first made our mark as a toy company, Mattel today is a global brand management company home to one of the most iconic portfolios in the world. Our partnership with Google Cloud has helped us synthesize millions of points of consumer feedback. From phone calls and emails to online, and social media comments,"
      },
      "video": {
        "content": "A man stands on a stage, gesturing with his hands as he speaks. The background is a simple, dark setting with vertical light strips. The scene transitions to a close-up of a screen displaying the word \"Gemini\" in blue letters, accompanied by a star symbol. The video then cuts to a different setting where the same man is seen in an office environment, wearing a white shirt and a dark jacket. Finally, the video shows a colorful display of various toy brand logos, including Mattel, Hot Wheels, Fisher-Price, Barbie, American Girl, and others."
      }
    },
    {
      "start_time": 4670.24,
      "end_time": 4690.82,
      "audio": {
        "content": " delivering insights and opportunities to deepen our relationship with Mattel fans. Before partnering with Google Cloud, teams identified patterns manually. Now we can analyze sentiment and consumer preferences in real time. We can instantly identify key issues and trends improving both efficiency"
      },
      "video": {
        "content": "The video begins with a montage of social media posts featuring reviews and testimonials about a product. The posts are displayed on a dark background, with various user avatars and comments visible. A prominent red sticker with the word \"MATTEL\" is superimposed over the center of the screen, drawing attention to the brand. The scene then transitions to a man standing in an office environment. He is dressed in a white shirt and a dark jacket, and he appears to be speaking or presenting. The background includes a large, colorful sign that reads \"Barbie.\" The man's expression and body language suggest he is engaged in delivering a message or presentation."
      }
    },
    {
      "start_time": 4690.82,
      "end_time": 4713.28,
      "audio": {
        "content": " and innovation. For example, we improved the right mechanism in the Barbie Dreamhouse elevator and enhanced the interactive features in the Fisher-Price Kick-and and Play Piano Gym. These are two of our top-selling global products that have been made even better through data-driven insights. We see Google Cloud as a true partner"
      },
      "video": {
        "content": "The video begins with a series of customer reviews displayed on a screen. The reviews are from different users, each providing feedback on various aspects of a product. The reviews include comments such as \"The elevator ride could be smoother?\" and \"Nice learning toy for all ages.\" The reviews are accompanied by star ratings, indicating the user's satisfaction level with the product."
      }
    },
    {
      "start_time": 4713.28,
      "end_time": 4734.64,
      "audio": {
        "content": " in bringing the magic of play to life for every Mattel fan. Thanks, Enon. My kids love playing with Fisher-Price gyms. Our data platform, BigQuery, has five times more customers"
      },
      "video": {
        "content": "A man is standing in an office environment, speaking to the camera. He is wearing a white shirt under a dark blue sweater. The background features a modern office setting with large windows and a blurred view of other people and equipment."
      }
    },
    {
      "start_time": 4734.64,
      "end_time": 4756.3,
      "audio": {
        "content": " than the two leading independent data cloud companies. With BigQuery, you can activate all your data for AI, combining structured and unstructured data, such as tables, text, logs, images, and video. You can also work with open formats like Apache Iceberg directly integrated into BigQuery."
      },
      "video": {
        "content": "A speaker is presenting on a stage at an event, likely a conference or seminar. The stage features a large screen displaying various slides related to data platforms and analytics. The speaker, dressed in a blue shirt and dark pants, gestures with his hands as he explains the content on the screen. The background includes a series of vertical light panels that change colors, adding a dynamic visual element to the presentation."
      }
    },
    {
      "start_time": 4756.3,
      "end_time": 4784.69,
      "audio": {
        "content": " And you can use BigQuery to access data in any storage system or in any SaaS application on any cloud. And multimodal analysis with Gemini and BigQuery has grown more than 16 times this past year. And now, if you're a big Oracle customer, the full range of Oracle database services running on OCI are integrated with BigQuery, Gemini, and Vertex AI."
      },
      "video": {
        "content": "A man stands on a stage, dressed in a blue long-sleeve shirt and dark pants. He is speaking and gesturing with his hands, indicating he is explaining something. The background consists of vertical light panels that change color from blue to white as the video progresses. The lighting on the stage is bright, focusing on the speaker."
      }
    },
    {
      "start_time": 4784.69,
      "end_time": 4806.97,
      "audio": {
        "content": " They're being deployed natively in 20 Google Cloud locations serving customers such as Macy's and Sabre. And today we're very excited to announce specialized agents for every member of your data team. Now, for data engineering teams, we deliver agents for all aspects of the data engineering life cycle,"
      },
      "video": {
        "content": "A man stands on a stage, gesturing with his hands as he speaks. He is wearing a blue long-sleeve shirt and dark pants. The background features vertical stripes, and the lighting is focused on him, creating a professional and engaging atmosphere. The scene then transitions to a wide shot of the stage, showing the audience seated in darkness. A large screen behind the speaker displays the text \"New Data Agents.\" The speaker continues to address the audience, maintaining eye contact and using hand gestures to emphasize points."
      }
    },
    {
      "start_time": 4806.97,
      "end_time": 4827.97,
      "audio": {
        "content": " from catalog automation to metadata generation, to maintaining data quality to data pipeline generation. And for data science teams, our AI agent acts as a comprehensive coding partner in your data science notebook, accelerating every step of your workflow from data loading and feature engineering"
      },
      "video": {
        "content": "A man stands on a stage, gesturing with his hands as he speaks. He is dressed in a blue long-sleeve shirt and dark pants, with a microphone clipped to his shirt. The background features vertical light panels that create a modern and professional atmosphere. As he continues speaking, the camera shifts focus to a large screen behind him, displaying the text \"New Data Agents Data Science\" with a \"Preview\" button below it. The man then resumes his speech, maintaining a confident and engaging demeanor."
      }
    },
    {
      "start_time": 4827.97,
      "end_time": 4849.72,
      "audio": {
        "content": " to predictive modeling. And for data analysts and business users, our conversational analytics agent performs powerful, trustworthy analysis entirely in natural language. And you can also embed this agent in line in your own web or mobile application."
      },
      "video": {
        "content": "A man stands on a stage, gesturing with his hands as he speaks. He is dressed in a blue long-sleeve shirt and dark pants, with a microphone clipped to his shirt. The background is a simple, light-colored curtain with vertical lines. The scene transitions to a large screen displaying the text \"New Data Agents Data Analysis\" with a \"Preview\" button below it. The man continues to speak, occasionally pointing towards the screen."
      }
    },
    {
      "start_time": 4852.02,
      "end_time": 4870.64,
      "audio": {
        "content": " Now, for over a decade, Spotify has partnered with Google Cloud to cost effectively handle massive scale. They use BigQuery to harness enormous amounts of data to deliver personalized experiences to over 675 million users worldwide, including many of us here."
      },
      "video": {
        "content": "A speaker is presenting on a stage at an event, likely a conference or seminar. The background features a large screen displaying information about Spotify, highlighting personalized experiences for 675 million users enjoying music, audio, podcasts, and audiobooks. The screen also shows images of smartphones with Spotify interfaces. The speaker is dressed in a blue shirt and dark pants, gesturing with his hands as he speaks. The audience is seated in front of the stage, attentively watching the presentation."
      }
    },
    {
      "start_time": 4870.64,
      "end_time": 4891.5,
      "audio": {
        "content": " Unilever uses BigQuery to reach millions of retailers in emerging markets. Buyer built an agent that predicts flu trends. Now, customers are also taking advantage of our databases with AI. For example, Nero, an autonomous driving company"
      },
      "video": {
        "content": "A man stands on a stage, dressed in a blue polo shirt and gray pants, with his hands clasped together in front of him. He appears to be speaking or presenting, as he gestures occasionally with his hands. The background is a simple, dark curtain with vertical light strips, creating a professional and focused atmosphere."
      }
    },
    {
      "start_time": 4891.5,
      "end_time": 4913.64,
      "audio": {
        "content": " uses Allo ADB to identify challenging scenarios on the road. And public sector organizations like State of Nevada are using agents to speed up benefit claims. Let's find out more. The Nevada Department of Employment, Training, and Rehabilitation provides critical unemployment and job placement services."
      },
      "video": {
        "content": "A man stands on a stage, gesturing with his hands as he speaks. The background is a large screen displaying an image of a self-driving car driving on a road lined with trees. The text on the screen reads \"nuro Identifying challenging scenarios on the road with vector search in AlloyDB.\" The scene then transitions to a wide shot of the Las Vegas Strip, with the word \"Nevada\" prominently displayed in the foreground."
      }
    },
    {
      "start_time": 4913.64,
      "end_time": 4945.73,
      "audio": {
        "content": " To support limited staff in a regulated space, Nevada Dieter developed an appeals AI assistant, powered by BigQuery and Vertex AI. It synthesizes case data to help appeals referees make fair approvals four times faster, surpassing DOJ standards. Nevada Department of Employment, Training, and Rehabilitation is creating a new way to serve constituents. And now, let's see all of this in action."
      },
      "video": {
        "content": "The video begins with a scene showing a man entering an office where three women are seated at a table. The text \"unemployment\" is displayed prominently on the screen. The scene then transitions to a black screen with the text \"Appeals AI Assistant\" and the BigQuery logo. Next, the video shows two men shaking hands in an office setting, with the Nevada Department of Employment, Training, and Rehabilitation logo visible. The final scene features the Google Cloud and DETR logos side by side."
      }
    },
    {
      "start_time": 4945.73,
      "end_time": 4970.64,
      "audio": {
        "content": " Please welcome Yasmin Ahmad. Thank you, Brad. I'm here to show you the future of data signs made easy. All it takes is BigQuery, co-lab, and Vertex AI, now powered with Gemini."
      },
      "video": {
        "content": "A man is standing on a stage, gesturing with his hands as he speaks. He is wearing a blue long-sleeve shirt and dark pants. The background is a plain, light-colored curtain. The scene then transitions to a woman walking down a hallway. She is wearing a black outfit with a maroon headscarf and a white shirt underneath. The hallway has a modern design with a large Google Cloud logo on the wall behind her. The woman continues walking towards a stage where she stands at a podium. The stage is well-lit with spotlights and has a sleek, futuristic design. The woman appears to be preparing to speak, holding a laptop in front of her."
      }
    },
    {
      "start_time": 4970.64,
      "end_time": 4991.64,
      "audio": {
        "content": " So say you're running a consumer goods company. Sales are booming, but cash flow is slowing down. Why? Well, to answer this question, we need to see everything, from sales to invoices to customer signal. So let's take a look at our data. Now, traditionally, data is siloed,"
      },
      "video": {
        "content": "A woman stands behind a podium on a stage, presenting financial data. The screen behind her displays a graph titled \"Revenue & Net Cash Flow\" for the last 12 months, showing a steady increase in revenue and net cash flow. Key metrics include cash on hand at $345.6M, accounts payable at $14.2M (89% of target), and an inventory level of 24 days. The presentation also includes a sales conversion rate funnel, cash flow by category, and territory performance map."
      }
    },
    {
      "start_time": 4991.64,
      "end_time": 5014.64,
      "audio": {
        "content": " but now BigQuery helps me connect everything, including SAP deep integration and real-time feeds from Salesforce and even Google Ads. See? Easy. Let's ask our data engineering agent to now do the heavy lift. I'm going to add my first prompt from the clipboard."
      },
      "video": {
        "content": "A woman wearing a hijab is standing in front of a screen displaying a Google Cloud interface. She appears to be presenting or explaining something related to data engineering. The interface shows various tabs such as 'Orders, Sales & Advertising,' 'Finance,' and 'Data Engineering Agent.' The woman gestures with her hands while speaking, indicating she is engaged in a discussion about data-related topics."
      }
    },
    {
      "start_time": 5014.64,
      "end_time": 5041.97,
      "audio": {
        "content": " And to do this cash flow analysis, we're going to combine invoices, sales, and audience data from all of these sources into a single multimodal data table. Instantly, we have a unified view of everything. No complex integrations and no waiting. Easy. Of course, our new table is a little messy. Look at those dates. Well, it's a good thing now"
      },
      "video": {
        "content": "A woman wearing a red hijab is standing at a podium, speaking into a microphone. She gestures with her hands as she talks. The background is a plain, dark-colored wall. On the right side of the screen, there is a computer interface displaying a data engineering project. The interface shows three queries: \"Inventory & Payables (SAP),\" \"Sales Orders & Payments (Salesforce),\" and \"Campaign & Conversions (Google Ads).\" A message from a \"Data Engineering Agent\" appears, asking to pull together data for a cash flow analysis. The agent then creates a pipeline that joins three tables and stores the results in a new table called \"invoice_orders_ads.\" The interface also includes options to apply transformations and extract street, city, state, and zip code from an address."
      }
    },
    {
      "start_time": 5041.97,
      "end_time": 5062.87,
      "audio": {
        "content": " with BigQuery, we have access to Gemini-powered recommendations. And look at that. A clean data set. Super easy. Let's now move to BigQuery Data Canvas to do some analysis. Here we can see our structured data is ready to go."
      },
      "video": {
        "content": "A person wearing a red hijab is standing behind a podium, speaking into a microphone. The background is dark, and there is a small cloud logo visible on the podium. The person appears to be giving a presentation or speech."
      }
    },
    {
      "start_time": 5063.67,
      "end_time": 5082.97,
      "audio": {
        "content": " However, to do a true cash flow analysis, I need to extract signals from my PDF invoices. Not so easy. But here, I can ask our data science agent to help me extract buyer and payment information and group buyers into segments."
      },
      "video": {
        "content": "A woman wearing a maroon hijab is standing behind a podium, speaking into a microphone. She appears to be presenting or giving a speech. The background is dark, and there is a small logo on the podium. The woman's hands are occasionally gesturing as she speaks."
      }
    },
    {
      "start_time": 5082.97,
      "end_time": 5106.97,
      "audio": {
        "content": " In the past, this would have taken hours of manual effort. Definitely not easy. But now with BigQuery's new AI query engine, without having to review each PDF, I can automatically extract key information and we can group buyers into segments using Gemini's real-world knowledge."
      },
      "video": {
        "content": "A woman wearing a maroon hijab and a black blazer is standing in front of a laptop screen. She appears to be presenting or explaining something related to data analysis. The screen shows a Google Cloud interface with SQL queries and a visualization of buyer categories. The woman gestures with her hands as she speaks, indicating she is engaged in the presentation."
      }
    },
    {
      "start_time": 5106.97,
      "end_time": 5128.63,
      "audio": {
        "content": " Again, super easy. So what exactly is causing our cash flow drop from January to March. Here, our data science agent uses Gemini's new thinking model and BigQuery machine learning to build an automated data science workflow right before our eyes."
      },
      "video": {
        "content": "A woman wearing a maroon hijab is standing at a podium, speaking into a microphone. She gestures with her hands as she talks. The background is a plain, dark-colored wall. On the right side of the screen, there is a Google Cloud interface displaying a bar chart titled \"Buyer Category.\" The chart shows five categories: Retailer, Wholesale, Foodservice, Hospitality, and Other. The chart indicates that Retailers are the most common category, with 78,340 products, while NGO is the least common, with only 4,520 products. Below the chart, there is a text box labeled \"Data Science Agent\" that provides additional information about the data visualization."
      }
    },
    {
      "start_time": 5128.63,
      "end_time": 5151.63,
      "audio": {
        "content": " It's analyzing hundreds of dimensions in mere seconds. And it looks like we have our answer. Payment terms. It looks like our new 36-month payment promotion offer while boosting sales has caused the recent cash flow dip. Hmm. I wonder how this is impacting my cash forecast."
      },
      "video": {
        "content": "A woman wearing a maroon hijab and a black blazer is standing at a podium, speaking into a microphone. She gestures with her hands as she explains something. The background is blurred, but it appears to be an indoor setting, possibly a conference room or a studio. The video then cuts to a computer screen displaying a Google Cloud interface. The screen shows a SQL query being run, with results displayed in a table format. The query analyzes data related to sales and advertising, showing contributors, metric tests, control, differences, and relative differences. Insights are provided, highlighting that Payment Terms of 36 Months is the biggest contributor with a 32% influence on cash flow, followed by Buyer City Depot with 17% influence. Logistics United has an 8% influence, and buyers in Wholesale and Retail categories are the highest contributing buyers to cash flow. Another insight states that 36 month payment terms and City Depot are the second largest contributors for January to March."
      }
    },
    {
      "start_time": 5151.63,
      "end_time": 5173.75,
      "audio": {
        "content": " To do this, I want to jump to code, so I'm going to extract my analysis here into a big query notebook. And in our notebook, we can ask our data science agent to again help us write some code. So here, we'll ask our agent to build a forecast for the next three months broken down"
      },
      "video": {
        "content": "A woman wearing a hijab is standing at a podium, speaking into a microphone. She appears to be presenting or giving a speech. The background is blurred, focusing on her as she gestures with her hands while speaking."
      }
    },
    {
      "start_time": 5173.75,
      "end_time": 5196.3,
      "audio": {
        "content": " by buyer category. BigQuery now uses Google's new pre-trained time series forecasting model to build this out. And we can reveal the big insight. It looks like wholesalers taking those long 36-month payment terms, are causing the issue. And finding this answer?"
      },
      "video": {
        "content": "A person is standing at a podium, speaking into a microphone. The background is a plain, dark-colored wall. The person is wearing a black outfit with a maroon scarf. On the right side of the screen, there is a computer interface displaying code and a line chart titled 'Cash Flow Forecast by Buyer Category.' The code includes imports and functions related to forecasting cash flow. The line chart shows various lines representing different buyer categories (Wholesale, Foodservice, Retailer, Hospitality, Other) over time, indicating percentage growth. A message from the Data Science Agent states that the forecast predicts wholesalers will have the largest cash flow change from April to June."
      }
    },
    {
      "start_time": 5196.3,
      "end_time": 5221.3,
      "audio": {
        "content": " Easy. We can make this analysis even more powerful. We can ask our data science agent to include product category as a breakdown. Big Cree Colab Composer instantly gets to work updating all of our code. And as we see forecasts, now including buyer segment and product category, we can answer"
      },
      "video": {
        "content": "A woman wearing a maroon hijab is standing at a podium, speaking into a microphone. She appears to be presenting or explaining something. The background is dark, and there is a screen displaying a line chart titled 'Cash Flow Forecast by Buyer Category.' The chart shows various lines representing different buyer categories, such as Wholesale, Foodservice, Retailer, Hospitality, and Other, with data points over time. The woman gestures with her hands while speaking, emphasizing her points. The video also includes a text box labeled 'Data Science Agent' that provides information about the forecasted cash flow changes for different buyer categories."
      }
    },
    {
      "start_time": 5221.3,
      "end_time": 5260.3,
      "audio": {
        "content": " a huge range of potential questions. So easy. In fact, it looks like our 36-month terms are impacting fast-moving segments like food and beverage, and not other segments, for example, medication. This insight allows for surgical precision. Instead of a blunt action like removing the 36-month promo offer entirely, we can make a data-driven, targeted, easy decision. Pretty incredible, right? This whole process used to take months of manual work, but today it took just a few minutes."
      },
      "video": {
        "content": "A woman wearing a maroon hijab is standing in front of a laptop screen, which displays a code editor with Python code. The code is being executed, and the resulting heatmap of cash flow growth forecast is shown on the screen. The woman appears to be explaining the code and the results."
      }
    },
    {
      "start_time": 5260.3,
      "end_time": 5286.97,
      "audio": {
        "content": " Gemini and Vertix AI have made BigQuery a complete data science platform, unlocking new insights faster than ever with both natural language and code. And that, my friends, is the future of data science made? Say it with me. Easy. Back to you, Brad. Thanks, Yasmin."
      },
      "video": {
        "content": "A woman wearing a maroon hijab and a black blazer stands behind a podium, presenting data on a large screen behind her. The screen displays a heatmap titled \"Heatmap of Cash Flow Growth Forecast.\" The heatmap categorizes different retailers and their respective cash flow growth percentages across various product categories. The woman gestures with her hands as she speaks, emphasizing points about the data. The background is a simple, dark curtain, and the lighting focuses on the presenter and the screen."
      }
    },
    {
      "start_time": 5286.97,
      "end_time": 5311.63,
      "audio": {
        "content": " That was amazing. Now, just like with data, Gemini's fast performance, large context window, and reasoning make it highly effective for coding agents. We offer Gemini Code Assist in Google Cloud, Android Studio, Firebase Studio, and your favorite IDEE. Our enterprise version understands your code-based standards and conventions."
      },
      "video": {
        "content": "A man stands on a stage, dressed in a blue long-sleeve shirt and dark pants, with a microphone clipped to his shirt. He gestures with his hands as he speaks, moving them from his chest outward and then back together. The background is a simple, light-colored curtain. The text \"Brad Calder\" appears in orange letters on the left side of the screen."
      }
    },
    {
      "start_time": 5311.63,
      "end_time": 5336.9,
      "audio": {
        "content": " And companies like Amp here from No Group, Broadcom, CME group, PayPal, and LibPro use codicists today. And today, we're announcing new code assist agents to help with everything from modernizing code to helping with the full software development lifecycle. Developers can interact with our agents on the con bond board, which"
      },
      "video": {
        "content": "A man stands on a stage, dressed in a blue shirt and dark pants, gesturing with his hands as he speaks. The background is a simple, modern design with vertical light panels. The scene transitions to a large screen displaying logos of companies like Renault, Broadcom, CME Group, PayPal, and Wipro. The screen then shows a new feature called \"Gemini Code Assist Agents\" with a subtitle indicating assistance across the development life cycle. The man continues to speak, and the audience is visible in the foreground."
      }
    },
    {
      "start_time": 5336.9,
      "end_time": 5360.3,
      "audio": {
        "content": " provides a real-time display of the task codices working on, as well as the ability for developers to interact with our agents. Kodysis also has integrations with dozens of partners, such as Atlantean, Sentry, Sneak, and many more coming soon. Now, outside of Google, Gemini is also available"
      },
      "video": {
        "content": "A speaker stands on a stage, presenting to an audience. The background features a large screen displaying a Kanban board with various tasks labeled as 'Action Needed,' 'Running,' and 'Completed.' The speaker gestures towards the screen while explaining the process. The screen transitions to show a list of companies under the heading 'Gemini Code Assist Partner Ecosystem,' highlighting their logos and names."
      }
    },
    {
      "start_time": 5360.3,
      "end_time": 5393.63,
      "audio": {
        "content": " for your development needs in ATER, Cursor, GitHub Copilot, Replit, Replit, TapLine, and WindSurf. If you want to see more, join me tomorrow at the developer keynote to see Code Assistant Action. Now, to share what's new and security Code Assistant Action. Now, to share what's new in security, please welcome Sandra Joyce. Thank you. Thank you. Thank you. Thank you. Thanks, Brad."
      },
      "video": {
        "content": "A man stands on a stage, addressing an audience. He is dressed in a blue shirt and dark pants, with his hands clasped together in front of him. The stage features a large, illuminated screen displaying the text \"Developer Keynote\" along with the date and time: \"Thursday, April 10 • 2:30pm-3:45pm.\" The background is dark, with blue lighting accents and a few spotlights illuminating the speaker. The atmosphere suggests a formal presentation or conference setting."
      }
    },
    {
      "start_time": 5393.63,
      "end_time": 5418.94,
      "audio": {
        "content": " Security agents can dramatically increase in the speed and effectiveness of security analysts. The integration of AI across our security products is just one reason why organizations around the world are making Google part of their security team. We offer critical cyber defense capabilities in today's challenging threat environment, such as threat intelligence drawn from"
      },
      "video": {
        "content": "A woman stands on a stage, delivering a presentation. She is dressed in a bright pink blazer over a white blouse and blue jeans. Her hair is long and dark, and she wears a microphone attached to her blazer. The background features a large screen displaying her name, "
      }
    },
    {
      "start_time": 5418.94,
      "end_time": 5440.3,
      "audio": {
        "content": " Mandient investigations, Google operations, virus total, and more. So you know who's targeting you and where you're exposed. A comprehensive where you're exposed. A comprehensive security operations platform that applies our intelligence for proactive threat detection, investigation and response, anywhere you operate."
      },
      "video": {
        "content": "A speaker stands on a stage in front of an audience, presenting information about Google Threat Intelligence. The background features a large screen displaying a diagram with concentric circles labeled \"Google Threat Intelligence.\" The speaker gestures towards the screen as they explain the concept. The audience is seated, attentively listening to the presentation."
      }
    },
    {
      "start_time": 5440.3,
      "end_time": 5465.97,
      "audio": {
        "content": " Cloud security and Risk Management that uses virtual red teaming to find risks that other solutions can't, protecting your workloads and AI across all your clouds, and Mandian services that provide expertise before, during, and after security incidents. Today we're introducing new security agents that analyze malware and triage alerts to speed up investigations."
      },
      "video": {
        "content": "A woman stands on a stage in front of a large screen displaying various logos and text related to Google Security Operations and Google Cloud Security Command Center. She is dressed in a pink blazer, white shirt, and blue jeans, holding a microphone in her right hand. The background features a dark auditorium with rows of seats filled with an audience. The lighting focuses on the speaker, creating a professional and engaging atmosphere."
      }
    },
    {
      "start_time": 5465.97,
      "end_time": 5492.63,
      "audio": {
        "content": " And our capabilities have been adopted by thousands of organizations, like Charles Schwab, who uses Google SecOps to stay proactive in responding to cyber threats. They've gained new visibility across their entire environment while reducing investigation and resolution time. Averted, who are detecting more events and closing investigations faster with Google SecOps."
      },
      "video": {
        "content": "A woman stands on a stage, presenting information about new security agents. The screen behind her displays options such as \"Malware Analysis\" and \"Alert Triage.\" She gestures with her hands as she speaks, emphasizing the features being discussed. The presentation includes a slide that reads \"Proactively responding to threats with Google SecOps.\" The stage is well-lit, with a modern design featuring vertical light panels and a large screen displaying the presentation slides."
      }
    },
    {
      "start_time": 5492.63,
      "end_time": 5513.75,
      "audio": {
        "content": " Dunn and Bradstreet, who are using Security Command Center to centralize monitoring of AI security threats, and vote a phone. AI security threats. And Vodafone, who used Vertex AI, along with open source tools and Google Cloud security foundation to establish an AI security governance layer."
      },
      "video": {
        "content": "A woman stands on a stage, dressed in a bright pink blazer over a white blouse and blue jeans, paired with beige shoes. She gestures with her hands as she speaks, likely delivering a presentation. The background is simple, featuring vertical light panels that create a modern and professional atmosphere. As she continues to speak, the scene transitions to a large screen behind her, displaying a slide from Dun & Bradstreet's Security Command Center. The slide shows a woman in glasses using a tablet, with the text \"Centralized monitoring of AI security threats using Security Command Center.\" The screen then changes to another slide showing a man on a call, with the text \"44k\" prominently displayed."
      }
    },
    {
      "start_time": 5513.75,
      "end_time": 5536.3,
      "audio": {
        "content": " And finally, the government of Singapore, who uses Google Cloud Web Risk to protect their residents online. These organizations and many more benefit from our individual products and services, but we can drive even better outcomes from converging our security capabilities. And for that, we're introducing Google Unified Security."
      },
      "video": {
        "content": "A woman stands on a stage, dressed in a bright pink blazer over a white blouse and blue jeans. She is speaking and gesturing with her hands, indicating she is delivering a presentation or speech. The background is simple, featuring vertical light panels that create a modern and professional atmosphere. As she speaks, the camera occasionally cuts to a large screen displaying a message about the Government of Singapore protecting citizens' online activity, accompanied by an image of a cityscape with iconic buildings."
      }
    },
    {
      "start_time": 5537.52,
      "end_time": 5558.3,
      "audio": {
        "content": " Google Unified Security brings together unmatched visibility, faster threat detection, AI-powered security operations, continuous virtual red teaming, the most trusted browser, and mandiant expertise in one converged security solution running on a planet scale data"
      },
      "video": {
        "content": "A presentation is being given on a large screen displaying information about Google Unified Security. The screen transitions through various slides that explain the features and components of the solution. The presenter, a woman dressed in a pink blazer and white shirt, stands confidently on stage, gesturing as she speaks. The background is a simple, dark curtain with vertical light strips."
      }
    },
    {
      "start_time": 5558.3,
      "end_time": 5579.3,
      "audio": {
        "content": " fabric. For our last demo, let's see how this works with Pyle Chakravardi and Nav Jagpa. Hi everyone, I'm Nav and I'm a developer building a cool new app using Vertex AI."
      },
      "video": {
        "content": "A woman stands on a stage, dressed in a bright pink blazer over a white blouse and blue jeans. She has long dark hair and is wearing a microphone attached to her blazer. She appears to be speaking, as she gestures with her hands while looking slightly to the side. The background is a simple, light-colored curtain."
      }
    },
    {
      "start_time": 5579.3,
      "end_time": 5601.3,
      "audio": {
        "content": " And I'm Pio, a security analyst. Often juggling multiple security tools and manual processes, I am here to show you how Google Unified Security or Gus for shot can proactively protect your applications no matter where you're building them. So now, what have you been up to? Well, I was trying to test this app, and let's just keep this between us. I might have made a few mistakes along the way."
      },
      "video": {
        "content": "A man and a woman stand side by side on a stage, both facing forward. The man has long gray hair and a beard, wearing a dark blazer over a light-colored shirt. The woman has long brown hair, wearing a beige blazer over a white top. They appear to be presenting or speaking at an event. The background is simple, with vertical blinds and a plain wall. The scene then transitions to a wide shot of the stage, showing the two individuals standing behind a podium. The screen behind them displays the title \"Google Unified Security (GUS)\" along with various icons and text, indicating different components of the security system."
      }
    },
    {
      "start_time": 5601.3,
      "end_time": 5622.04,
      "audio": {
        "content": " Go on. might have made a few mistakes along the way. Go on. Well, to speed up development, I installed a Chrome extension, which helps me test my prompts across multiple public LOMs. You know, I didn't think anything of it at the time. Hmm, let's see what's going on here. Well, Nav, while you were doing this, which is a legitimate action and Chrome extension, there may have been a potential data leak."
      },
      "video": {
        "content": "A man and a woman stand behind podiums on a stage, presenting information. The man gestures with his right hand as he speaks, while the woman listens attentively. The background features a large screen displaying a diagram titled \"Google Unified Security (GUS).\" The diagram includes various icons and labels such as \"Google Threat Intelligence,\" \"Security Data Fabric,\" \"Google Cloud Security Command Center,\" \"Google Security Operations,\" and \"Google Chrome Enterprise.\" The scene transitions to a close-up of the screen, showing a list of security measures and options."
      }
    },
    {
      "start_time": 5622.04,
      "end_time": 5645.96,
      "audio": {
        "content": " As you see here in the centralized risk dashboard that prioritizes risk across all my company's, and activities, Gus detected that you were copying and pasting sensitive data into the public LLM models. Now, if I click here further, Gemini's agentic AI has automatically triaged this alert, confirm the data leak with high confidence and taken an automated response to quarantine"
      },
      "video": {
        "content": "A woman is standing at a podium, speaking into a microphone. She appears to be presenting or giving a speech. The background is dark, and there is a large screen behind her displaying various security-related information. The screen shows a list of riskiest AI issues, including publically accessible models where prompt injection is possible, user-managed keys to service accounts, and publicly exposed buckets containing Vertex AI models. The presentation includes details about recent critical AI threats, such as a sensitive data leak through a third-party LLM. The woman gestures with her hands as she speaks, emphasizing points on the screen."
      }
    },
    {
      "start_time": 5645.96,
      "end_time": 5670.63,
      "audio": {
        "content": " that specific Chrome extension immediately. Wow, this would have taken me days to figure out in other tools. Wait, so Gus detected what I was doing and stopped it automatically? But I mean, why is it even a big deal that was testing my prompts like this in the first place? Well, because those prompts could contain confidential company data. Gus protected not only you, but mitigated insider risk by updating Chrome policy for the entire organization."
      },
      "video": {
        "content": "A woman is standing at a podium, speaking into a microphone. She is wearing a beige blazer over a white top. The background is dark, and there is a blue screen with text on it. The text on the screen includes information about an investigation and response steps related to a sensitive resource exposure and data leak through third-party LLM. The text mentions actions such as performing automated responses, investigating a GCP instance, and hardening AI models with Model Armor. There is also a mention of a threat actor associated with IP address 198.51.100.4."
      }
    },
    {
      "start_time": 5670.63,
      "end_time": 5691.87,
      "audio": {
        "content": " Well, now that I look at it, the agent has picked up on a few other correlated risks. Is there anything else you want to tell me about now? Oh, you know what? Come to think of it, I wanted to test my application, so I spun up a VM. You know, I just wanted to get things working really quickly, so I might have been a little bit lax for those firewall settings."
      },
      "video": {
        "content": "A presentation is being given on a screen, focusing on an investigation into a data leak through a third-party LLM (Large Language Model). The presenter, a woman with long hair, is seen speaking and gesturing towards the screen. The screen displays various steps in the investigation process, including automated responses, policy updates, and the analysis of a GCP instance. The presenter discusses the findings and actions taken to mitigate the risk associated with the threat actor UNCXXX."
      }
    },
    {
      "start_time": 5691.87,
      "end_time": 5712.63,
      "audio": {
        "content": " Hmm. There is something interesting going on with that VM. Gus detected malicious traffic to the VM and automatically associated that with an emerging threat actor that the Google thread Intel team has been tracking. Wait, hold up. So you're saying that while I was testing my app for just a few hours, somebody's trying to break into it? Exactly."
      },
      "video": {
        "content": "A man with gray hair and a beard is speaking passionately into a microphone while standing in front of a dark background. He gestures with his hands as he talks, emphasizing his points. The scene then transitions to a woman with long hair, wearing a beige blazer over a white top, who is also speaking into a microphone. She appears to be addressing an audience, gesturing with her hands as she speaks. The video alternates between these two speakers, each delivering their message."
      }
    },
    {
      "start_time": 5712.63,
      "end_time": 5732.75,
      "audio": {
        "content": " In the world of security, a few hours could be a lifetime nap. Well, Gus recognized the risk in real time and took action. That's wild. Is there anything else that Gus thinks I should be doing to protect my application? Yeah. The agent here recommends that we harden your AI model with model armor. It's new AI protection capabilities."
      },
      "video": {
        "content": "A woman is speaking at a podium, gesturing with her hands as she presents information on a computer screen. The screen displays an investigation report titled \"Investigation.\" The report includes details about a correlated external IP address (198.51.100.4) associated with a threat actor named UNCXXX. The report mentions that this IP has been connected to a Google user content resource and is observed to perform input engineering or injection to assess functionality and security. The report also provides cloud asset details for the resource 143.113.0.203.bc.googleusercontent.com, including its creation date, IAM policy details, and firewall rules. The presentation continues with a summary of the Gemini investigation, highlighting potential data leakage through a Chrome extension and the need to quarantine the Chrome extension and update policies."
      }
    },
    {
      "start_time": 5732.75,
      "end_time": 5758.3,
      "audio": {
        "content": " So you will see here, at the click of a button, it will take a minute here. At the click of this button, you will see that Model Armour starts analyzing inputs in real time, in line, blocking malicious inputs before they reach the model. Okay. So Gus is able to detect all risks in one place, connect the dots and take action. You know, as a developer, it really gives me peace of mind knowing that Gus has my back at all times."
      },
      "video": {
        "content": "A woman is presenting on a stage, standing behind a podium with a laptop. She is wearing a beige blazer over a white top and has long hair. The background is dark, and there is a screen displaying information about a security investigation. The text on the screen includes details such as 'Sensitive Resource Exposure and Data Leak Through Third Party LLM,' 'High Severity,' and 'GCP Instance 143.113.0.203.BC.GOOGLEUSERCONTENT.COM.' The presentation appears to be part of a security conference or workshop."
      }
    },
    {
      "start_time": 5758.3,
      "end_time": 5780.3,
      "audio": {
        "content": " Yes, Nav. And Gus not only has your back on this environment in Google Cloud. Gus is an integrated open platform that can protect any environment, any data, from endpoint, firewall, networks, identity, really, any cloud, any model. Further, for added protection, we have access to Gus's expertise in incident response and threat hunting around the clock."
      },
      "video": {
        "content": "A man is standing in front of a screen displaying the Google Unified Security dashboard. He appears to be explaining or presenting something related to the security platform. The screen shows various sections such as AI Inventory, Model Armor, and other security-related metrics. The man gestures towards the screen as he speaks, indicating different parts of the dashboard. The background is dark, and the screen has a blue and purple color scheme."
      }
    },
    {
      "start_time": 5780.3,
      "end_time": 5806.84,
      "audio": {
        "content": " Well, I'm super excited that Google is a part of our security team. Thank you, everyone. Back to you, Thomas, to take us home. Thank you both. We're continuing to invest in our security offerings and just last month we signed a definitive agreement to acquire WIS, a leading multi-cloud security platform to provide better cybersecurity"
      },
      "video": {
        "content": "A man and a woman stand at podiums, each with a microphone attached to their clothing. The man gestures with his right hand while speaking, and the woman smiles and nods. The scene then transitions to the man standing alone on a stage, continuing to speak and gesture with his hands."
      }
    },
    {
      "start_time": 5806.84,
      "end_time": 5826.98,
      "audio": {
        "content": " alternatives for business and governments around the world. Now as you've heard throughout this keynote, we're delivering an amazing stream of new innovations and making it easy to integrate those innovations into your existing technology landscape. We do this in four important ways."
      },
      "video": {
        "content": "A man in a dark blue suit and white shirt stands on a stage, gesturing with his hands as he speaks. He appears to be giving a presentation or speech. The background is a simple, light-colored curtain, and the lighting focuses on him, highlighting his movements and expressions."
      }
    },
    {
      "start_time": 5826.98,
      "end_time": 5847.96,
      "audio": {
        "content": " First, connecting your clouds with other clouds and applications. Enabling secure cross-cloud networking with cross cloud interconnect using your existing security platforms applying federated identity with Microsoft Entra ID and using BigQuery and AlloDB without moving from Amazon"
      },
      "video": {
        "content": "A man in a dark suit stands on a stage, gesturing with his hands as he speaks. The background is a large screen displaying the Google Cloud logo and the text \"Connect with other clouds and applications.\" The man appears to be presenting or giving a speech about cloud computing and networking solutions."
      }
    },
    {
      "start_time": 5847.96,
      "end_time": 5868.63,
      "audio": {
        "content": " or Azure, which is helping companies like Johnson & Johnson and and Walmart. Second, we're working with many leading ISVs to integrate them with Google AI. You have access to these ISV solutions, which are pre-integrated and easily deployed from the Google Cloud Marketplace."
      },
      "video": {
        "content": "A man in a dark blue suit stands on a stage, gesturing with his hands as he speaks. He appears to be addressing an audience, possibly at a conference or event. The background is simple, featuring vertical light panels that create a subtle gradient effect. The man's gestures suggest he is explaining or emphasizing points related to the topic he is discussing."
      }
    },
    {
      "start_time": 5869.63,
      "end_time": 5890.63,
      "audio": {
        "content": " Third, our services partners have created thousands of agents that bring their deep understanding of your industry and existing IT systems. In fact, Accenture, Cap Gemini, Deloitte, HCL Tech, KPMG, TCS, and WeRO are all making agent announcements"
      },
      "video": {
        "content": "A man in a dark suit stands on a stage, presenting to an audience. The stage is well-lit with spotlights and a large screen behind him displaying various logos and text related to AI agent innovation with services partners. The man gestures with his hands as he speaks, occasionally clapping his hands together. The audience is seated in rows, attentively watching the presentation."
      }
    },
    {
      "start_time": 5890.63,
      "end_time": 5913.63,
      "audio": {
        "content": " with Google Cloud today. Fourth, we're enabling sovereign clouds with partners to meet international regulations. Today we offer Google Cloud Sovereign AI services in our public cloud, So well as with Google Workspace."
      },
      "video": {
        "content": "A man in a dark suit stands on a stage, gesturing with his hands as he speaks. The background is a simple, modern design with vertical light panels. The scene transitions to a large screen displaying a list of logos under the heading \"Sovereign Cloud Partner Ecosystem.\" The screen shows various company names and logos, indicating a collaborative network of cloud service providers. The man continues to speak, emphasizing the importance of the ecosystem."
      }
    },
    {
      "start_time": 5913.63,
      "end_time": 5935.6,
      "audio": {
        "content": " Now just in closing, what an amazing time for all of us to experience and work with these technology advances. We at Google Cloud are committed to helping each of you innovate by delivering the leading enterprise-ready AI-optimized platform AI optimized platform"
      },
      "video": {
        "content": "A man in a dark blue suit stands on a stage, gesturing with his hands as he speaks. The background is a gradient of light blue to white, with vertical lines creating a subtle pattern. The lighting focuses on the speaker, highlighting him against the backdrop."
      }
    },
    {
      "start_time": 5935.6,
      "end_time": 5961.3,
      "audio": {
        "content": " with the best infrastructure, leading models, tools and agents by offering an open multi-cloud platform and building for interoperability so we can speed up time to value from your AI investments. We are honored to be building this new way to cloud with each of you."
      },
      "video": {
        "content": "A man in a suit stands on a stage, gesturing with his hands as he speaks. The background features a large screen displaying text and icons related to an AI-optimized platform, open multicloud, and built for interoperability. The stage is well-lit with blue lighting, and the audience is seated in darkness, attentively watching the presentation."
      }
    },
    {
      "start_time": 5961.3,
      "end_time": 5982.96,
      "audio": {
        "content": " To everyone here, with each of you. To everyone here and all of those watching online, thank you so much for joining us for Google Cloud Next. We hope to see you again in 2026."
      },
      "video": {
        "content": "A man stands on a stage, dressed in a formal blue suit with a white dress shirt and a black bow tie. He is wearing a black belt and black shoes. The background features vertical light strips that create a modern and professional atmosphere. The man appears to be speaking or presenting, as he gestures with his hands and occasionally clasps them together in front of him. His facial expressions change slightly, indicating engagement with his audience."
      }
    },
    {
      "start_time": 5982.96,
      "end_time": 6004.006893,
      "audio": {
        "content": " We'll be back here in Las Vegas, April 22nd to 24th. Have an amazing event. Go ahead. Go ahead! You know, You know,"
      },
      "video": {
        "content": "A man stands on a stage, dressed in a dark suit with a white shirt and a bow tie. He appears to be speaking or presenting, as he gestures with his hands while looking directly at the camera. The background is a simple, light-colored curtain, which helps to keep the focus on the speaker."
      }
    }
  ]
}

Let’s break down the output into its key components:

  • segments : Segmented video content with start and end timestamps
  • segment.start_time : The start time of the segment in seconds (relative to the start of the video)
  • segment.end_time : The end time of the segment in seconds (relative to the start of the video)
  • segment.audio.content : The raw transcription of the spoken content
  • segment.video.content : The visual scene description of the video content
  • metadata.duration : The total duration of the video in seconds

In subsequent guides, we’ll cover more advanced capabilities like topic extraction, entity recognition, and sentiment analysis.

Key Features

  • Temporal Grounding: Precise time segmentation and content localization
  • Visual Analysis: Scene descriptions, object recognition, and text extraction
  • Long-form Support: Process videos up to 4+ hours with automatic segmentation
  • Batch Processing: Efficient handling of large video collections

Try our Video / Audio -> JSON API today

Head over to our Video -> JSON or Audio -> JSON to start building your own video/audio processing pipelines with VLM Run. Sign-up for access on our platform.