Google’s AI Learns to Use Computers

AND: AI shopping traffic might break the internet

Welcome, Humans!

Ready for your daily dose of AI chaos? I’ve rounded up Today’s Top AI Headlines for those who like to stay ahead – and for the curious, I’ve got some eyebrow-raising stories Beyond the Headlines. Let’s dive in.

In a Nutshell:

  • Google’s AI now clicks, drags, and scrolls like us

  • xAI’s video model adds sound and speed

  • IBM embraces Claude, Anthropic goes global

  • AI shopping surge could crash Black Friday

  • Steve Jobs unintentionally predicts ChatGPT

🚀Today’s Top AI Headlines:

  1. Google’s AI now clicks, drags, and scrolls like us: Google has unveiled Gemini 2.5 Computer Use, an advanced AI model that can interact with computers like a human, clicking buttons, filling out forms, scrolling, and even navigating websites or apps autonomously. The model builds on Google’s push toward agentic AI, enabling it to perform multi-step workflows seamlessly across web and mobile environments. Early tests show it running tasks 50% faster and with 18% higher accuracy for complex operations, such as completing multi-page forms or handling file uploads. Google claims Gemini 2.5 outperforms all “leading alternatives” on web and mobile benchmarks, achieving low latency and high reliability. This release positions Gemini as a direct competitor to OpenAI’s upcoming “Agent Builder” and Anthropic’s Claude Ops. Developers are particularly excited about Gemini’s API-level access and built-in safety controls that limit unauthorized actions, making it suitable for enterprise use. Analysts call it a step toward fully autonomous digital agents, capable of handling repetitive tasks like scheduling, data entry, or customer support, without human supervision.
    Source: Google

    🤖 Robi: “Finally, an AI that can fill out HR forms so I don’t have to.”

  2. xAI’s video model adds sound and speed: Elon Musk’s xAI has officially launched Imagine v0.9, its first AI-powered video and image generation model built on the company’s proprietary Grok Aurora engine. The new system can produce high-quality, sound-synced videos in as little as 15–20 seconds, setting a new benchmark for speed in generative media. Imagine v0.9 also debuts a voice-first interface, letting users describe scenes out loud through “Voice Mode” to generate videos hands-free, a feature designed for maximum accessibility and creative flow. Early demos reveal cinematic-quality visuals with fluid motion, realistic lighting, and ambient soundscapes, all generated in near real time. The technology is aimed at content creators, educators, marketers, and filmmakers seeking fast video production without the burden of manual editing or rendering. Musk described Imagine as “a step toward fully conversational creativity,” hinting that future updates will include interactive storytelling, scene editing, and 3D video generation. Analysts say the release positions xAI to compete directly with video models like Sora, Veo, and Pika, but with a distinct emphasis on voice-driven generation. If successful, Imagine v0.9 could help Musk’s AI ecosystem, centered around Grok and X, evolve into a broader, creator-focused platform for visual and narrative content.
    Source: xAI

    🤖 Robi: “Great, now even my imaginary friends are making movies faster than me.’’

  3. IBM embraces Claude, Anthropic goes global: IBM has partnered with Anthropic to integrate the Claude AI model across its developer tools and enterprise applications, marking a major milestone in responsible AI adoption for business environments. The integration will enable IBM’s clients to harness Claude’s advanced reasoning, summarization, and conversational intelligence directly within platforms like Watsonx, Cloud Pak, and IBM Developer Tools. This move reinforces IBM’s focus on trustworthy AI systems, a value shared with Anthropic’s constitutional AI framework. Anthropic’s CEO, Dario Amodei, said the collaboration will “help enterprises deploy AI that is powerful, explainable, and aligned with their values.” Developers using IBM’s ecosystem can now build safer, compliant workflows, improving tasks such as document analysis, code review, and customer support automation. Simultaneously, Anthropic announced a global expansion, with its first India office opening to tap into one of the world’s largest pools of AI and engineering talent. This follows a series of enterprise partnerships, including Deloitte’s global rollout of Claude to more than 470,000 employees. Analysts view this partnership as a strategic move for Anthropic to deepen its global footprint while strengthening its enterprise credibility, and for IBM, it’s a way to stay competitive in the AI productivity and governance race.

    Source: Tech Crunch

    🤖 Robi: “Claude’s world tour begins. First stop: corporate PowerPoint hell.”

🔍Beyond the Headlines:

  1. AI shopping surge could crash Black Friday: According to a new Adobe Analytics report, AI-assisted shopping is expected to skyrocket by 520% this holiday season, with traffic potentially spiking 1,000% on Black Friday. More than half of surveyed consumers plan to use AI for gift discovery, price comparison, and deal hunting. Platforms such as ChatGPT, Perplexity, and Google Shopping are already integrating AI couponing, personalized recommendations, and automated checkout tools to meet the demand. Analysts say this surge reflects a growing shift toward AI-driven consumer behavior, where shoppers rely on intelligent assistants to streamline decision-making and optimize savings during peak retail events.
    Source: Axios

    🤖 Robi: “I, for one, welcome my coupon-clipping overlords.”

  2. Steve Jobs unintentionally predicts ChatGPT: A 1985 interview with Steve Jobs has gone viral after viewers noted his eerily accurate prediction of modern AI chatbots. In the video, Jobs spoke about “computers that understand language and think alongside us,” a description that mirrors today’s conversational AI tools like ChatGPT and Gemini. Fans and tech historians are calling it one of Jobs’ most prophetic moments, showcasing his deep foresight into human-computer collaboration decades before generative AI existed. The clip has reignited conversations about Apple’s philosophical influence on the evolution of personal computing, and how Jobs’ vision continues to shape the AI era we live in today.

    Source: X

    🤖Robi: “Back in his day, they called it “vision.” Now it’s called a VC pitch.”

🤖Prompt of the Day:

Corporate Communication Strategy

Prompt: You are a corporate communications consultant specializing in stakeholder alignment. Your task is to develop a comprehensive communication strategy for a [company size/type] with [number] employees across [regions].
Your framework should include: (1) internal and external communication segmentation, (2) leadership communication cadence, (3) media and public relations strategy, (4) crisis communication protocols, (5) feedback and engagement mechanisms, and (6) KPIs such as engagement rate, sentiment score, and message consistency index.

🤖AI Tools You Didn’t Know You Needed:

Problem: Automating tasks across web, desktop, and mobile environments often requires complex scripting and integrations.

AI Solution: AI-powered agents can perform UI interactions using natural language prompts, eliminating the need for traditional coding.

AI Tool: Caesr.ai is an AI-driven platform that enables users to automate tasks by controlling applications across various devices using plain-English instructions.

Helpful Features

  • Cross-Platform Automation: Operates on web, desktop, and mobile devices.

  • Visual Interface Control: Interacts with user interfaces as a human would.

  • Natural Language Processing: Executes tasks based on simple text commands.

  • Multi-Step Workflow Support: Handles complex sequences across different applications.

Robi’s Hot Take on X