Google is reportedly preparing to unveil “Project Jarvis,” its own version of a large action model, by December. This tool, designed to perform tasks like gathering research, making purchases, and booking flights, operates within a browser environment, tailored specifically for Chrome, and leverages an advanced version of Google’s Gemini AI.
Jarvis is crafted to streamline common web-based tasks by interacting with on-screen elements—taking screenshots, clicking buttons, and entering text—though each action currently takes a few seconds to complete.
This development places Google alongside other tech giants enhancing task automation via AI. Microsoft’s upcoming “Copilot Vision,” for example, will enable conversations about viewed webpages, while Apple Intelligence is expected to handle multi-app interactions by next year. Anthropic recently released a beta for Claude, though it has faced challenges with efficiency, and OpenAI is reportedly developing a similar model.
.
.
.
.
.
.
.
#Initiatemagazine #initiate #initiator #GoogleAI #ProjectJarvis #AIAutomation #TaskAutomation #DigitalAssistant #ChromeBrowser #GoogleGemini #FutureTech #AIInnovation #DigitalProductivity #TechNews
CIC ART ,charityinchina.cn