"Introducing computer use, a new Claude 3.5 Sonnet, and Claude 3.5 Haiku."
For safety reasons, the last thing we'd allow an AI to do is take full control over a computer, looking at the screen and typing keys and moving the mouse and doing mouse clicks, just like a human, enabling it to do literally everything on a computer a human can do. Oh wait...
"Available today on the API, developers can direct Claude to use computers the way people do -- by looking at a screen, moving a cursor, clicking buttons, and typing text. Claude 3.5 Sonnet is the first frontier AI model to offer computer use in public beta. At this stage, it is still experimental -- at times cumbersome and error-prone. We're releasing computer use early for feedback from developers, and expect the capability to improve rapidly over time."
"Asana, Canva, Cognition, DoorDash, Replit, and The Browser Company have already begun to explore these possibilities, carrying out tasks that require dozens, and sometimes even hundreds, of steps to complete. For example, Replit is using Claude 3.5 Sonnet's capabilities with computer use and UI navigation to develop a key feature that evaluates apps as they're being built for their Replit Agent product."
But unlike me, everyone else seems to be reacting very positively.
"It doesn't get said enough: Not only is Claude the most capable LLM, but they also have the best character. Great work Claude and Team!"
"Just imagine the accessibility possibilities. For those with mobility or visual impairments, Claude can assist with tasks by simply asking, like helping in usage with apps and systems that often lack proper accessibility features."
That's a good point, actually.
Still, you might want to run it in a VM for now?
"Wow, this is going to be quite game-changing!"
"Impressive to see Claude navigating screens like a human! Though still in beta, this could be a game-changer for automating tedious tasks. Can't wait to see how it develops!"
"What I found particularly noteworthy in this demo was that the information wasn't copied from the CRM, but typed letter by letter. Purely speculating, but perhaps because there are rare cases where websites do not accept copied input, which often also affects password managers."
"This is RPA-like functionality. Wow, Will this be a game-changer?"
RPA stands for Robotic Process Automation.
"What are the security implications of this? Could a bad actor use this to ask Claude to go into other people's computers and access their confidential information?"
Ok, at least one person besides me is feeling a little worry.
"That's epic, you guys have the best AI. This company is something special."
"Computer Use is truly a pivotal advancement. Enabling AI to interact with computers like humans do is a significant leap towards AGI. Exciting times ahead!"
"Looks like Siri on screen awareness but two (or more) years early and available for use now (but meanwhile, on server.) WOW. Well done guys."
"Absolutely incredible -- Super excited to build with this & see what others build!"
"Immediately prompting: 'Do all my work'"
If Claude can do all your work, why will you get paid?
"This could be huge for companies struggling with legacy systems and modernization."
"This is one more pivotal point in AI's evolution. In 2025, more innovation and use cases will emerge, and human involvement is slowly being eliminated. It looks like a small improvement, but it's huge at its core and will significantly impact how AI will be used in a few years. Kudos Claude Team!"
Introducing computer use, a new Claude 3.5 Sonnet, and Claude 3.5 Haiku