Elon Musk’s AI chatbot can now “understand” images, including informative diagrams. Sorry, but isn’t everyone using the platform once known as Twitter to conduct multidisciplinary research and optimize their workflow?
Launched as the Grok-1.5V or Grok 1.5 “Vision,” the company’s “first-generation multi-modal model,” the bot can not only respond to the pictures and screenshots you upload, but also navigate through complex documents, Scientific diagrams, diagrams, screenshots and photos for reasoning, the company said. In addition, Grok-1.5V will gain “real-world spatial understanding” to better understand the physical world depicted in user-uploaded images.
“Increasing our multimodal understanding and generation capabilities are important steps toward building beneficial general artificial intelligence capable of understanding the universe,” the company wrote in a statement. “In the coming months, we expect to develop new technologies in images, audio, and video. Significant improvements to both features in various modes including video and video.”
Elon Musk wants his Grok AI to write posts for you if you’re a paying X user, report says
Example use cases include converting diagrams into Python code, turning a child’s drawing into a bedroom story, pinpointing the largest object in a group of objects, and telling a driver if there is enough room to maneuver around an obstacle.
Grok-1.5V is released alongside xAI’s RealWorldQA, an image and hint dataset designed to test other GenAI models against Grok’s real-world inference.
Tweet may have been deleted
However, competition is the least of Grok’s worries. Despite continued investment in xAI, Grok has yet to retain early users and employees — a new report says its own developers are struggling to use the slow xAI API.The same report was published by wealth This week, X employees highlighted concerns that Musk suggested Grok write paid user posts for them, despite warnings from developers and employees. Gronk was criticized last week for creating false news headlines in an alternate reality where Iran attacked Tel Aviv with its military arsenal — not for the first time.
While it stands to reason that GenAI chatbots hallucinate and generate fake news, Grok’s gaffe points to another site-wide problem. The bot is a standard in Musk’s course response to ChatGPT, which is being integrated into a platform that has slowly chipped away at defenses against artificial intelligence gone bad. Coupled with X’s poor reputation for moderation and the CEO’s own refusal to help the site’s “citizen journalists” tackle misinformation, Grok occupies a precarious position in the platform’s beleaguered information ecosystem.
Grok-1.5V will be available to early testers and selected users soon.