The AI startup Image Shows The Talking Robot Introduced With OpenAI Tech

Robotics developer Image made waves on Wednesday with a video demonstration of the first humanoid robot having a real-time conversation thanks to AI created by OpenAI.

“With OpenAI, Image 01 can now have full conversations with people,” Image said on Twitter, highlighting its ability to understand and respond to human interactions.

The company recently explained that its partnership with OpenAI will bring advanced visual and linguistic intelligence to its robots, allowing for “quick, low-level, efficient robotic actions.”

In the video, Figure 01 meets its creator, Senior AI Engineer Corey Lynch, who has the robot perform several tasks in a makeshift kitchen, including sorting apples, plates and cups.

Figure 01 When Lynch asks the robot for something to eat, he reveals that an apple is food. Lynch Figure 01 collects waste into a basket and asks questions at the same time, demonstrating the robot's versatility.

On Twitter, Lynch explained Figure 01 in more detail.

Through our partnership with OpenAI, we are now having a full conversation with Figure 01.

Our robot can describe its visual experience – plan its future actions – reflect on its memory – explain its reasoning in words Technical Deep-Dive: pic.twitter.com/6QRzfkbxZY

— Corey Lynch (@coreylynch) March 13, 2024

“Our robot can describe its visual experience, plan future actions, reflect on its memory, and verbalize its logic,” he wrote in an extensive thread.

According to Lynch, they feed images from the robot's cameras and transcribe the speech captured by onboard microphones into a large multimodal model trained by OpenAI.

Multimodal AI refers to artificial intelligence that can understand and generate different types of data, such as text and images.

Lynch emphasizes that Figure 01 is behavior-learned, runs at normal speeds, and is not remotely controlled.

“The model processes the entire history of the conversation, including past images, to retrieve language responses, which are then translated into text-to-speech to the person,” Lynch said. “The same model is responsible for deciding which learned and closed-loop behavior to execute on the robot to execute a given command, load specific neural network weights on the GPU, and execute the policy.”

Lynch explains that Figure 01 is designed to describe the environment succinctly and can apply “common sense” to decisions, like cooking dishes placed on a shelf. It can also be parsed whenever vague expressions such as hunger describe actions such as offering an apple.

The first presentation caused quite a stir on Twitter, with many people impressed by Figure 01's capabilities – and more than a few adding it to their list of milestones on the road to singularity.

Please tell me your team has seen every Terminator movie,” one replied.

Please tell me your team has seen every Terminator movie.

— Daniel Innovation (@danielinnov8) March 13, 2024

“We need to get John Connor as soon as possible,” added another.

We need to find John Connor as soon as possible

— Kaylard – e/acc (@KaylardAI) March 13, 2024

Sci-fi has become Sci-nonfi.

Congratulations to @adcock_brett, @sama and their teams for creating the first compelling Life Demonstration 2.0

— Justin Halford (@Justin_Halford_) March 13, 2024

For AI developers and researchers, Lynch provided several technical details.

“All the features are driven by neural network visuomotor transformer policies, processing pixels directly into actions,” said Lynch. “These networks capture board images at 10hz and generate 24-DOF actions (wrist and finger joint angles) at 200hz.”

Image 01's influential debut comes as policymakers and global leaders try to wrestle AI tools into mainstream circulation. While much of the discussion centers around large-scale language models such as OpenAI's ChatGPT, Google's Gemini, and Anthropic's Claude AI, developers are looking for ways to give AI physical humanoid robot elements.

Image AI and OpenAI did not immediately respond to Decrypt's request for comment.

“One is utility, which is what Elon Musk and others are pursuing,” UC Berkeley industrial engineering professor Ken Goldberg previously told Decrypt. “A lot of the work that's going on right now – why do people invest in these companies – as a picture – the hope is that these things can work and adapt,” he said, especially in the field of space research.

Along with drawing, others working to integrate AI into robotics include Hansen Robotics, which debuted the Desdemona AI robot in 2016.

“Even a few years ago, I thought we'd have to wait decades to see a fully conversational human robot planning and executing fully learned behavior,” Corey Lynch, AI's senior AI engineer, said on Twitter. “Of course, a lot has changed.”

Edited by Ryan Ozawa.

Stay on top of crypto news, get daily updates in your inbox.

Tags: Crypto

Name	Price	24H %
Bitcoin(BTC)	$0.00	-1.51%
Ethereum(ETH)	$0.00	0.580%
Tether(USDT)	$0.00	-0.08%
XRP(XRP)	$0.00	0.230%
BNB(BNB)	$0.00	0.370%
Solana(SOL)	$0.00	-1.39%
Dogecoin(DOGE)	$0.00	-0.05%
USDC(USDC)	$0.00	-0.07%
Lido Staked Ether(STETH)	$0.00	0.500%
Cardano(ADA)	$0.00	3.05%

Stay on top of crypto news, get daily updates in your inbox.

The Year in NFTs: Bitcoin’s Ordinal Boom, Airdrop Craze, and Brands Come and Go

The best generative AI models: from chatbots to image and video generators

More Profits than Bitcoin: The Top Performing Crypto Assets of 2024

The Year in Crypto: Bitcoin and Ethereum ETFs Bring More Investors to Crypto

The best games of 2024 for under $25

History Protocol helps developers survive AI attacks with ‘programmable IP’ crypto

You may have missed

Top 3 Altcoins to Invest in September 2024

The Year in NFTs: Bitcoin’s Ordinal Boom, Airdrop Craze, and Brands Come and Go

Crypto Pig Farming Scam Loses $3.6 Billion By 2024, Report Says

AVAX, ONDO and FTM prices to attract high profits

Sitemap

Legal Information

Pin It on Pinterest

Stay on top of crypto news, get daily updates in your inbox.

More Stories

Leave a Reply Cancel reply

You may have missed

Sitemap

Legal Information

Categories

Pin It on Pinterest