Google DeepMind and World Labs unveil AI tools to create 3D spaces from simple prompts

PorPablo Santiago dezembro 5, 2024

Google DeepMind and startup World Labs this week both revealed previews of AI tools that can be used to create immersive 3D environments from simple prompts.

World Labs, the startup founded by AI pioneer Fei-Fei Li and backed by $230 million in funding, announced its 3D “world generation” model on Tuesday. It turns a static image into a computer game-like 3D scene that can be navigated using keyboard and mouse controls.

“Most GenAI tools make 2D content like images or videos,” World Labs said in a blog post. “Generating in 3D instead improves control and consistency. This will change how we make movies, games, simulators, and other digital manifestations of our physical world.”

One example is the Vincent van Gogh painting “Café Terrace at Night,” which the AI model used to generateadditional content to create a small area to view and move around in. Others are more like first-person computer games.

World Labs’ 3D “world generation” model turns a static image into a computer game-like 3D scene that can be navigated with keyboard and mouse controls.

World Labs

WorldLabs also demonstrated the ability to add effects to 3D scenes, and control virtual camera zoom, for instance. (You can try out the various scenes here.)

Creators that have tested the technology said it could help cut the time needed to build 3D environments, according to a video posted in the blog post, and help users brainstorm ideas much faster.

The 3D scene builder is a “first early preview” and is not available as a product yet.

Separately, Google’s DeepMind AI research division announced in a blog post Wednesday its Genie 2, a “foundational world model” that enables an “endless variety of action-controllable, playable 3D environments.”

It’s the successor to the first Genie model, unveiled earlier this year, which can generate 2D platformer-style computer games from text and image prompts. Genie 2 does the same for 3D games that can be navigated in first-person view or via an in-game avatar that can perform actions such as running and jumping.

It’s possible to generate “consistent worlds” for up to a minute, DeepMind said, with most of the examples showcased in the blog post lasting between 10 and 20 seconds. Genie 2 can also remember parts of the virtual world that are no longer in view, reproducing them accurately when they’re observable again.

DeepMind said its work on Genie is still at an early stage; it’s not clear when the technology might be more widely available. Genie 2 is described as a research tool that can “rapidly prototype diverse interactive experiences” and train AI agents.

Google also announced that its generative AI (genAI) video model, Veo, is now available in a private preview to business customers using its Vertex AI platform. The image-to-video model will open up “new possibilities for creative expression” and streamline “video production workflows,” Google said in a blog post Tuesday.

Amazon Web Services also announced its range of Nova AI models this week, including AI video generation capabilities; OpenAI is thought to be launching Sora, its text-to-video software, later this month.

Remote Work

How to train an AI-enabled workforce — and why you need to

PorPablo Santiago agosto 8, 2024

Artificial intelligence (AI) is taking the business world by storm, with at least three in four organizations adopting the technology or piloting it to increase productivity. Over the next two years, generative AI (genAI) will force organizations to address a myriad of fast-evolving issues, from data security to tech review boards, new services, and —…

Remote Work

8 out-of-sight superpowers for Google Contacts on Android

PorPablo Santiago agosto 9, 2024

Quick: What’s the most exciting app on your Android phone right now? Just a hunch here, but I’m gonna go out on a limb and say Google Contacts probably wasn’t your answer. And why would it be? Your phone’s virtual Rolodex is about as exhilarating as a trip to the endodontist. Plus, our mobile devices…

Remote Work

Download our Excel PivotTables and PivotCharts Cheat Sheet

PorPablo Santiago dezembro 4, 2024

Download the PDF Computerworld Cheat Sheet today.

Remote Work

Federal judge slaps down Automattic, granting temporary injunction to WP Engine in ongoing WordPress squabble

PorPablo Santiago dezembro 11, 2024

The battle between WordPress owner Automattic and WP Engine seemingly struck US federal Judge Araceli Martinez-Olguin as rather one-sided, as she ruled against Automattic on Tuesday and granted WP Engine the preliminary injunction it sought. “Judge Martinez-Olguin’s ruling clearly explains why [Automattic founder] Matt Mullenweg’s campaign against WP Engine has been so misguided,” said IDC…

Remote Work

Apple Intelligence arrives for the UK, Australia, Canada, New Zealand with iOS 18.2

PorPablo Santiago dezembro 11, 2024

Apple has released iOS 18.2, iPadOS 18.2, and macOS Sequoia 15.2, which means local language access to Apple Intelligence is now available to iPhone, Mac, and iPad users in the UK, Australia, Canada, New Zealand. (They’ve had to set their language to US English to use these features until now; users will need to update…

Remote Work

Best Places to Work in IT 2025

PorPablo Santiago dezembro 10, 2024

What makes a company a great place to work for IT professionals? Top salaries and benefits certainly help, but those are just table stakes. Tech workers are looking for challenging projects, growth opportunities, and continuous learning in a supportive workplace. For the 31st year, Computerworld publisher Foundry surveyed large, midsize, and small organizations to find…

Posts Similares

Deixe um comentário Cancelar resposta

Follow Us