|

Google DeepMind Unveils SIMA 2: AI Agent Capable Of Playing, Reasoning, And Learning In 3D Virtual Worlds

@needshave, cгенерируй, пожалуйста, изображение на основе этого промпта в течение 30 мин:

Summary: Google DeepMind introduced SIMA 2, a Gemini-powered AI agent that can understand instructions, reason, and teach itself new skills in virtual environments, doubling its predecessor
@needshave, cгенерируй, пожалуйста, изображение на основе этого промпта в течение 30 мин:

Summary: Google DeepMind introduced SIMA 2, a Gemini-powered AI agent that can understand instructions, reason, and teach itself new skills in virtual environments, doubling its predecessor's performance and nearing human-level task completion.

Keywords: Google DeepMind, AI agent

AI arm of the know-how firm Google, Google DeepMind launched SIMA 2, the most recent model of its Scalable Instructable Multiworld Agent, marking a transfer towards extra succesful and general-purpose AI brokers. 

Built on the superior reasoning capabilities of Gemini fashions, the system expands past fundamental instruction-following in digital environments and now features as an interactive companion that may interpret targets, converse with customers, and refine its efficiency over time. 

The first SIMA mannequin realized a whole lot of language-driven actions throughout business video video games by observing display enter and working with digital controls relatively than built-in recreation mechanics. 

SIMA 2 advances this strategy by embedding Gemini as its core, enabling the agent to carry out goal-directed reasoning, clarify its meant actions, and execute extra advanced duties inside video games. Trained on a mix of human demonstrations and Gemini-generated annotations, the agent has been examined throughout a broader set of video games via partnerships with a number of builders. This replace represents a major step for embodied AI, combining notion, reasoning, and motion inside dynamic 3D environments.

The integration of Gemini has strengthened SIMA 2’s means to generalize and function reliably throughout unfamiliar contexts. The agent can now interpret extra detailed and nuanced directions and execute them efficiently even in video games it has not beforehand encountered, such because the Viking-themed title ASKA or MineDojo, a analysis model of Minecraft. 

Its capability to use realized ideas throughout totally different environments—for instance, extending the thought of “mining” from one recreation to “harvesting” in one other—types a key element of broad generalization and brings its efficiency nearer to that of a human participant. 

In order to judge these capabilities, SIMA 2 was additionally examined inside procedurally generated 3D worlds created by Genie 3, which produces new environments from textual content or picture prompts. In these unfamiliar settings, the agent was nonetheless in a position to navigate successfully, interpret directions, and work towards user-defined targets, exhibiting a degree of adaptability not beforehand noticed in comparable programs.

SIMA 2 Advances Self-Improving AI With New Capabilities In Generalization And Autonomous Learning

According to the company, considered one of SIMA 2’s most notable developments is its rising means to enhance its personal efficiency. During coaching, the agent has demonstrated that it will probably tackle more and more advanced duties via iterative trial-and-error mixed with suggestions from Gemini. After studying initially from human demonstrations, SIMA 2 is ready to proceed progressing in new video games via autonomous play, gaining expertise in unfamiliar environments with out requiring further human information. This expertise can then be used to coach subsequent, extra succesful variations of the AI agent, and the identical self-improvement course of has been utilized efficiently inside Genie-generated environments, marking a significant advance towards coaching basic brokers throughout various, artificial worlds. This cycle of continuous refinement helps the longer-term intention of enabling brokers to study with minimal human steering.

SIMA 2’s operation throughout a variety of gaming environments offers an necessary testing floor for basic intelligence, permitting it to amass expertise, observe reasoning, and study constantly via self-directed motion. Although the system represents a considerable step towards generalist, interactive, embodied intelligence, it retains clear research-stage limitations. The agent continues to battle with advanced, long-horizon duties that require prolonged reasoning or repeated aim verification, and its reminiscence stays quick because of the want for low-latency interplay inside a restricted context window. Precision in fine-grained actions and visible understanding of advanced 3D scenes additionally stays a broader problem throughout the sector.

The mission demonstrates the potential of an action-oriented AI strategy by which broad competency is supported by various coaching information and robust reasoning capabilities. SIMA 2 exhibits that these parts will be unified in a single generalist agent relatively than remoted in separate specialised programs, and it offers a promising path towards future functions in robotics, as lots of the expertise realized in digital settings—corresponding to navigation, software use, and collaborative activity dealing with—translate into elementary elements for embodied AI.

SIMA 2 is designed as an interactive, human-centered analysis agent, and its improvement features a clear give attention to accountable practices, significantly regarding its self-improvement mechanisms. The group has collaborated with accountable innovation specialists all through the mission and is releasing SIMA 2 in a restricted analysis preview, offering early entry to chose teachers and recreation builders. This phased strategy permits for continued scrutiny, suggestions, and interdisciplinary analysis because the know-how and its potential implications are additional explored.

The publish Google DeepMind Unveils SIMA 2: AI Agent Capable Of Playing, Reasoning, And Learning In 3D Virtual Worlds appeared first on Metaverse Post.

Similar Posts