Test AI agent performance on multimodal inputs including text, sketches, emojis, and multiple languages
SIMA 2 can understand and execute tasks from multimodal prompts including user-drawn sketches, emoji commands, and instructions in different languages, enabling flexible and intuitive human-AI interaction beyond text-only interfaces.
