A powerful AI model processing text, audio, and images with multimodal capabilities.