A single content part within a multimodal message.
int height
Image height after preprocessing (0 = not yet processed)
int width
Image width after preprocessing (0 = not yet processed)
std::string image_path
Local file path (type == IMAGE)
std::string text
Text content (type == TEXT)
std::string image_url
URL or data URI (type == IMAGE)