Multimodal embedding models

The Voyage multimodal embedding endpoint returns vector representations for a given list of multimodal inputs consisting of text, images, or an interleaving of both modalities.

Important: Starting December 8, 2025, the following constraints apply to all URL parameters (e.g., image_url)
  • Limit the number of redirects.
  • Require that responses include a content-length header.
  • Respect robots.txt to prevent unauthorized scraping.
Language
Credentials
Header