Discussions

Ask a Question
Back to All

Is there support for asynchronous requests?

For my use case, I need to make multiple non-blocking embeddings calls in parallel. I can accomplish this using an asynchronous HTTP client, like aiohttp, but I'm wondering if it's doable with the Python client.

I'm also wondering how spawning multiple requests at once will affect the rate limit. Is the limit applied on a per-second or per-minute level?

Thanks in advance for the help!