Actually, I'm curious about streaming too, Last I had run some tests, I was almost close to writing a blogpost titled "Streaming Is All You Need" because I saw that streaming calls for the same requests typically had somewhat lower latencies.. But should rerun the tests and confirm! @Siddharth | Portkey we can do this as part of the LLMs in Prod series too!