Portkey Community

Configuring Portkey API Endpoint

Absolutely! You can just add openai.base_url = https://api.portkey.ai/v1 and Portkey should start logging all of your Assistants/Threads/Files requests

4 comments

VVrushank | Portkey

Gateway for other APIs -HTTP Methods Support

Hi! Checking this and getting back to you asap

5 comments

VVrushank | Portkey

OpenAI Parallel Tool Calling

This was shared by @thismlguy - is anyone else seeing some faulty multi_tool_use.parallel calls in their OpenAI tool calling requests?

There's an ongoing thread from the last year, where multiple users are reporting the same bug

5 comments

VVrushank | Portkey

Hey @hebertrfreitas, welcome!

Hey @hebertrfreitas, welcome!

This is interesting. We'll investigate and get back. Currently Portkey only supports the standard Azure OpenAI URLs, but I wonder if using the custom_host param we can make this work in some way.

3 comments

VVrushank | Portkey

Comparing the Error Rates of Sonnet 3.5 and GPT-4-Turbo

Would love to follow up with the error rate on Sonnet 3.5 - is it coming up better than gpt-4-turbo?

2 comments

VVrushank | Portkey

Hi @dodgery thanks for reporting,

Hi @dodgery thanks for reporting, checking this. In the meanwhile, can you please DM me your Portkey email? Will see if it's specifically failing for you for some reason

1 comment

VVrushank | Portkey

portkey from go

I'd suggest to use some method in Go to directly make REST calls. Not sure how good the unofficial Go OpenAI SDK is.

8 comments

VVrushank | Portkey

Difference in outputs and prompt template id

Possible to share the difference in outputs? That way we can judge if something went wrong in between. Also, if you can DM me your prompt template ID, that'll help as well

5 comments

VVrushank | Portkey

Will update you shortly! cc @Sabbyasachi

1 comment

VVrushank | Portkey

Hey @shubham welcome! Checking if you

Hey @shubham welcome! Checking if you are on the latest Portkey package.. confirming that prompts.render() method should work

1 comment

VVrushank | Portkey

Changing the baseURL to http://localhost:8787 will work

Hey Harsh, changing the baseURL to http://localhost:8787 will work

7 comments

VVrushank | Portkey

Pythin client.files.create() functionality with configs and virtual key

Hey confirming that on Pythin client.files.create() is working with Configs as well as Virtual key. Can you share the code snippet you're using? Can quickly see if something needs to be fixed there

7 comments

VVrushank | Portkey

Tagging @visarg who can confirm if this

Tagging @visarg who can confirm if this might be happening on Cloudflare's end. Checking our own debugger now 🧐

My hunch is, it shouldn't because we have seen multiple requests with 100+ seconds go through previously. But confirming now.

34 comments

VVrushank | Portkey

Not all Llama 2's are interferenced equal

Not all Llama 2's are Created Equal

Llama 2 models from different providers like Together, Anyscale, Perplexity, etc may often seem identical on paper, but the same query on the same Llama 2 model can yield different responses depending on the inference provider.

Why? It often comes down to how the provider handles the model. For instance, some might use quantization to make the model run faster and consume less resources, but this can subtly alter the quality of the output.

I recently read this blog post by Together AI about their inference engine, and importantly, their chart comparing their inference performance to the vanilla HuggingFace implementation.

Here's a snippet from their blog:

"The improvements to performance with the Together Inference Engine come without any compromise to quality. These changes do not involve techniques like quantization which can change the behavior of the model, even if in a modest way."

This got me thinking —

how do these subtle differences impact our work?
how does this affect your choice of provider for Llama 2 models?

Would love to hear your thoughts on this!

2 comments

VVrushank | Portkey

Exploring Embeddings and Large Language Models

What combo of embeddings + LLMs are you looking at?

2 comments

VVrushank | Portkey

Swticher misses

It shows REST option in the dropdown right :think:

4 comments

VVrushank | Portkey

The Gateway

Awesome! Look forward to your thoughts on the gateway!

39 comments

VVrushank | Portkey

error message when not used required "messages" property

Thanks for sharing! If it is messagaes you're missing, it should ideally say: Error code: 400 - 'messages' is a required property :think:

Are you also using a Config with this?

5 comments

VVrushank | Portkey

Troubleshooting string operations

Try putting "n" : "1" - 1 in strings maybe that's an issue

10 comments

VVrushank | Portkey

Totally hear you. cc @visarg

Totally hear you. cc @visarg

If possible, can you please share how you imagine such a workflow to be? And for what use case fallback + load balance beneficial for you? I'll take that to the team! 😄

3 comments

VVrushank | Portkey

Generating text using the 'n' parameter

You can do this using the ‘n’ param - it’s used same as top_p, temperature etc

3 comments

VVrushank | Portkey

GPT-4 is Getting Faster 🐇

Notice GPT-4 getting faster? We did too. 🐇

Over the last 3 months, GPT-4 and GPT-3.5 latencies have more than halved - for both your regular requests and computationally complex requests with high token counts.

Check out the findings: https://blog.portkey.ai/blog/gpt-4-is-getting-faster/

3 comments

VVrushank | Portkey

Yes you can register multiple feedback

Yes, you can register multiple feedback values to the same trace id, and all of it will be visible on logs and the analytics pages

4 comments

VVrushank | Portkey

portkey-python-sdk/examples/loadbalance_...

Hey @deepanshu_11!

With the SDK it's pretty simple. Check out this notebook tailor made for your use case: https://github.com/Portkey-AI/portkey-python-sdk/blob/readme/examples/loadbalance_two_api_keys.ipynb

39 comments

Welcome to Portkey Forum

Configuring Portkey API Endpoint

Gateway for other APIs -HTTP Methods Support

OpenAI Parallel Tool Calling

Hey @hebertrfreitas, welcome!

Comparing the Error Rates of Sonnet 3.5 and GPT-4-Turbo

Hi @dodgery thanks for reporting,

portkey from go

Difference in outputs and prompt template id

Will update you shortly! cc @Sabbyasachi

Hey @shubham welcome! Checking if you

Changing the baseURL to http://localhost:8787 will work

Pythin client.files.create() functionality with configs and virtual key

Tagging @visarg who can confirm if this

Not all Llama 2's are interferenced equal

Exploring Embeddings and Large Language Models

Swticher misses

The Gateway

error message when not used required "messages" property

Troubleshooting string operations

Totally hear you. cc @visarg

Generating text using the 'n' parameter

GPT-4 is Getting Faster 🐇

Yes you can register multiple feedback

portkey-python-sdk/examples/loadbalance_...