Restricting models

Question

@elentaure. @bgeneto You can restrict models using the config by mentioning it in the override params, but this would be limited to one per object, you can have multiple objects in the configportkey = Portkey( api_key="...", config={ "override_params": { "model": "gpt-4o" }, "virtual_key": "openai-cxy-362aeb" }
)

Griphook | Portkey · Answer

But I'm assuming you folks would like something more flexible, can you tell me what your use case for this is? I'll look at if this is possible with some changes

Griphook | Portkey · Answer

documentation about configs:https://portkey.ai/docs/api-reference/inference-api/config-object

elentaure. · Answer

Right, we would need to whitelist the possible models as @bgeneto mentioned, not override to a specific one

elentaure. · Answer

It would be to avoid having controls on this in the cloud provider platform

bgeneto · Answer

That's right. More flexibility would be nice. The use case can be ample here.... Allowing only newest models, avoiding embedding models etc...

Griphook | Portkey · Answer

also this might be of interest to you guys, tracking cost/user on a centralized instance of librechat with portkey https://github.com/timmanik/librechat-for-portkey

bgeneto · Answer

Cool... I prefer the much more feature rich Open Webui (it has a whitelist by the way, but it is global, not per provider or key). I can write a custom function for open-webui in order to add metadata to the request so we can track costs in portkey.ai. I could also write a tutorial/article... Problem is, I've just checked my trial account in portkey.ai does not have spent enabled... So I can't test.

Griphook | Portkey · Answer

oh yes, I use openwebui myself xd

Griphook | Portkey · Answer

in Librechat you can restrict models per provider in the librechat.yml

Griphook | Portkey · Answer

you might want to try tweaking the openwebui code for this

Griphook | Portkey · Answer

link to docs for librechat with portkey: https://portkey.ai/docs/integrations/libraries/librechat

Griphook | Portkey · Answer

would you like us to enable a month long upgraded trial for you? @elentaure. has been a super active user in the forum, so we'd be happy to do this!cc: @morsczx

bgeneto · Answer

Sure! I'm just arriving at portkey.ai (leaving litellm). It would be nice if you could activate spend/cost tracking for my account (me@bgeneto.com), even if it is only trial. This account is only for testing purposes. I need to be sure that I can use portkey.ai in production, I mean, if it has all the features we need for our Gen AI projects. @morsczx

elentaure. · Answer

Thanks @sega no need in my case, we have already enterprise agreement in our company as far as I know. We have a line of communication about this open as well with Ayush

elentaure. · Answer

Hi @sega I was thinking if I get some time to try creating a PR with a plugin to have a list of allowed models as a guardrail. Do you think that would be a good place for it? Also the other question, how can I debug plugins with a local instance of the gateway and without access to the UI to configure the guardrail? can the before_request_hooks be passed in the config object in the headers for example with a full content somehow instead of pointing to an id?

Griphook | Portkey · Answer

Yes, that'd be super, the pre request hooks have access to the Request body, so it can be parsed and used!
Yes correct, here's a guide on using writing custom guardrails https://portkey.ai/docs/product/guardrails/list-of-guardrail-checks/bring-your-own-guardrails, lemme know if you have any doubts on it

elentaure. · Answer

Thanks @sega but this means having the logic somewhere else and exposed with a webhook. What I meant was writing a plugin in the gateway codebase

elentaure. · Answer

similar to the regex one or the schema validator and so on. But with a logic to setup a list of allowed models and check it agains the model selected in the body

elentaure. · Answer

should be quite straight forward I guess, but Im not sure how to setup the dev environment for this case to be able to debug

elentaure. · Answer

Ok, I see that you can pass the object with the check and so on in the config for the hook

elentaure. · Answer

@sega why are hooks skipped for embeddings?

Griphook | Portkey · Answer

We implemented it for request and response body, but we’re working on making it generic now

elentaure. · Answer

ok. when thats available will the plugings work directly with embed function or will it require code modification? Also any ETA on that? 🙂

Griphook | Portkey · Answer

haven't planned on a timeline, but yes, they should, the idea is to have generic pre request and post request methods that can be used as hooks

morsczx · Answer

@bgeneto dming you for some details

elentaure. · Answer

Ok, In the meantime Ive created a PR for a simple plugin to define a list of allowed models as guardrail: https://github.com/Portkey-AI/gateway/pull/695

Griphook | Portkey · Answer

sweet! I'll check it out!

elentaure. · Answer

Hi @Griphook | Portkey looks like the PR was merged a while ago, but I don't see the plugin listed in the UI when creating guardrails. Do you know if its going to be added?

Griphook | Portkey · Answer

Hey @elentaure. , long time, I'll add it to the UI, missed it. But it should work directly also if you pass the guardrail name. I'll add it soon

elentaure. · Answer

Awesome. Thanks!

elentaure. · Answer

Hi @Griphook | Portkey , sorry to bother you but it looks like the plugin is still missing from the UI. Do you know if it will finally be added? Thanks

Griphook | Portkey · Answer

hey, sorry, I'm tagging @visarg , who can make the final decision on this

elentaure. · Answer

thanks

Griphook | Portkey · Answer

I've raised the changes, will be in the next release!!

elentaure. · Answer

awesome. thanks!

Welcome to Portkey Forum

Restricting models