Welcome to Portkey Forum

Updated last year

The Gateway

At a glance

Awesome! Look forward to your thoughts on the gateway!

39 comments

II got passed the auth problem but getting now a response from the gateway complaining about the format in which function calls are sent. I'll try to include relevant files here.

RRaul Raja

Partial request json

Plain Text

"function": {
                "name": "GeneratedMessages",
                "parameters": {
                    "$schema": "http://json-schema.org/draft-07/schema",
                    "type": "object",

Gateway response:

Plain Text

{
   "error":{
      "message":"Invalid JSON payload received. Unknown name \"$schema\" at 'tools[0].function_declarations[0].parameters': Cannot find field.\nInvalid JSON payload received. Unknown name \"if\" at 'tools[0].function_declarations[0].parameters.properties[0].value.items.properties[2].value': Cannot find field.\nInvalid JSON payload received. Unknown name \"else\" at 'tools[0].function_declarations[0].parameters.properties[0].value.items.properties[2].value': Cannot find field.\nInvalid JSON payload received. Unknown name \"definitions\" at 'tools[0].function_declarations[0].parameters': Cannot find field.",
      "type":"INVALID_ARGUMENT",
      "param":null,
      "code":"INVALID_ARGUMENT"
   },
   "provider":"google"
}

RRaul Raja

If I send the request without functions or tool calls then I get:

Plain Text

{"error":{"message":"Please ensure that multiturn requests ends with a user role or a function response.","type":"INVALID_ARGUMENT","param":null,"code":"INVALID_ARGUMENT"},"provider":"google"}

For a request that contains two messages for assistant and user roles

vvisarg

Looking into this issue.

RRaul Raja

BTW what I'm thinking here with these tests is to provide an integration module in https://xef.ai to run portkey as gateway server for Xef to interface with multiple models.

RRaul Raja

vvisarg

Gemini expects the function argument in openAPI compatible schema. So the "$schema" field in the the function is causing the issue. This should ideally be handled in gateway. Till the time we implement a better solution, can you try removing $schema field from the parameters and try once?

Reference link: https://ai.google.dev/docs/function_calling#how_it_works

RRaul Raja

Thanks!, I removed the $schema field and the it jumps in the next error:

Plain Text

{"error":{"message":"Invalid JSON payload received. Unknown name \"if\" at 'tools[0].function_declarations[0].parameters.properties[0].value.items.properties[2].value': Cannot find field.\nInvalid JSON payload received. Unknown name \"else\" at 'tools[0].function_declarations[0].parameters.properties[0].value.items.properties[2].value': Cannot find field.\nInvalid JSON payload received. Unknown name \"definitions\" at 'tools[0].function_declarations[0].parameters': Cannot find field.","type":"INVALID_ARGUMENT","param":null,"code":"INVALID_ARGUMENT"},"provider":"google"}

RRaul Raja

Seems like our JsonSchema library uses:

Plain Text

private fun JsonObjectBuilder.applyJsonSchemaDefaults(
  descriptor: SerialDescriptor,
  annotations: List<Annotation>,
  skipNullCheck: Boolean = false,
  skipTypeCheck: Boolean = false
) {
  if (descriptor.isNullable && !skipNullCheck) {
    this["if"] = buildJson { it["type"] = descriptor.jsonLiteral }
    this["else"] = buildJson { it["type"] = "null" }
  } else {
    if (!skipTypeCheck) {
      this["type"] = descriptor.jsonLiteral
    }
  }

which is ok with OpenAI, but seems to not work in this case

RRaul Raja

Also it's not happy with definitions which is for the refs in the schema so they don't repeat when nested. I'm gonna guess that whatever you using on the gateway side for json Zod may be using a json schema version ref that is older than what I'm using.

vvisarg

Looks like gemini is yet to add full support of all the JSONSchema capabilities. There are other open issues as well related to some other capabilities like maxItems minItems
https://github.com/google/generative-ai-docs/issues/210

RRaul Raja

I see, thanks!, Do you have any recommendations to test with other models that support function calling other than Google's gemini pro?

vvisarg

Currently, gateway supports function calling in openai, azure-openai, anyscale, google. You can try out anyscale models with function calling. Please let me know if you need any help with that.

RRaul Raja

I tried with anyscale and mistralai/Mistral-7B-Instruct-v0.1. The gateway responds with a 422 unprocessable entity, and the following response:

Plain Text

{"error":{"param":null,"code":null},"provider":"anyscale"}

RRaul Raja

Is there a way to connect the gateway to ollama or have it download and serve a local LLM through some other tool?

RRaul Raja

When I try with together-ai I get:

Plain Text

{"error":{"message":"(unknown path)\n  Error: Unable to call `content[\"trim\"]`, which is undefined or falsey","type":null,"param":null,"code":null},"provider":"together-ai"}

RRaul Raja

Tha one is a 500

vvisarg

Can you please share the request body that you are using for anyscale and together-ai. ollama support is on the roadmap. It should be available before end of this week.

vvisarg

This is an example request body that you can use for anyscale function call

Plain Text

{
    "model": "mistralai/Mistral-7B-Instruct-v0.1",
    "stream": false,
    "max_tokens": 100,
    "messages": [
        {
            "role": "user",
            "content": "What is weather in London ? "
        }
    ],
    "tools": [
        {
            "type": "function",
            "function": {
                "name": "get_current_weather",
                "parameters": {
                    "type": "object",
                    "required": [
                        "location"
                    ],
                    "properties": {
                        "unit": {
                            "enum": [
                                "celsius",
                                "fahrenheit"
                            ],
                            "type": "string"
                        },
                        "location": {
                            "type": "string",
                            "description": "The city and state, e.g. San Francisco, CA"
                        }
                    }
                },
                "description": "Get the current weather in a given location"
            }
        }
    ]
}

RRaul Raja

The request for example to anyscale looks like:

Plain Text

{
  "messages": [
    {
      "content": [
        {
          "text": " ...",
          "type": "text"
        }
      ],
      "role": "user"
    }
  ],
  "model": "mistralai/Mistral-7B-Instruct-v0.1",
  "max_tokens": 500,
  "temperature": 0.4,
  "user": "user"
}

RRaul Raja

And that results in :

Plain Text

{"error":{"param":null,"code":null},"provider":"anyscale"}

RRaul Raja

The same request to together-ai returns:

Plain Text

{"error":{"message":"(unknown path)\n  Error: Unable to call `content[\"trim\"]`, which is undefined or falsey","type":null,"param":null,"code":null},"provider":"together-ai"}

RRaul Raja

I think the problem here is that the gateway does not support the latest OpenAI especification.
https://github.com/openai/openai-openapi/blob/master/openapi.yaml#L5421-L5429

vvisarg

The structure of messages is incorrect. The one that you are sending is vision model request body structure. Anyscale does not support any vision models currently. You can use openAI chat completions message structure to hit anyscale models. Something like this:

Plain Text

"messages": [
        {
            "role": "user",
            "content": "Hello world"
        }
    ]

RRaul Raja

Text message do not need to be just of type string based on the openAI spec. We generate our client from the yaml spec file and it works in that format with OpenAI fine.

RRaul Raja

Also if that's the case that tells me you are currently not supporting image urls in the vision api, because that changes the format of the container to optionally be held in an array as well so you can pass text + images

RRaul Raja

that structure also works in openai with other models and some clients use it. I was using that structure with gpt3.5 before trying portkey

RRaul Raja

This is the structure for ChatCompletionRequestUserMessage:

Plain Text

ChatCompletionRequestMessage:
      oneOf:
        - $ref: "#/components/schemas/ChatCompletionRequestSystemMessage"
        - $ref: "#/components/schemas/ChatCompletionRequestUserMessage"
        - $ref: "#/components/schemas/ChatCompletionRequestAssistantMessage"
        - $ref: "#/components/schemas/ChatCompletionRequestToolMessage"
        - $ref: "#/components/schemas/ChatCompletionRequestFunctionMessage"

RRaul Raja

All those different types are now supported and each one has slight variations in the format

RRaul Raja

all for the chatcompletion endpoint

RRaul Raja

It's not just part of vision apis, also regular chatcompletion

vvisarg

gateway currently supports this new structure only for OpenAI. This new structure is not yet supported for other providers. Will you be able to create a github issue for this? Or else I will create one by tomorrow and start working on it.

RRaul Raja

yes, creating one now, thanks!

vvisarg

image URLs are supported for OpenAI models currently. But the new message structure is not yet supported for other providers yet.

RRaul Raja

thank you for all your help, I created https://github.com/Portkey-AI/gateway/issues/116 , please let me know if I can help in any way.

vvisarg

Would also like to understand if there is any reason that you prefer the new structure even for non-vision models. Because the OpenAI API docs only suggests the new structure for image input

RRaul Raja

No particular reason that our library xef, gets an autogenerated client and it follows the types in the spec. The spec says ChatCompletionRequestMessage has multiple types and we can't control how our users want to structure the message as we publicly export the OpenAI client.
We just want to be on par with the OpenAI endpoints and currently they support this feature for all chat completions, not just images.

RRaul Raja

I get that supporting that spec with so many combinations is hard, we had pains for months until we decided to autogenerate the client from the yaml using openai-generator

vvisarg

Understood. Thanks for explaining.!

Add a reply