Vertex structured outputs

Question

Hi everyone!I'm working with the openai client + portkey and I can't send structured_output type queries to vertex ai gemini models.However sending the same queries to Azure Openai hosted model o1 works fine.I'm attaching the request I send and the error I receive.I'm using the openai==1.61.1 package version & 1.9.5 gateway proxy.Will be glad for your assistance 🙂

Vrushank | Portkey · Answer

Hi @artyabra, checking this!

artyabra · Answer

Hey, did you guys had a chance to have a look?

artyabra · Answer

@Vrushank | Portkey ?

Vrushank | Portkey · Answer

Hi @artyabra so sorry for the delay here - let us get back to you with the docs on this shortly. @sega

b4s36t4 · Answer

Hi, @artyabra. Is it possible for you to share the Python code you're using. Or atleast share the pydantic model you're using for the response_format value.

I'll also try taking a look at from my side as well maybe a share a working example as well.

artyabra · Answer

Hi, there's no problem in sharing, I will send you first the response_format, since it's easier:

Plain Text

class Step(BaseModel):
    explanation: str
    output: str

class MathReasoning(BaseModel):
    steps: list[Step]
    final_answer: str

I'm sending response_format=MathReasoning

artyabra · Answer

It's an example from openai docs

b4s36t4 · Answer

Sure, thanks for sharing the info. Will revert back with the solution soon.

artyabra · Answer

ah and the message if you want is also like in the openai docs:message = [ {"role": "user", "content": "You are a helpful math tutor. Guide the user through the solution step by step."}, {"role": "user", "content": "how can I solve 8x + 7 = -23"}
]

b4s36t4 · Answer

Just tested the code, it's working fine for me.

b4s36t4 · Answer

from portkey_ai import Portkey
from pydantic import BaseModel portkey = Portkey( api_key="api_key", virtual_key="key",
) class Step(BaseModel): explanation: str output: str class MathReasoning(BaseModel): steps: list[Step] final_answer: str completion = portkey.beta.chat.completions.parse( model="gemini-2.0-pro-exp-02-05", messages=[ { "role": "system", "content": "You are a helpful math tutor. Guide the user through the solution step by step.", }, {"role": "user", "content": "how can I solve 8x + 7 = -23"}, ], response_format=MathReasoning,
) print(completion)

b4s36t4 · Answer

This is the code I have tried, ParsedChatCompletion[MathReasoning](id='portkey-25129568-7600-4fc7-9d7d-4e5d20b4a707', choices=[ParsedChoice[MathReasoning](finish_reason='STOP', index=0, logprobs=None, message=ParsedChatCompletionMessage[MathReasoning](content='{
 "final_answer": "-3.75",
 "steps": [
 {
 "explanation": "The goal is to isolate x on one side of the equation. First, subtract 7 from both sides.",
 "output": "8x + 7 - 7 = -23 - 7"
 },
 {
 "explanation": "Simplify both sides of the equation.",
 "output": "8x = -30"
 },
 {
 "explanation": "Divide both sides by 8 to isolate x.",
 "output": "8x / 8 = -30 / 8"
 },
 {
 "explanation": "Simplify to get the value of x.",
 "output": "x = -3.75"
 }
 ]
}', refusal=None, role='assistant', function_call=None, tool_calls=[], parsed=MathReasoning(steps=[Step(explanation='The goal is to isolate x on one side of the equation. First, subtract 7 from both sides.', output='8x + 7 - 7 = -23 - 7'), Step(explanation='Simplify both sides of the equation.', output='8x = -30'), Step(explanation='Divide both sides by 8 to isolate x.', output='8x / 8 = -30 / 8'), Step(explanation='Simplify to get the value of x.', output='x = -3.75')], final_answer='-3.75')))], created=1739961346, model='Unknown', object='chat_completion', service_tier=None, system_fingerprint=None, usage=CompletionUsage(completion_tokens=188, prompt_tokens=52, total_tokens=240), provider='vertex-ai')this is the response I've got

b4s36t4 · Answer

portkey-ai==1.9.1 SDK Version.

artyabra · Answer

Im not using the portkey client, Im using the openai client

b4s36t4 · Answer

got it, let me check that too.

b4s36t4 · Answer

yea, I got response with openai as well.

b4s36t4 · Answer

openai==1.63.2

artyabra · Answer

can you send me the request it's generating for you and being sent to gateway proxy?

b4s36t4 · Answer

from portkey_ai import createHeaders
from openai import OpenAI
from pydantic import BaseModel client = OpenAI( api_key="api-key", #skip the error. base_url="gateway_url", default_headers=createHeaders( api_key="key", virtual_key="key", ),
) class Step(BaseModel): explanation: str output: str class MathReasoning(BaseModel): steps: list[Step] final_answer: str completion = client.beta.chat.completions.parse( model="gemini-2.0-pro-exp-02-05", messages=[ { "role": "system", "content": "You are a helpful math tutor. Guide the user through the solution step by step.", }, {"role": "user", "content": "how can I solve 8x + 7 = -23"}, ], response_format=MathReasoning,
) print(completion)This the code block I have used.

b4s36t4 · Answer

let me know if that did worked or not, can also get on a call to quickly understand more.

artyabra · Answer

Im not familiar with create_headers method.Can you try with these default_headers?{ "strategy" : { "mode" : "loadbalance" }, "request_timeout" : 60000, "targets" : [ { "provider" : "vertex-ai", "vertex_region" : "us-central1", "override_params" : { "model" : "gemini-2.0-pro-exp-02-05", "safety_settings" : [ { "category" : "HARM_CATEGORY_DANGEROUS_CONTENT", "threshold" : "BLOCK_NONE" }, { "category" : "HARM_CATEGORY_HARASSMENT", "threshold" : "BLOCK_NONE" }, { "category" : "HARM_CATEGORY_HATE_SPEECH", "threshold" : "BLOCK_NONE" }, { "category" : "HARM_CATEGORY_SEXUALLY_EXPLICIT", "threshold" : "BLOCK_NONE" } ] }, "api_key" : api_key, "vertex_project_id" : vertex_project_id, "weight" : 1 } ]
}

artyabra · Answer

I also dont have a virtual key

artyabra · Answer

I sent the x-portkey-config key in default_headers

sega · Answer

if you're not usng the createHeaders method, you have to format you header keys manuallyrequest_timeout would be x-portkey-request-timeout and similarly for others in kebab-case

artyabra · Answer

The root cause is because we're using pydantic v1 package version.
When installing pydantic v2 it works.
However it's hard for us to migrate pydantic to v2.
Thank you @b4s36t4 for identifying the root cause!

Welcome to Portkey Forum

Vertex structured outputs