Langchain ChatOpenAI errors out with cached responses w...

At a glance

GGriphook | Portkey

openai stream does not send token count

27 comments

ddeepanshu_11

@sega does that mean I can't cache with streaming?

GGriphook | Portkey

you can cache

GGriphook | Portkey

cache is based on request

ddeepanshu_11

but this code is giving the error

Plain Text

config = {
    "cache": {
        "mode": "semantic",
    },
    "retry" : {
        "attempts": 3
    },
}


portkey_headers = createHeaders(api_key= os.getenv("PORTKEY_API_KEY"),
                                provider="openai",
                                metadata={"_user": "m"},
                                config=config
                                )


def init_openai_chat(temperature):
    OPENAI_API_KEY = os.getenv("OPENAI_API_KEY")
    return ChatOpenAI(
        openai_api_key=OPENAI_API_KEY, streaming=True, temperature=temperature, model='gpt-4o',
        base_url=PORTKEY_GATEWAY_URL, default_headers=portkey_headers
    )

Error:

Plain Text

    total_tokens = oai_token_usage.get("total_tokens", input_tokens + output_tokens)
                                                       ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
TypeError: unsupported operand type(s) for +: 'NoneType' and 'int'

@sega any solutions for this?

GGriphook | Portkey

I'm able to replicate this

ddeepanshu_11

yes this is bit urgent @sega lemme know once fixed, by when do you think you can roll the update?

GGriphook | Portkey

I'm able to replicate the issue, I'm still trying to understand what's causing it

ddeepanshu_11

Okay sure

GGriphook | Portkey

Hey @deepanshu_11 just add this option stream_usage=True when initializing the client. We'll make this the default behaviour in the opensource gateway

Plain Text

...
    return ChatOpenAI(
        openai_api_key=OPENAI_API_KEY, streaming=True, temperature=temperature, model='gpt-3.5-turbo', default_headers=portkey_headers, base_url=PORTKEY_GATEWAY_URL, max_tokens=10,
        stream_usage=True
    )
...

GGriphook | Portkey

Let me know, if this doesn't fix the issue

ddeepanshu_11

sure lemme try

GGriphook | Portkey

Langchains ChatOpenAI errors out with cached responses

GGriphook | Portkey

Langchain ChatOpenAI errors out with cached responses

GGriphook | Portkey

Langchain ChatOpenAI errors out with cached responses while streaming

ddeepanshu_11

Also @sega since the documents are very long, its resulting into "openai error: This model's maximum context length is 128000 tokens. "