Closed
Description
I built an agent 2 weeks ago. The agent has 4 python tools and one MCP server.
It's kyc_agent.py here
Today, the same agent with the same tools and the same sequence of questions is taking over one minute and sometimes up to two minutes to return an answer.
I can see in traces that the tools are executing fast (1-2 seconds at the most) but I can't tell what is taking so long to produce the final answer that is sent by the agent.
No code changes in the last 2 weeks.
I can provide traces showing the huge difference in latency.
I tried changing default model (gpt-4o) to o3-mini, o4-mini and same result
What would you suggest to troubleshoot this problem?