Skip to content

Commit 88b7ee1

Browse files
committed
replace o3-mini with o4-mini
1 parent a074073 commit 88b7ee1

File tree

7 files changed

+7
-7
lines changed

7 files changed

+7
-7
lines changed

README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -31,7 +31,7 @@ docker compose up --build
3131

3232
We use a simple AI agent with tool use. An LLM attempts to answer a user question with 3 tools – `text_to_sql`, `web_search`, and `pdf_with_citations`.
3333

34-
The model then recursively asks questions using one of these tools until it is satisfied that it has enough context to answer the users question. By default, we use `o3-mini` for text to SQL, `gemini-2.0-flash` for web search, and `claude-3-7-sonnet` for both PDF analysis and orchestration.
34+
The model then recursively asks questions using one of these tools until it is satisfied that it has enough context to answer the users question. By default, we use `o4-mini` for text to SQL, `gemini-2.0-flash` for web search, and `claude-3-7-sonnet` for both PDF analysis and orchestration.
3535

3636
<img width="730" alt="image" src="https://github.com/user-attachments/assets/1b719e12-e4ea-4e85-82ee-5ac09f07f27a" />
3737

backend/query_routes.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -29,7 +29,7 @@ async def generate_sql_query_route(request: GenerateSQLQueryRequest):
2929
previous_context=request.previous_context,
3030
hard_filters=request.hard_filters,
3131
num_golden_queries=request.num_golden_queries,
32-
model_name=request.model_name or "o3-mini",
32+
model_name=request.model_name or "o4-mini",
3333
)
3434
if resp is None:
3535
return JSONResponse(

backend/tests/backend_routes/test_tools.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -412,7 +412,7 @@ async def ticket_summary_tool(input: TicketSummaryInput):
412412
json={
413413
"token": admin_token,
414414
"question": user_question,
415-
"model": "o3-mini"
415+
"model": "o4-mini"
416416
},
417417
headers={"Content-Type": "application/json"},
418418
)

backend/tools/tool_routes.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -46,7 +46,7 @@ async def answer_question_from_database_route(
4646
"""
4747
question = request.question
4848
db_name = request.db_name
49-
model = request.model or "o3-mini"
49+
model = request.model or "o4-mini"
5050
return await generate_report_from_question(
5151
question=question,
5252
db_name=db_name,

backend/utils_join_hints.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -42,7 +42,7 @@ async def infer_join_hints(
4242
]
4343
LOGGER.debug(f"Join hints user prompt: {user_prompt}")
4444
response = await chat_async(
45-
model="o3-mini",
45+
model="o4-mini",
4646
messages=messages,
4747
max_completion_tokens=16384,
4848
response_format=JoinHints,

backend/utils_sql.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -523,7 +523,7 @@ async def generate_sql_query(
523523
previous_context: list[Dict[str, str]] = None,
524524
hard_filters: list[HardFilter] = None,
525525
num_golden_queries: int = 4,
526-
model_name: str = "o3-mini",
526+
model_name: str = "o4-mini",
527527
):
528528
"""
529529
Generate SQL query for a given question, using an LLM.

backend/utils_table_descriptions.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -156,7 +156,7 @@ async def infer_table_descriptions(
156156
{"role": "user", "content": user_prompt},
157157
]
158158
response = await chat_async(
159-
model="o3-mini",
159+
model="o4-mini",
160160
messages=messages,
161161
response_format=TableDescriptions,
162162
)

0 commit comments

Comments
 (0)