|
61 | 61 | "\n",
|
62 | 62 | "### OpenAI Model Evolution \n",
|
63 | 63 | "\n",
|
64 |
| - "\n", |
| 64 | + |
| 65 | + "\n", |
| 66 | + |
65 | 67 | "\n",
|
66 | 68 | "### Key Characteristics\n",
|
67 | 69 | "\n",
|
|
79 | 81 | "\n",
|
80 | 82 | "## 3A. Use Case: Long-Context RAG for Legal Q&A\n",
|
81 | 83 | "\n",
|
82 |
| - "\n", |
| 84 | + |
| 85 | + "\n", |
| 86 | + |
83 | 87 | "## 🗂️ TL;DR Matrix\n",
|
84 | 88 | "\n",
|
85 | 89 | "This table summarizes the core technology choices and their rationale for **this specific Long-Context Agentic RAG implementation**.\n",
|
|
133 | 137 | "id": "db9bad1b",
|
134 | 138 | "metadata": {},
|
135 | 139 | "source": [
|
136 |
| - "\n", |
| 140 | + |
| 141 | + "\n", |
| 142 | + |
137 | 143 | "\n",
|
138 | 144 | "\n",
|
139 | 145 | "## Agentic RAG System: Model Usage\n",
|
|
1815 | 1821 | "================================================================================\n",
|
1816 | 1822 | "\n",
|
1817 | 1823 | "## 3B. Use Case: AI Co-Scientist for Pharma R&D\n",
|
1818 |
| - "\n", |
| 1824 | + |
| 1825 | + "\n", |
| 1826 | + |
1819 | 1827 | "\n",
|
1820 | 1828 | "This section details how to build an AI system that functions as a \"co-scientist\" to accelerate experimental design in pharmaceutical R&D, focusing on optimizing a drug synthesis process under specific constraints.\n",
|
1821 | 1829 | "\n",
|
|
1855 | 1863 | "\n",
|
1856 | 1864 | "The system employs a multi-agent architecture that emulates a high-performing scientific team. Different AI components, acting in specialized roles (such as ideation, critique, and learning from outcomes), collaborate using various models and tools to execute the workflow.\n",
|
1857 | 1865 | "\n",
|
1858 |
| - "\n", |
| 1866 | + |
| 1867 | + "\n", |
| 1868 | + |
1859 | 1869 | "\n",
|
1860 | 1870 | "### 2.1. **Scientist Input & Constraints:** \n",
|
1861 | 1871 | "The process starts with the scientist defining the goal, target compound, and constraints."
|
|
2463 | 2473 | "\n",
|
2464 | 2474 | "## 3C. Use Case: Insurance Claim Processing\n",
|
2465 | 2475 | "\n",
|
2466 |
| - "\n", |
| 2476 | + |
| 2477 | + "\n", |
| 2478 | + |
2467 | 2479 | "\n",
|
2468 | 2480 | "Many businesses are faced with the task of digitizing hand filled forms. In this section, we will demonstrate how OpenAI can be used to digitize and validate a hand filled insurance form. While this is a common problem for insurance, the same techniques can be applied to a variety of other industries and forms, for example tax forms, invoices, and more.\n",
|
2469 | 2481 | "\n",
|
|
2493 | 2505 | "\n",
|
2494 | 2506 | "The high level basic architecture of the solution is shown below.\n",
|
2495 | 2507 | "\n",
|
2496 |
| - "\n", |
| 2508 | + |
| 2509 | + "\n", |
| 2510 | + |
2497 | 2511 | "\n",
|
2498 | 2512 | "This task is complex and requires a wide variety of model capabilities, including vision, function calling, reasoning, and structured output. While `o3` is capable of doing all of these at once, we found during experimentation that `o4-mini` alone was not sufficient to achieve the necessary performance. Due to the higher relative costs of `o3`, we instead opted for a two-stage approach.\n",
|
2499 | 2513 | "\n",
|
|
2503 | 2517 | "\n",
|
2504 | 2518 | "To demonstrate concretely how this works, let's look at a sample image of an insurance form.\n",
|
2505 | 2519 | "\n",
|
2506 |
| - "\n", |
| 2520 | + |
| 2521 | + "\n", |
| 2522 | + |
2507 | 2523 | "\n",
|
2508 | 2524 | "While the form itself is fairly straightforward, there is missing data and ambiguous information that will be difficult for a traditional OCR system to fill out correctly. First, notice that the zip code and county have been omitted. Second, the email address of the user is ambiguous \\-- it could be `[email protected]` or `[email protected]`. In the following sections, we will walk through how a well-designed solution can handle these ambiguities and return the correct form results.\n",
|
2509 | 2525 | "\n",
|
|
3186 | 3202 | "\n",
|
3187 | 3203 | "## Adaptation Decision Tree\n",
|
3188 | 3204 | "\n",
|
3189 |
| - "\n", |
| 3205 | + |
| 3206 | + "\n", |
| 3207 | + |
| 3208 | + |
3190 | 3209 | "\n",
|
3191 | 3210 | "## Communicating Model Selection to Non-Technical Stakeholders\n",
|
3192 | 3211 | "\n",
|
|
3286 | 3305 | "\n",
|
3287 | 3306 | "## Contributors\n",
|
3288 | 3307 | "\n",
|
3289 |
| - "- Kashyap Coimbatore Murali\n", |
3290 |
| - "- Nate Harada \n", |
3291 |
| - "- Sai Prashanth Soundararaj \n", |
3292 |
| - "- Shikhar Kwatra " |
| 3308 | + |
| 3309 | + "- [Kashyap Coimbatore Murali](https://www.linkedin.com/in/kashyap-murali/)\n", |
| 3310 | + "- [Nate Harada](https://www.linkedin.com/in/nate-harada/) \n", |
| 3311 | + "- [Sai Prashanth Soundararaj](https://www.linkedin.com/in/saiprashanths/)\n", |
| 3312 | + "- [Shikhar Kwatra](https://www.linkedin.com/in/shikharkwatra/)" |
| 3313 | + |
3293 | 3314 | ]
|
3294 | 3315 | }
|
3295 | 3316 | ],
|
|
0 commit comments