Add evals for the bindings server and a hyperdrive binding #117

deloreyj · 2025-04-30T20:54:14Z

This PR adds tools for Hyperdrive and introduces evals to the workers-bindings MCP server

cmsparks · 2025-04-30T20:57:45Z

apps/workers-bindings/evals/accounts.eval.ts

+				input: 'List all my Cloudflare accounts.',
+				expected: 'The accounts_list tool should be called to retrieve the list of accounts.',
+			},
+			{


I would split these into separate describe eval scripts. Like

describeEval("List accounts", ...) describeEval("Set account", ...)

Then you don't need to have the if statements below, because to me that feels like a bit of a code smell in tests.

cmsparks

LGTM

cmsparks · 2025-04-30T20:58:54Z

apps/workers-bindings/evals/kv_namespaces.eval.ts

+			const client = await initializeClient(/* Pass necessary mocks/config */)
+			const { promptOutput, toolCalls, fullResult } = await runTask(client, model, input)
+
+			if (input.includes('List all my Cloudflare KV Namespaces')) {


Same here, I'd split this into a separate describe eval.

deloreyj and others added 11 commits April 30, 2025 14:15

feat: add hyperdrive bindings and evals

63876e9

chore: fix dev mode and change CI to use dev mode

49bea21

chore: package version updates

90754fd

chore: fix formatting

03f51f8

chore: do not fail with no tests

9469ec1

fix: make evals work

bcada55

fix: formatting

5f471f3

fix: change port

5885a45

fix: override inspector port

b924b00

chore: remove console.logs

e5fdecf

chore: fix formatting

11c91ae

cmsparks reviewed Apr 30, 2025

View reviewed changes

cmsparks requested changes Apr 30, 2025

View reviewed changes

Maximo-Guk approved these changes Apr 30, 2025

View reviewed changes

cmsparks self-requested a review April 30, 2025 20:59

cmsparks approved these changes Apr 30, 2025

View reviewed changes

chore: PR feedback

9c85534

deloreyj merged commit 38aa001 into main Apr 30, 2025
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add evals for the bindings server and a hyperdrive binding #117

Add evals for the bindings server and a hyperdrive binding #117

deloreyj commented Apr 30, 2025

cmsparks Apr 30, 2025 •

edited

Loading

cmsparks left a comment

cmsparks Apr 30, 2025

Add evals for the bindings server and a hyperdrive binding #117

Add evals for the bindings server and a hyperdrive binding #117

Conversation

deloreyj commented Apr 30, 2025

cmsparks Apr 30, 2025 • edited Loading

Choose a reason for hiding this comment

cmsparks left a comment

Choose a reason for hiding this comment

cmsparks Apr 30, 2025

Choose a reason for hiding this comment

cmsparks Apr 30, 2025 •

edited

Loading