You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: nemoguardrails/eval/data/topical/README.md
+6-4Lines changed: 6 additions & 4 deletions
Original file line number
Diff line number
Diff line change
@@ -28,7 +28,7 @@ For additional information about topical rails evaluation and results on the two
28
28
### Chit-chat dataset
29
29
30
30
We are using a slightly modified version of the chit-chat dataset available [here](https://github.com/rahul051296/small-talk-rasa-stack).
31
-
For this dataset, we have configured a [Guardrail app](./chitchat) that already has the:
31
+
For this dataset, we have configured a [Guardrail app](./chitchat) that already has:
32
32
- Config file: `config.yml`
33
33
- A set of defined flows: `flows.co`
34
34
- A set of predefined bot messages for the topical rails: `bot.co`
@@ -51,7 +51,7 @@ To run the topical evaluation on this dataset run:
51
51
### Banking dataset
52
52
53
53
We are starting from the banking dataset available [here](https://github.com/PolyAI-LDN/task-specific-datasets/tree/master/banking_data).
54
-
For this dataset, we have configured a [Guardrail app](./banking) that already has the:
54
+
For this dataset, we have configured a [Guardrail app](./banking) that already has:
55
55
- Config file: `config.yml`
56
56
- A set of defined flows: `flows.co`
57
57
- A file mapping the user intents in the original dataset to user canonical forms used by Guardrails: `categories_canonical_forms.json`
@@ -73,5 +73,7 @@ To run the topical evaluation on this dataset run:
73
73
74
74
If you want to assess the performance of topical rails with a new NLU dataset, you can use the `./nemoguardrails/eval/data/topical/dataset_tools.py` functionality.
75
75
For each dataset, you need to define a new class that extends the `DatasetConnector` class and implements the two following two functions:
76
-
-`read_dataset`
77
-
-`_read_canonical_forms`
76
+
-`read_dataset`: Reads the dataset from the specified path, instantiating at least intent names, intent canonical forms, and intent samples.
77
+
The path received as parameter should contain the original dataset files, in the specific format they were distributed.
78
+
-`_read_canonical_forms`: Reads the intent - canonical form mappings from a file.
79
+
This can be a `json` or any other format and should be created by the evaluation user as the mapping is not part of the original dataset.
0 commit comments