You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* Example of reliability scoring in human eval
Added notebook supporting GTC talk about the human touch.
This serves as basis to produce a REL score for win-tie-loss human evaluation with 2 models.
* moved files around and created folder
0 commit comments