You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
For now we can only see evaluations of traces(grouped by dataset) in experiments, but we can not view thread evaluations in experiments to see overview performance.
This is inconvenient to use thread evaluations(although we can we the score for each single thread in thread)