mirror of
https://github.com/hwchase17/langchain.git
synced 2025-06-03 21:54:04 +00:00
Notebook shows preference scoring between two chains and reports wilson score interval + p value I think I'll add the option to insert ground truth labels but doesn't have to be in this PR |
||
---|---|---|
.. | ||
deployments | ||
evaluation | ||
model_laboratory.ipynb |