mirror of
https://github.com/kennethreitz/instructor.git
synced 2026-06-05 22:50:18 +00:00
add evals
This commit is contained in:
@@ -1,5 +1,9 @@
|
||||
We would love for you to contribute to `Instructor`.
|
||||
|
||||
## [Evals](https://github.com/jxnl/instructor/tree/main/tests/openai/evals)
|
||||
|
||||
We invite you to contribute evals in pytest as a way to monitor the quality of the openai models and the instructor library. To get started check out the [jxnl/instructor/tests/evals](https://github.com/jxnl/instructor/tree/main/tests/openai/evals) and contribute your own evals in the form of pytest tests. These evals will be run once a week and the results will be posted.
|
||||
|
||||
## Issues
|
||||
|
||||
If you find a bug, please file an issue on [our issue tracker on GitHub](https://github.com/jxnl/instructor/issues).
|
||||
|
||||
@@ -0,0 +1,9 @@
|
||||
# How to Contribute: Writing and Running Evaluation Tests
|
||||
|
||||
We welcome contributors to expand our suite of evaluation tests for data extraction. This guide provides instructions on creating tests with `pytest`, `pydantic`, and other tools, focusing on broad coverage and failure modalities understanding.
|
||||
|
||||
## Define Test Scenarios
|
||||
|
||||
Identify data extraction scenarios relevant to you. Create test cases with inputs and expected outputs.
|
||||
|
||||
Reference the `test_extract_users.py` which contains a test case for extracting users, using all models and all modes. The test case is parameterized with the model and mode, and the test function is parameterized with the input and expected output.
|
||||
Reference in New Issue
Block a user