quivr

mirror of https://github.com/StanGirard/quivr.git synced 2024-12-03 06:24:15 +03:00

History

Stan Girard 38589d32cf feat(backend): add RAG evaluation using Ragas (#2429 ) # Description Please include a summary of the changes and the related issue. Please also include relevant motivation and context. ## Checklist before requesting a review Please delete options that are not relevant. - [ ] My code follows the style guidelines of this project - [ ] I have performed a self-review of my code - [ ] I have commented hard-to-understand areas - [ ] I have ideally added tests that prove my fix is effective or that my feature works - [ ] New and existing unit tests pass locally with my changes - [ ] Any dependent changes have been merged ## Screenshots (if appropriate): <!-- ELLIPSIS_HIDDEN --> ---- \| <a href="https://ellipsis.dev" target="_blank"><img src="https://avatars.githubusercontent.com/u/80834858?s=400&u=31e596315b0d8f7465b3ee670f25cea677299c96&v=4" alt="Ellipsis" width="30px" height="30px"/></a> \| 🚀 This PR description was created by [Ellipsis](https://www.ellipsis.dev) for commit `5350e0a071`. \| \|--------\|--------\| ### Summary: This PR adds a new script for evaluating the RAG model using the Ragas library, with results saved as a JSON file and printed to the console. Key points: - New script `run_evaluation.py` added to `backend/tests/ragas_evaluation/`. - The script processes documents, generates replies using a QuivrRAG chain, and evaluates the replies using specified metrics. - Results are saved as a JSON file and printed to the console. - The script can be run from the command line with various options. ---- Generated with ❤️ by [ellipsis.dev](https://www.ellipsis.dev) <!-- ELLIPSIS_HIDDEN --> --------- Co-authored-by: Damien Mourot <damien.mourot@gmail.com> Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>	2024-04-15 06:53:48 -07:00
..
run_evaluation.py	feat(backend): add RAG evaluation using Ragas (#2429 )	2024-04-15 06:53:48 -07:00

feat(backend): add RAG evaluation using Ragas (#2429 )

# Description

Please include a summary of the changes and the related issue. Please
also include relevant motivation and context.

## Checklist before requesting a review

Please delete options that are not relevant.

- [ ] My code follows the style guidelines of this project
- [ ] I have performed a self-review of my code
- [ ] I have commented hard-to-understand areas
- [ ] I have ideally added tests that prove my fix is effective or that
my feature works
- [ ] New and existing unit tests pass locally with my changes
- [ ] Any dependent changes have been merged

## Screenshots (if appropriate):


<!--
ELLIPSIS_HIDDEN
-->


----

| <a href="https://ellipsis.dev" target="_blank"><img
src="https://avatars.githubusercontent.com/u/80834858?s=400&u=31e596315b0d8f7465b3ee670f25cea677299c96&v=4"
alt="Ellipsis" width="30px" height="30px"/></a> | 🚀 This PR
description was created by [Ellipsis](https://www.ellipsis.dev) for
commit 5350e0a071. |
|--------|--------|

### Summary:
This PR adds a new script for evaluating the RAG model using the Ragas
library, with results saved as a JSON file and printed to the console.

**Key points**:
- New script `run_evaluation.py` added to
`backend/tests/ragas_evaluation/`.
- The script processes documents, generates replies using a QuivrRAG
chain, and evaluates the replies using specified metrics.
- Results are saved as a JSON file and printed to the console.
- The script can be run from the command line with various options.


----
Generated with ❤️ by [ellipsis.dev](https://www.ellipsis.dev)



<!--
ELLIPSIS_HIDDEN
-->

---------

Co-authored-by: Damien Mourot <damien.mourot@gmail.com>
Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>

2024-04-15 06:53:48 -07:00

run_evaluation.py feat(backend): add RAG evaluation using Ragas (#2429 ) 2024-04-15 06:53:48 -07:00