-
Notifications
You must be signed in to change notification settings - Fork 966
feat: Add context evaluation to eval framework [COG-1366] #586
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
53 commits
Select commit
Hold shift + click to select a range
468de67
refactor: utils folder in retrieval
lxobr d789dd0
feat: add base_retriever.py
lxobr 49c2355
feat: add retriever classes
lxobr 7619df2
fix: include generate_completion function
lxobr 5a5eb5e
feat: add search comparison script, compare summaries
lxobr 8f0cbee
feat: enable context dumping
lxobr beacdea
fix: improve context getting and completion
lxobr 4b71081
feat: add all searches and context comparisons
lxobr 7631b11
Merge branch 'dev' into feat/COG-1365-unify-retrievers
lxobr 62f8ac3
Update cognee/tasks/completion/query_completion.py
lxobr 58c7eaf
feat: context dumping error handling
lxobr afd5ca4
feat: expose aggregate metrics, enable saving
lxobr 8bf5aae
feat: add modal example
lxobr fd7f837
delete: metrics_dashboard.py
lxobr 416eed1
fix: dashboard generation
lxobr d9fcb12
feat: add get_golden_context flag
lxobr 36dbdf7
feat: implement get_golden_context for hotpot_qa
lxobr c07cf22
chore: added todos
lxobr 2ef174a
chore: added a todo
lxobr 5910fb7
Merge branch 'dev' into feat/COG-1365-unify-retrievers
lxobr 65784e1
Merge branch 'dev' into feat/COG-1364-golden-contexts
lxobr bdaea29
feat: simplify twowikimultihop, get golden context
lxobr 32d5829
feat: add golden context to musique_adapter.py
lxobr ec3b753
Merge branch 'dev' into feat/COG-1331-modal-run-eval
lxobr 2f70de4
fix: update tests
lxobr 3d0b839
Merge branch 'dev' into feat/COG-1365-unify-retrievers
lxobr 4903d7e
feat: update code retriever
lxobr e98c12e
refactor: rename variables
lxobr af5d7c6
Merge branch 'dev' into feat/COG-1364-golden-contexts
lxobr 0ece58a
refactor: add metadata_field_name property
lxobr cb0fccd
Merge remote-tracking branch 'origin/feat/COG-1331-modal-run-eval' in…
alekszievr 1eb5e71
Merge remote-tracking branch 'origin/feat/COG-1364-golden-contexts' i…
alekszievr 30927d7
First render.
soobrosa b02231d
Small fixes.
soobrosa 2d90221
coderabbit don't be smart
soobrosa 68a4584
Merge branch 'dev' into feat/cog-1366-add-context-evaluation
alekszievr 8521fa8
Merge remote-tracking branch 'origin/feature/cog-1403-transition-to-n…
alekszievr 3906bf5
Merge branch 'dev' into feat/cog-1366-add-context-evaluation
alekszievr aae1237
Calculate context relevancy score
alekszievr 4cffd4b
Adjust dashboard tests
alekszievr 2e2beb3
Adjust answer generation test
alekszievr 7a574e3
adjust deepeval adapter test
alekszievr a17a5c8
Fix type hinting
alekszievr 3f10725
Merge branch 'dev' into feat/cog-1366-add-context-evaluation
alekszievr df5ba7b
Merge branch 'dev' into feat/cog-1366-add-context-evaluation
alekszievr 4c09877
Merge branch 'dev' into feat/cog-1366-add-context-evaluation
alekszievr d9b007a
ruff format
alekszievr 5691a1f
fix
alekszievr 634a7fa
fix: add comment to new param
borisarzentar 3453ede
Merge branch 'dev' into feat/cog-1366-add-context-evaluation
borisarzentar 5b9a64d
Merge branch 'dev' into feat/cog-1366-add-context-evaluation
alekszievr 6b2b6f2
Pass system prompt in question answering
alekszievr 1a3371e
Adjust tests
alekszievr File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
38 changes: 18 additions & 20 deletions
38
cognee/eval_framework/answer_generation/answer_generation_executor.py
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.