chandar-lab / epik-eval Goto Github PK
View Code? Open in Web Editor NEWBenchmark to evaluate the capability of language models to consolidate and recall information from multiple training documents.
Home Page: https://gabprato.github.io/epik-eval/
License: MIT License