qherreros / toolqa Goto Github PK
View Code? Open in Web Editor NEWThis project forked from night-chen/toolqa
ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels (easy/hard) across eight real-life scenarios.
Home Page: https://arxiv.org/pdf/2306.13304.pdf
License: Apache License 2.0