Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents Integration
benchmark information-retrieval evaluation dataset bias evaluation-framework large-language-models llm llms large-language-model llm4ir source-bias
-
Updated
Jun 4, 2024 - Python