Skip to content

[bug] ERROR - Benchmark directory not found: benchmarks/DPBench-e2e/eval_dataset/DPBench/layout #148

@msmygit

Description

@msmygit

I'm trying to follow this page https://github.com/docling-project/docling-eval/blob/main/docs/DP-Bench_benchmarks.md and hit with this at the Create the evaluation report: step,

$ docling-eval evaluate \
  --modality layout \
  --benchmark DPBench \
  --output-dir ./benchmarks/DPBench-e2e
2025-08-20 13:28:13,211 - ERROR - Benchmark directory not found: benchmarks/DPBench-e2e/eval_dataset/DPBench/layout

I tried to create the below directories manually,

  • mkdir -p benchmarks/DPBench-e2e/eval_dataset/DPBench
  • mkdir -p benchmarks/DPBench-e2e/eval_dataset/DPBench/layout
    and it fails with below,
(docling-eval) ubuntu@ip-172-31-90-179:~/docling-eval$ mkdir -p benchmarks/DPBench-e2e/eval_dataset/DPBench
(docling-eval) ubuntu@ip-172-31-90-179:~/docling-eval$ mkdir -p benchmarks/DPBench-e2e/eval_dataset/DPBench/layout

(docling-eval) ubuntu@ip-172-31-90-179:~/docling-eval$ docling-eval evaluate   --modality layout   --benchmark DPBench   --output-dir ./benchmarks/DPBench-e2e
2025-08-20 14:09:25,358 - INFO - Loading the split 'test' from: 'benchmarks/DPBench-e2e/eval_dataset/DPBench/layout'
2025-08-20 14:09:25,358 - INFO - #-files: 0
Generating test split: 0 examples [00:00, ? examples/s]
Traceback (most recent call last):
  File "/home/ubuntu/docling-eval/.venv/lib/python3.12/site-packages/datasets/builder.py", line 1887, in _prepare_split_single
    num_examples, num_bytes = writer.finalize()
                              ^^^^^^^^^^^^^^^^^
  File "/home/ubuntu/docling-eval/.venv/lib/python3.12/site-packages/datasets/arrow_writer.py", line 669, in finalize
    raise SchemaInferenceError("Please pass `features` or at least one example when writing data")
datasets.arrow_writer.SchemaInferenceError: Please pass `features` or at least one example when writing data

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File "/home/ubuntu/docling-eval/.venv/bin/docling-eval", line 10, in <module>
    sys.exit(app())
             ^^^^^
  File "/home/ubuntu/docling-eval/.venv/lib/python3.12/site-packages/typer/main.py", line 341, in __call__
    raise e
  File "/home/ubuntu/docling-eval/.venv/lib/python3.12/site-packages/typer/main.py", line 324, in __call__
    return get_command(self)(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/ubuntu/docling-eval/.venv/lib/python3.12/site-packages/click/core.py", line 1442, in __call__
    return self.main(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/ubuntu/docling-eval/.venv/lib/python3.12/site-packages/typer/core.py", line 757, in main
    return _main(
           ^^^^^^
  File "/home/ubuntu/docling-eval/.venv/lib/python3.12/site-packages/typer/core.py", line 195, in _main
    rv = self.invoke(ctx)
         ^^^^^^^^^^^^^^^^
  File "/home/ubuntu/docling-eval/.venv/lib/python3.12/site-packages/click/core.py", line 1830, in invoke
    return _process_result(sub_ctx.command.invoke(sub_ctx))
                           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/ubuntu/docling-eval/.venv/lib/python3.12/site-packages/click/core.py", line 1226, in invoke
    return ctx.invoke(self.callback, **ctx.params)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/ubuntu/docling-eval/.venv/lib/python3.12/site-packages/click/core.py", line 794, in invoke
    return callback(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/ubuntu/docling-eval/.venv/lib/python3.12/site-packages/typer/main.py", line 699, in wrapper
    return callback(**use_params)
           ^^^^^^^^^^^^^^^^^^^^^^
  File "/home/ubuntu/docling-eval/docling_eval/cli/main.py", line 1310, in evaluate_cmd
    evaluate(
  File "/home/ubuntu/docling-eval/docling_eval/cli/main.py", line 577, in evaluate
    evaluation = layout_evaluator(  # type: ignore
                 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/ubuntu/docling-eval/docling_eval/evaluators/layout_evaluator.py", line 192, in __call__
    ds = load_dataset("parquet", data_files={split: split_files})
         ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/ubuntu/docling-eval/.venv/lib/python3.12/site-packages/datasets/load.py", line 2084, in load_dataset
    builder_instance.download_and_prepare(
  File "/home/ubuntu/docling-eval/.venv/lib/python3.12/site-packages/datasets/builder.py", line 925, in download_and_prepare
    self._download_and_prepare(
  File "/home/ubuntu/docling-eval/.venv/lib/python3.12/site-packages/datasets/builder.py", line 1001, in _download_and_prepare
    self._prepare_split(split_generator, **prepare_split_kwargs)
  File "/home/ubuntu/docling-eval/.venv/lib/python3.12/site-packages/datasets/builder.py", line 1742, in _prepare_split
    for job_id, done, content in self._prepare_split_single(
  File "/home/ubuntu/docling-eval/.venv/lib/python3.12/site-packages/datasets/builder.py", line 1898, in _prepare_split_single
    raise DatasetGenerationError("An error occurred while generating the dataset") from e
datasets.exceptions.DatasetGenerationError: An error occurred while generating the dataset

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions