Hyperfine 2.0

See also: [Hyperfine 2.0 milestone](https://github.com/sharkdp/hyperfine/milestones/hyperfine%202.0)

This is an (unfinished) planning document for a Hyperfine 2.0 release. This release will probably include some breaking changes (where it can't be avoided), so it makes sense to plan ahead. There are three major features that I would like to include:

* Additional performance metrics
* Interleaved benchmark execution
* Saved benchmarks for later comparison

And there are two behavioral changes that might make sense (motivated by choices made by [poop](https://github.com/andrewrk/poop))

* Make `--shell=none` the default
* Use the first command as a `--reference` by default

These new features and behavior-changes might lead to breaking changes regarding (1) Hyperfine's presentation of results in the terminal (2) the JSON output format and (3) the command line interface. We will discuss these changes on a high level to gauge their overall impact, and I would love to get feedback on this. Detailed discussions for a particular feature should go in the respective tickets though.

### Additional performance metrics

The major new feature I am planning to add will be additional performance metrics. We received many feature requests for this over the years:

* https://github.com/sharkdp/hyperfine/issues/86
* https://github.com/sharkdp/hyperfine/issues/736
* https://github.com/sharkdp/hyperfine/issues/660

Recently, [poop](https://github.com/andrewrk/poop) implemented this in a separate tool, seemingly because Hyperfine lacks this ability ([*maybe it will inspire the Hyperfine maintainers to add the extra data points*](https://github.com/andrewrk/poop?tab=readme-ov-file#comparison-with-hyperfine)). Here, I'm not so much interested in which exact new metrics we support, but rather in the fact that we need some architectural/output-format changes to be able to support multiple metrics. Ideally, wall clock time and user/system time would not be handled any different from more advanced metrics like CPU counters and peak memory usage. 

We probably want to invest into an abstraction that can handle these measurements in a uniform way. This would handle things like taking the actual measurement, providing/handling the different units of measurement, I/O, etc.

We should probably also change the JSON output format in a way that all of these measurements are available in the same structure (which is basically what #660) is asking for.

We also need to design a CLI that allows users to include/exclude additional performance metrics. We need to choose a good default. And we probably want a way to select one of these metrics as the target metric for comparisons.

We also need to become more generic w.r.t. wording/naming in lots of places. It's not necessarily the "fastest" command we're looking for, but maybe the most memory-efficient, or the one with the least cache misses.

### Interleaved benchmark execution

Ticket: https://github.com/sharkdp/hyperfine/issues/21

I think this would mainly affect how we display intermediate benchmark results *while they are still running*. A disadvantage would be that we would not be able to show the finished results for one command as soon as they are done. But an advantage might be that we could show current estimates for all commands, not just a single one.

An important open part is to think about is how this interacts with warmup/setup/prepare/etc.

### Saved benchmarks for later comparison

Ticket: https://github.com/sharkdp/hyperfine/issues/607

This will also be important for the design of the JSON output format. Ideally, we would save baseline results as JSON and then simply load everything we need from the JSON file again. This might require us to store more information than we currently do. It might be useful to use the same format (not as JSON, but as a struct) internally as well to make sure that everything is available.

### Make `--shell=none` the default

Ticket: https://github.com/sharkdp/hyperfine/issues/787

### Use the first command as a `--reference`

Ticket: —

This might require a redesign of the `--reference` command.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Hyperfine 2.0 #788

Additional performance metrics

Interleaved benchmark execution

Saved benchmarks for later comparison

Make `--shell=none` the default

Use the first command as a `--reference`

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Uh oh!

Hyperfine 2.0 #788

Description

Additional performance metrics

Interleaved benchmark execution

Saved benchmarks for later comparison

Make --shell=none the default

Use the first command as a --reference

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions

Make `--shell=none` the default

Use the first command as a `--reference`