An idiomatic and mostly safe API wrapper for the awesome and very fast library libsais
by Ilya Grebnov.
This crate exposes the whole functionality of libsais
. It might be useful to also check out the documentation of the original library.
- Suffix array construction for
u8
/u16
/i32
/i64
texts andi32
/i64
output arrays - Generalized suffix arrays
- Longest common prefix arrays and permuted longest common prefix arrays
- Burrows-Wheeler-Transform and reversal
- Optional multithreading support via the
openmp
feature (enabled by default)
This crate provides generic builder-like APIs for the whole functionality listed above. The following is a simple example of how to use this library to construct a suffix array in parallel:
use libsais::{SuffixArrayConstruction, ThreadCount};
let text = b"barnabasbabblesaboutbananas";
let suffix_array: Vec<i32> = SuffixArrayConstruction::for_text(text)
.in_owned_buffer()
.multi_threaded(ThreadCount::openmp_default())
.run()
.expect("The example in the README should really work")
.into_vec();
Please consult the documentation and the examples for more details on how to use this library.
This library only adds a few safety checks, which should not impact performance in a relevant way. A notable exception is the suffix array construction for i32
and i64
inputs. Please consult the documentation for details.
Below are the results of a small benchmark of suffix array construction algorithms available on crates.io. The input was the human genome, truncated to 2 GB. Details about this benchmark can be found here.
The excellent performance of libsais
is one of the main reasons why this API wrapper crate was created.