Reed-Solomon Erasure Coding in Haskell, with speeds exceeding multiple GB/s/cpu core implemented in pure Haskell (and some SIMD C/assembler).
For an introduction on erasure coding, see the post on the Backblaze blog.
Package home: https://github.com/NicolasT/reedsolomon
Performance depends mainly on the number of parity shards. In rough terms, doubling the number of parity shards will double the encoding time.
Here are the throughput numbers with some different selections of data and parity shards. For reference each shard is 1MB random data, and 1 CPU core is used for encoding.
|Data||Parity||Parity||SSSE3 MB/s||AVX2 MB/s|
Example of performance on Intel(R) Core(TM) i7-4600U CPU @ 3.30GHz - 2 physical cores, 4 logical cores (note:
/proc/cpuinfo mentions 2.10GHz only). The example uses 10 blocks with 16MB data each and 4 parity blocks.
|Threads||SSSE3 MB/s||AVX2 MB/s||Speed|
- Backblaze Open Sources Reed-Solomon Erasure Coding Source Code.
- GolangReedSolomon. Compatible Go library by Klaus Post.
- JavaReedSolomon. Compatible java library by Backblaze.
- go-erasure. A similar library using cgo, slower in my tests.
- Screaming Fast Galois Field Arithmetic. Basis for SSE3 optimizations.