Skip to content

Instantly share code, notes, and snippets.

@zamazan4ik
Created September 21, 2024 22:05
Show Gist options
  • Save zamazan4ik/44992d2ca69a3fd98131a3ad7555553d to your computer and use it in GitHub Desktop.
Save zamazan4ik/44992d2ca69a3fd98131a3ad7555553d to your computer and use it in GitHub Desktop.
raptorq: Release
Running benches/codec_benchmark.rs (target/release/deps/codec_benchmark-14b90cb555383273)
WARNING: HTML report generation will become a non-default optional feature in Criterion.rs 0.4.0.
This feature is being moved to cargo-criterion (https://github.com/bheisler/cargo-criterion) and will be optional in a future version of Criterion.rs. To silence this warning, eithe
Symbol mulassign_scalar()/
time: [24.522 ns 24.531 ns 24.540 ns]
thrpt: [19.431 GiB/s 19.438 GiB/s 19.445 GiB/s]
Found 1 outliers among 100 measurements (1.00%)
1 (1.00%) high severe
Symbol +=/ time: [23.690 ns 23.825 ns 24.021 ns]
thrpt: [19.851 GiB/s 20.014 GiB/s 20.129 GiB/s]
Found 3 outliers among 100 measurements (3.00%)
3 (3.00%) high severe
Symbol FMA/ time: [26.912 ns 26.916 ns 26.920 ns]
thrpt: [17.713 GiB/s 17.716 GiB/s 17.718 GiB/s]
Found 8 outliers among 100 measurements (8.00%)
5 (5.00%) high mild
3 (3.00%) high severe
encode 10KB/ time: [42.279 µs 42.336 µs 42.451 µs]
thrpt: [230.04 MiB/s 230.67 MiB/s 230.98 MiB/s]
Found 5 outliers among 100 measurements (5.00%)
1 (1.00%) high mild
4 (4.00%) high severe
roundtrip 10KB/ time: [44.066 µs 44.078 µs 44.091 µs]
thrpt: [221.49 MiB/s 221.55 MiB/s 221.61 MiB/s]
Found 9 outliers among 100 measurements (9.00%)
1 (1.00%) low mild
6 (6.00%) high mild
2 (2.00%) high severe
roundtrip repair 10KB/ time: [97.345 µs 97.376 µs 97.408 µs]
thrpt: [100.25 MiB/s 100.29 MiB/s 100.32 MiB/s]
Found 5 outliers among 100 measurements (5.00%)
2 (2.00%) high mild
3 (3.00%) high severe
Running benches/decode_benchmark.rs (target/release/deps/decode_benchmark-adb935813f7dae43)
Symbol size: 1280 bytes
symbol count = 10, decoded 127 MB in 0.440secs using 0.0% overhead, throughput: 2327.1Mbit/s
symbol count = 100, decoded 127 MB in 0.343secs using 0.0% overhead, throughput: 2983.8Mbit/s
symbol count = 250, decoded 127 MB in 0.318secs using 0.0% overhead, throughput: 3216.8Mbit/s
symbol count = 500, decoded 127 MB in 0.307secs using 0.0% overhead, throughput: 3324.1Mbit/s
symbol count = 1000, decoded 126 MB in 0.317secs using 0.0% overhead, throughput: 3203.9Mbit/s
symbol count = 2000, decoded 126 MB in 0.337secs using 0.0% overhead, throughput: 3013.7Mbit/s
symbol count = 5000, decoded 122 MB in 0.369secs using 0.0% overhead, throughput: 2646.5Mbit/s
symbol count = 10000, decoded 122 MB in 0.426secs using 0.0% overhead, throughput: 2292.4Mbit/s
symbol count = 20000, decoded 122 MB in 0.576secs using 0.0% overhead, throughput: 1695.4Mbit/s
symbol count = 50000, decoded 122 MB in 0.823secs using 0.0% overhead, throughput: 1186.6Mbit/s
symbol count = 10, decoded 127 MB in 0.437secs using 5.0% overhead, throughput: 2343.1Mbit/s
symbol count = 100, decoded 127 MB in 0.347secs using 5.0% overhead, throughput: 2949.4Mbit/s
symbol count = 250, decoded 127 MB in 0.323secs using 5.0% overhead, throughput: 3167.0Mbit/s
symbol count = 500, decoded 127 MB in 0.308secs using 5.0% overhead, throughput: 3313.3Mbit/s
symbol count = 1000, decoded 126 MB in 0.325secs using 5.0% overhead, throughput: 3125.0Mbit/s
symbol count = 2000, decoded 126 MB in 0.336secs using 5.0% overhead, throughput: 3022.7Mbit/s
symbol count = 5000, decoded 122 MB in 0.372secs using 5.0% overhead, throughput: 2625.2Mbit/s
symbol count = 10000, decoded 122 MB in 0.453secs using 5.0% overhead, throughput: 2155.8Mbit/s
symbol count = 20000, decoded 122 MB in 0.610secs using 5.0% overhead, throughput: 1600.9Mbit/s
symbol count = 50000, decoded 122 MB in 0.955secs using 5.0% overhead, throughput: 1022.6Mbit/s
Running benches/encode_benchmark.rs (target/release/deps/encode_benchmark-91c1f7fa85ada316)
Symbol size: 1280 bytes (without pre-built plan)
symbol count = 10, encoded 127 MB in 0.307secs, throughput: 3335.3Mbit/s
symbol count = 100, encoded 127 MB in 0.262secs, throughput: 3906.2Mbit/s
symbol count = 250, encoded 127 MB in 0.261secs, throughput: 3919.3Mbit/s
symbol count = 500, encoded 127 MB in 0.259secs, throughput: 3940.2Mbit/s
symbol count = 1000, encoded 126 MB in 0.266secs, throughput: 3818.1Mbit/s
symbol count = 2000, encoded 126 MB in 0.282secs, throughput: 3601.5Mbit/s
symbol count = 5000, encoded 122 MB in 0.300secs, throughput: 3255.2Mbit/s
symbol count = 10000, encoded 122 MB in 0.358secs, throughput: 2727.8Mbit/s
symbol count = 20000, encoded 122 MB in 0.494secs, throughput: 1976.8Mbit/s
symbol count = 50000, encoded 122 MB in 0.657secs, throughput: 1486.4Mbit/s
Symbol size: 1280 bytes (with pre-built plan)
symbol count = 10, encoded 127 MB in 0.160secs, throughput: 6399.5Mbit/s
symbol count = 100, encoded 127 MB in 0.111secs, throughput: 9220.2Mbit/s
symbol count = 250, encoded 127 MB in 0.120secs, throughput: 8524.6Mbit/s
symbol count = 500, encoded 127 MB in 0.122secs, throughput: 8364.8Mbit/s
symbol count = 1000, encoded 126 MB in 0.126secs, throughput: 8060.5Mbit/s
symbol count = 2000, encoded 126 MB in 0.134secs, throughput: 7579.3Mbit/s
symbol count = 5000, encoded 122 MB in 0.142secs, throughput: 6877.2Mbit/s
symbol count = 10000, encoded 122 MB in 0.172secs, throughput: 5677.7Mbit/s
symbol count = 20000, encoded 122 MB in 0.224secs, throughput: 4359.7Mbit/s
symbol count = 50000, encoded 122 MB in 0.362secs, throughput: 2697.7Mbit/s
Running benches/matrix_sparsity.rs (target/release/deps/matrix_sparsity-410ec823ba326d21)
Row density for 27x27: min=0 max=18 p50=7 p80=18 p90=18 p95=18 p99=18
Original density for 27x27: 231 of 729 (31.687%)
Initial memory usage: 3KB
Optimized decoder mul ops: 269 (26.9 per symbol), add ops: 414 (41.4 per symbol)
By phase mul ops: [129, 140, 0, 0, 0], add ops: [175, 152, 21, 45, 21]
Row density for 128x128: min=0 max=119 p50=5 p80=20 p90=20 p95=119 p99=119
Original density for 128x128: 2077 of 16384 (12.677%)
Initial memory usage: 14KB
Optimized decoder mul ops: 1253 (12.4 per symbol), add ops: 2918 (28.9 per symbol)
By phase mul ops: [1016, 237, 0, 0, 0], add ops: [1585, 356, 350, 277, 350]
Row density for 1071x1071: min=0 max=1062 p50=5 p80=9 p90=29 p95=52 p99=53
Original density for 1071x1071: 20933 of 1147041 (1.825%)
Initial memory usage: 125KB
Optimized decoder mul ops: 10600 (10.6 per symbol), add ops: 30302 (30.2 per symbol)
By phase mul ops: [9937, 663, 0, 0, 0], add ops: [16799, 2435, 4305, 2458, 4305]
Row density for 10269x10269: min=0 max=10251 p50=5 p80=8 p90=15 p95=32 p99=126
Original density for 10269x10269: 214641 of 105452361 (0.204%)
Initial memory usage: 1373KB
Optimized decoder mul ops: 112049 (11.2 per symbol), add ops: 328540 (32.8 per symbol)
By phase mul ops: [109861, 2188, 0, 0, 0], add ops: [178357, 19887, 53259, 23778, 53259]
Row density for 41104x41104: min=0 max=41019 p50=5 p80=8 p90=14 p95=29 p99=177
Original density for 41104x41104: 1025396 of 1689538816 (0.061%)
Initial memory usage: 6305KB
Optimized decoder mul ops: 612014 (15.1 per symbol), add ops: 1554467 (38.5 per symbol)
By phase mul ops: [605623, 6391, 0, 0, 0], add ops: [881919, 87897, 244445, 95761, 244445]
Row density for 57326x57326: min=0 max=57163 p50=5 p80=8 p90=13 p95=27 p99=189
Original density for 57326x57326: 1484776 of 3286270276 (0.045%)
Initial memory usage: 9285KB
Optimized decoder mul ops: 910038 (16.1 per symbol), add ops: 2223234 (39.4 per symbol)
By phase mul ops: [901621, 8417, 0, 0, 0], add ops: [1285203, 137388, 333568, 133507, 333568]
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment