Richard Geldreich's Blog: One test showing the performance of miniz vs. zlib

Thursday, December 3, 2015

One test showing the performance of miniz vs. zlib

miniz (was here, now migrating to github here) is my single source file zlib-alternative. It's a complete from scratch reimplementation, and my 5th Deflate/Inflate implementation so far. It has an extremely fast, real-time Deflate-compatible compressor, and for fun the entire decompressor lives in only a single C function. From this post by Tom Alexander:

miniz vs zlib

For this final test, we will use the code from the above test which is using read and only a single thread. This should be enough to compare the raw performance of miniz vs zlib by comparing our binary vs zcat.

Type	Time
fzcat (modified for test)	64.25435829162598
zcat	109.0133900642395

Conclusions

So it seems that the benefit of mmap vs read isn't as significant as I expected. THe benefit theoretically could be more significant on a machine with multiple processes reading the same file but I'll leave that as an excercise for the reader.

miniz turned out to be significantly faster than zlib even when both are used in the same fashion (single threaded and read). Additionally, using the copious amounts of ram available to machines today allowed us to speed everything up even more with threading.

8 comments:

Arseny KapoulkineDecember 3, 2015 at 8:38 PM
"For this final test, we will use the code from the above test which is using read and only a single thread. This should be enough to compare the raw performance of miniz vs zlib by comparing our binary vs zcat."

Uh. Apples to oranges?.. I thought miniz should be slower than zlib.
ReplyDelete
Replies
Evan NemersonDecember 3, 2015 at 8:47 PM
I recently had some performance issues in the Squash benchmark when mmap was used, which have disappeared since I started using MAP_HUGETLB… it might be worth looking into that before abandoning mmap. FWIW, if you're on Linux the `perf` can be quite helpful here; it certainly was for me.

That said, in Squash I currently only use mmap for codecs which don't support streaming so I can avoid allocating enough room for the entire input and output in RAM. I haven't looked into using it for streaming I/O (though I do have a bug open about the idea).

If you do abandon mmap, there are two things I've been meaning to investigate for Squash: the first idea (which I got from Lasse Collin) is to use posix_fadvise with POSIX_FADV_SEQUENTIAL, and possibly POSIX_FADV_WILLNEED. The second is using the POSIX AIO API (see `man 7 aio`) to ask the OS to read another block of data before you start processing the current one.
ReplyDelete
Replies

Add comment

Note: Only a member of this blog may post a comment.