PARQUET-1866: Replace Hadoop ZSTD with JNI-ZSTD#793
PARQUET-1866: Replace Hadoop ZSTD with JNI-ZSTD#793gszadovszky merged 7 commits intoapache:masterfrom
Conversation
e5ebcac to
4c264fc
Compare
…stdCodec.java Co-authored-by: Gabor Szadovszky <gabor@apache.org>
…g, and fix the codec name
|
@luben, Do you have time to review the code? |
|
LGTM |
|
@gszadovszky Do you have time for another look? |
|
Thank you, @shangxinli and all! |
|
+1 @shangxinli and thank you for this contribution. This will allow users who are on order versions of hadoop that don't support native ZSTD to use ZSTD compression in Parquet, and also, users don't have to go through the very complicated hadoop native installation. For developers, we will be able to easily test this out in different local envs. cc @rdblue |
|
@shangxinli do we have benchmark comparing to native hadoop codec both in size and speed? Thanks. |
Hi @dbtsai, I didn't because I don't have Hadoop host installed with ZSTD. @luben, did you ever compare it with Hadoop ZSTD? |
|
@shangxinli : I haven't benchmarked |
Original pull request: apache#793 Change-Id: Iccb6a643f06664a4626b0f5e219de6c907743b94
Original pull request: apache#793 Change-Id: Iccb6a643f06664a4626b0f5e219de6c907743b94
To add zstd-jni and support zstd-jni compression easily in connectors. cf: apache/parquet-java#793
Make sure you have checked all steps below.
Jira
Tests
Commits
Documentation