hadoop - What is the difference between partitioning and bucketing in hive? -


this question has answer here:

i not clear difference between partitioning , bucketing in hive , appreciate if can provide details example.

here nice difference between buckets , partitioning.

basically both partitioning , bucketing slice data executing query more efficiently on non-sliced data. major difference number of slices keep on changing in case of partitioning data modified, bucketing number of slices fixed specified while creating table.

bucketing happen using hash algorithm , modulo on number of buckets. so, row might inserted of bucket. bucketing can used sampling of data, joining 2 data sets more , more.


Comments

Popular posts from this blog

java.util.scanner - How to read and add only numbers to array from a text file -

rewrite - Trouble with Wordpress multiple custom querystrings -