Page 1 of 2

Compression Ratio varies with number of nodes

Posted: Tue Nov 12, 2013 7:12 am
by banurajadurai
HI

How Exactly data is compressed in Multinode Vertica set up ? Is there any difference in Compression ratio with number of nodes ?

Re: Compression Ratio varies with number of nodes

Posted: Tue Nov 12, 2013 10:07 am
by banurajadurai
Reply soon !!!!!!

Re: Compression Ratio varies with number of nodes

Posted: Tue Nov 12, 2013 1:52 pm
by JimKnicely
Hi,

I don't think the compression ratio would change significantly based on the number of nodes. That is, the sum of the compressed data on all nodes should have a similar size to the compressed data on a single node. But that's assuming you have optimized projections. Default super projections might not optimize database performance, resulting in slow query performance and low data compression.

Re: Compression Ratio varies with number of nodes

Posted: Wed Nov 13, 2013 6:23 am
by banurajadurai
i have loaded 39 GB of raw data
In single node it is compressed to 5.1 GB and in 3 node it is compressed to 12.01 GB with K safety 1 (Original data + 1 backup copy)


1 GB differece .....


All Table structure , Optimized Query , Db designs are same in both the single and 3 node Vertica ..

why this large difference ?

Re: Compression Ratio varies with number of nodes

Posted: Wed Nov 13, 2013 3:19 pm
by JimKnicely
Are you sure the physical designs are the same?

What are the projection counts from PROJECTIONS table in each solution? What are the projection counts where the is_super_projection = t?

On the single node I bet all projections are super projections that can be poorly compressed as expressed by the documentation.

Re: Compression Ratio varies with number of nodes

Posted: Thu Nov 14, 2013 5:52 am
by banurajadurai
IN single node vertica , For super projection Compression ratio was 12.71 GB .

Re: Compression Ratio varies with number of nodes

Posted: Thu Nov 14, 2013 10:41 am
by nnani
So now the compression size is matching in your single node and your 3 node cluster.