Administration

Added Column Encoding Types

0

Vertica 7.1 introduces two new encoding types: BZIP_COMP and GZIP_COMP. These types allow for bzip2 and gzip compression on table columns. While these encodings give high compression, they also require more resources to process.

BZIP_COMP

This encoding type uses the bzip2 compression algorithm on the block contents. See bzip web site for more information. This algorithm results in higher compression than the automatic LZO and gzip encoding. However, it requires more CPU time to compress than these other methods. This algorithm is best used on large string columns such as VARCHAR, VARBINARY, CHAR, and BINARY. Choose this encoding type when you are willing to trade slower load speeds for higher data compression.

GZIP_COMP

This encoding type uses the gzip compression algorithm. See gzip web site for more information. This algorithm results in better compression than the automatic LZO compression, but lower compression than BZIP_COMP. It requires more CPU time to compress than LZO but less CPU time than BZIP_COMP. This algorithm is best used on large string columns such as VARCHAR, VARBINARY, CHAR, and BINARY. Use this encoding when you want a better compression than LZO, but at less CPU time than bzip2.

The supported encoding and compression types are available in the documentation.

Below is an overview of encoding types across previous versions:

Encoding Type7.1.x7.0.x6.1.x6.0.x5.1.x5.0.x4.1.x
AUTO (default)YesYesYesYesYesYesYes
BLOCK_DICTYesYesYesYesYesYesYes
BLOCKDICT_COMPYesYesYesYesYesYesYes
BZIP_COMPYes------
COMMONDELTA_COMPYesYesYesYesYesYesYes
DELTARANGE_COMPYesYesYesYesYesYesYes
DELTAVALYesYesYesYesYesYesYes
GCDDELTAYesYesYes----
GZIP_COMPYes------
RLEYesYesYesYesYesYesYes
NONEYes *Yes *Yes *Yes *Yes *Yes *Yes *

* Encoding NONE should not be used. It is deprecated and only exists for backwards compatibility.

About the author / 

Norbert Krupa

Norbert is the founder of vertica.tips and a Solutions Engineer at Talend. He is an HP Accredited Solutions Expert for Vertica Big Data Solutions. He has written the Vertica Diagnostic Queries which aim to cover monitoring, diagnostics and performance tuning. The views, opinions, and thoughts expressed here do not represent those of the user's employer.

Leave a Reply

Upcoming Events

  • No upcoming events
AEC v1.0.4

Subscribe to Blog via Email

Enter your email address to subscribe and receive notifications of new posts by email.

Read more use cases here.

Notice

This site is not affiliated, endorsed or associated with HPE Vertica. This site makes no claims on ownership of trademark rights. The author contributions on this site are licensed under CC BY-SA 3.0 with attribution required.
%d bloggers like this: