Generic Field seprator for ASCII/UNICODE

Moderator: NorbertKrupa

Post Reply
Minister
Newbie
Newbie
Posts: 2
Joined: Thu Oct 04, 2012 3:55 pm

Generic Field seprator for ASCII/UNICODE

Post by Minister » Thu Oct 04, 2012 3:59 pm

I have multiple tables to be copied over from Oracle database to Vertica database, some of them have the UNICODE chars.
When I do it by dumping a data file (csv like file) and use LCOPY command later on to load this file I do get issues when some UNICODE chars are in the data file. Data file is not fixed length.
Do you have any suggestions on a generic field and record separator which will be applicable in all of the cases, even when UNICODE data is loaded?

User avatar
JimKnicely
Site Admin
Site Admin
Posts: 1825
Joined: Sat Jan 21, 2012 4:58 am
Contact:

Re: Generic Field seprator for ASCII/UNICODE

Post by JimKnicely » Thu Oct 11, 2012 2:14 pm

Hi Minister,

When you say that you are having "issues" when there is UNICODE in your data what exactly are the issues? Do you get errors? The "Checking Data Format Before or After Loading" section of the Administrator's Guide might be of assistance to you.

It states:
Vertica expects all loaded data files to be in the Unicode UTF-8 format. You can load ASCII data which is UTF-8 compatible. Character sets like ISO 8859 (Latin 1) are incompatible with UTF-8 and are not supported.
It goes on to explain how to convert the your files.

Also, check out the "StandardConformingStrings" parameter if you are having issues with the back slash.

Thanks
Jim Knicely

Image

Note: I work for Vertica. My views, opinions, and thoughts expressed here do not represent those of my employer.

Post Reply

Return to “Vertica Data Load”