Link to home
Start Free TrialLog in
Avatar of trevor1940
trevor1940

asked on

perl: How dose perl handle diacritics and non roman characters

I'm trying to ingest a large text file into PostgreSQL Full details here

I may need to run  a script against the file to either remove the problem rows into a separate file, or insert into the database via DBI and was wondering how perl handles diacritics?

I'm sure perls native I/O file handle is UTF8 but not certain about the diacritics or non roman characters
ASKER CERTIFIED SOLUTION
Avatar of wilcoxon
wilcoxon
Flag of United States of America image

Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
Avatar of trevor1940
trevor1940

ASKER

Assuming the data is Unicode

Yes the docs state Unicode UTF8


Strawberry

perl -v

perl 5 version 20 (v5.20.2) for WIN32 -x64-multi-thread

Open in new window


There is so much information regarding perl and charsets I'm now confused

Any suggestions how to test if a script is corrupting the text?
bearing in mind i'm running in a windows command prompt which is rubbish at displaying extended characters
Thanx