alpha-lemming
asked on
Mac OSX Lion: Problems with Umlauts and ISO8859
Hello Experts,
I have a tar archive created on an old Unixware machine using ISO8859-1 encoding. When I try to extract it under Macosx Lion, I experience a bit of weirdness with German umlauts. For instance:
I open a terminal.app window with the encoding set to "ISO8859-1" and set my locale like this:
note that the "ä" is displayed correctly
I then unpack the archive with
the "ä" is also displayed correctly in the output from tar, but when I list the directory contents, I see:
Thanks!
I have a tar archive created on an old Unixware machine using ISO8859-1 encoding. When I try to extract it under Macosx Lion, I experience a bit of weirdness with German umlauts. For instance:
I open a terminal.app window with the encoding set to "ISO8859-1" and set my locale like this:
export LANG=de_DE.ISO8859-1
export LC_ALL=de_DE.ISO8859-1
then take a peek at the archive:dhcp202:Downloads frank$ tar tvf backup.tar home/frank
x home/frank/
x home/frank/Präferenzen/
note that the "ä" is displayed correctly
I then unpack the archive with
tar xvf backup.tar home/frank
the "ä" is also displayed correctly in the output from tar, but when I list the directory contents, I see:
dhcp202:Downloads frank$ ls home/frank
Pr%E4ferenzen
I've unpacked the archive on Linux and Unixware machines - no problems. Could it be something with HFS+?Thanks!
ASKER CERTIFIED SOLUTION
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
LANG=C pax -x file.tar
Foreword says it all (probably)
https://secure.wikimedia.org/wikipedia/en/wiki/ISO/IEC_8859-1
https://secure.wikimedia.org/wikipedia/en/wiki/ISO/IEC_8859-1
ASKER
hmm, when pax encounters one of the files i question, it spits out
pax: Invalid header, starting valid header search
and skips extracting the file.
pax: Invalid header, starting valid header search
and skips extracting the file.
Does setting encoding to win1252 help?
You might need gnu tar and/or pax to extract non-posix TAR files (with non-ascii characters)
You might need gnu tar and/or pax to extract non-posix TAR files (with non-ascii characters)
ASKER