Solved

Mac OSX Lion: Problems with Umlauts and ISO8859

Posted on 2011-09-12
6
571 Views
Last Modified: 2012-05-12
Hello Experts,

I have a tar archive created on an old Unixware machine using ISO8859-1 encoding. When I try to extract it under Macosx Lion, I experience a bit of weirdness with German umlauts. For instance:

I open a terminal.app window with the encoding set to "ISO8859-1" and set my locale like this:
export LANG=de_DE.ISO8859-1
export LC_ALL=de_DE.ISO8859-1

Open in new window

then take a peek at the archive:

dhcp202:Downloads frank$ tar tvf backup.tar home/frank
x home/frank/
x home/frank/Präferenzen/

Open in new window


note that the "ä" is displayed correctly

I then unpack the archive with

tar xvf backup.tar home/frank

Open in new window


the "ä" is also displayed correctly in the output from tar, but when I list the directory contents, I see:

dhcp202:Downloads frank$ ls home/frank
Pr%E4ferenzen 

Open in new window

I've unpacked the archive on Linux and Unixware machines - no problems. Could it be something with HFS+?

Thanks!
0
Comment
Question by:alpha-lemming
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 4
  • 2
6 Comments
 
LVL 62

Accepted Solution

by:
gheist earned 500 total points
ID: 36525482
Not all programs are compiled against same language libraries
http://hintsforums.macworld.com/showthread.php?t=81848

You can try pax and 7z as alternatives to tar. they may do better or worse, but try.
0
 

Author Closing Comment

by:alpha-lemming
ID: 36920509
Tried several other utilities, 7zip, build gnu tar from source, didn't help.
0
 
LVL 62

Expert Comment

by:gheist
ID: 36921192
LANG=C pax -x file.tar
0
What Is Transaction Monitoring and who needs it?

Synthetic Transaction Monitoring that you need for the day to day, which ensures your business website keeps running optimally, and that there is no downtime to impact your customer experience.

 
LVL 62

Expert Comment

by:gheist
ID: 36921200
0
 

Author Comment

by:alpha-lemming
ID: 36922743
hmm, when pax encounters one of the files i question, it spits out

pax: Invalid header, starting valid header search

and skips extracting the file.
0
 
LVL 62

Expert Comment

by:gheist
ID: 36922767
Does setting encoding to win1252 help?

You might need gnu tar and/or pax to extract non-posix TAR files (with non-ascii characters)
0

Featured Post

Don't Miss ATEN at InfoComm 2017!

Visit booth #2167 to see the  new ATEN VM3200 32 x 32 Modular Matrix Switch. Other highlights include the VE8950 4K HDMI Over IP Extender, VS1912 12-Port DP Video Wall Media Player  and VK2100 ATEN Control System. Register now with Free Pass Code ATEN288!

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

It’s 2016. Password authentication should be dead — or at least close to dying. But, unfortunately, it has not traversed Quagga stage yet. Using password authentication is like laundering hotel guest linens with a washboard — it’s Passé.
Fine Tune your automatic Updates for Ubuntu / Debian
Learn how to navigate the file tree with the shell. Use pwd to print the current working directory: Use ls to list a directory's contents: Use cd to change to a new directory: Use wildcards instead of typing out long directory names: Use ../ to move…
Connecting to an Amazon Linux EC2 Instance from Windows Using PuTTY.

726 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question