?
Solved

Mac OSX Lion: Problems with Umlauts and ISO8859

Posted on 2011-09-12
6
Medium Priority
?
574 Views
Last Modified: 2012-05-12
Hello Experts,

I have a tar archive created on an old Unixware machine using ISO8859-1 encoding. When I try to extract it under Macosx Lion, I experience a bit of weirdness with German umlauts. For instance:

I open a terminal.app window with the encoding set to "ISO8859-1" and set my locale like this:
export LANG=de_DE.ISO8859-1
export LC_ALL=de_DE.ISO8859-1

Open in new window

then take a peek at the archive:

dhcp202:Downloads frank$ tar tvf backup.tar home/frank
x home/frank/
x home/frank/Präferenzen/

Open in new window


note that the "ä" is displayed correctly

I then unpack the archive with

tar xvf backup.tar home/frank

Open in new window


the "ä" is also displayed correctly in the output from tar, but when I list the directory contents, I see:

dhcp202:Downloads frank$ ls home/frank
Pr%E4ferenzen 

Open in new window

I've unpacked the archive on Linux and Unixware machines - no problems. Could it be something with HFS+?

Thanks!
0
Comment
Question by:alpha-lemming
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 4
  • 2
6 Comments
 
LVL 62

Accepted Solution

by:
gheist earned 1500 total points
ID: 36525482
Not all programs are compiled against same language libraries
http://hintsforums.macworld.com/showthread.php?t=81848

You can try pax and 7z as alternatives to tar. they may do better or worse, but try.
0
 

Author Closing Comment

by:alpha-lemming
ID: 36920509
Tried several other utilities, 7zip, build gnu tar from source, didn't help.
0
 
LVL 62

Expert Comment

by:gheist
ID: 36921192
LANG=C pax -x file.tar
0
Limited time offer using promo code EXPERTS25

Designed with a wealth of functionality and convenience, ATEN's new Thunderbolt™ 2 Sharing Switch takes your Thunderbolt setup to the next level. Now through August 31, 2017, Experts Exchange members get 25% off the US7220 on the ATEN USA eShop using promo code EXPERTS25.

 
LVL 62

Expert Comment

by:gheist
ID: 36921200
0
 

Author Comment

by:alpha-lemming
ID: 36922743
hmm, when pax encounters one of the files i question, it spits out

pax: Invalid header, starting valid header search

and skips extracting the file.
0
 
LVL 62

Expert Comment

by:gheist
ID: 36922767
Does setting encoding to win1252 help?

You might need gnu tar and/or pax to extract non-posix TAR files (with non-ascii characters)
0

Featured Post

Get proactive database performance tuning online

At Percona’s web store you can order full Percona Database Performance Audit in minutes. Find out the health of your database, and how to improve it. Pay online with a credit card. Improve your database performance now!

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Google Drive is extremely cheap offsite storage, and it's even possible to get extra storage for free for two years.  You can use the free account 15GB, and if you have an Android device..when you install Google Drive for the first time it will give…
In the first part of this tutorial we will cover the prerequisites for installing SQL Server vNext on Linux.
Learn how to find files with the shell using the find and locate commands. Use locate to find a needle in a haystack.: With locate, check if the file still exists.: Use find to get the actual location of the file.:
This demo shows you how to set up the containerized NetScaler CPX with NetScaler Management and Analytics System in a non-routable Mesos/Marathon environment for use with Micro-Services applications.
Suggested Courses
Course of the Month14 days, 18 hours left to enroll

770 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question