Solved

Mac OSX Lion: Problems with Umlauts and ISO8859

Posted on 2011-09-12
6
567 Views
Last Modified: 2012-05-12
Hello Experts,

I have a tar archive created on an old Unixware machine using ISO8859-1 encoding. When I try to extract it under Macosx Lion, I experience a bit of weirdness with German umlauts. For instance:

I open a terminal.app window with the encoding set to "ISO8859-1" and set my locale like this:
export LANG=de_DE.ISO8859-1
export LC_ALL=de_DE.ISO8859-1

Open in new window

then take a peek at the archive:

dhcp202:Downloads frank$ tar tvf backup.tar home/frank
x home/frank/
x home/frank/Präferenzen/

Open in new window


note that the "ä" is displayed correctly

I then unpack the archive with

tar xvf backup.tar home/frank

Open in new window


the "ä" is also displayed correctly in the output from tar, but when I list the directory contents, I see:

dhcp202:Downloads frank$ ls home/frank
Pr%E4ferenzen 

Open in new window

I've unpacked the archive on Linux and Unixware machines - no problems. Could it be something with HFS+?

Thanks!
0
Comment
Question by:alpha-lemming
  • 4
  • 2
6 Comments
 
LVL 62

Accepted Solution

by:
gheist earned 500 total points
ID: 36525482
Not all programs are compiled against same language libraries
http://hintsforums.macworld.com/showthread.php?t=81848

You can try pax and 7z as alternatives to tar. they may do better or worse, but try.
0
 

Author Closing Comment

by:alpha-lemming
ID: 36920509
Tried several other utilities, 7zip, build gnu tar from source, didn't help.
0
 
LVL 62

Expert Comment

by:gheist
ID: 36921192
LANG=C pax -x file.tar
0
Master Your Team's Linux and Cloud Stack!

The average business loses $13.5M per year to ineffective training (per 1,000 employees). Keep ahead of the competition and combine in-person quality with online cost and flexibility by training with Linux Academy.

 
LVL 62

Expert Comment

by:gheist
ID: 36921200
0
 

Author Comment

by:alpha-lemming
ID: 36922743
hmm, when pax encounters one of the files i question, it spits out

pax: Invalid header, starting valid header search

and skips extracting the file.
0
 
LVL 62

Expert Comment

by:gheist
ID: 36922767
Does setting encoding to win1252 help?

You might need gnu tar and/or pax to extract non-posix TAR files (with non-ascii characters)
0

Featured Post

Free Tool: Postgres Monitoring System

A PHP and Perl based system to collect and display usage statistics from PostgreSQL databases.

One of a set of tools we are providing to everyone as a way of saying thank you for being a part of the community.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Suggested Solutions

Why Shell Scripting? Shell scripting is a powerful method of accessing UNIX systems and it is very flexible. Shell scripts are required when we want to execute a sequence of commands in Unix flavored operating systems. “Shell” is the command line i…
Every server (virtual or physical) needs a console: and the console can be provided through hardware directly connected, software for remote connections, local connections, through a KVM, etc. This document explains the different types of consol…
Learn how to find files with the shell using the find and locate commands. Use locate to find a needle in a haystack.: With locate, check if the file still exists.: Use find to get the actual location of the file.:
In a previous video, we went over how to export a DynamoDB table into Amazon S3.  In this video, we show how to load the export from S3 into a DynamoDB table.

820 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question