Advertisement

03.21.2008 at 07:02PM PDT, ID: 23260989
[x]
Attachment Details
[x]
The Solution Rating System

With so many solutions, how can you tell which solutions are most likely to help you and which ones are not? To provide you with a tool to use, we rate our solutions based on various elements that most accurately determine if a solution is a quality solution. To explain what factors affect the solution rating, here are the elements we take into consideration when formulating our solution rating.

  • The Grade of the Solution
  • The Zone Rank of the Expert Providing the Solution
  • The Number of Author and Expert Comments
  • The Number of Experts Contributing
  • The Feedback of the Community

Your Input Matters
Because of the way the system is set up, the most important variable in this equation is you. As a member of Experts Exchange, you are able to cast your vote on the quality of the solutions in regard to how complete, accurate, helpful and easy to understand each solution is. When you provide your feedback, each rating is adjusted accordingly. So, if you see a solution that has a poor rating that you think is a good solution, let us know by rating it. As you do, the rating will be adjusted and will become more accurate for other members of our site.

If you have any suggestions that you would like to make for our rating system, please ask a question in the Suggestions Zone of Community Support.

Thank you!

Why can you not compress compressed files?

Tags: compression, zip
Why is there a limit to compressed files?

If I ZIP a file, and I ZIP it again, it doesn't get smaller... why not?

If I use different compression methods, shouldn't it still get smaller?
Start your free trial to view this solution
Question Stats
Zone: Software
Question Asked By: rcoast
Solution Provided By: MrHusy
Participating Experts: 4
Solution Grade: A
Views: 9
Translate:
Loading Advertisement...
03.21.2008 at 07:33PM PDT, ID: 21184528

All comments and solutions are available to Premium Service Members only.

Start your 7 day free trial and see for yourself why Experts Exchange is the easiest and most proven technology resource in the world. Get Started

Already a member? Login to view this solution.

 
03.21.2008 at 07:36PM PDT, ID: 21184536

All comments and solutions are available to Premium Service Members only.

Start your 7 day free trial and see for yourself why Experts Exchange is the easiest and most proven technology resource in the world. Get Started

Already a member? Login to view this solution.

 
03.21.2008 at 07:58PM PDT, ID: 21184593

All comments and solutions are available to Premium Service Members only.

Start your 7 day free trial and see for yourself why Experts Exchange is the easiest and most proven technology resource in the world. Get Started

Already a member? Login to view this solution.

 
03.21.2008 at 08:01PM PDT, ID: 21184597

All comments and solutions are available to Premium Service Members only.

Start your 7 day free trial and see for yourself why Experts Exchange is the easiest and most proven technology resource in the world. Get Started

Already a member? Login to view this solution.

 
03.21.2008 at 08:05PM PDT, ID: 21184607

All comments and solutions are available to Premium Service Members only.

Start your 7 day free trial and see for yourself why Experts Exchange is the easiest and most proven technology resource in the world. Get Started

Already a member? Login to view this solution.

 
03.21.2008 at 08:07PM PDT, ID: 21184608

All comments and solutions are available to Premium Service Members only.

Start your 7 day free trial and see for yourself why Experts Exchange is the easiest and most proven technology resource in the world. Get Started

Already a member? Login to view this solution.

 
03.21.2008 at 08:11PM PDT, ID: 21184620

All comments and solutions are available to Premium Service Members only.

Start your 7 day free trial and see for yourself why Experts Exchange is the easiest and most proven technology resource in the world. Get Started

Already a member? Login to view this solution.

 
03.21.2008 at 08:13PM PDT, ID: 21184627

All comments and solutions are available to Premium Service Members only.

Start your 7 day free trial and see for yourself why Experts Exchange is the easiest and most proven technology resource in the world. Get Started

Already a member? Login to view this solution.

 
03.21.2008 at 08:16PM PDT, ID: 21184636

All comments and solutions are available to Premium Service Members only.

Start your 7 day free trial and see for yourself why Experts Exchange is the easiest and most proven technology resource in the world. Get Started

Already a member? Login to view this solution.

 
03.21.2008 at 08:19PM PDT, ID: 21184639

All comments and solutions are available to Premium Service Members only.

Start your 7 day free trial and see for yourself why Experts Exchange is the easiest and most proven technology resource in the world. Get Started

Already a member? Login to view this solution.

 
03.21.2008 at 08:22PM PDT, ID: 21184645

All comments and solutions are available to Premium Service Members only.

Start your 7 day free trial and see for yourself why Experts Exchange is the easiest and most proven technology resource in the world. Get Started

Already a member? Login to view this solution.

 
03.23.2008 at 03:58AM PDT, ID: 21189206

All comments and solutions are available to Premium Service Members only.

Start your 7 day free trial and see for yourself why Experts Exchange is the easiest and most proven technology resource in the world. Get Started

Already a member? Login to view this solution.

 
 
Loading Advertisement...
Microsoft
  • Internet Protocols
  • Applications
  • Development
  • OS
  • Hardware
  • Windows Security
Apple
  • Operating Systems
  • Hardware
  • Programming
  • Networking
  • Software
Internet
  • Search Engines
  • File Sharing
  • WebTrends / Stats
  • Spy / Ad Blockers
  • Web Browsers
  • New Net Users
  • Web Development
  • Chat / IM
  • Anti Spam
  • Web Servers
  • Anti-Virus
  • Email Clients
Gamers
  • Tips
  • Online / MMORPG
  • Puzzle
  • Emulators
  • Action / Adventure
  • Role Playing
  • Consoles
  • Game Programming
  • Strategy
  • Sports
  • Misc
  • Computer Games
Digital Living
  • Hardware
  • New Net Users
  • New Users
  • Software
  • Digital Music
  • Gaming World
  • Home Security
  • Apple
  • Networking Hardware
Virus & Spyware
  • Vulnerabilities
  • IDS
  • Encryption
  • Anti-Virus
  • Operating Systems Security
  • Software Firewalls
  • WebApplications
  • Cell Phones
  • Operating Systems
  • Internet
  • Hardware Firewalls
Hardware
  • Handhelds / PDAs
  • Displays / Monitors
  • Components
  • Networking Hardware
  • Peripherals
  • Laptops/Notebooks
  • Storage
  • Servers
  • Desktops
  • New Users
  • Misc
  • Apple
Software
  • System Utilities
  • Industry Specific
  • Network Management
  • Photos / Graphics
  • Page Layout
  • VMWare
  • Misc
  • Web Development
  • OS
  • CYGWIN
  • Voice Recognition
  • Message Queue
  • Quality Assurance
  • Security
  • Firewalls
  • MultiMedia Applications
  • Development
  • Database
  • Office / Productivity
  • Business Management
  • OS/2 Apps
  • Server Software
  • Internet / Email
ITPro
  • OS
  • Storage
  • Encryption
  • Operating Systems Security
  • Apple Hardware
  • Laptops & Notebooks
  • Servers
  • Networking Hardware
  • Peripherals
  • Devices
  • Displays / Monitors
  • WebTrends / Stats
  • Search Engines
  • Firewalls
  • WebApplications
  • IDS
  • Vulnerabilities
  • Email Clients
  • File Sharing
  • Spy / Ad Blockers
  • Web Browsers
  • Web Servers
  • Networking
  • Anti-Virus
  • Chat / IM
  • Anti Spam
Developer
  • Web Servers
  • Web Browsers
  • Game Programming
  • Dev Tools
  • Industry Specific
  • Office / Productivity
  • Database
  • CYGWIN
  • Web Development
  • Search Engines
  • File Sharing
  • WebTrends / Stats
  • Programming
  • Content Management
  • Application Servers
  • Protocols
Storage
  • Removable Backup Media
  • Storage Technology
  • Servers
  • Grid
  • Remote Access
  • Backup / Restore
  • Misc
  • Hard Drives
OS
  • Miscellaneous
  • Security
  • Development
  • Linux
  • VMWare
  • MainFrame OS
  • Unix
  • Apple
  • OS / 2
  • AS / 400
  • BeOS
  • Microsoft
  • VMS / OpenVMS
Database
  • Oracle
  • Miscellaneous
  • MySQL
  • Software
  • Sybase
  • Contact Management
  • PostgreSQL
  • Data Manipulation
  • Clarion
  • InterSystems Cache
  • Siebel
  • MUMPS
  • OLAP
  • SQLBase
  • SAS
  • GIS & GPS
  • 4GL
  • Berkeley DB
  • DB2
  • Informix
  • Interbase / Firebird
  • FoxPro
  • Reporting
  • LDAP
  • Filemaker Pro
  • MS SQL Server
  • dBase
  • MS Access
Security
  • Misc
  • Web Browsers
  • Software Firewalls
  • Operating Systems Security
  • File Sharing
  • Spy / Ad Blockers
  • Vulnerabilities
  • WebApplications
  • IDS
  • Anti-Virus
  • Encryption
  • Anti Spam
  • Email Clients
  • VPN
  • Chat / IM
Programming
  • Editors IDEs
  • Installation
  • Handhelds / PDAs
  • Multimedia Programming
  • System / Kernel
  • Algorithms
  • Game
  • Signal Processing
  • Project Management
  • Open Source
  • Database
  • Misc
  • Languages
  • Processor Platforms
  • Theory
Web Development
  • Scripting
  • Blogs
  • Web Servers
  • Software
  • Search Engines
  • Web Graphics
  • Images
  • Internet Marketing
  • Images and Photos
  • Components
  • Document Imaging
  • Web Languages/Standards
  • Illustration
  • WebApplications
  • Fonts
  • WebTrends / Stats
  • Authoring
  • Digital Camera Software
  • Miscellaneous
Networking
  • Protocols
  • Apple Networking
  • Network Management
  • Message Queue
  • Application Servers
  • Content Management
  • File Servers
  • Email Servers
  • Misc
  • Java Editors & IDEs
  • Wireless
  • Networking Hardware
  • Backup / Restore
  • System Utilities
  • ISPs & Hosting
  • Web Servers
  • Storage Technology
  • Removable Backup Media
  • Servers
  • Broadband
  • Grid
  • OS / 2
  • Novell Netware
  • Unix Networking
  • Windows Networking
  • Security
  • Telecommunications
  • Operating Systems
  • Linux Networking
Other
  • Community Advisor
  • Lounge
  • Community Support
  • New Net Users
  • Philosophy / Religion
  • Math / Science
  • Miscellaneous
  • URLs
  • Expert Lounge
  • Politics
  • Puzzles / Riddles
Community Support
  • Suggestions
  • New to EE
  • New Topics
  • Community Advisor
  • CleanUp
  • Announcements
  • General
  • Feedback
  • Input
  • EE Bugs
 
03.21.2008 at 07:33PM PDT, ID: 21184528
Hi rcoast
     To get the answer, you should first have a basic understanding of compression
     http://www.2brightsparks.com/resources/articles/understanding-compression.html
     And here are file type based compression percentages
     http://www.bleepingcomputer.com/tutorials/tutorial107.html

Regards
 
03.21.2008 at 07:36PM PDT, ID: 21184536
Thanks... but this information doesn't answer my question.

I understand how compression schemes work independently, but why can't I apply them over and over again, to the same file?
 
03.21.2008 at 07:58PM PDT, ID: 21184593
Compression works because it finds repeating sequences of data and reduces them. So, for a simplistic explanation, if I have the data 0000000100000, I could represent it as 715 (7 zeros, a 1, and 5 more zeros)

After compression, the algorithm has gotten rid of all the patterns. The same algorithm won't find anything to simplify, hence no space savings.

In theory, you could RAR-compress a ZIP file, but the compressed file is much closer to randomness (a lack of patterns) than a non-compressed file.

In short, don't bother. Find a good compression algorithm (GZIP, ZIP, RAR are pretty good), set it to the highest level compression, and be done with it.
 
03.21.2008 at 08:01PM PDT, ID: 21184597
I understand what you are saying... but there are always patterns...

For instance,

If you have your compressed data: 7' 0s and 1' 5... as in your example... that is a pattern. Why can't another ZIP tool recognize this and compress it?
 
03.21.2008 at 08:05PM PDT, ID: 21184607
Compression tools may use various techniques to reduce the size of files, but ultimately they are only removing unneeded space/contents.  Once these files are reduced to their smallest size, it simply isn't possible to reduce them to a smaller size simply by the fact that the compression software has already done this.  There is nothing left to remove without damaging the contents.
 
03.21.2008 at 08:07PM PDT, ID: 21184608
Well it actually does. Let me explain according to the example in above article.
Lets say that you have a text file containing the following text

"One swallow does not make a summer, neither does one fine day; similarly one day or brief time of happiness does not make a person entirely happy"

Each character consumes a space of 1 byte. And there are 146 characters above, therefore the file is a total of 146 bytes.

   As you can see, there are words repeated like "one" (repeated for 3 times), "does" (repeated for 3 times), "not" (2 times),  "day" (2 times).
   Compressing creates a chart and assign these words to characters that consumes less space (while "does" consumes 4 bytes, compressing assigns this word to "1" for example. When it sees "1"
while decompressing, it reads the chart and knows that it is in fact the word "does")
Lets assign the words to chars
1=one
2=does
3=not
4=make
5=day

   After the compression, text looks like following
"1 swallow 2 3 4 a summer, neither 2 1 fine 5; similarly 1 5 or brief time of happiness 2 3 4 a person entirely happy"    

   We are ok untill now. Lets take that compressed file and try to compress it again. Now tell me can you see any repetetive words to assign to a char? For example the "2" which is the product of previous compression is a 1 byte char and used 3 times. Can you assign this to soething which consumes less space? NO because byte is the least usable thing. Thats why you can not compress it again.
Accepted Solution
 
03.21.2008 at 08:11PM PDT, ID: 21184620
When a file is compressed the patterns are removed, then the compression method adds information to the file to allow decompression software to understand what is necessary to return the file to its original state.

Files are not like air where you can first compress 10 psi, then compress again to 20 psi, then to 30 or 40 psi - as long as the compressor and container can handle it, you can keep compressing.  Files are more like water; compression is limited.
 
03.21.2008 at 08:13PM PDT, ID: 21184627
Excellent explanation MrHusy!
 
03.21.2008 at 08:16PM PDT, ID: 21184636
Exactly what I was trying to say. We're fast approaching data theory here, but I'll continue for a second.

Let's say I look for long sequences of data and replace them. This is what normal compression does, and they tend to look for lots of unused space, padding (empty data in a file can be very well compressed), and raw text (which compresses very nicely).

Now, I spit out my compressed format. This is a lot more random and a lot less ordered and predictable than the original. It contains, basically, notations of the sequences that were removed during the compression, interspersed with data. If I were to try and compress that, there'd be a lot less order and a lot less sequences to abstract away.

So, you say, make longer and more random sequences, and try and recompress using those.

Unfortunately, you eventually take more space to identify what you removed than you are removing.

Modern compression algorithms are very good, and have evolved for nearly twenty years. You're going to save a fraction of the total file size if you compress a ZIP file using, say, WinRAR.

The theory is interesting, but in practice it's not helpful. You may save a few kilobytes off a 200 megabyte file, but that's not worth the significant time it takes.

P.S. This is why video and music files don't compress well (that is, using a general purpose compression like ZIP) - they're already compressed and have no (or few) patterns.
Assisted Solution
 
03.21.2008 at 08:19PM PDT, ID: 21184639
I concur - I wrote that novel you see above in the interval... marvelous explanation MrHusy. Exactly what I was trying to say. Couldn't do better (or comparably) myself.
 
03.21.2008 at 08:22PM PDT, ID: 21184645
Thanks m8s :)
 
03.23.2008 at 03:58AM PDT, ID: 21189206
The above excellently covers the algorithm methods but doesn't cover the mathematical impossibility of decompression.  Assume that you have finally managed to figure out how to compress something down to just a single byte: the decompression algorithm then only has 256 possible ways to de-compression that byte.  Any one of those 256 possibilities only has one method of de-compression or is the extracted file.  It is possible to proove from this basis that most files cannot be compressed.  Thankfully, as humans, we don't deal in the kind of random data required to find this out on a daily basis & the very small subset of data we do use has significant redundancy.
 
 
20080236-EE-VQP-29 / EE_QW_EXPERT_20070906