Solved

.doc, .xls, .ppt Magic Bytes?

Posted on 2006-10-27
3
1,174 Views
Last Modified: 2008-01-09
Does anyone know the true "magic bytes" for .doc, .xls and .ppt.

I have come across the following:

.ppt: d0cf11e0a1b11ae100000000
.xls:  d0cf11e0a1b11ae100
.doc: d0cf11e0a1b11ae1

The problem with those magic bytes is that both xls and ppt's have the extra zeros on the end too.

I have also seen:

fdff at offset 512

this is also common to all office documents.


How can I tell the difference between .doc, .xls and .ppt?

And soon coming with office 2k7 .docx, .xlsx, .pptx?

Thanks

Matt
0
Comment
Question by:essentialtalk
3 Comments
 
LVL 6

Accepted Solution

by:
SCDMETA earned 500 total points
ID: 17824502
1) Could you use the extension?

2) http://www.wotsit.org/default.asp 
Might be a good resource for you.

3) http://www.google.com/codesearch?hl=en&lr=&q=d0cf11e0&btnG=Search

This is a google code search for the OLE signature.  You might be able to find some more magic bytes (or references) by looking at existing code.

Good luck
0

Featured Post

Announcing the Most Valuable Experts of 2016

MVEs are more concerned with the satisfaction of those they help than with the considerable points they can earn. They are the types of people you feel privileged to call colleagues. Join us in honoring this amazing group of Experts.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

This is about my first experience with programming Arduino.
Computer science students often experience many of the same frustrations when going through their engineering courses. This article presents seven tips I found useful when completing a bachelors and masters degree in computing which I believe may he…
An introduction to basic programming syntax in Java by creating a simple program. Viewers can follow the tutorial as they create their first class in Java. Definitions and explanations about each element are given to help prepare viewers for future …
In this fifth video of the Xpdf series, we discuss and demonstrate the PDFdetach utility, which is able to list and, more importantly, extract attachments that are embedded in PDF files. It does this via a command line interface, making it suitable …

808 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question