• Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 496
  • Last Modified:

*.DOC header Q.

Out of interest, I would like to know how a Word Document header is arranged, (and if it is substantially different v6 to 32-bit).

But my question is, "How can I reliably determine where the body of a document begins?

Brian
0
BrianWren
Asked:
BrianWren
1 Solution
 
DassaCommented:
Hex D9 appears to be the marker indicating the end of the header area in a Word Document.  At least for Word 2000.  There are two spaces after the D9 and then the text body starts.
0
 
vboukharCommented:
Look at http://wwwwbs.cs.tu-berlin.de/~schwartz/pmh/laola.html - it's the only site in the Web with OLE compound file structure description. Microsoft doesn't publish it!
So you have to read header (first big block - 512b) and reconstruct BBD to know, where text body really starts.
Hope it helps.
0
 
BrianWrenAuthor Commented:
A truly helpful site.  (Actually, it  _LED_  me to a truly helpful site,

http://www.wotsit.org/

Thanks
0

Featured Post

Veeam Disaster Recovery in Microsoft Azure

Veeam PN for Microsoft Azure is a FREE solution designed to simplify and automate the setup of a DR site in Microsoft Azure using lightweight software-defined networking. It reduces the complexity of VPN deployments and is designed for businesses of ALL sizes.

Tackle projects and never again get stuck behind a technical roadblock.
Join Now