Solved

Extracting text from Word and Excell?

Posted on 2002-04-09
5
284 Views
Last Modified: 2010-04-02
I building a program that can extract metadata and text from Word and Excell documents.

OLE automation is out of the question since it would be to slow in this case.

What library or SDK do I need? Preferably something that is free or at least cheap??
0
Comment
Question by:kbb2
5 Comments
 
LVL 32

Expert Comment

by:jhance
ID: 6928058
I'm not aware of any other way, short of hacking the Word and Excel file formats.  

What is it about the Word and Excel COM/Automation interfaces that you believe are slow?  There is nothing inherently slow about COM.
0
 

Author Comment

by:kbb2
ID: 6928070
COM itself is now slow I agree, but invoking Word or Excell as COM is indeed slow (I've tried).

The only way is to get a specification of the format, but I was hoping that someone already had made a small library that I could utilize!
0
 
LVL 86

Accepted Solution

by:
jkr earned 100 total points
ID: 6928089
>>The only way is to get a specification of the format

http://www.wotsit.org/download.asp?f=wword8
http://www.wotsit.org/download.asp?f=xls
0
 
LVL 11

Expert Comment

by:griessh
ID: 7012003
Dear kbb2

I think you forgot this question. I will ask Community Support to close it unless you finalize it within 7 days. You can always request to keep this question open. But remember, experts can only help you if you provide feedback to their questions.
Unless there is objection or further activity,  I will suggest to accept

     "jkr"

comment(s) as an answer.

If you think your question was not answered at all, you can post a request in Community support (please include this link) to refund your points. The link to the Community Support area is: http://www.experts-exchange.com/commspt/

=========================================================
You have 13 open questions out of 45 that need your attention! Please take
some time and accept an answer if an expert was able to help you or
provide feedback if needed.
==========================================================

PLEASE DO NOT ACCEPT THIS COMMENT AS AN ANSWER!
======
Werner
0
 

Author Comment

by:kbb2
ID: 7013225
I was hoping for more leads, but this will do! thanks!
0

Featured Post

Is Your Active Directory as Secure as You Think?

More than 75% of all records are compromised because of the loss or theft of a privileged credential. Experts have been exploring Active Directory infrastructure to identify key threats and establish best practices for keeping data safe. Attend this month’s webinar to learn more.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

In days of old, returning something by value from a function in C++ was necessarily avoided because it would, invariably, involve one or even two copies of the object being created and potentially costly calls to a copy-constructor and destructor. A…
Written by John Humphreys C++ Threading and the POSIX Library This article will cover the basic information that you need to know in order to make use of the POSIX threading library available for C and C++ on UNIX and most Linux systems.   [s…
The viewer will learn how to user default arguments when defining functions. This method of defining functions will be contrasted with the non-default-argument of defining functions.
The viewer will learn how to clear a vector as well as how to detect empty vectors in C++.

930 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

10 Experts available now in Live!

Get 1:1 Help Now