Solved

Extracting text from Word and Excell?

Posted on 2002-04-09
5
283 Views
Last Modified: 2010-04-02
I building a program that can extract metadata and text from Word and Excell documents.

OLE automation is out of the question since it would be to slow in this case.

What library or SDK do I need? Preferably something that is free or at least cheap??
0
Comment
Question by:kbb2
5 Comments
 
LVL 32

Expert Comment

by:jhance
Comment Utility
I'm not aware of any other way, short of hacking the Word and Excel file formats.  

What is it about the Word and Excel COM/Automation interfaces that you believe are slow?  There is nothing inherently slow about COM.
0
 

Author Comment

by:kbb2
Comment Utility
COM itself is now slow I agree, but invoking Word or Excell as COM is indeed slow (I've tried).

The only way is to get a specification of the format, but I was hoping that someone already had made a small library that I could utilize!
0
 
LVL 86

Accepted Solution

by:
jkr earned 100 total points
Comment Utility
>>The only way is to get a specification of the format

http://www.wotsit.org/download.asp?f=wword8
http://www.wotsit.org/download.asp?f=xls
0
 
LVL 11

Expert Comment

by:griessh
Comment Utility
Dear kbb2

I think you forgot this question. I will ask Community Support to close it unless you finalize it within 7 days. You can always request to keep this question open. But remember, experts can only help you if you provide feedback to their questions.
Unless there is objection or further activity,  I will suggest to accept

     "jkr"

comment(s) as an answer.

If you think your question was not answered at all, you can post a request in Community support (please include this link) to refund your points. The link to the Community Support area is: http://www.experts-exchange.com/commspt/

=========================================================
You have 13 open questions out of 45 that need your attention! Please take
some time and accept an answer if an expert was able to help you or
provide feedback if needed.
==========================================================

PLEASE DO NOT ACCEPT THIS COMMENT AS AN ANSWER!
======
Werner
0
 

Author Comment

by:kbb2
Comment Utility
I was hoping for more leads, but this will do! thanks!
0

Featured Post

What Security Threats Are You Missing?

Enhance your security with threat intelligence from the web. Get trending threat insights on hackers, exploits, and suspicious IP addresses delivered to your inbox with our free Cyber Daily.

Join & Write a Comment

When writing generic code, using template meta-programming techniques, it is sometimes useful to know if a type is convertible to another type. A good example of when this might be is if you are writing diagnostic instrumentation for code to generat…
Go is an acronym of golang, is a programming language developed Google in 2007. Go is a new language that is mostly in the C family, with significant input from Pascal/Modula/Oberon family. Hence Go arisen as low-level language with fast compilation…
The viewer will learn how to user default arguments when defining functions. This method of defining functions will be contrasted with the non-default-argument of defining functions.
The viewer will be introduced to the member functions push_back and pop_back of the vector class. The video will teach the difference between the two as well as how to use each one along with its functionality.

762 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

6 Experts available now in Live!

Get 1:1 Help Now