[Last Call] Learn about multicloud storage options and how to improve your company's cloud strategy. Register Now

x
?
Solved

Text documents (grabbing the first few sentences)

Posted on 2004-04-02
8
Medium Priority
?
191 Views
Last Modified: 2010-04-15
Aright, I'm going to hell for asking this question (wait for it): how can I grab the first xxx about of characters from a text (I know this one), doc, or rtf file?
0
Comment
Question by:rossryan
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 3
  • 3
  • 2
8 Comments
 
LVL 12

Expert Comment

by:dfiala13
ID: 10746285
You doing this on a server?  Or can you use Word automation?
0
 
LVL 2

Expert Comment

by:coltrane2003
ID: 10746395
Once you get the file contents read into a string you can use Substring. No offense if you already knew this much. Substring takes two arguments - the index of the first character and the index of the last character. You could do something like

aStringFormofsomeWorddoc.Substring(0,3);

Hope that helps
0
 

Author Comment

by:rossryan
ID: 10746417
It's the getting the file contents (particulary word documents) into the string that I am interested in ;).

What is word automation (api?).

I need to programmatically grab the first few sentences...i.e. I need an API or code that grabs it (for use in a program).
0
VIDEO: THE CONCERTO CLOUD FOR HEALTHCARE

Modern healthcare requires a modern cloud. View this brief video to understand how the Concerto Cloud for Healthcare can help your organization.

 
LVL 2

Expert Comment

by:coltrane2003
ID: 10746430
Word along with many other MS office products has a means of writing scripts. This is called VBA Visual Basic for applications. It's under tools/macro/visual basic editor. Perhaps this is what is meant by API?

Have you tried reading a .doc file with standard IO?

I've got to run now. If you haven't figured this out by the time I get back, I will try to help. Good luck.
0
 
LVL 12

Expert Comment

by:dfiala13
ID: 10746452
>>It's the getting the file contents (particulary word documents) into the string that I am interested in ;).
That's why I asked.  If you are running a Winforms app, you can use the Word automation objects to open and manipulate a document.  Using the Word objects it is relatively trivial to grab the the first X characters.
http://msdn.microsoft.com/vstudio/office/default.aspx?pull=/library/en-us/odc_vsto2003_ta/html/wordobject.asp

If however you are ripping through lots of documents being uploaded to a server, this might not be the best option,
0
 
LVL 2

Expert Comment

by:coltrane2003
ID: 10749079
Yes I agree with the above. It depends on what kind of solution you are trying to provide. A couple years back I worked with a content managment system that published web templates from Word. It worked with a VBA script and validated documents to be published against a DTD. I didn't build it so I'm afraid I don't know much about it.
0
 

Author Comment

by:rossryan
ID: 10749309
Right. Now, does this work with Office XP, or shall I install 2003? (I've been hoping to hold off on that one).
0
 
LVL 12

Accepted Solution

by:
dfiala13 earned 2000 total points
ID: 10750642
Automation works bach as far as Word 6, so yes yuo can do it with Office XP.
0

Featured Post

VIDEO: THE CONCERTO CLOUD FOR HEALTHCARE

Modern healthcare requires a modern cloud. View this brief video to understand how the Concerto Cloud for Healthcare can help your organization.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

We all know that functional code is the leg that any good program stands on when it comes right down to it, however, if your program lacks a good user interface your product may not have the appeal needed to keep your customers happy. This issue can…
Performance in games development is paramount: every microsecond counts to be able to do everything in less than 33ms (aiming at 16ms). C# foreach statement is one of the worst performance killers, and here I explain why.
In this video, Percona Director of Solution Engineering Jon Tobin discusses the function and features of Percona Server for MongoDB. How Percona can help Percona can help you determine if Percona Server for MongoDB is the right solution for …
In this video, Percona Solutions Engineer Barrett Chambers discusses some of the basic syntax differences between MySQL and MongoDB. To learn more check out our webinar on MongoDB administration for MySQL DBA: https://www.percona.com/resources/we…
Suggested Courses

650 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question