?
Solved

Doc(RTF) -> XML with Java

Posted on 2003-03-07
13
Medium Priority
?
410 Views
Last Modified: 2008-03-17
I need to convert a word document into xml using Java...any ideas?  I want to have total control over what in the word doc is converted...primarily i will want to ignore most of the data expect that which has been interested into fields by the user.  I will specify a special style for these fields in word, such as style = Company Name...and then i need something to convert this field into an xml tag company name.  Any one know of any open source projects in java that would help me with this task?

0
Comment
Question by:menreeq
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 5
  • 3
  • 2
  • +3
13 Comments
 
LVL 86

Expert Comment

by:CEHJ
ID: 8090407
Listening...
0
 

Author Comment

by:menreeq
ID: 8090472
What does "Listening..." mean?
0
 
LVL 86

Expert Comment

by:CEHJ
ID: 8090507
Waiting for comments
0
Get real performance insights from real users

Key features:
- Total Pages Views and Load times
- Top Pages Viewed and Load Times
- Real Time Site Page Build Performance
- Users’ Browser and Platform Performance
- Geographic User Breakdown
- And more

 

Author Comment

by:menreeq
ID: 8090560
sweet
0
 
LVL 1

Expert Comment

by:shji1
ID: 8091365
Some source I found for reading RTF files. MAybe you will find it useful...

http://www.cs.umb.edu/~bill/java/jdk1.3/src/javax/swing/text/rtf/

Strangly enough, these classes are in the language, but I could not find them in the API....
I wonder
0
 
LVL 35

Expert Comment

by:girionis
ID: 8091686
 You can use Majix by IBM: http://www-106.ibm.com/developerworks/xml/library/x-tiprtf/ and you can download it from here: http://tetrasys.dhs.org/

  Hope it helps.
0
 
LVL 35

Expert Comment

by:girionis
ID: 8091691
> You can use Majix by IBM

  Sorry, it is by TetraSix and not by IBM. It's just the review of Majix on IBM's site.
0
 

Author Comment

by:menreeq
ID: 8091855
girionis, i have tried to use that software but it does not provide me with the flexibility that i require.  

What do you guys think about using macros in the word doc to some how capture the data in the form in a fashion that would allow me to easily recapture it when i need to process the form?  i have never ever used macros...nor do i know much about it.

shji1, i am going to look into your suggestion now.
0
 
LVL 35

Expert Comment

by:girionis
ID: 8091917
 Sorry menreeg I have never actually used ms word macros with java so I am unable to offer you anyhelp.

  Some more links that *might* be of interest to you though:

http://www.idealliance.org/papers/xmle02/dx_xmle02/papers/02-06-03/02-06-03.html
http://www.jguru.com/faq/view.jsp?EID=213149
http://www.rtf2fo.com/
0
 

Author Comment

by:menreeq
ID: 8116039
i don't think there is a good solution to my question, i am going to ask the admin to del it
0
 

Expert Comment

by:simonletter
ID: 8125523
I haven't work with OpenOffice before but it might help. http://www.openoffice.org. Download the ODK.

Have a look at OOo_1.0.2beta_Win32Intel_odk\OpenOffice.org1.0.2_Beta_SDK\examples\java\DocumentConverter\DocumentConverter.java

I think you'll need to start soffice or something that works as a UNO server whatever that is. So you'll write the client to get soffice to open the Word doc. I'm not sure what the API allows you to do.
0
 

Author Comment

by:menreeq
ID: 8125731
Well i found a good solution...I am using c# to access the word file where in less then 10 lines i was able to read each field and write my txt file...thanks for trying to help
0
 
LVL 5

Accepted Solution

by:
Netminder earned 0 total points
ID: 8125899
User resolved; points refunded and question closed.

Netminder
EE Admin
0

Featured Post

VIDEO: THE CONCERTO CLOUD FOR HEALTHCARE

Modern healthcare requires a modern cloud. View this brief video to understand how the Concerto Cloud for Healthcare can help your organization.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

By the end of 1980s, object oriented programming using languages like C++, Simula69 and ObjectPascal gained momentum. It looked like programmers finally found the perfect language. C++ successfully combined the object oriented principles of Simula w…
Introduction This article is the second of three articles that explain why and how the Experts Exchange QA Team does test automation for our web site. This article covers the basic installation and configuration of the test automation tools used by…
Video by: Michael
Viewers learn about how to reduce the potential repetitiveness of coding in main by developing methods to perform specific tasks for their program. Additionally, objects are introduced for the purpose of learning how to call methods in Java. Define …
This tutorial covers a practical example of lazy loading technique and early loading technique in a Singleton Design Pattern.
Suggested Courses
Course of the Month11 days, 17 hours left to enroll

752 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question