?
Solved

Need to convert HTML tables to CSV file

Posted on 2004-08-04
13
Medium Priority
?
792 Views
Last Modified: 2008-01-09
I need a program to convert html tables into a csv file for conversion into an sql server table. i've seen on program but it was written in VB 3 or something. Has anyone come across anything like that?

Dinesh
0
Comment
Question by:dprasad
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 5
  • 3
  • 3
13 Comments
 
LVL 55

Accepted Solution

by:
Jaime Olivares earned 1000 total points
ID: 11723002
Which language do you prefer?
Here is an example in Perl:
http://www.experts-exchange.com/Programming/Programming_Languages/Perl/Q_20287078.html
And here is another:
http://www.coscorrosa.com/programs/cgi/html2csv/html2csv.pl

I guess it will be easy to translate to PHP or ASP.
0
 
LVL 3

Expert Comment

by:TILL
ID: 11723384
You can use Excel, with Web Import feature. Is the easyest way.
0
 

Author Comment

by:dprasad
ID: 11723932
TIll, I'm using excel 2000, I dont see a web import feature. I tried to open it directly, and also tried pasting as html and text. when i do that, it puts everything in one column on the csv files. how should i do this?
0
Technology Partners: We Want Your Opinion!

We value your feedback.

Take our survey and automatically be enter to win anyone of the following:
Yeti Cooler, Amazon eGift Card, and Movie eGift Card!

 
LVL 3

Assisted Solution

by:TILL
TILL earned 1000 total points
ID: 11723968
From Excel menu, go to Data-> Import External Data ->New Web Query.
0
 

Author Comment

by:dprasad
ID: 11723998
I can find a newer copy, Ill try it with that
0
 
LVL 3

Expert Comment

by:TILL
ID: 11724013
When you go to WebQuery you have to enter an URL which can be an internet adress or a local file. You the html file is stored locally, you can enter the URL like file:///c:\test.html
After that, the WebQuery Dialog will show you all the tables that you gave in that HTML file. All you have to do is select de right table, click ok and choose the drpoint of your table. And voila... the document is ready. After that you save it as csv and you're ready to go.;)

Best of luck with that.
TILL
0
 

Author Comment

by:dprasad
ID: 11729048
OK, I got office XP. It works, but because of the way the data is set up in the tables, I don't get it in the right format.
The problem is, on the web page, the data is set up like this
Name:  1999 Cayuse Walla Walla Valley Syrah

Vintage: 1999
Style: Syrah
Vineyard:
Region: Walla Walla, Washington
Description:  blah blah
blah
Size: .750/ml  Qty Available: 3
Store: A Wine Store Washington

with a price in a new column on the right. Also, the description field is variable length. So in excel, everything shows up in the first column. do you know how i could put each data element in its own column without copying and pasting each one? I have about 1,000 of these to put in
0
 

Author Comment

by:dprasad
ID: 11729230
Jaime: I tried running the 2nd perl script you listed. I have lvery limited knowledge in perl, but i can get them to work. I'm seeing the program run and display the information to the screen. But how do i get access to the tables converted into a csv file? whats the command line syntax? Thanks!

Dinesh
0
 
LVL 55

Expert Comment

by:Jaime Olivares
ID: 11729299
Just use "Save as..."
0
 

Author Comment

by:dprasad
ID: 11729410
I dont understand, how do I do save as when I run the script?
0
 
LVL 55

Expert Comment

by:Jaime Olivares
ID: 11729453
I can't run the script right now, but if you see results in a web browser, just save page as .txt file (changing to .csv extension)
0

Featured Post

Get real performance insights from real users

Key features:
- Total Pages Views and Load times
- Top Pages Viewed and Load Times
- Real Time Site Page Build Performance
- Users’ Browser and Platform Performance
- Geographic User Breakdown
- And more

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Having just graduated from college and entered the workforce, I don’t find myself always using the tools and programs I grew accustomed to over the past four years. However, there is one program I continually find myself reverting back to…R.   So …
Go is an acronym of golang, is a programming language developed Google in 2007. Go is a new language that is mostly in the C family, with significant input from Pascal/Modula/Oberon family. Hence Go arisen as low-level language with fast compilation…
This theoretical tutorial explains exceptions, reasons for exceptions, different categories of exception and exception hierarchy.
The viewer will be introduced to the member functions push_back and pop_back of the vector class. The video will teach the difference between the two as well as how to use each one along with its functionality.
Suggested Courses

800 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question