Solved

Reading in the Contents of a directory on another WWW server with Perl?

Posted on 1998-12-26
2
139 Views
Last Modified: 2010-03-05
I am looking for a way to retrieve a directory list (files) of a user specified directory on a remote WWW server. I know I need to use the socket and I can already return a entire page or document. The directory listing will not be printed to the users browser, my server will use the directory contents to build a database. I am making a search engine...

I'd really appreciate the help! Thanx!
0
Comment
Question by:capsite
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
2 Comments
 

Author Comment

by:capsite
ID: 1207162
Please provide any code or knowlege you may have!! Thanks again!
0
 
LVL 7

Accepted Solution

by:
yoren earned 200 total points
ID: 1207163
Directory listings on the Web come in the form of HTML. In fact, you could handle any HTML page this way: retrieve the page, then the parse the HTML to extract the links. Once you have the list of links, you can insert it into your database.

You can find almost everything you need already written on CPAN (www.cpan.org). Get the LWP (lib-www-perl) package. That gives you all the code to retrieve a Web page into a variable (HTTP::Request) and extract the links from it (HTML:LinkExtor).

Yuval
0

Featured Post

[Live Webinar] The Cloud Skills Gap

As Cloud technologies come of age, business leaders grapple with the impact it has on their team's skills and the gap associated with the use of a cloud platform.

Join experts from 451 Research and Concerto Cloud Services on July 27th where we will examine fact and fiction.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

There are many situations when we need to display the data in sorted order. For example: Student details by name or by rank or by total marks etc. If you are working on data driven based projects then you will use sorting techniques very frequently.…
Checking the Alert Log in AWS RDS Oracle can be a pain through their user interface.  I made a script to download the Alert Log, look for errors, and email me the trace files.  In this article I'll describe what I did and share my script.
Explain concepts important to validation of email addresses with regular expressions. Applies to most languages/tools that uses regular expressions. Consider email address RFCs: Look at HTML5 form input element (with type=email) regex pattern: T…
Six Sigma Control Plans

617 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question