Link to home
Start Free TrialLog in
Avatar of Mlungisi Ndlela
Mlungisi NdlelaFlag for South Africa

asked on

Get internet domains/urls

I've tried to dig more into this and there was a kind of a smaller solution I figured out but after looking into it very close I see that it will be a problem in a long run so I'm trying to find the best way of getting let me say the list or all if possible internet web addresses, like if a new domain is registered I should also get it, my question is how can I find these, I know we may think crawling sites, but the reality now is that crawling sites will be something that wont be supported in a near future, that because I've also tried to check some sites that I wanted to crawl so that I can get more sites addresses to visit and crawl but people are now preventing crawling by hiding the addresses or links, like for instance a site that people post business ads for free, they hide the link that shows the full detailed ad details of the post by using some ad id which there script understands and dynamically loads the ad to the user, so crawling such sites require each custom site crawler after you have figured out how are they hiding this.

Bing, Google do find these information, how can I also find this information. I'm more interested in fining domains/sites urls.

Any one who can help on this?
Avatar of David Favor
David Favor
Flag of United States of America image

Ask your question in a single sentence which fits on one line.
Avatar of Mlungisi Ndlela

ASKER

What I want is to get if possible all web address(domains) for every website on the internet.
not every domain has a website and not every website has a unique domain. I've used tools like mxtoolbox to find domains hosted on a particular IP address.
Thanks that is helpful, I will have a look at this tool and see how it work and if it will do what I want, and I should also report back here on how it went. Thanks.
Ok just had a look at this tool and also tried it, but it just perform some lookup for that web address/domain I supplied and return information for that domain it doesn't return more domains as I thought it would based on what you said. How do you find domains hosted on that particular domain using this tool, maybe I was doing something wrong but I tried almost every option that is available on this tool.

This made me realize of domain trees and forest but I'm not sure if I can be able to hunt a forest that is not mine or not hosted by me/my server.
there are many, many new root domains i.e. .com, .net, .org, .info, .org, .online, .co, .store, .club, .app, .info, .life, .cologne, .ca, .co.uk, .co.au, .site, .io, .in, .pro, .me, .world, .website, .net, .tech, .tv, .uk, .mobi, .work, .today, .solutions
Even more added every year. For Example, here is 2018
https://iwantmyname.com/domains/new-gtld-domain-extensions

you'd have to contact each tld (top level domain) registrar and get a list of the issued domains and then start crawling those domains.  you'd basically be a scaled down version of google, bing, yahoo, yandex.. the bandwidth charges alone would make it a non-trivial dream.
Thanks, that's the exact
kind of a smaller solution I figured out
that I was referring to. I guess this is the only way to get the list. I will continue digging more into this and see other solutions i can have on this.
This question needs an answer!
Become an EE member today
7 DAY FREE TRIAL
Members can start a 7-Day Free trial then enjoy unlimited access to the platform.
View membership options
or
Learn why we charge membership fees
We get it - no one likes a content blocker. Take one extra minute and find out why we block content.