The SpiderWeb: The Internet Mapper Initiative

Roguebantha · Member (Posts: 159)

The server was built with a verification procedure. Essentially, each link had to be scanned twice by two separate clients to make sure the entries matched. If they didn't, both results were thrown out and the webpage was rescanned. The program also had built in robots.txt support so as to avoid websites who didn't want to be scanned.

The hope was to eventually create a visualization of the web, which showed all the connections and intersections between each node in one enormous web.

The framework has been essentially completed on both the server and the client; the final step would be to finish debugging both and get the scanning part of the client up and working.

While I have been very busy with school recently, I hope to finish both programs by the end of June (earlier if help comes along) and to hopefully have the internet scanned by the end of next summer.

Thoughts or offers of help appreciated!!

Eeems · Super-Expert (Posts: 870)

This, and how are you planning on handling redirects?

Roguebantha · Member (Posts: 159)