tag:blogger.com,1999:blog-6462995951094091977.post1967226268039484781..comments2009-04-29T02:42:06.506-07:00Comments on php for fun: My Web Spiderda404lewzerhttp://www.blogger.com/profile/02995961453909475185noreply@blogger.comBlogger5125tag:blogger.com,1999:blog-6462995951094091977.post-47869373113037005532009-04-29T02:42:00.000-07:002009-04-29T02:42:00.000-07:00cool!
im developing a web spider too in php/mysql....cool!<br />im developing a web spider too in php/mysql. the engine gets all url, scans, adds new urls,with new words and reprogramms itself for the next domain scan based on metatag visit after...domains are divided by starting letter so for each letter ther is a spider, like this in a short time everything is scanned and renewed, till now everythin seems ok, the only thing needed is space, i do really need tons of Terabytes :)b0r1shttps://www.blogger.com/profile/01313527111785704737noreply@blogger.comtag:blogger.com,1999:blog-6462995951094091977.post-35958948527180334512008-04-29T18:48:00.000-07:002008-04-29T18:48:00.000-07:00for titus:$web_page=file_get_contents('starting UR...for titus:<BR/>$web_page=file_get_contents('starting URL here');<BR/>parse '$web_page' using a regular expression, such as preg_match_all("/href=[\'\"](.*)[\'\"]/iU",$web_page, $results);<BR/>look up 'preg_match_all()' for return information, and just add the data to a list to search in turn.Anonymoushttps://www.blogger.com/profile/00063231903435312390noreply@blogger.comtag:blogger.com,1999:blog-6462995951094091977.post-57160405612380336902008-04-07T17:35:00.000-07:002008-04-07T17:35:00.000-07:00Sounds cool! Any tips on how to create this in PH...Sounds cool! Any tips on how to create this in PHP? :)Unknownhttps://www.blogger.com/profile/13973992657123517522noreply@blogger.comtag:blogger.com,1999:blog-6462995951094091977.post-49627127816366267652007-11-22T15:18:00.000-08:002007-11-22T15:18:00.000-08:00I just like watching the randomness of what it fin...I just like watching the randomness of what it finds. Makes me feel complete lolda404lewzerhttps://www.blogger.com/profile/02995961453909475185noreply@blogger.comtag:blogger.com,1999:blog-6462995951094091977.post-92045232434787275832007-11-22T07:36:00.000-08:002007-11-22T07:36:00.000-08:00Today, the interwebs, tomorrow THE WORLD! What wil...Today, the interwebs, tomorrow THE WORLD! What will u do with all zee data?Pak Behlhttps://www.blogger.com/profile/01818581293013546009noreply@blogger.com