I applaud the DDG spirit but without their own crawling data their future is doo...

ithkuil · on Feb 20, 2014

ck2 · on Feb 20, 2014

They aren't collecting much if anything.

hueving · on Feb 20, 2014

>They need to build their own crawlers like gigablast.

Unfortunately the opportunity for that is pretty meek. Many webmasters block crawlers that aren't the top search engines. :-(

_euvw · on Feb 20, 2014

Can't duckduckbot just present itself to the webserver as GoogleBot?

Aldo_MX · on Feb 21, 2014

In our company we do a forward-confirmed DNS to verify whenever a bot is who it claims to be.

Oddly enough we have blocked legit googlebot/bing/baidu servers, because they fail to properly configure their servers...

nly · on Feb 21, 2014

Their servers are probably configured fine. They likely have a pool of servers with no reverse DNS to try and catch servers issuing different content to Googlebot

Aldo_MX · on March 5, 2014

I'm pretty confident that the server 1.2.3.4 which returns crawl-5-6-7-8-googlebot.com it's a badly configured one.