Yan Pritzker photographer, entrepreneur, software engineer, musician, skier

skwpspace is Yan Pritzker's home on the web

Blog :: Photography :: About Me

TwitterCounter for @skwp

Get the news feed
Get updates by email
Follow me on twitter

hello, i'm yan

I am a photographer, entrepreneur, software engineer, guitarist, and telemark skier

This blog is about startups, blogging, Ruby On Rails, virtualization and cloud computing, photography, customer service, marketing, ux and design, git, and lots more.

planypus

I'm the founder of Planypus, the place to share your plans!

cohesiveft

Virtualize your application for download or deploy to the cloud in minutes!

flickr

sheepsicelandic mannequinbridgeskaterkristinaice flowstampede

Top Posts

Archives

Contact

Reach me at yan at pritzker.ws

Posted
13 March 2008 @ 7pm

Tagged
rails, ruby

webcrawler bot detection

  def self.bot_agent_list
    [ "panscient", "larbin", "dummy", "Teoma", "alexa",
      "froogle", "inktomi", "looksmart", "URL_Spider_SQL",
      "Firefly", "NationalDirectory", "Ask Jeeves", "TECNOSEEK",
      "InfoSeek", "WebFindBot", "crawler", "girafobot", "Scooter",
      "Baidu", "bot", "Google", "SiteUptime", "Slurp",
      "WordPress", "ZIBB", "ZyBorg", "msnbot", "check_http",
      "libwww-perl", "lwp-trivial", "wget", "curl", "SimplePie",
      "Python", "Feed", "HTTPClient", "Tumblr", "Spider", "sanszbot"]
  end

Full source at http://pastie.org/191922


2 Comments

Posted by
igor
16 March 2008 @ 5am

robots don’t smell.


Posted by
yan
17 March 2008 @ 3pm

but they can sure cause a stink

(cymbal crash)


Leave a Comment