{"id":13343,"date":"2010-10-13T13:03:20","date_gmt":"2010-10-13T20:03:20","guid":{"rendered":"http:\/\/www.kith.org\/journals\/jed\/2010\/10\/13\/13343.html"},"modified":"2010-10-13T13:03:20","modified_gmt":"2010-10-13T20:03:20","slug":"building-a-better-spambot","status":"publish","type":"post","link":"https:\/\/www.kith.org\/jed\/2010\/10\/13\/building-a-better-spambot\/","title":{"rendered":"Building a better spambot"},"content":{"rendered":"\n<p>I've been remiss in deleting spam from my moderated-comments queue in the past month-plus; as of yesterday, there were over 13,000 comments in the queue. I've been plowing through them&mdash;only 6500 or so to go!&mdash;and rescuing the few that aren't spam; apologies to those of you whose comments have been languishing.<\/p>\n<p>Anyway, as usual when I'm deleting comment spam, I can't help but think that I could write a way better spam-comment generator than the spambots are using.<\/p>\n<p>I guess it depends on one's goal. If I were writing a spambot, I would want its comments to look as much like real comments as possible, to make comment moderators have to spend a lot of time trying to figure out whether they were real comments or not.<\/p>\n<p>When there are, say, two thousand comments in a five-day period that all say variations on &ldquo;Lindsay Lohan Goes to Rehab Today&rdquo; (in a blog that has never once mentioned Lindsay Lohan until now), it takes me less than a second per comment to skim through a hundred at a time and zap them all.<\/p>\n<p>I suppose the people operating the spambots don't care about this kind of thing. They want to make it hard for automated spam-recognition systems to detect the spam, so they have systems that replace words and phrases in a given piece of spam with synonyms, but I guess they don't really care whether the spam comments look like reasonable comments to a human.<\/p>\n<p>But I've heard it suggested that we'll get to strong AI via the struggle between spammers (to get their spam posted) and spam-detection systems. And I don't think the spammers are really doing their part here.<\/p>\n<p>Anyway, I'm not gonna post my ideas about how to make comment spam read more like real comments, on the unlikely chance that a spambot writer might (a) not have thought of these ideas, and (b) decide to implement them. But my ideas aren't anything revolutionary; they're based partly on stuff I've known about since college.<\/p>\n<p>So as soon as some spambot writer does start caring about realistic comments, my workload in fighting spam is going to go way up.<\/p>\n\n","protected":false},"excerpt":{"rendered":"<p>I&#8217;ve been remiss in deleting spam from my moderated-comments queue in the past month-plus; as of yesterday, there were over&#8230;<\/p>\n","protected":false},"author":5,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[67],"tags":[],"class_list":["post-13343","post","type-post","status-publish","format-standard","hentry","category-spam"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.kith.org\/jed\/wp-json\/wp\/v2\/posts\/13343","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.kith.org\/jed\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.kith.org\/jed\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.kith.org\/jed\/wp-json\/wp\/v2\/users\/5"}],"replies":[{"embeddable":true,"href":"https:\/\/www.kith.org\/jed\/wp-json\/wp\/v2\/comments?post=13343"}],"version-history":[{"count":0,"href":"https:\/\/www.kith.org\/jed\/wp-json\/wp\/v2\/posts\/13343\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.kith.org\/jed\/wp-json\/wp\/v2\/media?parent=13343"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.kith.org\/jed\/wp-json\/wp\/v2\/categories?post=13343"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.kith.org\/jed\/wp-json\/wp\/v2\/tags?post=13343"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}