How I block comment spam
By Pete Freitag
You would think that by having custom written blogging software (only two other blogs out there are using this code), and not allowing HTML in comments, that comment spammers would not waste their time on me. But they do.
Why do they bother with me?
Even though their URL will not be hyperlinked on my blog giving them pagerank, they still spam because they are hoping for the following:
- They want me to click on the link before I delete it.
- They are hoping that people subscribed to the comment thread will click on the link.
- They are planting keywords on my pages so that someone searching for the term in google, may find my page, and copy and paste the url.
When I was at the bloggers BOF at cfunited, people mentioned that when using Ray Camden's blog CFC software they didn't get much if any comment spam. I think that is because the comment form is located in a popup window launched by javascript. So it's more of a hassle for spammers to spam them. I however would rather keep my comment form on my entry page, so it's easier for readers to post comments.
What I do to block comment spam
Here's what I do to block comment spam on this blog:
- Check HTTP Referrer to make sure it's coming from my site. I know some people like to turn this off in their browser, but they won't be able to post comments unless they turn it on.
- If the comment contains a HTML link I reject it, giving the user a detailed message that tells them to just post the url.
- Check for a set of bad words - my list is very small only about 10 words currently.
- Check for [url] - a lot of comment spammers try to pass the links as [url]http://foo[/url]
- Look for more than 5 url's in the comment. Comment spammers often try to post 10-20 urls at a time, so I just reject them. I use this regular expression
REFindNoCase("(http:.*){5,}", form.comment)
How I block comment spam was first published on July 19, 2005.
If you like reading about blog, spam, or comment spam then you might also like:
- Battling Comment Spam
- Over 90% of trackbacks were spam
- Google Blog Search - Not Impressed
- Trackback Salt
Discuss / Follow me on Twitter ↯
Tweet Follow @pfreitagComments
REFindNoCase("(https?://.*?){5,}", form.comment)
While I was getting a handful of spam messages a week, I've dropped down to zero.