You can follow any responses to this entry through the RSS 2.0 feed.
You can leave a response, or trackback from your own site.
Latest Tweets
- We come to @first_watch almost every weekend. I crave the tri-fecta come 7 AM every Sunday. posted @ 9:39 AM, Jul 05th
- #moonfruit pancakes forbreakfast! posted @ 9:09 AM, Jul 05th
- Eating some 4th of July #moonfruit . posted @ 8:45 AM, Jul 04th
- Unless Sarah Palin is going to buy me a Mac, she should shut up until #moonfruit is over. One more reason... posted @ 5:18 PM, Jul 03rd
- F'ing Wild West pinball makes me mad. I always lose the ball down the sides. posted @ 5:06 PM, Jul 03rd
Popular This Month
- The Decemberists' "The Hazards of Love": An Interpretation 79 comment(s) | 115 view(s) per day
- Megan Fox Analyzes Her Farts 0 comment(s) | 80 view(s) per day
- Review: Picasaweb vs. Flickr 24 comment(s) | 64 view(s) per day
- Flip 3.0.1 0 comment(s) | 47 view(s) per day
- A Review of Online Photo Services 23 comment(s) | 45 view(s) per day
Recently Shared Links
Popular Tags
-
Apple
Code
Development
Flip
Food
Funny
Google
iPhone
Links
Mac
Meta
Microsoft
Music
Nerd
OSNews
Phish
PHP
Politics
Random
Rant
Review
Small Axe
Social Commentary
Software
TV
Videos
Web
Web 2.0
Websites
YouTube
From the Vault
- July 2009 (1)
- June 2009 (6)
- May 2009 (5)
- April 2009 (8)
- March 2009 (12)
- February 2009 (10)
- January 2009 (12)
- December 2008 (9)
- November 2008 (16)
- October 2008 (11)
- September 2008 (18)
- August 2008 (18)



Trackback Spam Gateway
It’s over. My referrer experiment is over… at least, in its current form. Today, I roll out firsttube.com referrer gateway version 1.0. That makes it sound fancy, but it’s not. Basically, it’s PHP to prevent trackback spam.
Traffic at firsttube.com has grown steadily, for some reason, and the logs reveal it: we get a TON of traffic from search engines, and the most popular terms are surprising – sensitive readers beware – here are the terms that most frequently drive people here:
cumtube, red-tube, uporn, adult youtube, milf, gay tube, tube 8 and many more equally odd terms.
You know why? Because, in a shrewd move that search engines seem to love, I display links back to my referrers, thinking they are trackbacks. But when it’s not from Google, Yahoo, Live.com, or OSNews, it’s most often spam. Why? Because not only are we using the name “tube” in our title, but with each erroneous entry, we tell the search engine it’s a good thing by back-linking to that search. In short, I’m perpetuating the problem. As a result, dozens of spammers have begun issuing basic GET requests in the hundreds placing their sites in my referrer lists.
Some time ago, I began the battle by adding rel=”nofollow” to all outgoing links not added via the admin section. But alas, that wasn’t good enough, the spammer didn’t care, so I implemented a pre-check, whereby referrers are, via regular expressions, matched against a list of known crap. As of today, there are 36 terms that I actively filter. In time, this will be performance intensive, if it isn’t already.
Thus, a gateway. Now, *all* referring traffic goes into a temp table, and each entry must be approved. I wrote a nice tool to batch import, batch delete, or even approve based on certain filters, such as domain or term. As it matures and I get an idea of time, I will “whitelist” certain domains that can immediately post to the referrer table. In the meantime, I need to decide if I want to filter referrers with obscene unrelated terms or just leave them and let the magic run its course; after all, these are not “spam,” they are simply organic mistakes. An argument could be made that it’s interesting, and therefore, mostly the reason to post referrers, to see what terms and sites around the internet drive traffic to a site.
Anyway, spammers, take note: I gotcher number! Stop referrer spamming me! That means you , you stupid lyrics sites!