stargeek
PHP news website logo.
home    PHP scripts    articles    seo tools    links    search    contact    shop    realtors


Fighting comment spam, linking without google juice and other reasons to use the new href attribute







Fighting comment spam, linking without
google juice and other reasons to use
the new href attribute

Fighting comment spam, linking without
google juice and other reasons to use
the new href attribute
02/01/2005 08:40 PM

Fighting blog spam, allowing linking to people you don't like without giving them Googlejuice. The new link attribute has many applications. Most like it, but some think its a bad idea.




This is a GrokNews Entry: (what is grok?)





Similar Items

Fighting comment spam, linking without google juice and other reasons to use the new href attribute

Grok Headline matches for Fighting comment spam, linking without google juice and other reasons to use the new href attribute

Fighting comment spam


Fighting comment spam 02/01/2005 08:40 PM
Jay Allen has written a very nice document on how to fight comment spam Jay Allen should know a thing or two about comment spam. Before joing Six Apart he wrote the now famous MT-Blacklist plugin for Movable Type. If...

Guide for Fighting Comment Spam


Guide for Fighting Comment Spam 01/04/2005 08:15 PM
Call it a late holiday gift or a great way to start the new year. In either case, we are...

Step one in comment spam fighting


Step one in comment spam fighting 11/14/2003 10:54 PM
One of the drawbacks to rolling your own weblog software is that any time you want to add a feature you have to do it yourself.

For instance, I’ve wished a few times that I could use Jay Allen’s MT-Blacklist plugin. It would make it easier to fight comment spam.

(I have very good reasons for sticking with my own weblog software. I’m not going to change; please don’t suggest it.)

However, my software has a feature that would be cool to see in other weblog software, so I wanted to mention it: there’s an RSS feed that shows the last n comments, no matter which post they’re in reply to.

This means that no comment spam appears, even in very old posts, that I don’t see. I still have to go to the trouble of deleting it—but it’s much better than not knowing about it.

It may be that some other weblog software packages already have this feature. If so—cool. If yours doesn’t have this feature, you might want to consider it. I totally rely on it myself (and not just for fighting comment spam).

You might say—well, my weblog software does email notifications of comments, so an RSS feed of recent comments isn’t needed.

And I’d reply—well, my software has email notifications too. I found that I hardly ever looked at them. In amongst all the other email noise, comments notifications don’t work that well.

But an RSS feed for recent comments works wonderfully.

Fractured Realities: Fighting Against
Comment Spam


Fractured Realities: Fighting Against
Comment Spam
02/01/2005 09:14 PM
In a new posting over on Fractured Realities (Davey Shafik's weblog) today, he talks more about the eternal struggle these days with that menace of weblogs everywhere - comment spam.

New Google Spam Fighting


New Google Spam Fighting 03/11/2003 01:22 AM
WebmasterWorld: Good news about expired domains. ``We've also put more of a focus on algorithmic improvements for spam issues. One resulting improvement with this index is better handling of expired domains--the authority for a domain will be reset when a domain expires, even though dangling links to the expired domain are still out on the web. We'll be rolling this change in over the next few months starting with this index. [...] I think you'll see more emphasis at Google on scalable algorithms rather than responding to individual spam reports.'' Basically, when you buy a domain that previously expired, you won't get the PageRank the old domain's owner accrued. Neat idea....

Symantec Brightmail Anti-Spam v6.0
delivers next-generation spam-fighting
power, control to enterprises


Symantec Brightmail Anti-Spam v6.0
delivers next-generation spam-fighting
power, control to enterprises
07/01/2004 10:20 PM
Sunday Times South Africa Jul 2 2004 2:42AM GMT

Google Ignoring ALT Attribute Text


Google Ignoring ALT Attribute Text 01/30/2003 01:33 PM
Word from the trenches is that Google has started to ignore image ALT attributes.

Google preps new tool to juice revenue


Google preps new tool to juice revenue 05/10/2004 06:01 AM
The search engine giant plans a keyword service that will let advertisers automatically tap obscure terms and phrases, CNET News.com has learned.

Linking for Traffic: The Shift from Link
Directories to Hyper-Targeted Linking


Linking for Traffic: The Shift from Link
Directories to Hyper-Targeted Linking
07/04/2004 06:57 PM
WebDevInfo Jul 4 2004 11:27PM GMT

I put <a
href="https://www.google.com/adsense/hom
e">Google Adsense</a>


I put <a
href="https://www.google.com/adsense/hom
e">Google Adsense</a>
10/30/2003 08:17 PM
I put <a href="https://www.google.com/adsense/home">Google Adsense</a> on <a href="http://easytopicmaps.com/">Easy Topic Maps</a> two weeks ago. There have been 1,324 impressions and 40 clickthroughs (about 3 by myself), which gives an impressive average clickthrough rate of 3.0% (I'm not <a href="http://unicast.org/archives/000866.html">supposed</a> to say...

The Web We Weave, Linking for Google


The Web We Weave, Linking for Google 04/14/2005 07:24 PM

Comment spam


Comment spam 12/09/2003 05:05 PM
I started to come under comment spam fire again today. It didn’t last long. (It could be that they’re just taking a break.)

What happens to people that they grow up to be so unethical? Just wondering.

Comment Spam Changes


Comment Spam Changes 03/06/2004 02:03 AM
After giving up on the fight against Comment Spam on my blog, I have resorted to opening up comments on my most recent entries, and monitoring them for spam, and closing comments after they have had plenty of time to...

No more comment spam


No more comment spam 02/01/2005 09:33 PM
I've been waiting forever for someone tp pick up doing a particular project to fight comment spam. A couple of days ago I got tired of waiting and put it together in an evening. It's no silver bullet, but it did cut the amount of comment spam I get down to a fraction of what it was before. And it should scale in a way so the spammers can't easily program their way around it if many people start using...

MT Comment Spam


MT Comment Spam 01/16/2004 11:05 AM
So let's say you run a reasonably popular weblog that's open to comments from anyone and everyone. Let's also say in the same breath that you don't necessarily believe that turning off comments on older entries is a good...

Comment spam again


Comment spam again 10/29/2003 01:15 AM
I was recently hit by a mass comment spammer, leaving 21 comments on old entries in my blog, and so...

MT Comment Spam Fix


MT Comment Spam Fix 06/22/2004 12:20 PM

After the spam problem of a few weeks ago, I took a single step that Adam Kalsey talked about a long time ago that has fairly well fixed my spam problem: I renamed the Movable Type comments script. I have had exactly one spam in the last three weeks.

Spam bots, it seems, are designed to go after the default script name. You can rename the script (view the source sometime to see what I called it) and change the value in the mt.cfg file. I'm sure my log file is full of 404 requests to mt-comments.cgi.

It doesn't seem that spam bots are parsing the comment forms to find the name of the target script. It's probably just a matter of time.

Click here to comment on this entry


Comment Spam (Again)


Comment Spam (Again) 12/19/2004 02:58 PM
To continue my own post about running MT-Blacklist: Comment spams blocked: 2735 Comment spams moderated: 238 Duplicates blocked: 1 Blacklist...

"comment spam"


"comment spam" 07/05/2004 09:37 AM

Comment Spam


Comment Spam 10/28/2003 11:06 PM

Now that I am back home and rested, it is time to share an amusing story... as Randy noticed, I got some comment spam on Monday, all referencing an online gambling site.

32 comments in the course of 65 minutes.  The last 9 of which were not seen by anybody as I had blocked the ip address by then.

65 minutes to create.  Carefully crafted to appear to be on topic.  10 seconds to wipe out.


Businesses sue Google for linking to
rivals' ads


Businesses sue Google for linking to
rivals' ads
05/28/2004 06:14 AM
Seattle Times May 28 2004 9:48AM GMT

United against comment spam


United against comment spam 02/01/2005 08:43 PM

This is already being blogged all over the place, but I have to shout about it, too: several major search and blogging organizations (including Google, Yahoo! and Six Apart) have agreed upon a simple method to significantly reduce comment spam.


Comment spam update


Comment spam update 12/17/2004 06:44 PM
I've taken additional steps to prevent comment spam which will no longer affect normal site operation, so full posting features are once again avaialable.

[[ Visit http://www.macmegasite.com for full article ]]

new trends in comment spam


new trends in comment spam 08/02/2004 05:25 PM
kalsey has some good info as well. the hard part is getting people to upgrade.

New comment spam technique


New comment spam technique 07/30/2004 01:38 AM

The arms race against comment spammers has been stepped up a notch. I received a flurry of spam that linked to entries on other blogs. Curious to see what that was all about, I clicked on one of the links, fully expecting to be redirected to porn or an online casino. I was surprised to see a discussion of patent law; this comment spam linked to a legitimate site.

The comment that I received was certainly spam — other than the odd link, it was the typical formula: the name was “online casinos,” fake generic email address, and a vapid comment. Certainly a Stanford law professor hadn’t actually sent the spam. There was another reason this spammer was promoting someone else’s blog entry.The blog entry in question was full of comment spam. In the last 3 months, this entry had accumulated thousands of spam links in the comments.

It appears the spammers have a new tactic in increasing their PageRank. They find a site that doesn’t delete comment spam and fill it with links. Then they boost the PR of that site by spamming it in blog comments. Once the spam-friendly’s site has in increased Google ranking, all those spammed links in their comments will get a boost in rank as well.

It’s rather clever, actually.

I’m leaving out a link to the spam-ridden blog entry on purpose. I don’t want to give the spammers the link they want. If you want to see the page in question, find Elizabeth Rader’s March 1, 2004 entry called “All rights reserved in Birth Control for Flatworms” on cyberlaw.stanford.edu.

If you are a site that is apathetic toward link spam, it is now time to choose a side. If you continue your apathy and allow comment spam links to linger on your site you are helping the spammers. Spam friendly sites will now be placed on the list of blacklisted domains that are not allowed to post comments on this site.

In the war on spam if you are not for us; if you choose to look the other way and allow spammers to use your site; if you feel that keeping your site free from spam is too much trouble — you are against us.


comment spam eliminated?


comment spam eliminated? 05/29/2004 07:36 PM
Vive Le Canada,Canada-1 hour ago ... If you're interested, these comment spammers are trying to improve their ranking in google by dropping thousands of links to their shady websites all over the ...

Comment and Trackback Spam


Comment and Trackback Spam 03/14/2005 05:05 PM
Comment spam has increased to the point where I've reluctantly had to disable allowing unregistered readers to post comments for submission. I'm spending too much time cleaning it up as well as trackback spam. I am also disabling trackbacks. This is a shame as it undermines the connectivity that...

Solving comment spam


Solving comment spam 01/27/2004 10:57 PM

There are two main schools of thought concerning comment spam: the optimists and the defeatists. Optimists believe that comment spam can be beaten with technology; defeatists (maybe I should call them pessimists) believe that comments are as doomed as email and we're all going to hell in a hand basket.

The story so far

I fall squarely in to the techno-optimist category. Back in September I started blacklisting domains linked to from spam comments, defending against return visits from spammers and allowing others to syndicate my block list to run on their own site. Then in October I tweaked my comment system to eliminate PageRank from links in comments, making spamming for search engine optimisation a futile exercise. Of course, this measure only works if spammers realise it's there (I know at least one has) which is why I'm personally very happy to see that the latest release of Moveable Type has adopted the technique - to mixed reviews from the MT community.

There have been a whole bunch of other technological innovations over the past few months. Sam Ruby has implemented throttling to ban people who post three consecutive comments, and has some great ideas about guarding against strangers. Jay Allen's MT-Blacklist makes the blacklisting concept available to a wide audience. Meanwhile, James Seng's MT-Bayesian introduces trainable spam filters adapted from the fight against email spam.

The challenges ahead

So those are the solutions so far; the critical question is whether they work. The amount of spam I've been getting has definitely decreased, but as I run a completely custom blogging system I'm safe from the automated scripts that target more widespread systems - other sites make easier targets. Now that the less ethical search engine optimisers have started to catch on to the potential of comment spam to improve their PageRank the amount of spam can only increase. Some bloggers have already started to disable comments entirely (thankfully Dan turned them back on again shortly afterwards), setting a worrying precedent for the elimination two way interactions comments allow between bloggers and non-bloggers.

I'll put it in writing now: I will never disable comments on this blog. In the past few months the comments here have proved far more interesting and valuable than my actual posts, and I really appreciate the quality of the discussions that have arisen here. I will take whatever steps are necessary to keep this a useful environment for discussion.

Many people have hailed user registration as the ultimate solution to spam. It isn't, because the value of PageRank is just too high - and writing a script to automatically create accounts (even with email confirmation required) is child's play to anyone who is competent in an internet-aware scripting language. Even accessibility-impeding captchas are no defence against spammers who can afford to employ cheap labour to defeat them - and with search engine rankings as critical as they are there's no shortage of spam dollars.

With those ruled out, let's look at the remaining solutions:

The killer

Without links, comment spam has no purpose. To eliminate spam, eliminate links. Redirecting them through a PageRank killer already achieves this, but proves too subtle for spammers intent on spreading their links as widely as they can. Too truly eliminate spam, strip out links and anything that even looks like a URL and force the spammer to preview their carefully crafted advertisement before hitting submit. Seeing as hyperlinks are the single most important feature of the web this may seem draconian - and indeed it is. But on a site that serves more as a discussion forum than a farm and where the alternative to killing links is killing comments entirely this could be the saving factor.

For most blogs however links are an essential part of the discourse - I certainly wouldn't want to disable them here. Now only do they add huge value to the discussions, but more importantly they act as a "signature" for many commenters - knowing a comment is by "Dan" is far less useful than knowing that it's by Dan from www.simplebits.com.

Finding a compromise

Draconian measures such as the above wouldn't be necessary if spammers would wise up to the fact that their carefully crafted missives were having no effect on their precious PageRank. The real challenge then is to make anti-PageRank measures obvious to even the most brain-addled viagra peddlers. I've taken the first step towards this by turning on compulsory previewing for comments, which should have the added benefit of reminding legitimate commenters to use paragraph tags. I'll be working on ways of making the anti PageRank measures more obvious over the next few days, as and when work permits.

I've seen people argue that depriving legitimate commenters of PageRank is a poor compromise. I disagree: if the only cost of eliminating the incentive to spam is the loss of some Google ego then I see it as a price well worth paying. Of course, I say that as someone who's already built up their Google ego but at the end of the day it's my blog, my rules. One solution I've considered is creating a whitelist of sites that frequent commenters use in their signatures, causing them to be displayed without a redirect.

Comment spam is a solvable problem. Furthermore, blogging about comment spamming is almost as dull as blogging about blogging. Let's hurry up and solve it so we can go back to blogging about cats.


Comment Spam Attack


Comment Spam Attack 02/05/2005 09:12 PM

So, apparently I'm not the only one that was hit by some bleepity-bleep-bleep spammer trying to post 400+ comment spams to my blogs. MT-B blocked about 300 of them, moderated 80, and let 4 through. That's pretty decent. The other 80 all had the same base domain so future attacks will fail for that one domain. There are also regular expressions in place now that should moderate the more ... interesting ones.

Your comments may get moderated if you include any terms relating to animal sex or incest. If so, I'll notice when I check my mail next and approve/reject it, so don't worry. A little delay is all. Keep those illegal-in-Alabama discussions going! Woo! Eye-wink

That said, I'm wondering if going TypeKey-only is the way to go. Yes, it makes you make an account (boo-hoo) but it keeps things a little more sane on the management end. If I get two more of these full-on assaults I'll do it, but not until then. It will alienate the more lazy amongst you.


Comment Spam Flood


Comment Spam Flood 01/16/2004 11:33 AM
I just got 500+ comment spams (mainly for zoo sex, apparently) from someone who changes IP addresses every 3 msgs and changes the offensive link in every message. This defeats the MT Blacklist program I've been relying on. Help! I don't have time to manually strip out 500 spams. I will have to close comments (if I can figure out how to do so for all previous entries)....

Linking Schemes for Achieving Google
Rankings


Linking Schemes for Achieving Google
Rankings
12/30/2004 07:54 PM
Search Engine Journal Dec 30 2004 10:19PM GMT

"MT-Blacklist/Comment Spam
Clearinghouse"


"MT-Blacklist/Comment Spam
Clearinghouse"
11/10/2003 11:14 PM

The Comment Spam Arms Race


The Comment Spam Arms Race 09/15/2004 03:32 AM
  • Mark Glaser (Online Journalism Review): Bloggers Declare War on Comment Spam, but Can They Win? Spammers find a way to game Google search results by posting links in comments sections of popular blogs. Now the makers of Movable Type and bloggers are banding together to try to keep real-time interactivity alive in the blogosphere.

  • Nigerian Scam as comment spam


    Nigerian Scam as comment spam 02/01/2005 08:40 PM
    Nigeria scammers using comment spamming to fish for fools is certanly a new and "novel" approach. Hopefully anyone reading this post will understand exactly what kind of a scam this is.

    Comment spam load issue


    Comment spam load issue 12/17/2004 06:37 PM
    Hi everyone, my name is Jay Allen and I am the Product Manager for Movable Type. I'm writing today to...

    WordPress comment & trackback spam


    WordPress comment & trackback spam 02/06/2005 03:07 AM
    spampop

    candygenius.com/spampop
    track this site | 2 links


    Six Apart Guide to Combatting Comment
    Spam


    Six Apart Guide to Combatting Comment
    Spam
    01/05/2005 11:33 AM
    Six Apart Guide to Combatting Comment Spam

    sixapart.com/pronet/comment_spam.html
    track this site | 4 links


    MT-Blacklist/Comment Spam Clearinghouse


    MT-Blacklist/Comment Spam Clearinghouse 11/11/2003 07:06 AM
    The MT-Blacklist/Comment Spam Clearinghouse .. Jay Allen's MT-Blacklist

    jayallen.org/comment_spam
    track this site | 6 links


    "Mena on comment spam and the new
    version of MT"


    "Mena on comment spam and the new
    version of MT"
    12/24/2004 01:00 PM

    Grok Description matches for Fighting comment spam, linking without google juice and other reasons to use the new href attribute
    GrokA matches for Fighting comment spam, linking without google juice and other reasons to use the new href attribute

    Fighting comment spam, linking without google juice and other reasons to use the new href attribute

    The following phrases have been identified by the grok system as matching this entry:

















    Also check out:


    Grok

    Ipod Porn on the
    Rise

    Brief Abstract of
    Wikipedia's
    Mesothelioma Cancer
    page

    Get first aid
    instructions in your
    cell phone

    IE is crap
    JSPWiki gains
    podcasting support

    How to run a program
    as another user in
    Windows

    Bush's second
    inaugural speech

    Jon Udells
    screencast of the
    life of a Wikipedia
    article

    Search TV with
    Google

    Nigerian Scam as
    comment spam

    Developing sites for
    users with Cognitive
    disabilities and
    learning
    difficulties

    The Register
    interviews a link
    spammer

    Blairs doodles turns
    out to be Gates
    doodles

    Guest Writer Simon
    Waldman: The
    Importance of Being
    Permanent

    After Trust Me
    Journalism Comes
    Openness: Rather
    Report Released

    A Short Letter to
    Dan Rather

    Bloggers Are Missing
    in Action as Ketchum
    Tests the Conscience
    of PR

    Ketchum and
    Bloggers: Who Said
    What? What Remains?
    Lisa Stone Reports.

    Bloggers vs.
    Journalists is Over

    Keystone Crisis
    Management at
    Ketchum: Check the
    Home Page

    A Few Key Moments at
    the Blogging,
    Journalism &
    Credibility
    Conference

    Big Wigs Confer,
    Part Three

    Big Wigs Confer,
    Part Two

    Big Wigs From the
    Blogging &
    Journalism
    Conference Say What
    They Found

    Will the Greensboro
    Newspaper Open Its
    Archive?

    A Jack Shafer
    Problem

    David Akin of CTV in
    Canada: It's Not
    the Blog. It's the
    Net.

    Creative Commons at
    Australia Launch

    Prix Ars Electronica
    Pictures 2004

    Wired on Open Source
    Biology

    CC Italy Launch
    Report

    Australia Launch and
    River Boat Cruise

    CC on the charts
    Featured Commoner:
    Magnatune

    A Great Scene From
    iCommons

    At Sundance
    Ready to Share (Pret
    a Partager)

    CBC Radio 3 Chooses
    CC as a Top Issue

    Rip. Sample. Mash.
    Countdown.

    Read. Listen. Watch.
    Buckman.

    World Live (and
    Licensed) Web

    O'Reilly Licensing
    Book, Licensed

    Float Nightmares
    High schoolers on
    free speech

    Eyes on the Screen
    torrent mirror

    Reviews for Make
    Last five tickets to
    E Coast hacker con
    auctioned for EFF

    Soccer Mom Metadata
    VW car-bomber ad:
    dispute about more
    than copyright?

    Everything's coming
    up Gandhi

    Spanish-speaking
    bloggers blogging in
    English: an
    aggregator

    Web Zen: Rock Star
    Zen

    Services that insert
    sounds into mobile
    phone conversations

    Claire Robertson's
    stuffed animals

    Pramulator: Bloblike
    baby carriage

    what is grok?