The best part about WordPress is plugins. Theme designers also have a tendency of using WordPress plugins for adding cool features. When working on a custom WordPress theme project for a client, one thing you want to make sure is follow the best practice. Yesterday, we got an inquiry from one of our users whose site was broken. He had a custom design done by someone. The theme worked great until he changed hosts. His single posts would not load after post content. The comments area, sidebar, footer, nothing will load. It will simply output the error. We went in and fixed the issue for them. The issue was that his theme designer did not follow the best practice. He had a very popular plugin “User Photo” which allows you to add user’s photo at the bottom of the post, except he was just adding the function directly. Now this would work unless ofcourse the plugin gets disabled for some reason. When this client switched hosts, apparently the GD Library was not installed in the new server. This caused the plugin to deactivate. You also could not re-activate the plugin because it relies on the GD Library. This caused the site to be broken for all users. This obviously gets that theme designer a bad reputation because when the client asked what was the issue, we explained to them. If you are a theme designer adding plugin outputs, you should always follow the best practice. Here are some examples of how we do things with our clients:

For the popular User Photo plugin, we have it like this:

<?php
if( function_exists('userphoto') && userphoto_exists($author->ID))
    userphoto($author->ID);
else
    echo get_avatar($author->ID);
?>

The code above checks for two things. It checks if the ‘userphoto’ function exists (basically if the plugin is active). The second check is to see if the user photo exists for the specific author. If both checks return true, then we display the userphoto. Otherwise we simply have it displaying the user’s Gravatar.

For other plugins, which we do not have a substitute for, we always add a little note. For example when we add OIO Publisher output:

<?php if(function_exists('oiopub_banner_zone')) {
oiopub_banner_zone(1, 'center');
} else {
echo 'OIO Zone 1 does not exist. Check to see if this plugin is active.';
}
 ?>

The code above basically see if OIO Publisher Banner Zone function exists (which it will unless the plugin is deactivated). If it does exist, then it will output the banner. If it doesn’t exist, then it shows the text to let the site owner know that this plugin has been deactivated.

If you do not add the function_exists, then your site will return an error where the function fails. Theme designers please start doing this.

Best Practice: Check if Function Exists When Adding in WordPress Theme is a post from: WPBeginner which is not allowed to be copied on other sites.

I’m happy to present episode number two of the WPCandy Roundtable Podcast, this time with the BuddyPress Core Team of John James Jacoby, Boone Gorges, and Paul Gibbs. They spoke for just over an hour about issues of interest to them within the BuddyPress community, and where things are going in the near future.

This episode is sponsored by the upcoming WordPress service Raft.io and the Typecase plugin by UpThemes.

The gentleman also wanted me to say that if there were further questions you had about BuddyPress after listening, you’re welcome to leave them here and they will stop by and have a look at them.

You just finished reading WPCandy Roundtable #2: JJJ, Boone Gorges, Paul Gibbs on BuddyPress on WPCandy. Please consider leaving a comment!

For the third year now I’m over in Memphis for the World Championship of BBQ, joined by Otto, Nacin, Scott, and Rose. Last year due to flooding the festival was moved to a fairgrounds inland, but there’s nothing quite like being right on the Mississippi with the sweet aroma of pork all around you. (An aroma that, incidentally, follows you home in your clothes. :) ) The team we sponsor, the Moody Ques, put together an impressive booth this year, which you can see coming up in the below timelapse:

The video doesn’t do justice to the delicious food being cooked inside, though, which you have to experience in person.

Mark Jaquith writes How I built “Have Baby. Need Stuff!” — a nice overview of the latest and greatest in modern WP development.

One of the things that Zain Balkhi complained about when we were designing our current WPBeginner design was that the WordPress admin menu doesn’t float. Which is why in our current design, we made sure that our menu scrolled with the user. Recently we got an email from Till Kruess notifying us about a plugin that he has created which allows you to float your WordPress admin menu.

First thing you need to do is install and activate the Floating Admin Menu plugin.

The plugin will activate itself. If the viewport (window size) is taller than the admin menu itself, then the WordPress admin menu will float. This will save you time from scrolling up to access the menu when you are on a long admin page. The floating can be disabled on a per user basis under profile settings.

Floating WordPress Admin Menu

It currently does not work on mobile devices and tablets.

To see the preview of this plugin and download it, visit the official repository page of Floating Admin Menu plugin.

How to Float Your WordPress Admin Menu to Save Time is a post from: WPBeginner which is not allowed to be copied on other sites.

Joe Flood pitches the advantages of using WordPress and assorted plugins as your Intranet, in place of more bloated systems:

By using WordPress, you turn “add this information to the Intranet” from a frustrating task into something as simple as blogging. And just think how good your Intranet could be if people actually wanted to contribute to it.

You just finished reading WordPress in place of Sharepoint on WPCandy. Please consider leaving a comment!

This episode of Aftertaste followed WP Late Night #9 and saw the crew discussing potential future shows (particularly for the WPCandy Roundtable Podcast), and how we want to see questions sent in to the WP Late Night show.

Oh, and we chose the “Pewed a J” episode title, of course.

You just finished reading Aftertaste #19: After WP Late Night 9 on WPCandy. Please consider leaving a comment!

If you write original content day in and day out, you already are aware of the fact that your posts will end up on bunch of SPAM sites within a few days sometimes even few minutes. Some users even noted that the site with stolen content outranked the original post. It is very frustrating as a website owner to see that someone is stealing your content without permission, monetizing it, outranking you in SERPs, and stealing your audience. Content Scraping is a huge problem these days considering that it is so easy for someone to steal your content. In this article, we will cover what is blog content scraping, how to catch content scrapers, how to deal with content scrapers, how you can reduce and prevent content scraping, how to take advantage of content scraping, how to make money from content scrapers, and is content scraping ever good?

What is Blog Content Scraping?

Blog content scraping is an act usually performed with scripts that extract content from numerous sources and pulls it into one site. It is so easy now that anyone can install a WordPress site, put a free or commercial theme, and install a few plugins that will go and scrape content from selected blogs, so it can be published on their site.

Why are they Stealing my Content?

Some of our users have asked us why are they stealing my content? The simple answer is because you are AWESOME. The truth is that these content scrapers have ulterior motives. Below are just few reasons why someone would scrape your content:

  • Affiliate commission – There are some dirty affiliate marketers out there that just wants to exploit the system to make few extra bucks. They will use your content and other’s content to bring traffic to their site through search engine. These sites are usually targeted towards a specific niche, so they have related products that they are promoting.
  • Lead Generation – Often we see lawyers and realtors doing this. They want to seem like industry leaders in their small communities. They do not have the bandwidth to produce quality content, so they go out and scrape content from other sources. Sometimes, they are not even aware of this because they are paying some scumbag $30/month to add content and help them get better SEO. We have encountered quite a few of these in the past.
  • Advertising Revenue – Some folks just want to create a “hub” of knowledge. A one-stop-shop for users in a specific niche. If I had a penny for every time someone has done this with our content, then we would have a few hundred pennies. Often we notice that our site content is being scraped. The scraper always replies, I was doing this for the good of the community. Except the site is plastered with ads.

These are just a few reasons why someone would steal your content.

How to Catch Content Scrapers?

Catching content scrapers is a tedious task and can take up a lot of time. The are few ways that you can utilize to catch content scrapers.

Search Google with Your Post Titles

Yup that is as painful as it sounds. This method is probably not worth it specially if you are writing about a very popular topic.

Trackbacks

If you add internal links in your posts, you will notice a trackback if a site steals your content. This way is pretty much the scraper telling you that they are scraping your content. If you are using Akismet, then a lot of these trackbacks will show up in the SPAM folder. Again, this will only work if you have internal links in your posts.

Webmaster Tools

If you use google webmaster tools, then you are probably aware of the Links to your site page. If you look under “Traffic”, you will see a page that says Links to your site. Chances are your scrapers will be among the top ones there. They will have hundreds if not thousands of links to your pages (considering that you have internal links).

Links to Your Site - Google Webmaster Tools

FeedBurner Uncommon Uses

If you have setup Feedburner for your WordPress blog, then you can see some uncommon uses. In the Analyze Tab under Feed Stats, you will see “Uncommon Uses”. There you will see a list of sites.

FeedBurner Uncommon Uses

How to Deal with Content Scrapers

There are few approaches that people take when dealing with content scrapers. The Do Nothing Approach, Kill them all approach, Take Advantage of them approach.

The Do Nothing Approach

This is by far the easiest approach you can take. Usually the most popular bloggers would recommend this because it takes A LOT of time fighting the scrapers. This approach simply recommends that “instead of fighting them, spend your time producing even more quality content and having fun”. Now obviously if it is a well-known blog like Smashing Magazine, CSS-Tricks, Problogger, or others, then they do not have to worry about it. They are authority sites in Google’s eyes.

However during the Panda Update, we know some good sites got flagged as scrapers because google thought their scrapers were original content. So this approach is not always the best in our opinion.

Kill them all Approach

The exact opposite of the “Do Nothing Approach”. In this approach, you simply contact the scraper and ask them to take the content down. If they refuse to do so or simply do not reply to your requests, then you file a DMCA (Digital Millennium Copyright Act) with their host. In our experience, majority of the scraping websites do not have a contact form available. If they do, then utilize it. If they do not have the contact form, then you need to do a Whois Lookup.

Whois Lookup

You can see the contact info on the administrative contact. Usually the administrative, and technical contact is the same. The whois also shows the domain registrar. Most well-known web hosting companies and domain registrars have DMCA forms or emails. You can see that this specific person is with Hostgator because of their nameservers. HostGator has a form for DMCA complaints. If the nameserver is something like ns1.theirdomain.com, then you have to dig deeper by doing reverse IP lookups and searching for IPs.

You can also use a third party service for DMCA.com for takedowns.

Jeff Starr in his article suggest that you should block the bad guy’s IPs. Access your logs for their IP address, and then block it with something like this in your root .htaccess file:

Deny from 123.456.789

You can also redirect them to a dummy feed by doing something like this:

RewriteCond %{REMOTE_ADDR} 123\.456\.789\.
RewriteRule .* http://dummyfeed.com/feed [R,L]

You can get really creative here as Jeff suggests. Send them to really large text feeds full with Lorem Ipsum. You can send them some disgusting images of bad things. You can also send them right back to their own server causing an infinite loop which will crash their site.

The last approach that we take is to take Advantage of them.

How to Take Advantage of Content Scrapers

This is our approach of dealing with content scrapers, and it turns out quite well. It helps our SEO as well as help us make extra bucks. Majority of the scrapers use your RSS Feed to steal your content. So these are some of the things that you can do:

  • Internal Linking – You need to interlink the CRAP out of your posts. With the Internal Linking Feature in WordPress 3.1, it is now easier than ever. When you have internal links in your article, it helps you increase pageviews and reduce bounce rate on your own site. Secondly, it gets you backlinks from the people who are stealing your content. Lastly, it allows you to steal their audience. If you are a talented blogger, then you understand the art of internal linking. You have to place your links on interesting keywords. Make it tempting for the user to click it. If you do that, then the scraper’s audience will too click on it. Just like that, you took a visitor from their site and brought them back to where they should have been in the first place.
  • Auto Link Keywords with Affiliate Links – There are few plugins like Ninja Affiliate and SEO Smart Links that will automatically replace assigned keywords with affiliate links. For example: HostGator, StudioPress, MaxCDN, Gravity Forms << These all will be auto-replaced with affiliate links when this post goes live.
  • Get Creative with RSS Footer – You can either use the RSS Footer or WordPress SEO by Yoast Plugin to add custom items to your RSS Footer. You can add just about anything you want here. We know some people who like to promote their own products to their RSS readers. So they will add banners. Guess what, now those banners will appear on these scraper’s website as well. In our case, we always add a little disclaimer at the bottom of our posts in our RSS feeds. It simply reads like “How to Put Your WordPress Site in Read Only State for Site Migrations and Maintenance is a post from: WPBeginner which is not allowed to be copied on other sites.” By doing this, we get a backlink to the original article from scraper’s site which lets google and other search engines know we are authority. It also lets their users know that the site is stealing our content. If you are good with codes, then you can totally get nuts. Such as adding related posts just for your RSS readers, and bunch of other stuff. Check out our guide to completely manipulating your WordPress RSS feed.

How You Can Reduce Blog Content Scraping and Possibly Prevent It

Considering if you take our approach of lots of internal linking, adding affiliate links, rss banners and such chances are that you will reduce content scraping to good measure. If you take Jeff Starr’s suggestion of redirecting content scrapers, that too will stop those scrapers. Aside from what we have shared above, there are a few other tricks that you can use.

Full vs. Summary RSS Feed

There has been a debate in the blogging community whether to have full RSS feed or summary RSS feed. We are not going to go into much details about that debate, however one of the PROS of having a Summary Only RSS feed is that you prevent content scraping. You can change the settings by going to your WordPress admin panel and going under Settings » Reading. Then change the setting For each article in a feed show: Summary.

Note: We have full feed because we care more about our RSS readers than the spammers.

Trackback SPAM

Trackbacks and Pingbacks definitely had great uses however, they are now constantly being abused. Often themes display trackbacks and pingbacks under or among the comments. This gives the spammer an incentive to scrape your site and send trackbacks. If you mistakenly approves it, then they get a backlink and mention from your site. Here is how you can disable Trackbacks on all future posts. Here is an article that will show you how to disable trackbacks and pings on existing WordPress posts as well.

Is Content Scraping Ever Good?

It can be. If you see that you are making money from the scraper’s site, then sure it can be. If you see a lot of traffic from a scraper’s site, then it can be. In most cases however, it is not. You should always try to get your content taken off. But you will realize as your blog gets larger, it is almost impossible to keep track of all content scrapers. We still send out DMCA complaints, however we know that there are tons of other sites that are stealing our content that we just cannot keep up with.

What are your thoughts? Do you use any other mechanics to prevent content scraping? Would love to hear your thoughts.

Beginner’s Guide to Preventing Blog Content Scraping in WordPress is a post from: WPBeginner which is not allowed to be copied on other sites.

On this week’s episode of WP Late Night we discussed the WordPress Community Summit, security issues and WordPress plugins, and the newly launched WP App Store. And you know you want to listen in to find out where that episode title comes from, right?

This episode is sponsored by the Typecase plugin by UpThemes and the upcoming WordPress service Raft.io.

Episode #9 Show Notes

You just finished reading WP Late Night #9: “Pewed a J” on WPCandy. Please consider leaving a comment!

Chris Wallace at UpThemes has generously offered up 10 copies of his Typecase Pro plugin (usually $19) for giveaway, in addition to one grand prize of a year’s developer club membership at UpThemes (usually $99) and an UpThemes t-shirt.

To enter the giveaway, just leave a comment on this post or tweet at @wpcandy and @upthemes on Twitter and tell us your favorite font — bonus points for linking to an example of the font in use in a cool way. We’ll randomly select the winners from those who comment and tweet.

We’ll be collecting entries for the next week, and announcing the winner next Wednesday on the blog.

It’s been a while since we did a giveaway here, so this should be fun! Have at it in the comments.

You just finished reading Giveaway: Win a copy of Typecase Pro and a year’s dev club at UpThemes on WPCandy. Please consider leaving a comment!