<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: Google Content &#8220;Theft&#8221; Follow Up</title>
	<atom:link href="http://feedblog.org/2006/12/11/google-content-theft-follow-up/feed/" rel="self" type="application/rss+xml" />
	<link>http://feedblog.org/2006/12/11/google-content-theft-follow-up/</link>
	<description>You may say I'm a dreamer, but I'm not the only one.</description>
	<lastBuildDate>Thu, 04 Mar 2010 00:34:59 +0000</lastBuildDate>
	<generator>http://wordpress.com/</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Marius</title>
		<link>http://feedblog.org/2006/12/11/google-content-theft-follow-up/#comment-1507</link>
		<dc:creator>Marius</dc:creator>
		<pubDate>Fri, 16 Feb 2007 03:12:37 +0000</pubDate>
		<guid isPermaLink="false">http://burtonator.wordpress.com/2006/12/11/google-content-theft-follow-up/#comment-1507</guid>
		<description>Hi Kevin,

Check this out and let me know what you think:
http://blog.myfeedz.com/2007/02/16/myfeedz-released-on-adobe-labs/

However, it will take a while before search engines revisit the website and reindex content.

All the best,
Marius
</description>
		<content:encoded><![CDATA[<p>Hi Kevin,</p>
<p>Check this out and let me know what you think:<br />
<a href="http://blog.myfeedz.com/2007/02/16/myfeedz-released-on-adobe-labs/" rel="nofollow">http://blog.myfeedz.com/2007/02/16/myfeedz-released-on-adobe-labs/</a></p>
<p>However, it will take a while before search engines revisit the website and reindex content.</p>
<p>All the best,<br />
Marius</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Fred Zelders</title>
		<link>http://feedblog.org/2006/12/11/google-content-theft-follow-up/#comment-1506</link>
		<dc:creator>Fred Zelders</dc:creator>
		<pubDate>Fri, 09 Feb 2007 11:52:35 +0000</pubDate>
		<guid isPermaLink="false">http://burtonator.wordpress.com/2006/12/11/google-content-theft-follow-up/#comment-1506</guid>
		<description>Hi Kevin,

Feeds4all aggregates content from feeds (RSS, ATOM etc) and makes this content available through a directory or through search. So content is never &#039;grabbed&#039;  from the blog or web site it self. 

Feeds4all does more than replicating that feed content. It adds value to each feed and each article by serving links to related feeds and articles thus guiding visitors to more/similar information they are looking for.

This works both ways: Once YOUR feed/article is found, Feeds4all serves not only a lead to your article and site but also related feeds and articles. When other feeds/articles are found that are related to the content of your site, Feeds4all serves the feed and articles on your site.

By the way, served articles are always accompanied with information of the source (1 - the Article URL and a hyperlink leading to the full article on the actual blog/site. 2 - The name of the blog/site and a hyperlink to the home page of the blog/site) and lists of articles from a particular feed are completed with name of and hyperlink to the originating site. The feed url is available too.
Each feed page contains the statement: &quot;The content of this feed is property of the original publisher.&quot; and each article/item page contains the statement: &quot;The content of this article is property of the original publisher&quot;.

I hope you will reconsider your qualification for Feeds4all as &#039;a troublemaker&#039;.

Regards,

Fred</description>
		<content:encoded><![CDATA[<p>Hi Kevin,</p>
<p>Feeds4all aggregates content from feeds (RSS, ATOM etc) and makes this content available through a directory or through search. So content is never &#8216;grabbed&#8217;  from the blog or web site it self. </p>
<p>Feeds4all does more than replicating that feed content. It adds value to each feed and each article by serving links to related feeds and articles thus guiding visitors to more/similar information they are looking for.</p>
<p>This works both ways: Once YOUR feed/article is found, Feeds4all serves not only a lead to your article and site but also related feeds and articles. When other feeds/articles are found that are related to the content of your site, Feeds4all serves the feed and articles on your site.</p>
<p>By the way, served articles are always accompanied with information of the source (1 &#8211; the Article URL and a hyperlink leading to the full article on the actual blog/site. 2 &#8211; The name of the blog/site and a hyperlink to the home page of the blog/site) and lists of articles from a particular feed are completed with name of and hyperlink to the originating site. The feed url is available too.<br />
Each feed page contains the statement: &#8220;The content of this feed is property of the original publisher.&#8221; and each article/item page contains the statement: &#8220;The content of this article is property of the original publisher&#8221;.</p>
<p>I hope you will reconsider your qualification for Feeds4all as &#8216;a troublemaker&#8217;.</p>
<p>Regards,</p>
<p>Fred</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Stefan Svartling</title>
		<link>http://feedblog.org/2006/12/11/google-content-theft-follow-up/#comment-1505</link>
		<dc:creator>Stefan Svartling</dc:creator>
		<pubDate>Sat, 16 Dec 2006 08:55:15 +0000</pubDate>
		<guid isPermaLink="false">http://burtonator.wordpress.com/2006/12/11/google-content-theft-follow-up/#comment-1505</guid>
		<description>Aggregating full content is wrong, but aggregating a snippet of text, just like Google and all other search engines do is not.
It&#039;s like quoting someone on your own blog.
What&#039;s wrong with that?
Even if you get better rank in search results?

As I said, if you get linked, you&#039;ll get better pagerank in the end.</description>
		<content:encoded><![CDATA[<p>Aggregating full content is wrong, but aggregating a snippet of text, just like Google and all other search engines do is not.<br />
It&#8217;s like quoting someone on your own blog.<br />
What&#8217;s wrong with that?<br />
Even if you get better rank in search results?</p>
<p>As I said, if you get linked, you&#8217;ll get better pagerank in the end.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Muskie</title>
		<link>http://feedblog.org/2006/12/11/google-content-theft-follow-up/#comment-1504</link>
		<dc:creator>Muskie</dc:creator>
		<pubDate>Fri, 15 Dec 2006 16:50:06 +0000</pubDate>
		<guid isPermaLink="false">http://burtonator.wordpress.com/2006/12/11/google-content-theft-follow-up/#comment-1504</guid>
		<description>I forget where, possibly Jakob Nielsen possibly some other &quot;expert&quot; said you should allow people to have your entire post via RSS.  As a blog reader I like that better.  I can read them offline, they can download in the background automagically.

However the author should have ownership and control over his work.  Before this whole blogosphere, people stole content from my site without permission and put it up elsewhere.  But now with RSS feeds and scraper sites my words are ending up all over the internet.

I catch some of the scrapers because I have non-blog content I link to and they keep those links in.  I catch them by having a Google Alert for link:www.mydomain.whatever  You could possibly do something similar with a Technorati alert that would just target the blogosphere, Google added an option to just search blogs with Google Alerts I believe.

Knowing that people are scraping my content and perverting it, they usually chomp just around the keyword they are trying to optimize for, hasn&#039;t led to me doing much to stop them though.  I have done some SEO to my site, I&#039;ve even gone a little overboard for some stupid keywords and phrases which hasn&#039;t helped in all likelihood.

Creating a Google Alert for each permalink would be a bit much but you could make one for just the feedblog.org portion.  You could put in the year and even the month and that would limit the number of results returned, but honestly it shouldn&#039;t be a big problem.

If you subscribe to the Google PR, the algorithm will eventually sort itself out, but in the mean time it is stupid not to rank first for something you wrote, but it happens...  I&#039;ve seen where a story has been reblogged and the later blogger has more PR and his quoted version outranks the original.  So this isn&#039;t just RSS aggregators perhaps, but a sympton of Google&#039;s overall algorithm.</description>
		<content:encoded><![CDATA[<p>I forget where, possibly Jakob Nielsen possibly some other &#8220;expert&#8221; said you should allow people to have your entire post via RSS.  As a blog reader I like that better.  I can read them offline, they can download in the background automagically.</p>
<p>However the author should have ownership and control over his work.  Before this whole blogosphere, people stole content from my site without permission and put it up elsewhere.  But now with RSS feeds and scraper sites my words are ending up all over the internet.</p>
<p>I catch some of the scrapers because I have non-blog content I link to and they keep those links in.  I catch them by having a Google Alert for link:www.mydomain.whatever  You could possibly do something similar with a Technorati alert that would just target the blogosphere, Google added an option to just search blogs with Google Alerts I believe.</p>
<p>Knowing that people are scraping my content and perverting it, they usually chomp just around the keyword they are trying to optimize for, hasn&#8217;t led to me doing much to stop them though.  I have done some SEO to my site, I&#8217;ve even gone a little overboard for some stupid keywords and phrases which hasn&#8217;t helped in all likelihood.</p>
<p>Creating a Google Alert for each permalink would be a bit much but you could make one for just the feedblog.org portion.  You could put in the year and even the month and that would limit the number of results returned, but honestly it shouldn&#8217;t be a big problem.</p>
<p>If you subscribe to the Google PR, the algorithm will eventually sort itself out, but in the mean time it is stupid not to rank first for something you wrote, but it happens&#8230;  I&#8217;ve seen where a story has been reblogged and the later blogger has more PR and his quoted version outranks the original.  So this isn&#8217;t just RSS aggregators perhaps, but a sympton of Google&#8217;s overall algorithm.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: stefan svartling</title>
		<link>http://feedblog.org/2006/12/11/google-content-theft-follow-up/#comment-1503</link>
		<dc:creator>stefan svartling</dc:creator>
		<pubDate>Fri, 15 Dec 2006 14:17:03 +0000</pubDate>
		<guid isPermaLink="false">http://burtonator.wordpress.com/2006/12/11/google-content-theft-follow-up/#comment-1503</guid>
		<description>I think informatory help you get a better pagerank, because I link back to BOTH your startpage and the articles permalink.
That way you get better pagerank.</description>
		<content:encoded><![CDATA[<p>I think informatory help you get a better pagerank, because I link back to BOTH your startpage and the articles permalink.<br />
That way you get better pagerank.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Kevin Burton</title>
		<link>http://feedblog.org/2006/12/11/google-content-theft-follow-up/#comment-1502</link>
		<dc:creator>Kevin Burton</dc:creator>
		<pubDate>Wed, 13 Dec 2006 12:34:50 +0000</pubDate>
		<guid isPermaLink="false">http://burtonator.wordpress.com/2006/12/11/google-content-theft-follow-up/#comment-1502</guid>
		<description>Hey Marius.  Thanks for the response.

&gt; We surely don&#039;t want to be seen as troublemakers in the blogosphere. I should
&gt; make it clear that we aggregate whatever content we get from people&#039;s RSS
&gt; feeds. (some choose to publish the entire posts, while others just publish a
&gt; short description of the article). Also, at all times, we do keep the original
&gt; URL of the post and the source feed and make them visible on our website.

Great.... yeah.  I really want to publish my full-text RSS feed and this problem
really started to only now become obvious.

&gt; However, if I correctly understood the full argument you&#039;re trying to make, we
&gt; are hurting your page rank. Is this right?

Yup.

&gt; But isn&#039;t this something all aggregators do? 

Rojo didn&#039;t earlier on.  We had a robots.txt which blocked most of the
site. Half of this was to protect publishers from having their content indexed.

I left the company more than a year ago and I think the robots.txt went up since
then.  I still need to contact them about this but it might be redundant (see
below).

&gt; I mean, once an author decides to syndicate his articles, isn&#039;t this one of
&gt; the risks he must cope with? I don&#039;t think it can even be seen as a
&gt; risk. Rather, it&#039;s something intrinsic to the whole concept of syndication.

Maybe.... but I&#039;d like to avoid it altogether if possible.

What I&#039;m going to play with is tuning my blog for better SEO so that it can
index the permalinks better.  Then I&#039;m going to make sure that each full-content
feed includes a link back to the original permalink.  I think it would then be
possible for Google to figure out that my blog is the authoritative source.

Might fix the whole problem but right now I&#039;m not sure.

&gt; Don&#039;t get me wrong, I&#039;m not advocating Google scams or copyright infringements
&gt; here. Are you saying we shouldn&#039;t be publishing the entire content we get from
&gt; the feed? Or just that we shouldn&#039;t let Google re-index this content?

I think ideally that the content should be published by your site but either
blocked by robots.txt or a noindex meta tag.  This way you can build your
aggregator and I can blog without having to worry about people finding my
content on Google.

I think the issue is that EVERY post is penalized so it just pushes my blog
farther down the food chain. :-/

Kevin</description>
		<content:encoded><![CDATA[<p>Hey Marius.  Thanks for the response.</p>
<p>&gt; We surely don&#8217;t want to be seen as troublemakers in the blogosphere. I should<br />
&gt; make it clear that we aggregate whatever content we get from people&#8217;s RSS<br />
&gt; feeds. (some choose to publish the entire posts, while others just publish a<br />
&gt; short description of the article). Also, at all times, we do keep the original<br />
&gt; URL of the post and the source feed and make them visible on our website.</p>
<p>Great&#8230;. yeah.  I really want to publish my full-text RSS feed and this problem<br />
really started to only now become obvious.</p>
<p>&gt; However, if I correctly understood the full argument you&#8217;re trying to make, we<br />
&gt; are hurting your page rank. Is this right?</p>
<p>Yup.</p>
<p>&gt; But isn&#8217;t this something all aggregators do? </p>
<p>Rojo didn&#8217;t earlier on.  We had a robots.txt which blocked most of the<br />
site. Half of this was to protect publishers from having their content indexed.</p>
<p>I left the company more than a year ago and I think the robots.txt went up since<br />
then.  I still need to contact them about this but it might be redundant (see<br />
below).</p>
<p>&gt; I mean, once an author decides to syndicate his articles, isn&#8217;t this one of<br />
&gt; the risks he must cope with? I don&#8217;t think it can even be seen as a<br />
&gt; risk. Rather, it&#8217;s something intrinsic to the whole concept of syndication.</p>
<p>Maybe&#8230;. but I&#8217;d like to avoid it altogether if possible.</p>
<p>What I&#8217;m going to play with is tuning my blog for better SEO so that it can<br />
index the permalinks better.  Then I&#8217;m going to make sure that each full-content<br />
feed includes a link back to the original permalink.  I think it would then be<br />
possible for Google to figure out that my blog is the authoritative source.</p>
<p>Might fix the whole problem but right now I&#8217;m not sure.</p>
<p>&gt; Don&#8217;t get me wrong, I&#8217;m not advocating Google scams or copyright infringements<br />
&gt; here. Are you saying we shouldn&#8217;t be publishing the entire content we get from<br />
&gt; the feed? Or just that we shouldn&#8217;t let Google re-index this content?</p>
<p>I think ideally that the content should be published by your site but either<br />
blocked by robots.txt or a noindex meta tag.  This way you can build your<br />
aggregator and I can blog without having to worry about people finding my<br />
content on Google.</p>
<p>I think the issue is that EVERY post is penalized so it just pushes my blog<br />
farther down the food chain. :-/</p>
<p>Kevin</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Marius</title>
		<link>http://feedblog.org/2006/12/11/google-content-theft-follow-up/#comment-1501</link>
		<dc:creator>Marius</dc:creator>
		<pubDate>Wed, 13 Dec 2006 01:05:36 +0000</pubDate>
		<guid isPermaLink="false">http://burtonator.wordpress.com/2006/12/11/google-content-theft-follow-up/#comment-1501</guid>
		<description>Hi Kevin,

We surely don&#039;t want to be seen as troublemakers in the blogosphere. I should make it clear that we aggregate whatever content we get from people&#039;s RSS feeds. (some choose to publish the entire posts, while others just publish a short description of the article). Also, at all times, we do keep the original URL of the post and the source feed and make them visible on our website.

However, if I correctly understood the full argument you&#039;re trying to make, we are hurting your page rank. Is this right? But isn&#039;t this something all aggregators do? I mean, once an author decides to syndicate his articles, isn&#039;t this one of the risks he must cope with? I don&#039;t think it can even be seen as a risk. Rather, it&#039;s something intrinsic to the whole concept of syndication.

Don&#039;t get me wrong, I&#039;m not advocating Google scams or copyright infringements here. Are you saying we shouldn&#039;t be publishing the entire content we get from the feed? Or just that we shouldn&#039;t let Google re-index this content?

All the best,
Marius Zaharia, http://www.myFeedz.com</description>
		<content:encoded><![CDATA[<p>Hi Kevin,</p>
<p>We surely don&#8217;t want to be seen as troublemakers in the blogosphere. I should make it clear that we aggregate whatever content we get from people&#8217;s RSS feeds. (some choose to publish the entire posts, while others just publish a short description of the article). Also, at all times, we do keep the original URL of the post and the source feed and make them visible on our website.</p>
<p>However, if I correctly understood the full argument you&#8217;re trying to make, we are hurting your page rank. Is this right? But isn&#8217;t this something all aggregators do? I mean, once an author decides to syndicate his articles, isn&#8217;t this one of the risks he must cope with? I don&#8217;t think it can even be seen as a risk. Rather, it&#8217;s something intrinsic to the whole concept of syndication.</p>
<p>Don&#8217;t get me wrong, I&#8217;m not advocating Google scams or copyright infringements here. Are you saying we shouldn&#8217;t be publishing the entire content we get from the feed? Or just that we shouldn&#8217;t let Google re-index this content?</p>
<p>All the best,<br />
Marius Zaharia, <a href="http://www.myFeedz.com" rel="nofollow">http://www.myFeedz.com</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Greg G.</title>
		<link>http://feedblog.org/2006/12/11/google-content-theft-follow-up/#comment-1500</link>
		<dc:creator>Greg G.</dc:creator>
		<pubDate>Mon, 11 Dec 2006 12:07:34 +0000</pubDate>
		<guid isPermaLink="false">http://burtonator.wordpress.com/2006/12/11/google-content-theft-follow-up/#comment-1500</guid>
		<description>Very interesting.  It looks like Yahoo and MSN both do a better job of identifying your blog as the original source.

Regarding noindex for feeds, there has been some attempts at standardization for this in the past; see http://www.rassoc.com/gregr/weblog/archive.aspx?post=791 and http://www.bloglines.com/about/specs/fac-1.0 .  I dont think there&#039;s been much in the way of adoption though.</description>
		<content:encoded><![CDATA[<p>Very interesting.  It looks like Yahoo and MSN both do a better job of identifying your blog as the original source.</p>
<p>Regarding noindex for feeds, there has been some attempts at standardization for this in the past; see <a href="http://www.rassoc.com/gregr/weblog/archive.aspx?post=791" rel="nofollow">http://www.rassoc.com/gregr/weblog/archive.aspx?post=791</a> and <a href="http://www.bloglines.com/about/specs/fac-1.0" rel="nofollow">http://www.bloglines.com/about/specs/fac-1.0</a> .  I dont think there&#8217;s been much in the way of adoption though.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: stefan svartling</title>
		<link>http://feedblog.org/2006/12/11/google-content-theft-follow-up/#comment-1499</link>
		<dc:creator>stefan svartling</dc:creator>
		<pubDate>Mon, 11 Dec 2006 11:33:25 +0000</pubDate>
		<guid isPermaLink="false">http://burtonator.wordpress.com/2006/12/11/google-content-theft-follow-up/#comment-1499</guid>
		<description>Hi.
informatory only aggregates upto 300 characters of text and always publish BOTH the source link and the link to the homepage of the source.
That must be Ok i think.

Stefan Svartling / informatory</description>
		<content:encoded><![CDATA[<p>Hi.<br />
informatory only aggregates upto 300 characters of text and always publish BOTH the source link and the link to the homepage of the source.<br />
That must be Ok i think.</p>
<p>Stefan Svartling / informatory</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: stefan svartling</title>
		<link>http://feedblog.org/2006/12/11/google-content-theft-follow-up/#comment-1498</link>
		<dc:creator>stefan svartling</dc:creator>
		<pubDate>Mon, 11 Dec 2006 11:32:42 +0000</pubDate>
		<guid isPermaLink="false">http://burtonator.wordpress.com/2006/12/11/google-content-theft-follow-up/#comment-1498</guid>
		<description>Hi.
informatory only aggregates upto 300 characters of text and always publish BOTH the source link and the link to the homepage of the source.
That must be Ok i think.

Stefan Svartling / informatory</description>
		<content:encoded><![CDATA[<p>Hi.<br />
informatory only aggregates upto 300 characters of text and always publish BOTH the source link and the link to the homepage of the source.<br />
That must be Ok i think.</p>
<p>Stefan Svartling / informatory</p>
]]></content:encoded>
	</item>
</channel>
</rss>
