<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>

<channel>
	<title>SearchEngineFan</title>
	<atom:link href="http://www.searchenginefan.com/feed" rel="self" type="application/rss+xml" />
	<link>http://www.searchenginefan.com</link>
	<description>The Search Engines Technology Blog</description>
	<pubDate>Sat, 14 Jan 2012 10:42:43 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.5.1</generator>
	<language>en</language>
			<item>
		<title>What&#8217;s the matter with America? Stop SOPA, Stop PIPA</title>
		<link>http://www.searchenginefan.com/editorial/whats-the-matter-with-america-stop-sopa-pipa-14</link>
		<comments>http://www.searchenginefan.com/editorial/whats-the-matter-with-america-stop-sopa-pipa-14#comments</comments>
		<pubDate>Sat, 14 Jan 2012 10:36:34 +0000</pubDate>
		<dc:creator>martin</dc:creator>
		
		<category><![CDATA[editorial]]></category>

		<category><![CDATA[Politics]]></category>

		<guid isPermaLink="false">http://www.searchenginefan.com/?p=14</guid>
		<description><![CDATA[Today I was alerted when browsing wordpress.org. I actually wanted to check out the latest WordPress plugins and developements when I recognized a button &#8220;HELP STOP SOPA/PIPA&#8221; together with a new blog post on this matter. SOPA stands euphemistically for &#8220;Stop Online Piracy Act&#8221;, PIPA for &#8220;Protect IP Act&#8221;. Both proposed acts are in my [...]]]></description>
			<content:encoded><![CDATA[<p>Today I was alerted when browsing wordpress.org. I actually wanted to check out the latest WordPress plugins and developements when I recognized a button &#8220;<a href="http://wordpress.org/news/2012/01/help-stop-sopa-pipa/">HELP STOP SOPA/PIPA</a>&#8221; together with a new blog post on this matter. SOPA stands euphemistically for &#8220;Stop Online Piracy Act&#8221;, PIPA for &#8220;Protect IP Act&#8221;. Both proposed acts are in my opinion attempts to regulate the online freedom of speach in a way, which seems even to me as a guy from Europe absolutely unamerican.</p>
<p>Still - the bills haven&#8217;t passed the Congress. You can find more information about it and calls to action at the site <a href="http://americancensorship.org/">AmericanCensorship.org</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.searchenginefan.com/editorial/whats-the-matter-with-america-stop-sopa-pipa-14/feed</wfw:commentRss>
		</item>
		<item>
		<title>Duplicate Linking (DL)</title>
		<link>http://www.searchenginefan.com/fun/duplicate-linking-dl-12</link>
		<comments>http://www.searchenginefan.com/fun/duplicate-linking-dl-12#comments</comments>
		<pubDate>Thu, 16 Apr 2009 11:06:42 +0000</pubDate>
		<dc:creator>martin</dc:creator>
		
		<category><![CDATA[Fun]]></category>

		<category><![CDATA[Search Engine Optimization]]></category>

		<category><![CDATA[DC]]></category>

		<category><![CDATA[DL]]></category>

		<category><![CDATA[Duplicate Content]]></category>

		<category><![CDATA[Duplicate Links]]></category>

		<guid isPermaLink="false">http://www.searchenginefan.com/?p=12</guid>
		<description><![CDATA[Duplicate Linking
Fun stuff from a german discussion board about search engine optimization (SEO)&#8230;
The case. A webmaster, say Joe, raised his voice about a competitor, say Alice, who would blatantly copy his key to success. Joe claims a lot of ingenuity, thinking constantly of new ways for his secret recipe. Just to find it a few  [...]]]></description>
			<content:encoded><![CDATA[<h4>Duplicate Linking</h4>
<p>Fun stuff from a german discussion board about search engine optimization (SEO)&#8230;</p>
<p>The case. A webmaster, say <em>Joe</em>, raised his voice about a competitor, say <em>Alice</em>, who would blatantly copy his key to success. Joe claims a lot of ingenuity, thinking constantly of new ways for his secret recipe. Just to find it a few  days later copied by Alice. That&#8217;s why Joe goes into the open. He describes a general code of honor among SEO webmasters and explains in detail that Alice isn&#8217;t acting according to it. Alice should be punished in his eyes. By Google at the best. So, you might wonder, what is it actually that Alice is copying?</p>
<p><span id="more-12"></span></p>
<h4>Copyright for backlink structures?</h4>
<p>What Joe is so concerned about is his <em>backlink structure</em>. He regards his backlinks as a kind of intellectual property, which must not be allowed to duplicate. As you probably know, one of the key factors for success in the business of SEO is the acquisition of qualified web links linking to your projects. That&#8217;s why these links are closely observed by any competitor&#8217;s eyes. And if one finds a new source of link to their projects, the others will follow in an instant.</p>
<p>Poor Joe not being aware of this was making a lot of webmasters chuckle. One David coined the new term of &#8220;Duplicate Linking&#8221;, as a corrolar to a major problem called Duplicate Content.</p>
<p>So you better be careful. If you are trying to get the same backlinks as your competition,  you might get angry mails from Joe for Duplicate Links. <img src='http://www.searchenginefan.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /></p>
]]></content:encoded>
			<wfw:commentRss>http://www.searchenginefan.com/fun/duplicate-linking-dl-12/feed</wfw:commentRss>
		</item>
		<item>
		<title>German SEO firms are courting&#8230;</title>
		<link>http://www.searchenginefan.com/search-engine-optimization/german-seo-firms-are-courting-11</link>
		<comments>http://www.searchenginefan.com/search-engine-optimization/german-seo-firms-are-courting-11#comments</comments>
		<pubDate>Sun, 12 Apr 2009 22:22:32 +0000</pubDate>
		<dc:creator>martin</dc:creator>
		
		<category><![CDATA[Search Engine Optimization]]></category>

		<category><![CDATA[fairrank]]></category>

		<category><![CDATA[german seo]]></category>

		<category><![CDATA[seoline]]></category>

		<guid isPermaLink="false">http://www.searchenginefan.com/?p=11</guid>
		<description><![CDATA[&#8230; their customers. Kind of. Well, that&#8217;s what you&#8217;d expect. SEOs convincing their customers by rendering superior services. But it seems some of the companies have found a new leisure activity. Going to court, against blogs and online communities.
The SEOLINE Case
The first remarkable incident has been just a few weeks ago. A german SEO firm [...]]]></description>
			<content:encoded><![CDATA[<p>&#8230; their customers. Kind of. Well, that&#8217;s what you&#8217;d expect. SEOs convincing their customers by rendering superior services. But it seems some of the companies have found a new leisure activity. Going to court, against blogs and online communities.</p>
<h3>The SEOLINE Case</h3>
<p>The first remarkable incident has been just a few weeks ago. A german SEO firm seems to have gotten penalized by Google for presumably black hat activities. Too many Russian backlinks, as the rumors go. Some of the company&#8217;s customers might had been affected as well. At least somebody dropped a blog comment claiming to be a customer of the SEO company, saying that she cancelled her business relationship for the reason of inappropriate SEO conducts. The SEO company disapproved of the bad coverage and sent its lawyers to get the blog post removed. This action provoked a <em>tremendous</em> echo among the german SEO community. In the end&#8230;</p>
<p><span id="more-11"></span></p>
<p>&#8230;the SEO company was calling the lawyers back and is still paying Adwords to be found in the serps when searching for the company&#8217;s name. The whole <a title="Seoline vs. Sistrx" href="http://www.sistrix.de/news/860-seoline-mahnt-sistrix-ab.html">Seoline Case</a> is still documented at the blog.</p>
<h3>The FAIRRANK Case</h3>
<p>Now, a similar story seems to be on its way: the <a title="FairRank vs. Omtalk" href="http://www.thomasbindl.com/blog/index.php/fairrank-verklagt-thomas-bindl">Fairrank Case</a>. I&#8217;ll sum it up as it is reported at <em>Thomas Bindl&#8217;s</em> blog. The story reads like this:</p>
<p>Somebody has been posting at the online marketing bulletin board <a title="Online Marketing Talk" href="http://www.omtalk.com/">omtalk.com</a> to the dislike of the SEO and SEM firm Fairrank. You&#8217;d expect Fairrank to contact the board administrator to remove the posts. And the company did. The board administrator kindly removed the posts and claims to have informed Fairrank about further technical measures to prevent similar bad posts in the future.</p>
<p>End of story? No. Instead of a friendly word of thanks Fairrank sent lawyers. Thomas is supposed to sign a paper that his board won&#8217;t ever be a source of bad information about the company. But a bulletin board owner can&#8217;t possibly sign this. He can&#8217;t prevent malicious posts as he can&#8217;t prevent the board being abused by hackers or spammers. Signing the paper would have meant for Thomas to close his board. As he isn&#8217;t willing to shut his community down he is forced to get the story handled at court.</p>
<p>The thing I&#8217;m asking myself is, how come a company is fighting some rather cooperative webmaster? If Thomas&#8217; description of the ongoings is correct, some specialist company is endagering its online reputation. And hasn&#8217;t been learning a lot from the Seoline case&#8230;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.searchenginefan.com/search-engine-optimization/german-seo-firms-are-courting-11/feed</wfw:commentRss>
		</item>
		<item>
		<title>Fun Tribute To Matt Cutts</title>
		<link>http://www.searchenginefan.com/fun/fun-tribute-to-matt-cutts-10</link>
		<comments>http://www.searchenginefan.com/fun/fun-tribute-to-matt-cutts-10#comments</comments>
		<pubDate>Wed, 08 Apr 2009 00:58:16 +0000</pubDate>
		<dc:creator>martin</dc:creator>
		
		<category><![CDATA[Fun]]></category>

		<category><![CDATA[matt cutts]]></category>

		<category><![CDATA[seo]]></category>

		<category><![CDATA[video]]></category>

		<guid isPermaLink="false">http://www.searchenginefan.com/?p=10</guid>
		<description><![CDATA[A webmaster who hasn&#8217;t heard of Matt Cutts? That sounds like a case for Urban Myth Busters. Well, really, most webmasters will have heard of the nice Googler from the Search Quality Team. Matt started a blog on private and search engine issues on a regular basis. Very soon he became something like Google&#8217;s single [...]]]></description>
			<content:encoded><![CDATA[<p>A webmaster who hasn&#8217;t heard of Matt Cutts? That sounds like a case for Urban Myth Busters. Well, really, most webmasters will have heard of the nice Googler from the Search Quality Team. Matt started a blog on private and search engine issues on a regular basis. Very soon he became something like Google&#8217;s <em>single face to the customer, </em> with the webmaster community reading his lips. Some of them literally, probably: since a) some webmasters or webmistresses will be able to read lip language and, b) Matt started a series of video messages a while ago. These videos became so popular among webmasters and SEOs (search engine optimizers), that they even made it to pole positions at youtube.</p>
<p><span id="more-10"></span></p>
<p>So it&#8217;s due time to paraphrase these videos, thought Martin Mißfeld (german artist and SEO). And created a humorous version of Matt&#8217;s video statements. Martin is focusing on Matt&#8217;s dilemma, that as a Googler he may not be too specific about internals, lest these be exploited, and on the other hand wants to help webmasters with tips for better quality sites. Well, enough of the words, just enjoy:</p>
<p><object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="425" height="344" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="allowFullScreen" value="true" /><param name="src" value="http://www.youtube.com/v/QR1r2x8rcio&amp;color1=0xb1b1b1&amp;color2=0xcfcfcf&amp;feature=player_embedded&amp;fs=1" /><embed type="application/x-shockwave-flash" width="425" height="344" src="http://www.youtube.com/v/QR1r2x8rcio&amp;color1=0xb1b1b1&amp;color2=0xcfcfcf&amp;feature=player_embedded&amp;fs=1" allowfullscreen="true"></embed></object></p>
<h3>Weblinks:</h3>
<ul>
<li><a href="http://www.tagseoblog.de/pagerank-backlinks-serps-matt-cutts-video-tribute-trash-fun">Tribute to Matt Cutts</a></li>
<li><a href="http://www.mattcutts.com/blog/type/movies/">Matt&#8217;s real videos</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.searchenginefan.com/fun/fun-tribute-to-matt-cutts-10/feed</wfw:commentRss>
		</item>
		<item>
		<title>Download the Internet Index for free!</title>
		<link>http://www.searchenginefan.com/experimental-engines/download-the-internet-index-for-free-8</link>
		<comments>http://www.searchenginefan.com/experimental-engines/download-the-internet-index-for-free-8#comments</comments>
		<pubDate>Sun, 08 Mar 2009 07:10:44 +0000</pubDate>
		<dc:creator>martin</dc:creator>
		
		<category><![CDATA[Experimental Engines]]></category>

		<category><![CDATA[Dotbot]]></category>

		<category><![CDATA[dotnetdotcom.org]]></category>

		<guid isPermaLink="false">http://www.searchenginefan.com/?p=8</guid>
		<description><![CDATA[Well, kind of&#8230; dotnetdotcom.org aka Dotbot
You might have noticed a new little critter gnawing at your web server:
Mozilla/5.0 compatible; DotBot/1.1; http://www.dotnetdotcom.org/crawler@dotnetdotcom.org)
The few Seattle based guys (pseudonym on their web site) promise an index of the web available to everybody. They intent to release as much information about the web&#8217;s structure (linking) and content as possible. [...]]]></description>
			<content:encoded><![CDATA[<h4>Well, kind of&#8230; dotnetdotcom.org aka <em>Dotbot</em></h4>
<p>You might have noticed a new little critter gnawing at your web server:</p>
<p><code>Mozilla/5.0 compatible; DotBot/1.1; http://www.dotnetdotcom.org/crawler@dotnetdotcom.org)</code></p>
<p>The <em>few Seattle based guys </em>(pseudonym on their web site) promise an index of the web available to everybody. They intent to release as much information about the web&#8217;s structure (linking) and content as possible. For a fee to cover their costs, though.</p>
<p><span id="more-8"></span></p>
<h4>Seomoz Linkscape</h4>
<p>The entity behind the pseudonym is actually the SEO company <em>Seomoz</em>. One of their best known products is their domain evaluation service <a href="http://www.seomoz.org/linkscape/help/sources" title="Linkscape's Data Sources">Linkscape</a>. Linkscape is building its own version of a web link graph, collecting and computing the relation of all web sites to each other. It is a nice service for webmasters but will provide quite a bit of information to your competition.</p>
<p>The best way to lock out the Linkscape bot from your site is to use an entry in your robots.txt contol file. It should start with the bots you like to lock out completely. For Linkscape you would have to include the following lines:</p>
<pre>User-agent: dotbot
Disallow: /</pre>
<p>Note that Linkscape is promoting a different way of locking out their bot, that is by including a &#8220;noindex&#8221; meta tag in the header of a web page. Alas, this version will not prevent the dotbot to crawl your site and to extract its links. It will only prevent the robot to store the content of your site for its search engine.</p>
<h4>The Dotbot Technology</h4>
<p>The guys tell jokily about their tools. Using C and python as a programming language, flat disk files instead of a database system, some open source software. That&#8217;s saying nothing in an elaborate way, of course.</p>
<h4>Downloadable Index</h4>
<p>The current <em>dotbot&#8217;s</em> index is available for download for everybody. The index file is constructed according to the structure:<code> "URL-Without-Protocol NULL Optional-String-Not-Used NULL Complete-HTTP-Response NULL"</code>, with NULL as the zero byte. Actually one sees this is rather a web dump, than a searchable web index. The sorting, filtering, and indexing will have to follow. I wonder a bit, why the protocol is omitted, when keeping on the other hand the complete http response.</p>
<p>As of end of January 2009 the index has a tiny fraction of the web available. It comprises about 9 million pages, summing up to an index file size of 68 GB. Find a link to download it on their site (weblink below).</p>
<h4>Sample Dump</h4>
<p>The example consists of two URLs:</p>
<pre><code>	www.example.com/  HTTP/1.1 200 OK
	Date: Sat, 20 Sep 2008 15:43:15 GMT
	Server: Apache/2.0.52 (CentOS)
	X-Powered-By: PHP/4.3.9
	Content-Length: 557
	Connection: close
	Content-Type: text/html; charset=UTF-8			

	&lt;!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd"&gt;
	&lt;html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en" lang="en"&gt;
	&lt;head&gt;
	&lt;meta http-equiv="Content-Type" content="text/html; charset=utf-8" /&gt;
	&lt;title&gt;I am an example.&lt;/title&gt;
	&lt;/head&gt;
	&lt;body&gt;
	...
	&lt;body&gt;
	&lt;/html&gt; www.example2.com/  HTTP/1.1 200 OK
	Date: Sat, 20 Sep 2008 15:43:15 GMT
	Server: Apache/2.0.52 (CentOS)
	X-Powered-By: PHP/4.3.9
	Content-Length: 557
	Connection: close
	Content-Type: text/html; charset=UTF-8			

	&lt;!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd"&gt;
	&lt;html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en" lang="en"&gt;
	&lt;head&gt;
	&lt;meta http-equiv="Content-Type" content="text/html; charset=utf-8" /&gt;
	&lt;title&gt;I am a different example.&lt;/title&gt;
	&lt;/head&gt;
	&lt;body&gt;
	...
	&lt;body&gt;
	&lt;/html&gt;
</code></pre>
<h4>Weblinks</h4>
<ul>
<li><a href="http://www.dotnetdotcom.org">Dotbot</a>, including a link to their Index (66 GB, torrent)</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.searchenginefan.com/experimental-engines/download-the-internet-index-for-free-8/feed</wfw:commentRss>
		</item>
		<item>
		<title>Berlin&#8217;s crime - A google maps mashup</title>
		<link>http://www.searchenginefan.com/mashup/berlins-crime-a-google-maps-mashup-7</link>
		<comments>http://www.searchenginefan.com/mashup/berlins-crime-a-google-maps-mashup-7#comments</comments>
		<pubDate>Fri, 06 Mar 2009 20:54:28 +0000</pubDate>
		<dc:creator>martin</dc:creator>
		
		<category><![CDATA[Mashup]]></category>

		<category><![CDATA[google]]></category>

		<category><![CDATA[google maps]]></category>

		<guid isPermaLink="false">http://www.searchenginefan.com/?p=7</guid>
		<description><![CDATA[The main occupations for tabloid newspapers around the world are quite likely reports on gossip and reports on crime. So the tabloid reporters are experts on these matters. The german tabloid &#8220;Berliner Kurier&#8221; (Berlin&#8217;s courier) is demonstrating their skills with a brand new google maps mashup, the &#8220;blaulicht-kurier&#8221; (flashing blue light courier).

5000 crime incidents of [...]]]></description>
			<content:encoded><![CDATA[<p>The main occupations for tabloid newspapers around the world are quite likely reports on gossip and reports on crime. So the tabloid reporters are experts on these matters. The german tabloid &#8220;<i>Berliner Kurier&#8221; (Berlin&#8217;s courier)</i> is<b> </b>demonstrating their skills with a brand new google maps mashup, the <i>&#8220;blaulicht-kurier&#8221; </i><i>(flashing blue light courier)</i>.</p>
<p><img src="http://www.searchenginefan.com/wp-includes/js/tinymce/plugins/wordpress/img/trans.gif" mce_src="http://www.searchenginefan.com/wp-includes/js/tinymce/plugins/wordpress/img/trans.gif" class="mceWPmore mceItemNoResize" title="More..."></p>
<h4>5000 crime incidents of the last months and years</h4>
<p>The <i>blaulicht-kurier</i> is showing the location of crimes in Berlin and the region of Brandenburg. By default you will see descriptive icons over the crime locations of the last two weeks. If you click on an icon you get to see a crime report.</p>
<p>If you want to see older crimes or if you are looking for crimes at a particular location, you can do so by specifying the times or whereabouts. The database consists of about 5000 incidents from the last years. The latest entries are a few weeks in the past, though.</p>
<h4>Icon legend</h4>
<ul>
<li>Black icons: crime</li>
<ul>
<li><i>fist</i> for violence</li>
<li><i>hammer</i> for vandalism</li>
<li><i>mask</i> for robbery</li>
<li><i>thief with flashlight</i> for theft</li>
</ul>
<li>Red icons: fire</li>
<li>Blue icons: traffic</li>
<li>Green icons: other</li>
<li>Target icon: more details available when clicking on it</li>
</ul>
<h3>WebLinks</h3>
<ul>
<li><a mce_href="http://www.blaulicht-kurier.de" href="http://www.blaulicht-kurier.de">Blaulicht-Kurier</a> (the mashup)</li>
<li><a target="_blank" mce_href="http://www.berlinonline.de/berliner-kurier/" href="http://www.berlinonline.de/berliner-kurier/">Berliner Kurier</a> (one of two Berlin local major tabloid papers)</li>
<li><a mce_href="http://www.searchcowboys.com/google/391" href="http://www.searchcowboys.com/google/391">Article from the Search Cowboys</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.searchenginefan.com/mashup/berlins-crime-a-google-maps-mashup-7/feed</wfw:commentRss>
		</item>
		<item>
		<title>Search Engines Are Simple Things</title>
		<link>http://www.searchenginefan.com/editorial/search-engines-are-simple-things-5</link>
		<comments>http://www.searchenginefan.com/editorial/search-engines-are-simple-things-5#comments</comments>
		<pubDate>Tue, 27 May 2008 13:58:31 +0000</pubDate>
		<dc:creator>martin</dc:creator>
		
		<category><![CDATA[editorial]]></category>

		<guid isPermaLink="false">http://www.searchenginefan.com/?p=5</guid>
		<description><![CDATA[What is the search engine fan blog about?]]></description>
			<content:encoded><![CDATA[<p>Well in fact, search engines are rather far from beeing simple. What makes them appear simple is clever engineering, market research and stunning amounts of technology. The core tasks of a web search engine are tough enough:</p>
<ol>
<li>knowing what a user really is looking for</li>
<li>knowing which of the 10s of billions of web pages or services are most relevant</li>
<li>knowing how to distinguish web spam, porn or malicious services from regular services</li>
<li>making the gazillions of data accessible to all users asap / instantaniously</li>
</ol>
<p>This blog will look into the engines and will try to explain them. How are they paying off commercially? What comes next? What is actually available (but nobody knows it)?</p>
<p>I hope you will enjoy this blog. Lots of fun!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.searchenginefan.com/editorial/search-engines-are-simple-things-5/feed</wfw:commentRss>
		</item>
	</channel>
</rss>

