<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>The Portable Consultant &#187; Search Engines</title>
	<atom:link href="http://theportableconsultant.com/blog/taxonomy/categorias/search-engines/feed/" rel="self" type="application/rss+xml" />
	<link>http://theportableconsultant.com/blog</link>
	<description>ECM infrastructure architecture... and unrelated matters.</description>
	<lastBuildDate>Wed, 14 Mar 2012 17:06:03 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3</generator>
		<item>
		<title>Oracle, Link Rot, and the Library of Alexandria</title>
		<link>http://theportableconsultant.com/blog/2008/02/13/oracle_link_rot_and_library_of_alexandria/</link>
		<comments>http://theportableconsultant.com/blog/2008/02/13/oracle_link_rot_and_library_of_alexandria/#comments</comments>
		<pubDate>Wed, 13 Feb 2008 21:08:04 +0000</pubDate>
		<dc:creator>pmh</dc:creator>
				<category><![CDATA[Grumbles]]></category>
		<category><![CDATA[Search Engines]]></category>

		<guid isPermaLink="false">http://theportableconsultant.com/blog/2008/02/13/oracle_link_rot_and_library_of_alexandria/</guid>
		<description><![CDATA[The Internet isn&#8217;t our modern version of the Library of Alexandria, Brewster Kahle&#8217;s Internet Archive is closer in spirit to that ancient centre of knowledge. link rot Over time, especially Internet time, web links break. It&#8217;s a part of the natural changing order of things. But the Portable Consultant is particularly sorry to see knowledge, [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in">The Internet isn&#8217;t our modern version of the <a href="http://en.wikipedia.org/wiki/Library_of_Alexandria" title="Wikipedia on The Library of Alexandria" target="_blank">Library of Alexandria</a>, Brewster Kahle&#8217;s <a href="http://www.archive.org/about/about.php" title="About the Internet Archive" target="_blank">Internet Archive</a> is closer in spirit to that ancient centre of knowledge.</p>
<p><strong>link rot</strong><br />
Over time, especially Internet time, web links break. It&#8217;s a part of the natural changing order of things. But the Portable Consultant is particularly sorry to see knowledge, freely offered by a company as part of its marketing, removed completely from the web after a merger or acquisition.</p>
<p>The Oracle Corporation is good at this &#8211; or bad, depending on your point of view.</p>
<p>It happened when Oracle acquired <a href="http://en.wikipedia.org/wiki/Stellent" title="Wikipedia article on Stellent" target="_blank">Stellent</a>, a well-known ECM company.</p>
<p>The Stellent site remained for a time, but links to free resources like white papers broke quickly and those to training programs followed soon after. It was reasonable, perhaps, in the case of training as that function came under the control of Oracle University. It was sad in the case of the other online ECM resources and white papers that disappeared. Some of these still exist but have been buried without the Stellent name, no doubt because the branding was inconsistent. If the Google search that lists any of these has a broken link you might try looking on Oracle.com under <em>Fusion Middleware</em> (say, What?).</p>
<p><strong>re-branding trumps Search Engine Optimization</strong><br />
Such re-branding seems foolish in one important sense: a Google search on &#8220;Stellent training&#8221; does not offer any canonical links to Oracle training services on the first page of hits, but if you search the Oracle site you may find it buried in Oracle University under the incomprehensible name of <em>Fusion Middleware</em>. Right, as if all those who read about Stellent in (pre-acquisition) articles and reviews will recognize that Stellent is today &#8220;Fusion Middleware&#8221;.</p>
<p>No doubt this made sense to someone Oracle management.</p>
<p><strong>Hyperion articles lost</strong><br />
It happened again when Oracle acquired Hyperion.</p>
<p>This time it was more personal. The Portable Consultant <a href="http://theportableconsultant.com/blog/2006/11/08/women-fire-and-dangerous-things/" title="Women, Fire, and Dangerous Things posting" target="_blank">had written here</a> about an article on the (former) Hyperion site, <em>Unstructured Text and Structured Data</em>. It took its theme from the George Lakoff book <em><a href="http://www.amazon.com/Women-Dangerous-Things-George-Lakoff/dp/0226468046" title="Amazon listing" target="_blank">Women, Fire, and Dangerous Things</a> (what categories reveal about the mind)</em>. The Hyperion article links categorization with data mining and Enterprise Content Management, describing several data mining approaches from various companies active in the space.</p>
<p>Sadly, that article is nowhere to be found on the Net. If it does still exist it is buried so deeply within the Oracle site that none of the unique keywords can retrieve it.</p>
<p>However, with some patience, I was able to <a href="http://web.archive.org/web/20061104043206/dev.hyperion.com/resource_library/articles/unstructured_text_data_article.cfm" title="Unstructured Text and Structured Data via the WayBack Machine" target="_blank">retrieve a copy</a> through <a href="http://www.archive.org/web/web.php" title="The WayBack Machine at the Internet Archive" target="_blank"><em>The WayBack Machine</em> at the <em>Internet Archive</em></a>.</p>
<p style="margin-bottom: 0in">What a shame that such articles should be lost for no apparent reason or for the sake of branding or a change in marketing approach. Thank goodness someone recognized the fragility of our Internet and is attempting to build our new Library of Alexandria.</p>
<p style="margin-bottom: 0in"><strong>The Open Text Index and the Internet Archive</strong><br />
I first heard of <a href="http://en.wikipedia.org/wiki/Brewster_Kahle" title="Wikipedia article on Brewster Kahle" target="_blank">Brewster Kahle</a> in connection with the intriguing <a href="http://en.wikipedia.org/wiki/Wide_area_information_server" title="Wikipedia article on WAIS" target="_blank">WAIS</a> (Wide Area Information Servers) distributed data search/retrieval system. WAIS indexed databases on Internet sites (not web sites, for this was before Berners-Lee created the world-wide web). As I recall, a central index led searchers to indexes, often at universities and research establishments – foreshadowing today&#8217;s Internet search engines.
</p>
<p style="margin-bottom: 0in">Later, when Open Text Corporation&#8217;s <a href="http://en.wikipedia.org/wiki/Tim_Bray" title="Wikipedia article on Tim Bray" target="_blank">Tim Bray</a> had established the Open Text Index search engine, it became for a short time the back-end search engine for <em>Yahoo!</em> (circa 1995). As technical administrator of the OTI I regularly created a small pile of backup tapes for Open Text&#8217;s CEO, Tom Jenkins, to hold up at presentations as: “all the information on the world wide web”. For a time this was probably a fair approximation. Later, these tapes were passed on to Brewster Kahle to incorporate into the Internet Archive.</p>
<p style="margin-bottom: 0in">Today the WayBack Machine provides one of the few links (pun intended) to our Internet past. Whole sites are harvested at regular intervals to provide access, if sometimes spotty, to past web site versions and the information they held. I was able to find my missing article in a collection of pages from the old Hyperion site of 2006.</p>
<p style="margin-bottom: 0in">That I can retrieve <em>Unstructured Text and Structured Data</em> from the Internet Archive is a testament to the foresight of Internet pioneers like Kahle. That I cannot  find it through an Internet search engine or retrieve it from an Oracle archive is an example of (understandable) short term commercial interests that do not practice knowledge management&#8230; including those companies that would be happy to tell you all about their KM software offerings! <img src='http://theportableconsultant.com/blog/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
<p>Cheers,<br />
-pmh</p>
]]></content:encoded>
			<wfw:commentRss>http://theportableconsultant.com/blog/2008/02/13/oracle_link_rot_and_library_of_alexandria/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>You&#8217;re more famous than you think when you check the sponsored link!</title>
		<link>http://theportableconsultant.com/blog/2007/11/14/you-are-more-famous-than-you-think/</link>
		<comments>http://theportableconsultant.com/blog/2007/11/14/you-are-more-famous-than-you-think/#comments</comments>
		<pubDate>Thu, 15 Nov 2007 02:01:45 +0000</pubDate>
		<dc:creator>pmh</dc:creator>
				<category><![CDATA[Search Engines]]></category>
		<category><![CDATA[YukYuks]]></category>

		<guid isPermaLink="false">http://theportableconsultant.com/blog/2007/11/14/you-are-more-famous-than-you-think/</guid>
		<description><![CDATA[Feeling unappreciated? &#8230;or just unknown? Try searching on your name and take note of the sponsored links! Sponsored links based on your search keywords can turn anyone, or any thing, into a hot web commodity. Cheers, -pmh ps: These are from SurfWax.com, a useful meta search engine. Sponsored Links Consultants Looking For Consultants? Review And [...]]]></description>
			<content:encoded><![CDATA[<p>Feeling unappreciated? &#8230;or just unknown?</p>
<p>Try searching on your name and take note of the sponsored links!</p>
<p>Sponsored links based on your search keywords can turn anyone, or any thing, into a hot web commodity.</p>
<p>Cheers,<br />
-pmh</p>
<p>ps: These are from SurfWax.com, a useful meta search engine.</p>
<p><strong>Sponsored Links</strong></p>
<p>Consultants<br />
<em>Looking For Consultants? Review And Compare Online.      </em></p>
<p>Find The Portable Consultant<br />
<em>The Most Popular Sites for The Portable Consultant.      </em></p>
<p>Bored? Check Out the Sumo!<br />
<em>Viral video mayhem. Games Galore. Sucker free music. Bangin&#8217; Hotties. Animation for your fascination. Go to the Sumo, live large and never be disappointed by a weak video website again.  </em></p>
<p>Find The Portable Consultant At eBay<br />
<em>Looking for The Portable Consultant? eBay has great deals on new &amp; used electronics, cars, apparel, collectibles, sporting goods and more. If you can&#8217;t find it on eBay, it probably doesn&#8217;t exist. Sign up today for FREE!    </em></p>
<p>Get Different Info On the portable consultant Now<br />
<em>The Web&#8217;s Most Complete the portable consultant resources all in one place &#8211; just a click away.  </em></p>
<p>Top the portable consultant Results for Your Search<br />
<em>Find all the latest and best the portable consultant info you need. Find it fast. Find it here!</em></p>
<p>the portable consultant<br />
<em>The most popular sites for the portable consultant!    </em></p>
<p>Find the portable consultant<br />
<em>The most popular sites for the portable consultant! </em></p>
<p>The best site for the portable consultant<br />
<em>The most popular sites for the portable consultant! </em></p>
<p>Find the portable consultant<br />
<em>Looking for the portable consultant? View the most popular sites.  </em></p>
<p>Deals on the portable consultant<br />
<em>Find Bargain Prices on the portable consultant. Shop at Trusted Online Stores.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://theportableconsultant.com/blog/2007/11/14/you-are-more-famous-than-you-think/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Crawl all over me now, Baby!</title>
		<link>http://theportableconsultant.com/blog/2006/11/22/xml-sitemap-protocol-agreed/</link>
		<comments>http://theportableconsultant.com/blog/2006/11/22/xml-sitemap-protocol-agreed/#comments</comments>
		<pubDate>Wed, 22 Nov 2006 17:06:40 +0000</pubDate>
		<dc:creator>pmh</dc:creator>
				<category><![CDATA[Search Engines]]></category>

		<guid isPermaLink="false">http://theportableconsultant.com/blog/2006/11/22/xml-sitemap-protocol-agreed/</guid>
		<description><![CDATA[XML sitemaps protocol adopted by major search engines XML sitemaps (as opposed to the graphical or people-readable list sitemaps) are a way for search engines to index web pages more easily/efficiently&#8230; Google has been using them for a while and now Yahoo and MSN have agreed to come on board. The assumption is that if [...]]]></description>
			<content:encoded><![CDATA[<p><strong>XML sitemaps protocol adopted by major search engines</strong><br />
XML <a title="XML sitemaps" target="_blank" href="http://www.sitemaps.org/">sitemaps</a> (as opposed to the graphical or people-readable list sitemaps) are a way for search engines to index web pages more easily/efficiently&#8230; Google has been using them for a while and now <a title="Yahoo and MSN join standard" target="_blank" href="http://www.techcrunch.com/2006/11/15/google-yahoo-and-microsoft-agree-to-standard-sitemaps-protocol/">Yahoo and MSN have agreed to come on board</a>.</p>
<p>The assumption is that if a search engine can index you more easily then your ranking will be more &#8220;accurate.&#8221; Internet ecologists might also hope this will reduce network traffic (thereby freeing up valuable bandwidth for spam!).</p>
<p>The Portable Consultant is not very concerned about his blog traffic at this point, but now that some big engines are using the same protocol it seemed like a good time to look into how this might be done in WordPress.</p>
<p>Of course <a title="sitemap plugin site" target="_blank" href="http://www.arnebrachhold.de/cat/seo/googlemap/">a WordPress sitemap plugin</a> was immediately located. Written by Arne Brachhold, it provides a simple setup screen and connects to Google by default. After a quick test the plugin was installed and it immediately pinged Google to let them know important data was ready for them and, in turn, you.</p>
<p>Yahoo seems to require a more complex process and I had trouble finding the instructions (they&#8217;re <a title="Yahoo's sitemap registration page" target="_blank" href="https://siteexplorer.search.yahoo.com/submit">here</a>). MSN will, of course, be the last to implement sitemap registration.</p>
<p>Now some major search engines will know that this posting is ready to index so that I can tell you, dear blogger, how to increase the ease with which your blog postings will discovered by readers in London, New York, Tokyo, and One Tree Hill.</p>
<p>Cheers,<br />
-pmh</p>
]]></content:encoded>
			<wfw:commentRss>http://theportableconsultant.com/blog/2006/11/22/xml-sitemap-protocol-agreed/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>CMS can cause SEO problems</title>
		<link>http://theportableconsultant.com/blog/2006/08/28/cms-causes-seo-problems/</link>
		<comments>http://theportableconsultant.com/blog/2006/08/28/cms-causes-seo-problems/#comments</comments>
		<pubDate>Mon, 28 Aug 2006 16:35:04 +0000</pubDate>
		<dc:creator>pmh</dc:creator>
				<category><![CDATA[ECM]]></category>
		<category><![CDATA[Search Engines]]></category>

		<guid isPermaLink="false">http://theportableconsultant.com/blog/2006/08/28/cms-can-cause-seo-problems/</guid>
		<description><![CDATA[The PortableÂ Consultant&#8217;sÂ current client&#8217;s web sites are under the management of the Sales &#38; Marketing group. They would like to ensure that any new CMS won&#8217;t create complex URLs that would upset the search engines that come calling. Today&#8217;s CMS&#8217;s tend to create URLs with long strings of unintelligible parameters. They work better this [...]]]></description>
			<content:encoded><![CDATA[<p>The PortableÂ Consultant&#8217;sÂ current client&#8217;s web sites are under the management of the Sales &amp; Marketing group. They would like to ensure that any new CMS won&#8217;t create complex URLs that would upset the search engines that come calling.</p>
<p>Today&#8217;s CMS&#8217;s tend to create URLs with long strings of unintelligible parameters. They work better this way, but people do not. More importantly, it turns out, neither do search engines. These â€œugly URLsâ€ are not only hard for users to remember and enter into their browser, but are also awkward for search engines to use when crawling links within a site. They may well bypass many or all of the internal pages of a site for this reason.</p>
<p>While looking for a good explanation of these issues to show my client I came across <a href="http://www.nonlinear.ca" title="Non-Linear Creations web site" target="_blank">Non-Linear Creations&#8217;</a> whitepaper <em>SEO and CMS: Implementing Best Practices</em>. A version of this paper by Randy Woods and Julie Batten is available on <a href="http://www.cmswatch.com" title="CMS Watch web site" target="_blank">CMS Watch</a> with the title <em><a href="http://www.cmswatch.com/Feature/150-SEO-and-Your-CMS" title="Embedding SEO Best Practices article" target="_blank">Embedding SEO Best Practices in CMS Implementations</a></em>.</p>
<p>Cheers,<br />
-pmh</p>
<p>Â</p>
]]></content:encoded>
			<wfw:commentRss>http://theportableconsultant.com/blog/2006/08/28/cms-causes-seo-problems/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

