<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments for Lee Romero</title>
	<atom:link href="http://blog.leeromero.org/comments/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.leeromero.org</link>
	<description>On Content, Collaboration and Findability</description>
	<lastBuildDate>Wed, 12 Oct 2011 12:15:16 -0500</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.6</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>Comment on Language change over time in your search log by Lee Romero</title>
		<link>http://blog.leeromero.org/2011/10/10/language-change-over-time-in-your-search-log/comment-page-1/#comment-23374</link>
		<dc:creator>Lee Romero</dc:creator>
		<pubDate>Wed, 12 Oct 2011 12:15:16 +0000</pubDate>
		<guid isPermaLink="false">http://blog.leeromero.org/?p=447#comment-23374</guid>
		<description>Thanks, Hoss - I intuitively agree with your points but I have found no data or reports actually supporting that assertion.  That&#039;s part of why I&#039;ve been posting in this on this topic - I think there is insight to be gained to actually prove these assumed truths are actually true (or to find out that they&#039;re not true or to be able to more precisely identify what are the factors that influence these metrics).

Are you aware of any journal articles (or even blog posts like mine) that support your points?  I&#039;d be very interested!</description>
		<content:encoded><![CDATA[<p>Thanks, Hoss &#8211; I intuitively agree with your points but I have found no data or reports actually supporting that assertion.  That&#8217;s part of why I&#8217;ve been posting in this on this topic &#8211; I think there is insight to be gained to actually prove these assumed truths are actually true (or to find out that they&#8217;re not true or to be able to more precisely identify what are the factors that influence these metrics).</p>
<p>Are you aware of any journal articles (or even blog posts like mine) that support your points?  I&#8217;d be very interested!</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Language change over time in your search log by Hoss</title>
		<link>http://blog.leeromero.org/2011/10/10/language-change-over-time-in-your-search-log/comment-page-1/#comment-23369</link>
		<dc:creator>Hoss</dc:creator>
		<pubDate>Tue, 11 Oct 2011 19:38:35 +0000</pubDate>
		<guid isPermaLink="false">http://blog.leeromero.org/?p=447#comment-23369</guid>
		<description>The specific domain of your data/site and the use case stories of your user base are going to be the primary factor in this sort of metric

If you run a news site, or an ecommerce portal selling new products, then numbers like these shouldn&#039;t surprise anyone. 

If you host an encyclopedia, or a database of medical ailments+symptoms, or a parts catalog for antique repair, or a site on stamp collecting, etc... the results are all going to be massively different.</description>
		<content:encoded><![CDATA[<p>The specific domain of your data/site and the use case stories of your user base are going to be the primary factor in this sort of metric</p>
<p>If you run a news site, or an ecommerce portal selling new products, then numbers like these shouldn&#8217;t surprise anyone. </p>
<p>If you host an encyclopedia, or a database of medical ailments+symptoms, or a parts catalog for antique repair, or a site on stamp collecting, etc&#8230; the results are all going to be massively different.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Language change over time in your search log by Lee Romero</title>
		<link>http://blog.leeromero.org/2011/10/10/language-change-over-time-in-your-search-log/comment-page-1/#comment-23368</link>
		<dc:creator>Lee Romero</dc:creator>
		<pubDate>Tue, 11 Oct 2011 18:23:58 +0000</pubDate>
		<guid isPermaLink="false">http://blog.leeromero.org/?p=447#comment-23368</guid>
		<description>Otis - I did not do that with this data set but I did apply quite a few normalizations to the data in my prior post (the 80-20 ... post).  

What I found was that it did not make that huge of a difference.  I was pretty aggressive with the normalization - removing stop words, stemming, ignoring word order, case, many kinds of punctuation, etc. - and I found that it &quot;raised&quot; that curve by about 1% (on the Y axis, i.e., count of searches) along the X axis (i.e., how deep into the list of terms you go) from about 1% on the X axis until about 20% - after that, the lines just grew closer together. 

It&#039;s hard to compare the two because of the difference in what the curves mean, but I would guess it might push overlap from about 13% to maybe the upper teens?  I&#039;ll give it a try for a few months and report back here, though.

Thanks!</description>
		<content:encoded><![CDATA[<p>Otis &#8211; I did not do that with this data set but I did apply quite a few normalizations to the data in my prior post (the 80-20 &#8230; post).  </p>
<p>What I found was that it did not make that huge of a difference.  I was pretty aggressive with the normalization &#8211; removing stop words, stemming, ignoring word order, case, many kinds of punctuation, etc. &#8211; and I found that it &#8220;raised&#8221; that curve by about 1% (on the Y axis, i.e., count of searches) along the X axis (i.e., how deep into the list of terms you go) from about 1% on the X axis until about 20% &#8211; after that, the lines just grew closer together. </p>
<p>It&#8217;s hard to compare the two because of the difference in what the curves mean, but I would guess it might push overlap from about 13% to maybe the upper teens?  I&#8217;ll give it a try for a few months and report back here, though.</p>
<p>Thanks!</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Language change over time in your search log by Otis Gospodnetic</title>
		<link>http://blog.leeromero.org/2011/10/10/language-change-over-time-in-your-search-log/comment-page-1/#comment-23367</link>
		<dc:creator>Otis Gospodnetic</dc:creator>
		<pubDate>Tue, 11 Oct 2011 18:00:20 +0000</pubDate>
		<guid isPermaLink="false">http://blog.leeromero.org/?p=447#comment-23367</guid>
		<description>Lee - I wonder how different your results would be if you were to normalize queries, say using http://code.google.com/p/google-refine/ .  Or maybe you already did that?</description>
		<content:encoded><![CDATA[<p>Lee &#8211; I wonder how different your results would be if you were to normalize queries, say using <a href="http://code.google.com/p/google-refine/" rel="nofollow">http://code.google.com/p/google-refine/</a> .  Or maybe you already did that?</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on 80-20: The lie in your search log? by Lee Romero &#187; Blog Archive &#187; Language change over time in your search log</title>
		<link>http://blog.leeromero.org/2010/11/13/80-20-the-lie-in-your-search-log/comment-page-1/#comment-23365</link>
		<dc:creator>Lee Romero &#187; Blog Archive &#187; Language change over time in your search log</dc:creator>
		<pubDate>Tue, 11 Oct 2011 03:05:01 +0000</pubDate>
		<guid isPermaLink="false">http://blog.leeromero.org/?p=391#comment-23365</guid>
		<description>[...] previous post, &#8220;80-20: The lie in your search log?&#8220;, highlighted how the slope of &#8220;short head&#8221; of your search terms may not be as [...]</description>
		<content:encoded><![CDATA[<p>[...] previous post, &#8220;80-20: The lie in your search log?&#8220;, highlighted how the slope of &#8220;short head&#8221; of your search terms may not be as [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on 80-20: The lie in your search log? by Lee Romero &#187; Blog Archive &#187; The Findability Gap by Lou Rosenfeld</title>
		<link>http://blog.leeromero.org/2010/11/13/80-20-the-lie-in-your-search-log/comment-page-1/#comment-23300</link>
		<dc:creator>Lee Romero &#187; Blog Archive &#187; The Findability Gap by Lou Rosenfeld</dc:creator>
		<pubDate>Fri, 23 Sep 2011 12:28:05 +0000</pubDate>
		<guid isPermaLink="false">http://blog.leeromero.org/?p=391#comment-23300</guid>
		<description>[...] Slide 24 &#8211; As I&#8217;ve written about before, I would say that the 80/20 rule is more than just &#8220;not quite accurate&#8221;.  But [...]</description>
		<content:encoded><![CDATA[<p>[...] Slide 24 &#8211; As I&#8217;ve written about before, I would say that the 80/20 rule is more than just &#8220;not quite accurate&#8221;.  But [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Enterprise Taxonomy &#8211; The Structure in Detail by Sandra Fidler</title>
		<link>http://blog.leeromero.org/2009/01/13/enterprise-taxonomy-the-structure-in-detail/comment-page-1/#comment-23235</link>
		<dc:creator>Sandra Fidler</dc:creator>
		<pubDate>Mon, 22 Aug 2011 16:24:50 +0000</pubDate>
		<guid isPermaLink="false">http://blog.leeromero.org/?p=240#comment-23235</guid>
		<description>Generally I don&#039;t read post on blogs, however I wish to say that this write-up very forced me to try and do it! Your writing taste has been amazed me. Thanks, quite nice article.</description>
		<content:encoded><![CDATA[<p>Generally I don&#8217;t read post on blogs, however I wish to say that this write-up very forced me to try and do it! Your writing taste has been amazed me. Thanks, quite nice article.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on 80-20: The lie in your search log? by Lee Romero</title>
		<link>http://blog.leeromero.org/2010/11/13/80-20-the-lie-in-your-search-log/comment-page-1/#comment-23213</link>
		<dc:creator>Lee Romero</dc:creator>
		<pubDate>Tue, 02 Aug 2011 11:52:31 +0000</pubDate>
		<guid isPermaLink="false">http://blog.leeromero.org/?p=391#comment-23213</guid>
		<description>Otis - I do think this kind of report is useful, but currently I&#039;m still not entirely sure what one might do with the insight.

I have a few data points myself (i.e., I understand what this curve looks like for a couple of search solutions) but I have not yet been able to determine if there is really an underlying interpretation of the shape.

If there were a means to understand what this curve looks like for a good-sized set of solutions and then be able to compare traits of those search solutions, I think it could be more insightful.

I also wish I knew what the &quot;something important&quot; is - I&#039;ve tried engendering interest in &quot;comparing notes&quot; across solutions via the SearchCoP community on Yahoo! groups without any luck so far.</description>
		<content:encoded><![CDATA[<p>Otis &#8211; I do think this kind of report is useful, but currently I&#8217;m still not entirely sure what one might do with the insight.</p>
<p>I have a few data points myself (i.e., I understand what this curve looks like for a couple of search solutions) but I have not yet been able to determine if there is really an underlying interpretation of the shape.</p>
<p>If there were a means to understand what this curve looks like for a good-sized set of solutions and then be able to compare traits of those search solutions, I think it could be more insightful.</p>
<p>I also wish I knew what the &#8220;something important&#8221; is &#8211; I&#8217;ve tried engendering interest in &#8220;comparing notes&#8221; across solutions via the SearchCoP community on Yahoo! groups without any luck so far.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on 80-20: The lie in your search log? by Otis Gospodnetic</title>
		<link>http://blog.leeromero.org/2010/11/13/80-20-the-lie-in-your-search-log/comment-page-1/#comment-23212</link>
		<dc:creator>Otis Gospodnetic</dc:creator>
		<pubDate>Mon, 01 Aug 2011 22:11:25 +0000</pubDate>
		<guid isPermaLink="false">http://blog.leeromero.org/?p=391#comment-23212</guid>
		<description>I&#039;m wondering if you think a report like this is something every search analytics tool should have?

I&#039;m asking because we have a Search Analytics service (see http://sematext.com/search-analytics/index.html ) but don&#039;t have such a report.  And your point 2) above (&quot;the shape of this curve tells you something important about how you can manage your search solution&quot;) sounds intriguing, although I also don&#039;t yet know what that &quot;something important&quot; is.</description>
		<content:encoded><![CDATA[<p>I&#8217;m wondering if you think a report like this is something every search analytics tool should have?</p>
<p>I&#8217;m asking because we have a Search Analytics service (see <a href="http://sematext.com/search-analytics/index.html" rel="nofollow">http://sematext.com/search-analytics/index.html</a> ) but don&#8217;t have such a report.  And your point 2) above (&#8221;the shape of this curve tells you something important about how you can manage your search solution&#8221;) sounds intriguing, although I also don&#8217;t yet know what that &#8220;something important&#8221; is.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on 80-20: The lie in your search log? by Ed Dale</title>
		<link>http://blog.leeromero.org/2010/11/13/80-20-the-lie-in-your-search-log/comment-page-1/#comment-23090</link>
		<dc:creator>Ed Dale</dc:creator>
		<pubDate>Tue, 07 Jun 2011 16:37:37 +0000</pubDate>
		<guid isPermaLink="false">http://blog.leeromero.org/?p=391#comment-23090</guid>
		<description>My search terms per month follow simialr curves, it takes 50% or higher to acheive 80% of the queries. Very different from external ecommerce search.</description>
		<content:encoded><![CDATA[<p>My search terms per month follow simialr curves, it takes 50% or higher to acheive 80% of the queries. Very different from external ecommerce search.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
