<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Lectures on the Google Technology Stack 1: Introduction to PageRank</title>
	<atom:link href="http://michaelnielsen.org/blog/lectures-on-the-google-technology-stack-1-introduction-to-pagerank/feed/" rel="self" type="application/rss+xml" />
	<link>http://michaelnielsen.org/blog/lectures-on-the-google-technology-stack-1-introduction-to-pagerank/</link>
	<description></description>
	<lastBuildDate>Tue, 09 Mar 2010 22:14:18 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.1</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Three Minutes on PageRank &#171;</title>
		<link>http://michaelnielsen.org/blog/lectures-on-the-google-technology-stack-1-introduction-to-pagerank/comment-page-1/#comment-22262</link>
		<dc:creator>Three Minutes on PageRank &#171;</dc:creator>
		<pubDate>Wed, 13 May 2009 13:08:26 +0000</pubDate>
		<guid isPermaLink="false">http://michaelnielsen.org/blog/?p=506#comment-22262</guid>
		<description>[...] bad for three minutes! PageRank a very complicated subject, as you can see from here, a college level course that spends a semester discussing it! Making sense out of PageRank in just [...]</description>
		<content:encoded><![CDATA[<p>[...] bad for three minutes! PageRank a very complicated subject, as you can see from here, a college level course that spends a semester discussing it! Making sense out of PageRank in just [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: argv</title>
		<link>http://michaelnielsen.org/blog/lectures-on-the-google-technology-stack-1-introduction-to-pagerank/comment-page-1/#comment-19120</link>
		<dc:creator>argv</dc:creator>
		<pubDate>Fri, 27 Mar 2009 02:08:31 +0000</pubDate>
		<guid isPermaLink="false">http://michaelnielsen.org/blog/?p=506#comment-19120</guid>
		<description>@JohnS

The IP angle is interesting.

But consider another angle.

Anyone can crawl the web and create and index.  Even a cache.  But some questions begin to emerge:

- are they holding the IP (e.g. copyright) of others&#039; sites as URI&#039;s or even cahed pages?

- are they operating as middlemen, as gateways to others&#039; content, and letting advertisers intrude into the pathway between user and the content she intends to retrieve?

- will content owners agree to anyone crawling and indexing and even caching their content, or only to certain parties (e.g. Google, Yahoo, MSN, etc.)

- do we need middlemen?

- are &quot;browsers&quot; even necessary, if we are merely searching and selecting known content from an index?  consider a tool that can handle regex searches, a tool that can generate an index and a tool that can retrieve a file when given a URI

is the internet of today (cf. to that of say 1993) too big to &quot;browse&quot;?  compare the &quot;portal&quot; concept (e.g. Yahoo! etc.) of the early web versus today&#039;s point of entry: the Google search</description>
		<content:encoded><![CDATA[<p>@JohnS</p>
<p>The IP angle is interesting.</p>
<p>But consider another angle.</p>
<p>Anyone can crawl the web and create and index.  Even a cache.  But some questions begin to emerge:</p>
<p>- are they holding the IP (e.g. copyright) of others&#8217; sites as URI&#8217;s or even cahed pages?</p>
<p>- are they operating as middlemen, as gateways to others&#8217; content, and letting advertisers intrude into the pathway between user and the content she intends to retrieve?</p>
<p>- will content owners agree to anyone crawling and indexing and even caching their content, or only to certain parties (e.g. Google, Yahoo, MSN, etc.)</p>
<p>- do we need middlemen?</p>
<p>- are &#8220;browsers&#8221; even necessary, if we are merely searching and selecting known content from an index?  consider a tool that can handle regex searches, a tool that can generate an index and a tool that can retrieve a file when given a URI</p>
<p>is the internet of today (cf. to that of say 1993) too big to &#8220;browse&#8221;?  compare the &#8220;portal&#8221; concept (e.g. Yahoo! etc.) of the early web versus today&#8217;s point of entry: the Google search</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: links for 2008-12-24 &#171; Bloggitation</title>
		<link>http://michaelnielsen.org/blog/lectures-on-the-google-technology-stack-1-introduction-to-pagerank/comment-page-1/#comment-16557</link>
		<dc:creator>links for 2008-12-24 &#171; Bloggitation</dc:creator>
		<pubDate>Thu, 25 Dec 2008 07:02:09 +0000</pubDate>
		<guid isPermaLink="false">http://michaelnielsen.org/blog/?p=506#comment-16557</guid>
		<description>[...] Lectures on the Google Technology Stack 1: Introduction to PageRank (tags: python programming web google seo) [...]</description>
		<content:encoded><![CDATA[<p>[...] Lectures on the Google Technology Stack 1: Introduction to PageRank (tags: python programming web google seo) [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Michael Nielsen &#187; Using your laptop to compute PageRank for millions of webpages</title>
		<link>http://michaelnielsen.org/blog/lectures-on-the-google-technology-stack-1-introduction-to-pagerank/comment-page-1/#comment-16508</link>
		<dc:creator>Michael Nielsen &#187; Using your laptop to compute PageRank for millions of webpages</dc:creator>
		<pubDate>Tue, 23 Dec 2008 18:13:33 +0000</pubDate>
		<guid isPermaLink="false">http://michaelnielsen.org/blog/?p=506#comment-16508</guid>
		<description>[...] difficult to apply to the web as a whole, simply because the web contains so many webpages. While just a few lines of code can be used to implement PageRank on collections of a few thousand webpages, it&#8217;s trickier to [...]</description>
		<content:encoded><![CDATA[<p>[...] difficult to apply to the web as a whole, simply because the web contains so many webpages. While just a few lines of code can be used to implement PageRank on collections of a few thousand webpages, it&#8217;s trickier to [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Michael Nielsen &#187; The PageRank distribution for the web</title>
		<link>http://michaelnielsen.org/blog/lectures-on-the-google-technology-stack-1-introduction-to-pagerank/comment-page-1/#comment-16204</link>
		<dc:creator>Michael Nielsen &#187; The PageRank distribution for the web</dc:creator>
		<pubDate>Fri, 12 Dec 2008 21:54:37 +0000</pubDate>
		<guid isPermaLink="false">http://michaelnielsen.org/blog/?p=506#comment-16204</guid>
		<description>[...] PageRank for a webpage is a probability between 0 and 1. The general idea is that PageRank quantifies the importance of the page: the bigger the probability the more [...]</description>
		<content:encoded><![CDATA[<p>[...] PageRank for a webpage is a probability between 0 and 1. The general idea is that PageRank quantifies the importance of the page: the bigger the probability the more [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Michael Nielsen &#187; Video for &#8220;Lectures on the Google Technology Stack 1: Introduction to PageRank&#8221;</title>
		<link>http://michaelnielsen.org/blog/lectures-on-the-google-technology-stack-1-introduction-to-pagerank/comment-page-1/#comment-16147</link>
		<dc:creator>Michael Nielsen &#187; Video for &#8220;Lectures on the Google Technology Stack 1: Introduction to PageRank&#8221;</dc:creator>
		<pubDate>Sun, 07 Dec 2008 13:50:48 +0000</pubDate>
		<guid isPermaLink="false">http://michaelnielsen.org/blog/?p=506#comment-16147</guid>
		<description>[...] video for the first lecture (notes are here, syllabus is here):   There&#8217;s a brief break in the middle, due to limitations of our [...]</description>
		<content:encoded><![CDATA[<p>[...] video for the first lecture (notes are here, syllabus is here):   There&#8217;s a brief break in the middle, due to limitations of our [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Michael Nielsen - Lectures on the Google Technology Stack</title>
		<link>http://michaelnielsen.org/blog/lectures-on-the-google-technology-stack-1-introduction-to-pagerank/comment-page-1/#comment-16102</link>
		<dc:creator>Michael Nielsen - Lectures on the Google Technology Stack</dc:creator>
		<pubDate>Wed, 03 Dec 2008 18:23:28 +0000</pubDate>
		<guid isPermaLink="false">http://michaelnielsen.org/blog/?p=506#comment-16102</guid>
		<description>[...] Read about it on Michaels blog. [...]</description>
		<content:encoded><![CDATA[<p>[...] Read about it on Michaels blog. [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: John Sidles</title>
		<link>http://michaelnielsen.org/blog/lectures-on-the-google-technology-stack-1-introduction-to-pagerank/comment-page-1/#comment-16101</link>
		<dc:creator>John Sidles</dc:creator>
		<pubDate>Wed, 03 Dec 2008 15:29:07 +0000</pubDate>
		<guid isPermaLink="false">http://michaelnielsen.org/blog/?p=506#comment-16101</guid>
		<description>This will be a wonderful series of lectures!

To the extent that the subject &quot;Google Technology&quot; is broadly conceived, there are several subjects of broad interest that I hope will be covered.

For example, what is (or should be) the intellectual property status of algorithms like PageRank?  

According to Wikipedia, Stanford University holds a patent of PageRank algorithm, which they exclusively license to Google ... for the princely sum of $350 million dollars.

We can contrast this the charter of UC Berkeley&#039;s CHESS program (Center for Hybrid and Embedded Software Systems) which explicitly states &quot;The Center publicly disseminates all research results as rapidly as possible ... software is made available in open-source form ... center patents are expected to be rare.&quot;

A wonderful question is: which of these two models is a more plausible/more viable/morally superior model for the future of science?

On a mathematical note, if we take &quot;n&quot; to be the number of vertices of the web, then every aspect of Google&#039;s page-rank algorithms has to be O(n) ... even matrix-vector multiplication.  So every matrix arising in PageRank has to be structured in such a way that (e.g., sparsity and/or factorization) that practical O(n) computation is feasible.

The property of &quot;O(n)-ness&quot; is becoming ubiquitous in every aspect of modern simulation science (which itself would be a good theme for a lecture).  Because nowadays it seems that almost every large-scale system that we humans care about--whether natural or human-made, whether classical or quantum---possesses sufficient algorithmic compressibility that &quot;O(n)-ness&quot; can be found.

In this sense, aren&#039;t Google and CHESS (and many other public and private enterprises) very much in the same line of business ... mathematically *and* morally *and* legally? 

These are deep waters ... and for this reason I look forward immensely to further lectures in this series.</description>
		<content:encoded><![CDATA[<p>This will be a wonderful series of lectures!</p>
<p>To the extent that the subject &#8220;Google Technology&#8221; is broadly conceived, there are several subjects of broad interest that I hope will be covered.</p>
<p>For example, what is (or should be) the intellectual property status of algorithms like PageRank?  </p>
<p>According to Wikipedia, Stanford University holds a patent of PageRank algorithm, which they exclusively license to Google &#8230; for the princely sum of $350 million dollars.</p>
<p>We can contrast this the charter of UC Berkeley&#8217;s CHESS program (Center for Hybrid and Embedded Software Systems) which explicitly states &#8220;The Center publicly disseminates all research results as rapidly as possible &#8230; software is made available in open-source form &#8230; center patents are expected to be rare.&#8221;</p>
<p>A wonderful question is: which of these two models is a more plausible/more viable/morally superior model for the future of science?</p>
<p>On a mathematical note, if we take &#8220;n&#8221; to be the number of vertices of the web, then every aspect of Google&#8217;s page-rank algorithms has to be O(n) &#8230; even matrix-vector multiplication.  So every matrix arising in PageRank has to be structured in such a way that (e.g., sparsity and/or factorization) that practical O(n) computation is feasible.</p>
<p>The property of &#8220;O(n)-ness&#8221; is becoming ubiquitous in every aspect of modern simulation science (which itself would be a good theme for a lecture).  Because nowadays it seems that almost every large-scale system that we humans care about&#8211;whether natural or human-made, whether classical or quantum&#8212;possesses sufficient algorithmic compressibility that &#8220;O(n)-ness&#8221; can be found.</p>
<p>In this sense, aren&#8217;t Google and CHESS (and many other public and private enterprises) very much in the same line of business &#8230; mathematically *and* morally *and* legally? </p>
<p>These are deep waters &#8230; and for this reason I look forward immensely to further lectures in this series.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: La technologie Google par Michael Nielsen</title>
		<link>http://michaelnielsen.org/blog/lectures-on-the-google-technology-stack-1-introduction-to-pagerank/comment-page-1/#comment-16092</link>
		<dc:creator>La technologie Google par Michael Nielsen</dc:creator>
		<pubDate>Tue, 02 Dec 2008 22:36:54 +0000</pubDate>
		<guid isPermaLink="false">http://michaelnielsen.org/blog/?p=506#comment-16092</guid>
		<description>[...] Nielsen vient de publier la première partie d&#8217;un cours sur la technologie Google. Michael présente une analyse de PageRank similaire à [...]</description>
		<content:encoded><![CDATA[<p>[...] Nielsen vient de publier la première partie d&#8217;un cours sur la technologie Google. Michael présente une analyse de PageRank similaire à [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Daniel Lemire</title>
		<link>http://michaelnielsen.org/blog/lectures-on-the-google-technology-stack-1-introduction-to-pagerank/comment-page-1/#comment-16091</link>
		<dc:creator>Daniel Lemire</dc:creator>
		<pubDate>Tue, 02 Dec 2008 22:29:19 +0000</pubDate>
		<guid isPermaLink="false">http://michaelnielsen.org/blog/?p=506#comment-16091</guid>
		<description>For a French version of similar ideas, see week 11 of my online course on Information Retrieval:

http://benhur.teluq.uqam.ca/SPIP/inf6460/rubrique.php3?id_rubrique=13&amp;id_article=52&amp;sem=Semaine%2011</description>
		<content:encoded><![CDATA[<p>For a French version of similar ideas, see week 11 of my online course on Information Retrieval:</p>
<p><a href="http://benhur.teluq.uqam.ca/SPIP/inf6460/rubrique.php3?id_rubrique=13&amp;id_article=52&amp;sem=Semaine%2011" rel="nofollow">http://benhur.teluq.uqam.ca/SPIP/inf6460/rubrique.php3?id_rubrique=13&amp;id_article=52&amp;sem=Semaine%2011</a></p>
]]></content:encoded>
	</item>
</channel>
</rss>
