{"id":546,"date":"2009-01-26T06:53:07","date_gmt":"2009-01-26T10:53:07","guid":{"rendered":"http:\/\/michaelnielsen.org\/blog\/?p=546"},"modified":"2009-01-26T06:53:07","modified_gmt":"2009-01-26T10:53:07","slug":"biweekly-links-for-01262009","status":"publish","type":"post","link":"https:\/\/michaelnielsen.org\/blog\/biweekly-links-for-01262009\/","title":{"rendered":"Biweekly links for 01\/26\/2009"},"content":{"rendered":"<ul>\n<li><a href=\"http:\/\/squarecog.wordpress.com\/2009\/01\/17\/building-an-inverted-index-with-hadoop-and-pig\/\">Building an Inverted Index with Hadoop and Pig \u00c2\u00ab SquareCog\u00e2\u20ac\u2122s SquareBlog<\/a>\n<ul>\n<li>&#8220;In this post, I present a (very) brief description of the Pig project and demonstrate how one can construct an inverted index from a collection of text files using just a few lines of PigLatin.\n<p>Pig offers SQL-like data processing instructions (select, project, filter, group), while being both more flexible by allowing simple integration of user-defined functions, and more straightforward by allowing users to issue command proceduraly, rather than declaratively, as in SQL.  &#8220;<\/li>\n<\/ul>\n<\/li>\n<li><a href=\"http:\/\/public.yahoo.com\/gogate\/hadoop-tutorial\/start-tutorial.html\">Yahoo! Hadoop Tutorial<\/a>\n<ul>\n<li><\/li>\n<\/ul>\n<\/li>\n<li><a href=\"http:\/\/friendfeed.com\/e\/4ad4515b-fd30-4f9f-bf5c-6a1381692fdc\/Comparison-of-biological-wikis\/\">Comparison of biological wikis<\/a>\n<ul>\n<li>Andrew Su&#8217;s survey of biological wikis (if you click again it links through to a spreadsheet).   Lots of very interesting data about number of edits, number of editors, etc.<\/li>\n<\/ul>\n<\/li>\n<li><a href=\"http:\/\/anand.typepad.com\/datawocky\/2008\/07\/the-real-long-tail-why-both-chris-anderson-and-anita-elberse-are-wrong.html\">Datawocky: The Real Long Tail: Why both Chris Anderson and Anita Elberse are Wrong<\/a>\n<ul>\n<li><\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<p>Click <a href=\"http:\/\/delicious.com\/nielsen\/\">here<\/a> for all of my del.icio.us bookmarks.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Building an Inverted Index with Hadoop and Pig \u00c2\u00ab SquareCog\u00e2\u20ac\u2122s SquareBlog &#8220;In this post, I present a (very) brief description of the Pig project and demonstrate how one can construct an inverted index from a collection of text files using just a few lines of PigLatin. Pig offers SQL-like data processing instructions (select, project, filter,&hellip; <a class=\"more-link\" href=\"https:\/\/michaelnielsen.org\/blog\/biweekly-links-for-01262009\/\">Continue reading <span class=\"screen-reader-text\">Biweekly links for 01\/26\/2009<\/span><\/a><\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[],"tags":[],"class_list":["post-546","post","type-post","status-publish","format-standard","hentry","entry"],"_links":{"self":[{"href":"https:\/\/michaelnielsen.org\/blog\/wp-json\/wp\/v2\/posts\/546","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/michaelnielsen.org\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/michaelnielsen.org\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/michaelnielsen.org\/blog\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/michaelnielsen.org\/blog\/wp-json\/wp\/v2\/comments?post=546"}],"version-history":[{"count":0,"href":"https:\/\/michaelnielsen.org\/blog\/wp-json\/wp\/v2\/posts\/546\/revisions"}],"wp:attachment":[{"href":"https:\/\/michaelnielsen.org\/blog\/wp-json\/wp\/v2\/media?parent=546"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/michaelnielsen.org\/blog\/wp-json\/wp\/v2\/categories?post=546"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/michaelnielsen.org\/blog\/wp-json\/wp\/v2\/tags?post=546"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}