Michael Nielsen – Page 32 – Michael Nielsen

Interrupting Google search

How can Google search be beaten? Google’s edge is to do search better than other companies, i.e., they have access to knowledge about search those other companies don’t, in part because they place a high premium on developing such knowledge in-house.

What happens if Google’s understanding of search starts to saturate, and further research produces only small gains in user experience? The knowledge gap to their competitors will start to close. Other companies will be able to replicate the search experience Google offers. The advantage will then shift to whichever company can manage the operations side of search (e.g., maintaining large teams, large data centers and so on) better. Google’s culture – all those clever people improving search – will then become a liability, not an asset.

This is the classic path to commodization. A new industry opens up. In the early days, the race is to those who develop know-how quickly, providing an edge in service. As know-how saturates, everyone can provide the same service, and the edge moves to whoever can manage operations the best. The old innovators are actually at a disadvantage at this point, since they have a culture strongly invested in innovation.

In Google’s case, there’s another interesting possibility. Maybe search just keeps getting better and better. It’s certainly an interesting enough problem that that may well be posible. But if our knowledge of search ever starts to saturate, Google may find itself needing another source of support for its major business (advertising).

How institutions change

Does anyone know of a good discussion of how institutions change? I’ve looked around a fair bit, online, in catalogues, and in bookstores. Nothing I’ve found has quite fit the bill.

Update: Shortly after posting this, I thought of Cosma’s notebooks, which do indeed contain several promising leads.

Biweekly links for 04/25/2008

Knowledge sharing
- A review of the literature, coming from the World Bank.
Liquidpub – Trac
- Open source meets science proposal.
Raganwald: Why we are the biggest obstacles to our own growth
Bruce Sterling Interview: Life, the Internet and Everything
Kevin Kelly: McLuhan, Web 2.0 Master
Blind scientist: How to improve scientific software?
Raganwald: Are we building Universities or Amphitheaters?
- Thoughtful piece on the incentives and disincentives to build good social spaces on the web.
Karl Schroeder: week-long science fiction writing workshop, Toronto, July 2008
- Karl Schroeder’s giving a week-long intensive workshop on science fiction writing in July. If only it were possible to live multiple lives – in one, I’d love to try science fiction writing.
Cocktail Party Physics: let me explain
PLoS Medicine – Finding Cures for Tropical Diseases: Is Open Source an Answer?
- Early paper on open source biology.
David Heinemeier Hansson at Startup School 08 | Omnisio
- A good example of Omnisio in action. Advocates the radical idea that web startups should actually aim to make money.
Omnisio
- Great new video site – perfect for scientists, since it handles the problem that there are two streams of information (the slides and the speaker) far better than any other site I’ve seen. The interface for moving through the slides is extremely slick.
Raganwald: Good sense
- Descartes: “Good sense is the most equitably distributed of all things because no matter how much or little a person has, everyone feels so abundantly provided with good sense that he feels no desire for more than he already possesses.|”
Blog: Toby Segaran
- Toby Segaran, who wrote the excellent “Programming Collective Intelligence”, has a blog.
Blog : business|bytes|genes|molecules
- Another good find for my blogroll – lots of thoughtful posts on how science is done, and how it’s changing.
Ologeez! – How It Works
- Ambitious site for scientific collaboration.
One Big Lab: New paper-protocol-lab-knowledge sharing website out of Stanford
Journal publishers are pioneers of Web 2.0 | iMechanica
- A conservative take, basically saying that journal publishers can be trusted to modernize science. The example of other media (music, movies, books, etc) doesn’t give me much confidence that this will happen.
Biocurious: Journal publishers are pioneers of Web 2.0?
Want to Remember Everything You’ll Ever Learn? Surrender to This Algorithm
- Utterly fascinating, at several different levels.
The Loom : When Scientists Go All Bloggy
- Thoughtful discussion of the role blogs and comments are playing in the discussion of peer-reviewed science.
Startup School 2008
Kevin Kelly : the reality of depending on True Fans
Open Science Directory

Click here for all of my del.icio.us bookmarks.

Info, bio, nano, or thermo? Turing’s revenge

People sometimes claim that we’re moving from the information age into the biotech age, or the nanotech age, or the age of energy. Will we really see such a shift, or is this just hype?

My recent thinking about the idea that everything should be code convinces me that the people claiming that such shifts will occur are wrong, at least in the case of biotech and nanotech.

It’s not that biotech and nanotech won’t make enormous, world-changing strides in the near future. They will. But the effect of many of those strides will be to bring biotech and nanotech effectively into the realm of information technology. Expressing biology and nanotechnology in the language of information allows you to set loose all the powerful ideas of computation. This is too much to pass up. So what we’ll see is not a shift, but rather a gradual convergence between the info, bio and nano worlds. Which of the three will have the upper hand, commercially, seems to me to be difficult to predict.

What about energy? Here the situation is different. Like information, energy has a fundamental, irreducible quality. Because of this, I expect we’ll see a complementary relationship between information and energy technologies, but one will never subsume the other.

Money, markets, and evolution

Aside from human beings, can anyone think of biological systems which have evolved money or a market?

FriendFeed

I’m now on FriendFeed.

A moment of creative genius

I’ve been feeling quite pleased with myself for getting the weblogger emacs mode working, giving me a simple way to post directly from emacs, without logging into my blogging software (WordPress).

That is, I was feeling pleased until this morning, when a cut-and-paste error made in weblogger mode resulted in me posting my blog password to the front page of my blog. It was only online for a few seconds, and I changed the password immediately, but it’s not exactly a shining moment…

How much power is used when you do a Google search?

The web is a great way of outsourcing tasks to specialized parallel supercomputers. Here’s a crude order-of-magnitude estimate of the amount of computing power used in a single Google search.

The size of the Google server cluster is no longer public, but online estimates typically describe it as containing about half a million commodity machines, each comparable in power to the personal computers widely used by consumers. As a rough estimate, let’s say about 200,000 of those are involved in serving search results.

I don’t know how many searches are done using Google. But online estimates put it in the range of hundreds of millions per day. At peak times this means perhaps 10,000 searches per second.

In usability studies, Google has found that users are less happy with their search experience when it takes longer than 0.4 seconds to serve pages. So they aim to serve most of their pages in 0.4 seconds or less. In practice, this means they’ve got to process queries even faster, since network communication can easily chew up much of that 0.4 seconds. For simplicity we’ll assume that all that time is available.

What this means is that at peak times, the Google cluster’s effort is being distributed across approximately 4,000 searches.

Put another way, each time you search, you’re making use of a cluster containing the equivalent of (very) roughly 50 machines.

I’d be very interested to know what fraction of computing power is contained in such supercomputers, versus the fraction in personal computers. Even more interesting would be to see a graph of how this fraction is changing over time. My guess is that at some point in the not-too-distant future most power will be in specialized services like Google, not personal computers.

Biweekly links for 04/18/2008

The discovery of high temperature superconductivity
- Interesting commentary piece in Nature.
Cambridge Science Festival
- Looks like fun. I’m amused by the contrast to New York’s “World Science Festival”. According to the site, “The Cambridge Science Festival is the first and only full-scale celebration of science and technology in the United States.”
Review of the fraction of papers available via Open Access
Virtual Observatory – Wikipedia
- “A virtual observatory is a collection of interoperating data archives and software tools which… form a … research environment in which astronomical research programs can be conducted.”
The International Virtual Observatory Alliance
- The idea seems to be to make the world one big observatory, to paraphrase “one big lab”.
Darwin Online: Darwin’s Publications
Kevin Kelly: Digital Things I’ve been Wrong About
Anil Dash: The Creative Environment
- Dash is doing a survey of people’s creative environments. Some of the responses are interesting.
New York Times: Maybe Money Does Buy Happiness After All
- This study is getting a lot of press. I don’t understand the fuss, since happiness studies don’t measure happiness. Why we’re supposed to believe a response of “8” from the US has any relationship to another “8” from the US, let alone India, is beyond me
Not Even Wrong: Multimedia and the Journal of Number Theory
- The Journal of Number Theory will now allow a four-minute video abstract for papers. It’d be interesting to allow a ten-minute extended abstract – I bet a lot of papers could be explained quite well in ten minutes.
Publishers Beware: Amazon has you in their sights – O’Reilly Radar
- Interesting motto: “At O’Reilly, we have a motto: ‘Create more value than you capture.'” It’d be nice to combine this with Google’s “Don’t be evil”.
Whimsley: Here Comes Everybody
- A perceptive review of Clay Shirky’s excellent book.
Is magic ever magic to the magician?
Directory of open access journals
Make Textbooks Affordable
- “1,000 Professors Sign Statement for Affordable Textbooks”
Everything you needed to know about human-created life forms but were afraid to ask
- The first in an excellent five-part series on synthetic biology. It’s worth reading the comments as well.
Ten Thousand Cents
- “‘Ten Thousand Cents’ is a digital artwork that creates a representation of a $100 bill. Using a custom drawing tool, thousands of individuals working in isolation from one another painted a tiny part of the bill without knowledge of the overall task.”
wiki.dbpedia.org
- “DBpedia is a community effort to extract structured information from Wikipedia and to make this information available on the Web. DBpedia allows you to ask sophisticated queries against Wikipedia and to link other datasets on the Web to Wikipedia data.”
Collaborative Thinking: Objects That Blog: Expanding The Architecture Of Participation
Synthetic biology (Wired article)
Worldwide Protein Data Bank
- Free and public archive of macromolecular structural data.
Science in the open: Bursty science depends on openness

Click here for all of my del.icio.us bookmarks.

Biweekly links for 04/14/2008

Collaborative Thinking: Objects That Blog: Expanding The Architecture Of Participation
Synthetic biology (Wired article)
Worldwide Protein Data Bank
- Free and public archive of macromolecular structural data.
Science in the open: Bursty science depends on openness
The Structural Genomics Consortium (SGC)
- Toronto-based effort to determine protein structure, and put them in the public domain.
Galaxy Zoo
- Online astronomy project in which volunteer labour is being used to classify galaxy images taken from the Sloan Digital Sky Survey. Turns out that the volunteers are better than computer programs at this. More than 20,000 people have helped out.
The Synaptic Leap
- Forum for scientific collaboration based on an open source model.
open…: Ecuador Goes Free
- Ecuador is now mandating the use of open source software by government agencies.
2nd Blender Peach Open Source Movie Premiere and Economies of the Commons
- Blender is an amazing animation tool, and the Blender community has just released their second open source movie.
Peter Murray-Rust: kudos to the Wellcome Trust
- The Wellcome Trust is leading the charge to VERY open access, including the possibility of re-use in derivative works.
The Truth, Respectfully | Cosmic Variance
- A thoughtful discussion of the vexing question of how probing to be during a talk. This is something I struggle with as an audience member – I like to understand things, and so ask lots of questions, but I know this intimidates some people.
Corie Lok: What can the Internet do to improve public discourse?
- Corie on Cass Sunstein and Yochai Benkler’s discussion of whether the internet leads to greater fragmentation and polarization.
Niyaz Ahmed’s Blog: PLoS ONEâ€™s strides at the Faculty-of-1000-Biology
- Examining changes in the impact of open access research.

Click here for all of my del.icio.us bookmarks.