My Photo

Peeps

« Wanted: Studs | Main | I'm Cool De La. I'm Cool De La!!! »

Fisher

(The following post is cut-n-pasted from an email conversation I had recently about a Personal Search Engine with Jeff Barr. I'm posting it on my typepad so that I can use Google to find these notes again. I note the irony that if I had a Fisher, I wouldn't need to blog it publicly to be able to find it with certainty later when I need to find it.)

I've been thinking long and hard lately about finding things in my (20 Gigabytes representing 15 years of) email archives (or, as Rohit calls it, The "Search My Mail D*mnit" Problem) so I can properly deal with Ham. In the email must evolve context, I think ZOË-as-Personal-Server points in the right direction.

I find myself wondering if there is still an opportunity to launch a desktop search product that fits the classic definition of platform. The equivalent of a "Browser" for the next decade that brings together existing disparate tools by mixing SMTP and HTTP and throws in a healthy dose of instant messaging and RSS -- except that instead of browsing for information it lets you go (for lack of a better word) "Fish" for information. It's got a simple browser interface and query language (like Google), is lightning fast (due to regular re-indexing), and offers search results of your personal stuff in that simple UI.

Rohit is three steps ahead of me here -- that any good "Fisher" of all your emails, IM's, desktop files, web history, and RSS feeds needs a great algorithm to rank the results of your "Go Fish" queries. (Ranking is something ZOË doesn't do, and therefore it cannot handle the volumes of email I receive daily.)

A "Fisher" isn't a replacement for our existing PIM's and browsers and IM clients and RSS readers, in the same way that Google doesn't replace the Web. Actually, Google is a good analogy since it provides a set of ranked results for any given query of the Public Web. But Google is an anonymous search of the Public Web of information. Fisher, by contrast, provides a set of ranked results for any given query with a personal search of your Private Web of information. It's a customized search of your personal stuff.

I don't have a better sketch of the opportunity yet, but if it works for your personal stuff anything close to the way Google works for public stuff, I think it's a killerApp just waiting for an auteur to get around to writing it. The platform play of course is that it is scriptable -- not just the query engine but the "tap on your ethernet" which can watch HTTP and SMTP traffic crossing your machine and do things on your behalf based on inspecting all the data that crosses through.

I have to think about it. Like I said, I'm sure Rohit is three steps ahead of me on this one... as was jwz six years ago in his write-up of Intertwingle (and short discussion), "a potential project to make it easier to deal with a massive volume of personal messages: excavating, traversing, relating, reporting, annotating. Intertwingle can be seen as a unification of a search tool and an address book. It is not, however, a mail reader. The presentation of query results could be done through a mail reader, but the intention is that ones choice of mail reader should be orthogonal to the use of this tool. The two kinds of tools just happen to operate on the same data." Might this be what the Twingle effort of Kasei is all about?

Update, 3/4/2004 at 5am. Rohit emailed me a description of Fisher in his own words:

Personally, I'm getting very aggravated by the irony that it's easier to find stuff on the Internet than on my own PC. The vast majority of this problem is email, specifically. And while email is impoverished in hyperlinks, ruling out the sorts of PageRank algorithms web search engines use, it is very rich in social network information. The correspondence information can help us choose which bits of text are likely to be the most relevant hit for a query, because it does matter who said what.

Admittedly, this may have something to contribute about searching multiple-agent discourses in general -- anywhere you can clearly identify authorship of a snippet, and thence calibrate which authors a user reads most.

Of course, if we simply ranked people by frequency of interaction, it would be kind of boring. Frame it as a simple principal-eigenvector problem -- count who reads your readers, and so on -- and interesting patterns emerge. It could well be as useful as PageRank itself was, by comparison to text search alone.

Imagine the Google UI for your own PC. You aim your web browser at localhost and get back a results page that looks eerily familiar, but the hits are actually documents, mails, photos, and cached web pages from your own personal archive. This means nailing the challenges of grouping similar results, generating short excerpts, converting file formats for indexing, and so on. There's more magic to how it installs -- you don't change your email client at all! -- but that's just more technology.

I believe Intertwingle is a good early manifesto about Fishers as a product category, and I think that ZOË and X1 are good first-generation instantiations of Fisher-as-a-product. Looking around SearchTools.Com I don't see any other Fishers... yet.

See also: Fisher as a product category, Lookout, Software!, Fluffy Bunny (aka Google Desktop).

TrackBack

TrackBack URL for this entry:
http://www.typepad.com/services/trackback/6a00d834559b4c69e200e5505e9f918833

Listed below are links to weblogs that reference Fisher:

» Personal information assistant from Notes from Classy's Kitchen
As previously mentioned, our personal information space is a shambles compared to the published information space of the web. A... [Read More]

» "Fluffy Bunny" is a WinSock-puppet from The Now Economy
We love Google's new Desktop Search. We've been arguing about something like this for a year or more. The idea of searching everything you've seen — not just your hard drive, but everything hyperlinked to it (such as your surfing... [Read More]

Comments

"Imagine the Google UI for your own PC."

You don't have to image, you just need to buy a Google Appliance. See http://www.google.com/appliance/ for details.

For the money it would cost me to buy a Google Appliance, I could buy a car. A really nice car. :)

Plus, I'm not convinced the Google Appliance is useful for spidering and indexing a single desktop PC. I'm willing to be convinced that I'm wrong, though. :) :)

I'm sure somewhere in Google Labs there is a desktop indexer being worked on that will eventually be integrated into Google Toolbar/Deskbar. It would be the perfect addition for those tools, and throw AdWords into the mix and they have a whole new source of revenue.

Somewhere in Google Labs they're also curing world hunger, drafting plans for world peace, and prototyping a new delicious type of curry that is code-named blue curry (red, green, and yellow curries already exist, so once blue is perfected, Google's marketing team will be able to spell their entire logo in curry!).

I believe there's nothing the Google folks can't and won't do. It's all just a matter of time.

Not that that will stop Microsoft, who just demo'd hard-drive searching:

MSN vice president Yusuf Mehdie made it clear that Microsoft will be entering the local hard-drive search arena, which analysts see as the next big battleground among search-engine providers. "The Web today is roughly 20 percent to 30 percent of all crawlable material," Mehdie said. "If you can crawl it, you can find it. But you can't find things on intranets, you can't find [things on] deep Web databases, you can't find stuff on your local PC very easily. It's certainly not an integrated experience."

"We have put together a working version of Local PC File Search," Mehdie told analysts and journalists on hand for the demonstration of the latest Microsoft search technology. "Watch how fast it works."

Mehdie then proceeded to demonstrate a desktop search tool that is designed to enable users to find e-mail messages, Word documents and other types of files on their computers. In the demonstration, the search results were returned almost instantaneously. Mehdie did not specify a release date for the new technology.

The race to build a great Fisher has started, but like the race to build a great Browser, this will be a 10-20 year war...

The most promising fisher I use:

http://www.lookoutsoft.com/

Was just bought by M$.

Marcel, thanks for your note.

I believe Microsoft's purchase of Lookout speaks to big investments in Fishers by not just Microsoft but also Google and Yahoo in coming years...

Also, TidBITS explains how Apple's Tiger OS in 2005 will have a Fisher called Spotlight:

"With Spotlight, Apple aims to make it significantly easier to find data already on your hard disk. Spotlight won't just search filenames and content, as Mac OS X can do now; it will also be able to gather and search through metadata, much as iTunes and iPhoto can do with Smart Playlists and Smart Albums. Spotlight will power additional smarts: Smart Folders in the Finder (which could let you overlay different organizational structures on top of the basic hierarchical file system we have now), Smart Mailboxes in Mail (letting you group the same set of messages in different ways), and Smart Groups in Address Book."

http://www.apple.com/macosx/tiger/spotlight.html

"It's good to see Apple acknowledging the need for more access to metadata about files and other data objects in the system, since as the amount of data we all accumulate increases, the more difficult it becomes to manage. Apple's metadata search engine will be able to extract some metadata from files automatically, and developers will be able to add their own metadata as well, making it possible to extend Spotlight's capabilities easily."

http://www.apple.com/macosx/tiger/searchtechnology.html

This will be a long race, like the browser race, spanning a decade, and the entrants have only begun to think about the problems associated with making a fast, useful Fisher.

I know, and I'm thrilled... so far the Desktop Google has not disappointed me...

Verify your Comment

Previewing your Comment

This is only a preview. Your comment has not yet been posted.

Working...
Your comment could not be posted. Error type:
Your comment has been posted. Post another comment

The letters and numbers you entered did not match the image. Please try again.

As a final step before posting your comment, enter the letters and numbers you see in the image below. This prevents automated programs from posting comments.

Having trouble reading this image? View an alternate.

Working...

Post a comment

Music

Reading

  • John Battelle: The Search

    John Battelle: The Search
    My favorite book of 2005. Period.


    (*****)

  • Steven D. Levitt: Freakonomics : A Rogue Economist Explores the Hidden Side of Everything

    Steven D. Levitt: Freakonomics : A Rogue Economist Explores the Hidden Side of Everything
    "Just because two things are correlated does not mean that one causes the other. A correlation simply means that a relationship exists between two factors -- let's call them X and Y -- but it tells you nothing about the direction of that relationship. It's possible that X causes Y; it's also possible that Y causes X; and it may be that X and Y are both being caused by some other factor, Z.

    Economics is, at root, the study of incentives: how people get what they want, or need, especially when other people want or need the same thing.

    Incentives are the cornerstone of modern life. The conventional wisdom is often wrong. Dramatic effects often have distant, even subtle, causes. Experts use their informational advantage to serve their own agenda. Knowing what to measure and how to measure it makes a complicated world much less so." (*****)

  • Malcolm Gladwell: Blink

    Malcolm Gladwell: Blink
    A book of anecdotes about the power of thinking without thinking; this book is a more interesting read than Gladwell's previous, The Tipping Point.

    New York Times: "Gottman believes that each relationship has a DNA, or an essential nature. It's possible to take a very thin slice of that relationship, grasp its fundamental pattern and make a decent prediction of its destiny. Gladwell says we are thin-slicing all the time -- when we go on a date, meet a prospective employee, judge any situation. We take a small portion of a person or problem and extrapolate amazingly well about the whole."

    David Brooks, who wrote that review, adds: "Isn't it as possible that the backstage part of the brain might be more like a personality, some unique and nontechnological essence that cannot be adequately generalized about by scientists in white coats with clipboards?" (*****)

  • Paul Graham: Hackers and Painters

    Paul Graham: Hackers and Painters
    I don't agree with some parts of this book, but I truly loved reading it, and it really made me think. I referenced it in my weblications and superhacker and phoneboy posts. Favorite chapter is How to Make Wealth. (Thanks, Ev.) (*****)

  • Joel Spolsky: Joel on Software

    Joel Spolsky: Joel on Software
    Joel is really good at wielding "diverse and occasionally related matters of interest to software developers, designers, and managers, and those who, whether by good fotune or ill luck, work with them in some capacity."

    Joel on Software embodies the principle of "Welcome to management! Guess what? Managing software projects has nothing at all to do with programming." This book, a compendium of the website's wisdom, is useful for everyone from team leads estimating schedules to software CEOs developing competitive strategy. (*****)

  • Bruce Sterling: Tomorrow Now: Envisioning The Next Fifty Years

    Bruce Sterling: Tomorrow Now: Envisioning The Next Fifty Years
    Bruce wrote this book to come to terms with seven novel aspects of the twenty-first century, situations that are novel to that epoch and no other. It's about future possibilities.

    "This is the future as it is felt and understood: via human experience... The years to come are not merely imaginary. They are history that hasn't happened yet. People will be born into these coming years, grow to maturity in them, struggle with their issues, personify those years, and bear them in their flesh. The future will be lived." Here here, well-spoken, Bruce. (*****)

  • The World's 20 Greatest Unsolved Problems: John Vacca

    The World's 20 Greatest Unsolved Problems: John Vacca
    "Science has extended life, conquered disease, and offered new sexual and commercial freedoms through its rituals of discovery, but many unsolved problems remain...

    If support for science falters and if the American public loses interest in it, such apathy may foster an age in which scientific elites ignore the public will and global imperatives." (*****)

  • Paul Hawken, Amory Lovins, L. Hunter Lovins : Natural Capitalism: Creating the Next Industrial Revolution

    Paul Hawken, Amory Lovins, L. Hunter Lovins : Natural Capitalism: Creating the Next Industrial Revolution
    I had the pleasure recently of meeting Amory Lovins and hearing him talk about Twenty Hydrogen Myths and the design of hypercar. (He also talked about Bonobos... wow.) I'm a convert to the way of thinking espoused in Natural Capitalism. I used to be cynical about the future, but Amory's work has made me a believer that many great things are about to come. The best way to predict the future is to invent it. (*****)

  • Merrill R. Chapman: In Search of Stupidity: Over 20 Years of High-Tech Marketing Disasters

    Merrill R. Chapman: In Search of Stupidity: Over 20 Years of High-Tech Marketing Disasters
    In hilarious prose, this book catalogs lots of stoopid high-tech marketing decisions. It offers clear, detailed analysis of many a marketing mishap, with what happened, why, and how to avoid such stupidity. Might just be the best. book. ever... (*****)

  • Paul Krugman: The Great Unraveling: Losing Our Way in the New Century

    Paul Krugman: The Great Unraveling: Losing Our Way in the New Century
    A book exposing the pitfalls of crony capitalism, from corrupt corporations straight up to the executive branch of our government. Krugman is nonpartisan -- what he exposes is foolish short-term thinking on the part of recent United States policies. The patriotic thing to do, he advises, is to fix these economic problems now before they become much harder to solve.

  • Henry Petroski: Small Things Considered: Why There Is No Perfect Design

    Henry Petroski: Small Things Considered: Why There Is No Perfect Design
    "Design can be easy and difficult at the same time, but in the end, it is mostly difficult." (*****)

  • Alexander Blakely: Siberia Bound

    Alexander Blakely: Siberia Bound
    One of my favorite books of the past few years. Xander is a master storyteller. (*****)

  • Susan Scott: Fierce Conversations

    Susan Scott: Fierce Conversations
    How to make every conversation count. One of my favorite books of the last decade. (*****)

Blog powered by TypePad
Member since 08/2003