« Things I know are true | Main | Can motion induced blindness be used to target for the logical possibility of zombies? »

March 24, 2005

So Yahoo buys flickr with which to beat Google

Cnet article on Yahoo's acquisition of flickr

Metafilter thread on same topic

Cory Doctorow's notes from the Folksonomy forum at the 2005 Emergent Technology Conference (Etech) (Doctorow writes for boing boing)

Many 2 Many's notes on the Folksonomy forum at Etech

I went looking for the older thread on folksonomies at Many 2 Many where Clay Shirky and Lou Rosenfeld duke it out, but heck, every freaking single post at Many 2 Many these days covers this topic, so just go read them all. Especially look for the articles on "Emergent Semantics" and "One World, Two Maps".

I think the thing that annoys me the most about "folksonomies" is the confusion of lay people (and professionals!) around what users are actually doing when they "tag" images at flickr. They are not, as the Cnet article suggests, "classifying". The whole notion of the community of flickr users regulating a common semantic vocabulary ala Wikipedia is absurd. There is no more wisdom in the flickr crowd than in us collective google users and pagerank spammers, only a deep abiding interest in porn and other lowest common denominators. Even flickr's peer pressure technology threatens to get as ugly as that high school popularity contest you've been repressing all these years.

If you talk to the flickr guys, especially the President of Ludacorp Stewart Butterfield, they will describe for you a very different, very personal "labelling" effort, much closer to what you see at Google's gmail. People are not collaboratively creating and defining buckets into which they deposit their content so that the unknown, average user can later find images via the shared semantic vocabulary. When you spell out this misconception it really takes on its full ridiculousness. By that I mean, who but librarians actually takes an interest in the information retrieval needs of their fellow web denizens?

What people are mostly doing at flickr is attempting to attach labels to their content that will most closely match search terms they themselves will use later to find their own stuff. Butterfield describes it as (from Cory Doctorow's ETECH notes):

Stewart: It's not really categorization on Flickr -- it's about letting users remember. If I add the "Norma" tag to pix of my mom, whose name is Norma, I don't think it goes into the Norma category. The unfortunate thing about the term "folksonomy" is that it implies that it's a replacement for categorization. People categorize things by noting what they do or don't have: mammals have hair and live babies; does it have property a? then it's a whatever.

As Butterfield says, its a very personal effort. What gets created out of this rather large, unappreciated effort is a large number of personal external memory systems. Individual users know that they'll forget the specifics of the image, but they'll want to be able to retrieve it, so they try to anticipate what words they might use later to search for the item. These words may have absolutely nothing to do with any kind of description of the content, format, or any of the other traditional cataloging facets. They are likely not to be the same words that anyone else would use. And yet, many of the pundits, experts and gurus are convinced that a "semantics" will "emerge." Y'know, just like it did at Google.

The average uninformed user has a snowballs chance in hell of conducting decent research in flickr, or at Google (this may be the only point on which Michael Gorman and I agree). The most they can do is enjoy serendipity. Folks like Clay Shirky who claim folksonomies to be the wave of the future make me almost as upset as Google itself, which is guilty for turning the world wide web into a high school popularity contest and convincing people to settle for "good enough." I can't decide who is worse, Google for convincing people to settle for the least because it takes zero effort, or Shirky for telling librarians to quit because the Google driven economics won't support their effort.

An example of "good enough" thinking, again from Butterfield (from Cory Doctorow's ETECH notes):

Stewart: The objective of tags shouldn't be to exhaustively cover the field -- we'll have a million photos of Tokyo, and if the TOKYO tag only gets you 400k of them, it's OK. You're only going to look at 20 of them anyway.

This is why academics have problems with Wikipedia. Their students, the ones using Wikipedia to conduct research, get the wrong idea about how you go about the Scientific Method, and how you conduct exhaustive research in any given subject area.

And I wish I could have gone to the Etech and SXSW conferences, seems like everybody in the technology biz is preoccupied by metadata these days. I wonder when the cool kids will find my weblog.

Posted by MetaMetadata at March 24, 2005 06:03 PM | TrackBack
Comments

Ah, the wonder of trackbacks!

Good post, and the distinction between tagging and classifying is important. I have to disagree with you on this point, though...

People are not collaboratively creating and defining buckets into which they deposit their content so that the unknown, average user can later find images via the shared semantic vocabulary. When you spell out this misconception it really takes on its full ridiculousness. By that I mean, who but librarians actually takes an interest in the information retrieval needs of their fellow web denizens?

Actually, while I think this is (mostly) true for del.icio.us bookmarks, it's less true for Flickr. There's a great deal of "bucket creation" in the tagging there. For example, the "me" tag, or the "whatsinmybag" tag...both are designed not for personal information retrieval, but for facilitating access by others. Color tags, too. I went back and tagged a lot of my photos with their dominant colors when I saw how cool it was that such tagging could facilitate things like this.

Posted by: Liz Lawley at March 24, 2005 07:03 PM

Apparently I can't post a comment with a URL in the box; I get a "Your comment could not be posted due to questionable content" message...which seems to indicate that the colon is the problem. Might want to check your blacklist.

Posted by: Liz Lawley at March 24, 2005 07:07 PM

look also at squaredcircle tag on flickr another "bucket"

Posted by: waltzzz at March 25, 2005 03:01 AM

I wonder about the users motive for applying tags that can be used by others to retrieve useful information. I find it hard to imagine the bulk of flickr users applying the "me" tag to photos of themselves with the idea that others will want to find portraits of random human beings. If they wanted to provide access to others, I imagine that they would use "portrait." "Me" doesn't mean "random other people" to me and I wouldn't have discovered that it did without assistance. That it does in flickr represents a whole new semantic vocabulary to be discovered and disseminated. If flickr users are using "me" and intend others to find their self portraits then the professionals (that's us) need to rethink the way we organize information. "Me" doesn't follow the rules of semantic description we've been trained as catalogers to apply in indexing this sort of content.

I've heard of squaredcircle as a community of flickr users and intend to explore their tagging conventions. I want to look for evidence in the squaredcircle group of "emergent semantics." The Many 2 Many blog contains a good academic introduction to this question of whether these folksonomies can actually produce shared vocabularies from the bottom up (like "me" for self portraits) or whether groups like squaredcircle actually represent the introduction of rules for information organization from the top down (like "Portrait photography" or "Self-portraits" for self portraits). I'm guessing that squaredcircle represents the latter, but the discovering evidence of the former and negotiating the right mix of the two organizational approaches is the next step.

Posted by: metametadata at March 30, 2005 02:01 AM