Posts tagged: keyword search

Dec 26 2007

Monopolies, Libraries, and Challenges


A somewhat rambling essay, but one that is important nonetheless:

Joe Wilcox has posted an interesting essay at Microsoft Watch regarding Google’s merger with DoubleClick, the internet advertising company.  I strongly disagree with some of his interpretations (he tries to have it both ways, and by defending Microsoft and chastising Google, he simply muddies the water), but the essay has me thinking about the good and bad of monopolies in libraryland.

First, is the love-hate relationship I have with “monopolies”.  Oftentimes a monopoly reduces choices for the user/consumer, and oftentimes the litmus test for this is whether the company/organization channels its energy towards preventing competition, rather than out-performing competition.  Efforts towards providing a better product/service than one’s competitor are rarely in vain.  Even if a company fails, the level of product/service is usually improved across the board.

Next, the concept of open standards is, for better or worse, tied up with monopolies.  A group with a monopoly is able to set standards much more effectively.  If the standards are set in a fair manner, i.e. not simply to prevent competition against one’s own product/service, then the monopoly can actually be more efficient.  If not, it isn’t truly an open standard, as much as it is a proprietary standard.

Libraries, then… we are swimming in a sea of standards, and companies that create them.  We are living with standards that work only for us, such as MARC, and aren’t of much (if any) benefit outside libraries.  The bibliographic information contained within them is of great benefit and value, but the standard is not very useful.

However, so much of our energies are tied up in this standard (and others, if we think about it), and it is dragging us down.  It is important to understand that the information is what has value; the value in how we store and access it is reflected in the ease of use, and the interest in using that storage/access method.

MARC has lost it’s luster, and we should move forward.  The information, however, is more valuable than ever, and we need to figure out how to maximize this value.  Making it easy for everyone to use, not only libraries, should be our top priority.  When Amazon or Google (or companies/groups like them) really want to access our bibliographic records, and use their structure, this will be when we know we have fixed the worst of our problems.  Is FRBR/RDA the answer?  I suspect not, simply because a new way needs to be much easier to describe and apply.

Google is, and has been for a while, the 800 pound gorilla in the search business.  This came about because their search tools were, and are, simply better than their competitors.  I don’t think this will last forever, but there are many benefits to their dominance.  They are able to set “standards” for web design that encourage compliant web site design and discourage  link farms and spam sites.  They have mastered, to a large extent, the art of interpreting the keyword search.  People now think in keywords when they search (which is why the natural language search engines are languishing in obscurity).

In libraryland, OCLC is our 800 pound gorilla.  When they come out with something new (and the last couple of years have been fantastic, with WorldCat leading the way), libraries pay attention.  If they set a particular course, it makes a great deal of sense to follow that same path.

Is this the best way, though?  Should the 800 pounders lead the way in information discovery?  How might they prevent innovation from happening, or are we doing that to ourselves already?  Is the slow pace of FRBR/RDA a reflection of the size of the beast as it slouches towards Bethlehem to be born, or simply the complexity of the solution?

One thing I have noticed on many blogs and listservs is that we love to talk about what is wrong and right about libraries and technology and search, but it is usually individuals and small groups taking the lead and deciding to blaze a new trail.  Open-ILS and LibraryThing are but two examples of dozens where people saw a need and decided to take charge of fulfilling it.

Why haven’t we come up with a new way to deal with bibliographic information?  Does one person, or a group, need to simply decide to do it?  The library community seems to be spinning its wheels on the issue, so perhaps this is the case.

Who wants to take on the challenge?

  • Share/Bookmark
Aug 29 2007

Resignation (not mine, though)


Resignation is a very thought provoking, albeit somewhat depressing, post by Alexander Johannesen on the Shelter It blog.  I have been also reading posts by him, very well presented, on the Next Generation Catalog for Libraries (NGC4Lib) listserv (where I found the link for this topic).

In the post, he discusses how the library world isn’t doing enough with what we have, and what we are doing isn’t being applied in the right way.  His points are well made, and worth passing along, but I don’t feel that he is hitting the nail quite on the head.

There is a lot we can be doing better.  We are not in enough control of our future, and it is costing us time, money, and people.  We cannot afford much within any of these categories.

However, we have gone through a great deal of change in the past 30 years (just look at the effect of computers alone), much more than could have been predicted.  Change is stressful; we are a stressed profession.  Change is necessary, though, and we must focus on changing our world to gain control, independence, and flexibility.

The next 30 years will not be forgiving ones, and I would hate to think of us becoming even less relevant in a world that is increasingly becoming enamored of the Google-type keyword search as being the end-all in retrieving knowledge.  This is a real possibility.

Read his essay; take it to heart.  Don’t resign yourself, though.  Become determined to direct change to everyone’s benefit and to make libraries better.

  • Share/Bookmark
Aug 28 2006

Metadata


In the 15 July 2006 issue of Library Journal, Jeffrey Beall writes a passioned defense of metadata against the forces of keyword searching. Much of what he says is valid, and I agree that metadata is necessary for effective storage and retrieval in the electronic age.

However, near the end of his essay he states:

There is also the problem of synonymy. For example, if a searcher needs information about plant science, but the best resources call it botany, then the searcher will likely be unsuccessful in his search. Our language is rich, and we often use many precise terms to represent a single concept. Full-text searching, however, is inherently imprecise in its execution.

This, to me, actually strikes me as one of the greatest challenges with the use of metadata: the need to know a controlled vocabulary. The average library user doesn’t necessarily know that botany, or cookery, or numismatics are the proper terms for subject searches, as opposed to more commonplace words.

Modern OPACs have plenty of “see” and “see also” examples, but this is only truly useful if the effort has been made to make the connections as complete as possible. I tend to use subjects only through the links available through results… results that I usually have reached by a keyword search. I like to tell patrons that, once you find a good result, track the subject headings to find other items, then check the shelves in each of the call number areas in which you found results.

The essay is well worth reading; we have a tendancy to forget the power of a controlled vocabulary and metadata, and it would be a shame to toss them aside in favor of the broad stroke of the keyword.

article discovered through Catalogablog

  • Share/Bookmark
FireStats icon Powered by FireStats