WebSearch


Think of Middlespot as a Zotero-like organization tool for web searches.  Perform a search, and you have several ways to explore the search results, as well as saving collections of results in what they call a “Workpad”.

The fastest way to learn what this can do is to view their slideshow tutorial.  If you think it has potential, then don’t forget to check out their tools (including a way to incorporate Google search results into your workpad).

from MetaFilter

A Useful Amplification of Records That Are Unavoidably Needed Anyway is an essay by Brett Bonfield which, dare I phrase it this way, usefully amplifies several of the major web-based entities which are intertwined with libraries.  These include (but aren’t limited to) OCLC’s WorldCat, Amazon, and LibraryThing.  Brett clearly understands libraries, and does a great job detailing the interrelationships between all involved.

Not directly related to the essay, LibraryThing has posted an expansion of their Common Knowledge fields for Authors and Events.  This is an interesting read, as it addresses in a real-world way the need for authorities and relationships.

About 2 million images from the LIFE Photo Archive have been digitized and are available via Google Image Search.  What is even better is that, over the next few months, the remaining 8 million images from the collection will be added.  Many of these have never been published, so there ought to be some historically interesting and significant gems in this collection.

You can search the collection by using the above link, or add “source:life” (without the quotes) to your image search query.

via the Official Google Blog

The 2008 Presidential Election Search Engine is a Google Custom Search Engine created by the University of Kansas Library.

It is a great resource in that one can find the needles of information in the haystack of campaign rhetoric.

found via “Organizing the 2008 Presidential Election”, from College and Resource Libraries, October 2008, p. 540.

APML stands for Attention Profiling Markup Language.  Its purpose is to permit a standardized way to gather and transfer your interests from site to site across the web.  It is built using XML, and is definitely a new technology that may or may not catch on.  It could become an important element of using the web, because of the benefits it offers to users.

For example, Amazon.com tracks what you purchase, what is on your wish list, and even what you view (while logged in).  If you incorporated this into your APML file (which could be stored with an online service, on your computer or flash drive, etc.) your interests could be applied to other APML-compatable websites.

This has the potential, as with much web 2.0 technology, of creating privacy issues and complications.  It also could save users an incredible amount of time and effort, as resources relating to their interests would gain relevance and become much easier to find.  Think of the potential for searching news sites, link services such as Digg, and many other specialized discovery tools.

And yes, as John Fudrow suggests, it has a lot of potential for libraries.  Imagine how our users could benefit from this, especially when you add “suggested by others who liked this” and other social connectivity tools.

UNdata is a search tool for the many informational databases that the United Nations maintains. It is straightforward, easy to use, and effective in attaining what you need.

If only the UN as a whole worked so well ;-)
via OSDir

Intelways is an interesting search site.  It isn’t a meta search tool, but it does harness multiple search engines to create an improved experience.

As with many things on the web, the best way to understand it is to try it.  Enter a search term, then select a category above the search bar, or a specific search tool below the search bar.  This tool makes it easy to perform a search one one engine, then switch to another to see a different set of results without having to retype the query or load the search pages.

One note to add, however: this site has been around for a while, but has changed its name a couple of times.  Follow the link below if you are curious about the history of the site.

from ResourceShelf

Wikia, the search engine designed around the concept of wikis, has been launched in alpha.  This means that a) it isn’t going to work as well as anyone wants, and b) no fair comparing it to Google et. al., except in the theoretical.  In fact, we usually wouldn’t see software until the beta stage of development; this shows how much Wikia will rely on users.

There isn’t all that much to see yet, but the idea of an annotated search engine where the users provide feedback about the search results, which will then be used to improve future results, is tantalizing.  Search 2.0 anyone?

via Open Access News

USA.gov Mobile is a great little search tool for government web pages and toll-free numbers (and it is well designed for your PC, even though it is made for mobile browsers).

found via ResourceShelf

I have encountered a few interesting items relating to online, full-text books during the past few days, and thought they would make a good snapshot of where things stand at this time:

A somewhat rambling essay, but one that is important nonetheless:

Joe Wilcox has posted an interesting essay at Microsoft Watch regarding Google’s merger with DoubleClick, the internet advertising company.  I strongly disagree with some of his interpretations (he tries to have it both ways, and by defending Microsoft and chastising Google, he simply muddies the water), but the essay has me thinking about the good and bad of monopolies in libraryland.

First, is the love-hate relationship I have with “monopolies”.  Oftentimes a monopoly reduces choices for the user/consumer, and oftentimes the litmus test for this is whether the company/organization channels its energy towards preventing competition, rather than out-performing competition.  Efforts towards providing a better product/service than one’s competitor are rarely in vain.  Even if a company fails, the level of product/service is usually improved across the board.

Next, the concept of open standards is, for better or worse, tied up with monopolies.  A group with a monopoly is able to set standards much more effectively.  If the standards are set in a fair manner, i.e. not simply to prevent competition against one’s own product/service, then the monopoly can actually be more efficient.  If not, it isn’t truly an open standard, as much as it is a proprietary standard.

Libraries, then… we are swimming in a sea of standards, and companies that create them.  We are living with standards that work only for us, such as MARC, and aren’t of much (if any) benefit outside libraries.  The bibliographic information contained within them is of great benefit and value, but the standard is not very useful.

However, so much of our energies are tied up in this standard (and others, if we think about it), and it is dragging us down.  It is important to understand that the information is what has value; the value in how we store and access it is reflected in the ease of use, and the interest in using that storage/access method.

MARC has lost it’s luster, and we should move forward.  The information, however, is more valuable than ever, and we need to figure out how to maximize this value.  Making it easy for everyone to use, not only libraries, should be our top priority.  When Amazon or Google (or companies/groups like them) really want to access our bibliographic records, and use their structure, this will be when we know we have fixed the worst of our problems.  Is FRBR/RDA the answer?  I suspect not, simply because a new way needs to be much easier to describe and apply.

Google is, and has been for a while, the 800 pound gorilla in the search business.  This came about because their search tools were, and are, simply better than their competitors.  I don’t think this will last forever, but there are many benefits to their dominance.  They are able to set “standards” for web design that encourage compliant web site design and discourage  link farms and spam sites.  They have mastered, to a large extent, the art of interpreting the keyword search.  People now think in keywords when they search (which is why the natural language search engines are languishing in obscurity).

In libraryland, OCLC is our 800 pound gorilla.  When they come out with something new (and the last couple of years have been fantastic, with WorldCat leading the way), libraries pay attention.  If they set a particular course, it makes a great deal of sense to follow that same path.

Is this the best way, though?  Should the 800 pounders lead the way in information discovery?  How might they prevent innovation from happening, or are we doing that to ourselves already?  Is the slow pace of FRBR/RDA a reflection of the size of the beast as it slouches towards Bethlehem to be born, or simply the complexity of the solution?

One thing I have noticed on many blogs and listservs is that we love to talk about what is wrong and right about libraries and technology and search, but it is usually individuals and small groups taking the lead and deciding to blaze a new trail.  Open-ILS and LibraryThing are but two examples of dozens where people saw a need and decided to take charge of fulfilling it.

Why haven’t we come up with a new way to deal with bibliographic information?  Does one person, or a group, need to simply decide to do it?  The library community seems to be spinning its wheels on the issue, so perhaps this is the case.

Who wants to take on the challenge?

Bernhard Eversberg of the Braunschweig University Library (or Universitätsbibliothek Braunschweig ) has created a very neat, and very useful online tool : a Library of Congress Subject Headings Browser.

Try it out; put in some of your favorite gnarly subject terms. Try various sub-headings. Enter a LC Classes to see the subject headings in shelf order. Enjoy!

found via the ngc4lib listserv

GRDDL, a new recommendation from the W3C, is short for “Gleaning Resource Descriptions from Dialects of Languages”, a means by which software can extract semantic information from a variety of web pages.

In other words, rather than having to rely on the code (table structure, metadata, microformats, etc.) used by whoever created the page, GRDDL will attempt to align the content it finds with other existing data structures, and turns this into RDF. The content can then be compared and used with content from other sources.

For perhaps the best means of understanding how this can change the way we find and use information, check out the Scenarios that the W3C give as examples for how this could be used (scroll down to the various “Use Cases”).

I am a big fan of Microformats, and I see this a means to create a microformat checker service, similar to how web page creators can check their html/xhtml/css code. It would be neat to be able to run one’s pages against this in order to find out how best to structure and mark up the information contained within.

found on Catalogablog

LibWorm is a fantastic resource for those who are interested in libraries and blogs.  It is a search engine for library-related blog content, drawn from 1400 RSS feeds.

The Google Custom Search Engine (CSE) has just been made easier to set up and use.

The brief summary: create a page of links to web resources you feel are useful and appropriate for a given topic. Via Google, you generate a short bit of code that is added to the page, and you have a search tool that will let people do a full-text Google search on all of those resources — without you having to maintain anything other than the page of links. If you add or remove links, the CSE automatically adjusts the search to match. Techie description and details are here.

So if your library has lists of web resources, this may be a way to effectively let your patrons search them for the information they want.

from Google Librarian Central

They’ve been around for a while, but I haven’t blogged about them yet, and they keep adding great features, so here is another great site for finding info related to location:

Melissa DATA has links to resources that give you information based on zip code, street address, and more.  Want to find out if an address is valid?  Know the address number and zip code, but don’t remember the street?  Mail delivery routes?  School districts?  These searches and more are at your fingertips.

found at ResourceShelf

Jonathan Rochkind has an interesting commentary on this New York Times article about Google’s Algorithm titled Google Keeps Tweaking Its Search Engine.

I really don’t have much to add, except for the thought that we are too mired in the “algorithm” we currently have, which was built around small index cards in drawers, and need to recognize that there are many parts, facets if you will, to how we store and provide information. We need to be open to each of these facets and the ways we can use them.

Storage, Search, Presentation, Categorization… these are all parts of what we do; they do not have to all reflect the same structure, they just need to be able to interact.

Pipl is a search engine geared towards finding people, but it casts a wider net than other people search tools. It searches social networking sites, job sites, and blogs.

As you might guess, there are usually a lot of results to wade through, especially with common names.  That aspect aside, this site can be helpful for tracking down someone, or for seeing what your own web presence is.

from Search Engine Land

PlantFacts is a resource from the Ohio State University which provides a wealth of information about plants and horticulture through a plant web search, an image database, how-to videos, a collection of over 800 FAQs, and an illustrated glossary.

from ResourceShelf

Google Translate has a new feature:  it will allow you to search foreign language pages using your language’s search terms, and translates the results on the fly.

As with all machine translation, the results will vary; the interface is well worth trying out.

from Google Librarian Central

Next Page »