Online Databases


As an accompaniment to this and this, I am including the text of OCLC’s proposed policy which was posted briefly yesterday before being removed and replaced with “We are reconsidering some aspects of the policy. More information will be available in the near future.”

Debate and discussion are healthy, especially in world of libraries.  Understanding why this policy was proposed, and why it bothers some people, is critical to the understanding of who we are and where we are going.

The text located within this post has been copied from http://marc.coffeecode.net/oclc_2008_11_02/.  I did not see OCLC’s page, and cannot guarantee that it is unchanged.  I have no reason to believe that it isn’t what existed on OCLC’s site on Sunday.

Much of what is contained here is fair and reasonable; at issue, I believe, is the concept of controlling the data contained within OCLC records through licensing.  I feel that the removal of the policy from their website is a signal that OCLC is truly attempting to do what is right for all involved, and that they can craft a policy that will express that.  This is not the 800 pound gorilla that will get it’s way no matter what, but a tiger whose instinct is to protect its territory.  We should respect that territory, while at the same time respecting our own rights to the data at issue.

The text of the since-removed policy:

———————————————————————————————-

Policy for Use and Transfer of WorldCat® Records

The Guidelines for the Use and Transfer of OCLC-Derived Records have been updated to become the Policy for Use and Transfer of WorldCat Records. The policy is scheduled to become effective mid-February 2009, to give OCLC member libraries and other organizations time to implement any changes resulting from the update. Until that time, the Guidelines will remain in effect.
Policy for Use and Transfer of WorldCat® Records
A. Intent of the Policy

OCLC® encourages and supports the widespread, non-commercial use of WorldCat Records (as defined below) for scholarship and research in furtherance of innovation that complements OCLC’s products and services for the benefit of libraries, museums, archives and other cultural heritage institutions and their respective patrons by

1. promoting the evolution of libraries, archives, museums and other cultural heritage institutions, the use of their collections and the advancement of their professions;
2. increasing availability of library, archive, museum and other cultural heritage institution resources to individual users; and/or
3. furthering ease of access to and use of world-wide scientific, literary and educational knowledge and information.

This “Policy for Use and Transfer of WorldCat® Records” is designed to foster such use while protecting the rights of OCLC’s membership and its investment in WorldCat, and ensuring that the use of WorldCat Records hereunder provides a benefit to the OCLC membership. Benefit to the OCLC membership may take the form of reciprocal linking, metadata, remuneration, services-in-kind or other negotiated value. This Policy governs all Use and Transfer of WorldCat Records unless a separate written agreement is made with OCLC. Please read the entire Policy and the documents incorporated herein by reference to ensure full understanding of the Policy.

This Policy covers WorldCat Records as defined below. This Policy may also govern the Use and Transfer of WorldCat bibliographic data available through other services to the extent determined by OCLC. Please check the FAQ regularly for the applicability of this Policy to other services.
B. Definitions

1. “Policy” means this “Policy for Use and Transfer of WorldCat Records”, as modified by OCLC from time to time.
2. “WorldCat” is the OCLC online union catalog, an electronic database of bibliographic records and other information maintained by OCLC;
3. A “WorldCat Record” is a bibliographic record meeting one or more of the following criteria unless otherwise specified by this Policy:
1. a bibliographic record obtained directly from WorldCat through the use of an OCLC product or service; or
2. a bibliographic record (i) identified as Derived from WorldCat by the party from which the WorldCat Record is received; or (ii) which is otherwise known by the recipient to have been Derived from WorldCat at any time prior to receipt. Please see the FAQ for information on fields, subfields and other factors that can indicate whether a bibliographic record has been Derived from WorldCat.

The Use and Transfer of data extracted from a WorldCat Record is subject to this Policy whether or not the extracted data itself meets one or more of the criteria described in this Section B.3 and shall be included within the term “WorldCat Record”.
An OCLC Member or Non-OCLC Member may Use or Transfer the following without complying with this Policy: (i) a WorldCat Record designated in WorldCat as the Original Cataloging of the OCLC Member or Non-OCLC Member; or (ii) a bibliographic record which is not Derived from WorldCat whether or not the OCLC Member or Non-OCLC Member adds the OCLC control number to the record.
4. A bibliographic record in WorldCat is designated as the “Original Cataloging” of the agency represented in the OCLC MARC 040 field, subfield a (original cataloging agency).
5. The “FAQ” is the Frequently Asked Questions document providing more information regarding this Policy, as updated by OCLC from time to time, which is located at: http://purl.org/oclc/wcrup-faq.
6. An “OCLC Member” is an organization that is listed on OCLC’s membership roster as a Governing Member or Member of OCLC.
7. A “Non-OCLC Member” is any party (including an individual) who is not an OCLC Member.
8. A bibliographic record is “Derived” from WorldCat if it was copied or otherwise obtained from WorldCat at any time prior to receipt.
9. “Use” means use in accordance with the requirements and intent of this Policy without making a Transfer to another person or organization.
10. “Non-Commercial Use” means Use for the purposes of research, teaching, scholarship or private study provided such use is not Commercial Use.
11. “Commercial Use” means Use in any manner that supports, is intended for or directed toward or results in commercial advantage or monetary compensation, including, without limitation, any sale of WorldCat Records.
12. “Reasonable Use” means Use of WorldCat Records that is reasonable for the intended Non-Commercial Use and consistent with the intent of this Policy. Without limiting the foregoing, the term “Reasonable Use” does not include any Use of WorldCat Records that:
1. discourages the contribution of bibliographic and holdings data to WorldCat, thus damaging OCLC Members’ investment in WorldCat, and/or
2. substantially replicates the function, purpose, and/or size of WorldCat, for example for the purpose of providing cataloging services to libraries or other organizations. Please see the FAQ for a discussion of Z39.50 for cataloging using WorldCat-derived bibliographic records.
13. “Transfer” means conveyance to another OCLC Member or Non-OCLC Member by exchange, merger, sharing, gift, providing the capability to download or otherwise electronically copy or any other means.
14. “Commercial Transfer” means Transfer of WorldCat Records for Commercial Use by the recipient.
15. “Holdings” means OCLC institution symbols attached to a record, indicating libraries that own, license or otherwise provide access to the item described by the record. An OCLC institution symbol is a unique identifier assigned by OCLC to Members.
16. “WorldCat Record Use Form” means the form prescribed by OCLC for purposes of contacting OCLC with proposals for Use and/or Transfer of WorldCat Records which are not authorized by this Policy. The WorldCat Record Use Form is located at: http://purl.org/oclc/wcrup-form.

C. Use of WorldCat Records

1. Each OCLC Member and Non-OCLC Member may Use WorldCat Records in accordance with this Policy.
2. Subject to the restrictions set forth in this Policy, OCLC Members and Non-OCLC Members are granted the non-exclusive, world-wide, royalty-free right to:
1. reproduce WorldCat Records, incorporate WorldCat Records into works and base works on WorldCat Records; and
2. display WorldCat Records, works incorporating WorldCat Records and works based on WorldCat Records;
for Non-Commercial Use.
3. The following restrictions apply to Use of WorldCat Records:
1. Reasonable Use. Use of WorldCat Records authorized by this Policy is limited to Reasonable Use except as otherwise authorized by written agreement with OCLC.
2. Commercial Use Prohibited. Commercial Use of WorldCat Records is prohibited unless OCLC has entered into a separate written agreement with the party wishing to make such Use. [OCLC has a standard agreement available which authorizes the processing (for a fee) of WorldCat Records received from OCLC Members and Non-OCLC Members.] The WorldCat Record Use Form should be used to initiate discussions with OCLC regarding an agreement authorizing Commercial Use of WorldCat Records. OCLC may refuse to authorize Commercial Use of WorldCat Records, without liability, in its sole discretion.

D. Transfer of WorldCat Records

1.
1. Subject to the restrictions set forth in this Policy, an OCLC Member may Transfer WorldCat Records of its own Holdings to other OCLC Members and Non-OCLC Members for Use in accordance with this Policy.
2. A Non-OCLC Member may Transfer WorldCat Records of its own Holdings to OCLC Members and Non-OCLC Members under separate agreement with OCLC. The Non-OCLC Member wishing to make such a Transfer must submit a WorldCat Record Use Form to OCLC for approval of the proposed Transfer. OCLC’s approval of the Transfer described in the WorldCat Record Use Form: (i) must be in writing; (ii) may be conditioned upon agreement to additional terms and conditions determined by OCLC; and (iii) may be withheld by OCLC, without liability, within its sole discretion.
2. The following restrictions apply to Transfer of WorldCat Records:
1. Attribution. WorldCat and OCLC Online Computer Library Center, Inc. must be clearly identified as the source of WorldCat Records Transferred. Please see the FAQ for information about appropriate attribution of WorldCat and OCLC.
2. Modification Restriction. The OCLC number (if any), the link to this Policy and any additional means of attribution (besides the link to this Policy) may not be removed from any WorldCat Record.
3. Commercial Transfer Prohibited. Commercial Transfer of WorldCat Records is prohibited unless OCLC has entered into a separate written agreement with the party that will receive the WorldCat Records. The WorldCat Record Use Form should be used to initiate discussions with OCLC regarding an agreement authorizing Commercial Transfer of WorldCat Records. OCLC may refuse to authorize Commercial Transfer of WorldCat Records, without liability, in its sole discretion.
4. Copy of Policy. If an OCLC Member Transfers WorldCat Records of its own Holdings under Section D.1.a above, the OCLC Member will provide the party receiving the WorldCat Records with a copy of this Policy (or a link by which this Policy may be accessed) and indicate that Use and Transfer of WorldCat Records is permitted only in accordance with the Policy.
3. The Transfer of WorldCat Records by an OCLC Member or Non-OCLC Member which do not represent its own Holdings requires submission of a WorldCat Record Use Form to OCLC for approval of the proposed Transfer. OCLC’s approval of the Transfer described in the WorldCat Record Use Form: (i) must be in writing; (ii) may be conditioned upon agreement to additional terms and conditions determined by OCLC; and (iii) may be withheld by OCLC, without liability, within its sole discretion.
4. WorldCat Records, a work incorporating WorldCat Records or a work based on WorldCat Records which are Transferred may be Used by the recipient only under the terms of this Policy. Additional or different terms may not be imposed. Any link to this Policy in a WorldCat Record that is Transferred may not be removed, hidden, deactivated or obscured.

E. Additional Provisions

1. The rights to Use and Transfer WorldCat Records afforded by this Policy shall automatically terminate upon any breach of the terms of this Policy. The OCLC Member(s) and/or Non-OCLC Member(s) involved in any breach of this Policy shall provide such assistance as is reasonably requested by OCLC to remedy the breach.
2. WORLDCAT RECORDS ARE PROVIDED AND/OR MADE AVAILABLE “AS IS”. NEITHER THE ORIGINAL CATALOGING LIBRARY NOR OCLC WARRANTS THE COMPLETENESS OF WORLDCAT RECORDS.
3. Neither this Policy nor the transactions contemplated herein serve to transfer any ownership right or interest in or to WorldCat or WorldCat Records, including, without limitation, the intellectual property rights therein.
4. When an organization makes bibliographic information available to OCLC which is subject to Use or Transfer restrictions which are different than those set forth in this Policy, and OCLC nevertheless elects to accept the information for addition to WorldCat, OCLC will inform OCLC Members and Non-OCLC Members of the restrictions, and the rights to Use and Transfer such information will be subject to such restrictions.
5. Regardless of the source from which WorldCat Records are received, Use and Transfer of WorldCat Records is authorized solely by OCLC pursuant to this Policy. Accordingly, this Policy constitutes a direct contractual relationship between OCLC and the party obtaining WorldCat Records, and may be enforced by OCLC directly against such party.
6. OCLC has the sole discretion to determine whether any Use and/or Transfer of WorldCat Records complies with this Policy. In the event OCLC identifies a Use and/or Transfer which does not comply with this Policy, OCLC shall notify the relevant OCLC Member(s) and/or Non-OCLC Member(s) and such parties agree to work with OCLC to resolve the noncompliance.
7. This Policy is the final, complete and exclusive statement of the agreement of the parties with respect to the subject matter hereof. Once applicable to a given Use or Transfer of WorldCat Records, no provision of this Policy may be changed, modified or supplemented except in a written document signed by the parties. OCLC may issue a modified version of this Policy or a substitute for this Policy at any time and the modified or substitute version will apply to any Use and Transfer of WorldCat Records after the date of issuance (or other effective date specified by OCLC).
8. If any provision of this Policy is invalid or unenforceable under applicable law, it shall not affect the validity or enforceability of the remainder of the terms of this Policy, and without further action by the parties to this agreement, such provision shall be reformed to the minimum extent necessary to make such provision valid and enforceable. No term or provision of this Policy shall be deemed waived and no breach consented to unless such waiver or consent shall be in writing and signed by the party to be charged with such waiver or consent.
9. This Policy shall be governed by and interpreted in accordance with the laws of the State of Ohio and the United States of America, without regard to principles of conflict of laws, except (i) as otherwise provided in a separate agreement with OCLC which incorporates this Policy; or (ii) as otherwise required by applicable law.

OCLC released their updated Policy for Use and Transfer of WorldCat Records yesterday, with implementation scheduled for mid-February.  If you see the phrase

We are reconsidering some aspects of the policy. More information will be available in the near future.

then you might suspect that it created quite a fuss.  It did, and OCLC responded by removing the policy from their web site.

Someone saved a copy of the web page; I will include the text of the policy in the next post.

The core criticism of the policy changes seem to revolve around the licensing of the records.  OCLC planned to include a license statement in a 996 field (from Terry’s Worklog):

996 $aOCLCWCRUP $iUse and transfer of this record is governed by the OCLC® Policy for Use and Transfer of WorldCat® Records.
$uhttp://purl.org/oclc/wcrup

Limits would include use of the records in anything that “substantially replicates the function, purpose, and/or size of WorldCat, for example for the purpose of providing cataloging services to libraries or other organizations.”  Cataloging services aside, how large does a union catalog have to be before it replicates the function of WorldCat, namely finding a library that owns a particular book?  Ohio’s Ohiolink sized?  Illinois’ I-Share sized?  Georgia’s Pines sized?

What about the OpenLibrary project?  Or LibraryThing?  Using information derived from an OCLC record without including the OCLC number and other OCLC references (like the above statement) would violate their terms, as I read them.  However, the last time I checked, the data itself is not covered under copyright and should be able to be extracted and expressed in creative ways (as long as OCLC’s creative way of expressing the data is not infringed).

We need to recognize the difference between the data held in these records and the expression of that data.  To enter into license agreements that suggest that we cannot recognize the data separate from the expression goes against core library principles.  This feels, in some ways, like the Major League Baseball statistics copyright battle from a few years back, in which the idea of the data itself being public domain was upheld.

One final note:  keep in mind the who and how behind the creation of these records.  We create them.  OCLC gathers them, disseminates them, and puts them to mostly good use.  If OCLC didn’t exist, there would be a need for some organization with a similar purpose.  They are not evil, they are not out to get us; they are, however, overreaching at times.

originally found via Thingology

In an example of how much open access may change scholarship, it was announced the Dead Sea Scrolls will be scanned in high resolution and made available online and open access.

As more primary sources are made available for study, scholarship will become more democratic.  This will have both positive and negative aspects, but as with the greater availability of scientific studies, the primary result will be beneficial.

found via Open Access News

This is one of those resources which you hope will not have to be used, but when it helps someone find the information they need, it can be invaluable:

The Unidentified Decedent Reporting System (Warning:  site is definitely not for the squeamish) is the National Association of Medical Examiners online tool for posting information about unidentified bodies across the U.S.  It is a distinctly depressing web site, with hints of sadness and loss in dozens of lives, and the realization that for many of these people there are families and friends without answers.

Along with the Doe Network, this site can help to bring closure to tragedy.

from ResourceShelf

The Internet Search Environment Number (ISEN) is a fantastic idea.  On-line databases would be assigned their own, unique number, similar to an ISBN or ISSN, which would identify them as a resource.  This would aid tremendously in organizing and ultimately accessing these troves of information that, for the most part, belong to the deep web.

found via Catalogablog

100 Unbelievably Useful Reference Sites You’ve Never Heard Of is the title of a post on the TeachingTips blog. And yes, there are a lot of interesting and useful links to be found (though I suspect you may recall hearing about a few of them before…)

found via LISNews

Think of Infochimps.org as not only a wikipedia of data sets, but as potentially the greatest data mash-up tool yet.

Imagine having loads of census, weather, sports, and other statistical data available in one big database.  Then standardize the fields so that you can interconnect the data sets with each other.  From the Infochimps site:

“A central, community-driven repository solves these problems and presents amazing possibilities. Once we interconnect the datasets along concepts they share, instead of 100,000 datasets, there’s just one. Study the physics of baseball by comparing the hourly weather during every single baseball game to game outcomes. Uncover political campaign irregularities by comparing neighborhood per-capita income, historical voter trends, and public campaign finance records. Plan real-estate decisions based on what news-and-other-media keywords rank highly in each area.”

Still don’t see the possibilities?  Browse through the datasets that are already loaded.  Then check out what is coming in the near future.  This will likely be the first place we will want to go for statistical information, as long as it is fast and easy to extract what we need.  I am looking forward to seeing what they (or some other enterprising web designers) come up with to work with data on the web.

(does this sound vaguely familiar?  you might be thinking of Freebase.com, previously discussed here)

found via Open Access News

Go to WorldCat and type in a search.  The results contain books, video, audio, you name it.  One of the more recent additions is the Digital Images Collections, which can be wonderful to view.  How to find them, though?

Type in your search terms, then add cntnt (”Content” without the vowels) as an additional term (which acts as a limit).  Viola!

Try chicago cubs cntnt

Try airplanes cntnt

Try carnegie libraries cntnt

You get the idea…

found via the WorldCat blog

BookLamp.org is a web 2.0 application that does something new with book recommendations. Their approach is to avoid any book selling sites and focus only on responses from readers. This provides benefits when one thinks about libraries — people often don’t buy the books they are reading.

The other new approach is how they break down the book information : they create a chart showing, chapter by chapter, the Pacing, Density, Action, Description and Dialog within the book. Here is a chart for one of my favorite reads, To Say Nothing of the Dog (click the image for the larger version):

BookLamp.org Chart

In the end, this will show similarities beneath the surface, as well as justifying why (in my own opinion) there is a natural progression for readers to begin with Harry Potter, move on to Lord of the Rings (with a brief visit to Narnia), and then onto The Dark Tower series.

If this seems like an interesting approach, sign up for an account. There isn’t much there yet, but what is there looks really promising. I hope they will consider OPAC integration sometime in the future…

found via LISNews

Open Bibliographic Data : The State of Play is a post by Rufus Pollock at the Open Knowledge Foundation which examines a variety of sources for cataloging and database information.  He makes the observation, which I think is on target, that when it comes to bibliographic data

You might even think, given the public-spiritedness of librarians, that this is the kind of area where not only could it be openly available but it would be openly available….

There is a movement out there working to create an open repository.  It would make everyone’s job easier, especially if this repository could incorporate some type of authority checking by the users.  Imagine leveraging the same type of error checking that Wikipedia uses, but on our catalog information.

One can dream…

found via Open Access News

UNdata is a search tool for the many informational databases that the United Nations maintains. It is straightforward, easy to use, and effective in attaining what you need.

If only the UN as a whole worked so well ;-)
via OSDir

Kind of an interesting post on Open Access News over the weekend.  It seems that the license requirements for Elsevier’s electronic journals include printing a copy of the article, then scanning the article, before a library can provide that article as an interlibrary loan (ILL).

Part of my job is the scanning of articles for ILL, a task that is simultaneously interesting and monotonous.  I cannot think of any reason for their policy, except perhaps to make it inconvenient to provide their articles via ILL.  I will be watching to see if there is further developments….

Kind of an interesting post on Open Access News over the weekend.  It seems that the license requirements for Elsevier’s electronic journals include printing a copy of the article, then scanning the article, before a library can provide that article as an interlibrary loan (ILL).

Part of my job is the scanning of articles for ILL, a task that is simultaneously interesting and monotonous.  I cannot think of any reason for their policy, except perhaps to make it inconvenient to provide their articles via ILL.  I will be watching to see if there is further developments….

Data on Demand is a collection of data sets put together by the Milwaukee Journal Sentinel.  It contains data for a variety of topics, such as campaign fundraising, home foreclosures, popular baby names, and school instructional hours.

This is a great ready-reference list for libraries in Wisconsin… it would be even greater to compile a database of available database sets like these across the country (or even around the world)!

from ResourceShelf

In a long interview on the state of the semantic web, Tim Berners-Lee (if you don’t recognize the name, you should really read this) briefly discussed DataPortability, the ability to move information freely and easily from one site to another:

“So, first of all, are they going to let people use the data? I think, the push now, as we’ve seen during the last year, has been unbearable pressure from users to say, ‘Look, I have told you who my friends are. You are the third site I’ve told who my friends are. Now, I’m going to a travel site and now I’m going to a photo site and now I’m going to a t-shirt site. Hello? You guys should all know who my friends are.’ Or, ‘You should all know who my colleagues are. I shouldn’t have to tell you again.’”

“So, the users are saying, ‘Give me my data back. That’s my data.’ That was one of the cries originally behind XML, it was a desktop application. Don’t store it in a format which I can’t reuse. So, now it’s, ‘Give it to me using the idea of standards. If you do that, then I can do things with it.’” (around 42 minutes into the interview)

Libraries are still playing catch-up in the social data area. We are starting to implement tagging and book recommendations, but we are not all that far along with implementing things. What this quote reminds me is that we should also be keeping an eye towards making it easy to export data out of our systems. Easy to use formats (like xml) and open standards and interfaces should become the norm for libraries.

This isn’t suggesting that we open all our data… it is not our place to provide patron reading or personal information. We shouldn’t make it difficult for patrons to do that themselves, if they so choose (although I feel we should make an effort to let them know the potential negative effects of placing information on the web).

We should, however, be using that data to generate social links between books (people who checked out “A” also tended to check out “B”, with A and B being books, authors, videos, etc.) and make it easy to access the patterns that result from any informational web site use.

Just something to keep in mind when we select our online tools and software…

interview found via TechCrunch

LCCN Permalink is a service provided by the Library of Congress that allows one to generate links to items in the LC collection by the Library of Congress Control Number (LCCN). The format is:

http://lccn.loc.gov/99011685

This is similar to the following examples of direct links used in other online catalogs such as ISBN Linking in WorldCat and ISSN linking in my own workplace’s catalog. These can be used to create stable, static links to items in catalogs, and are therefore very helpful for citing items online.

found via Web4lib

SERU: A Shared Electronic Resource Understanding is an attempt by libraries and publishers alike to break out of the licensing process that many in both camps find problematic.

Simply put, it is a straightforward agreement that a particular library subscribes to a given resource, and that the “shared expectations” of each side will be respected.  All this with no licensing contract.  An invoice and payment, and their presence on a registry maintained by the National Information Standards Organization (NISO), serves as the legal basis for the transaction.

This reminds me of those business deals where you know that the person you are dealing with is thinking long-term, and that a handshake means more than a legal contract would with someone else.  It will be interesting to see how well this works in practice… it would be great to streamline access to information in a way that benefits everyone involved.

found via the SERIALST list

Foreign Book Dealers Directory is just as described : a database of book dealers around the world that lets you search for book dealers by company, region, and country.  It is hosted by ALCTS, and seems extremely useful for those challenging acquisitions.

posted to ACQNET by Angie Cope of the American Geographical Society Library

Review : Directory of Open Access Journals (Pdf), by Heather Morrison is a very good introduction to what is becoming a top-tier collection of journals.

Although some of the statistics are a bit dated (due to a strength of the DOAJ - they are adding journals fairly rapidly, and are currently past the 3,000 mark) the review hits the mark, making a great case for libraries to connect their patrons with this excellent resource.

from ResourceShelf

I have encountered a few interesting items relating to online, full-text books during the past few days, and thought they would make a good snapshot of where things stand at this time:

Next Page »