Property Deals Hut

  • Subscribe to our RSS feed.
  • Twitter
  • StumbleUpon
  • Reddit
  • Facebook
  • Digg

Friday, 8 June 2012

A New Declaration of Rights: Open Content Mining

Posted on 05:59 by Unknown

In a recent investment report, analyst Claudio Aspesiconcluded that a new front had opened up in the Open Access (OA) debate. Writing in April, Aspesi noted that academics are “increasingly protesting the limitations to the usage of the information and data contained in the articles published through subscription models, and — in particular — to the practice of text mining articles.” Aspesi is right, and a central figure in this battleground is University of Cambridge chemist Peter Murray-Rust. A long-time advocate for open data, Murray-Rust is now spearheading an initiative to draft a “Content Mining Declaration”. What is the background to this?
Peter Murray-Rust
When I interviewed Peter Murray-Rust in 2008, he expressed considerable frustration at the difficulties he was experiencing in trying to extract and reuse the data published in scholarly journals — even where his university had paid an electronic licence to access the content. 

What Murray-Rust wanted to do, he explained, was to capture the “embedded data” contained in the tables, charts, and images published in science papers, along with the “supplemental information” that often accompanies papers. To do this, he had developed a variety of software tools to mine large quantities of digital text. Having extracted the data he then wanted to aggregate them, compare them, input them into programs, use them to create predictive models, and reuse them in a variety of other ways.

However, he was having huge problems achieving this, not because of any technical issue, but because of uncertainty over copyright and publishers’ insistence that a licence to read journals does not encompass the right to mine them with software.

To add to Murray-Rust’s frustration, many of his colleagues were either unsympathetic or uncomprehending. Even more galling, the Open Access movement — which should have been a natural ally — was more interested in making papers freely available to eyeballs, than to software. Even papers published in OA journals, he noted, are often released under licences that do not come with reuse rights.

In pursuit of his dream, Murray-Rust became a formative voice in the creation of the open data movement. Open data, Murray-Rust explained to me in 2008, is data “free of any restraint on access and on reuse.”  Recently, however, governments have tended to lead the way in urging for open data, spawning a generation of data wranglers; open scientific information has often lagged behind, but is now beginning to be seen as a central issue.

Four years later Murray-Rust is still frustrated. He is not, however, a man to give up, and he continues his advocacy today under the rubric of “open content mining”. Essentially, this is text mining plus. As Murray-Rust explains today, he views the mining of scholarly journals as a hierarchical activity, with content mining encompassing not just the mining of text and data, but other types of content too, including images, tables, graphs, audio, and video.

Simply using the term “text mining”, he adds, “might imply that anything other than text should be protected by the ‘content provider’. However, I and others can extract factual information from a wide range of material.”

The good news is that the research community is finally beginning to understand what Murray-Rust has been “banging on about” for all these years, as are research funders and governments, and Murray-Rust believes the door to what he wants is finally beginning to open.

However, he says, it is imperative that text mining advocates push hard at that open door if they want to achieve their objectives. To this end, Murray-Rust recently convened an ad hoc group of interested parties to draft what he calls a “Content Mining Declaration” (disclosure: I am a member of the group).

 ####

If you wish to read the rest of the article, and a short Q&A with Murray-Rust, please click on the link below. 

I am publishing the interview under a Creative Commons licence, so you are free to copy and distribute it as you wish, so long as you credit me as the author, do not alter or transform the text, and do not use it for any commercial purpose. 

To read the rest of the text (as a PDF file) click HERE.


Email ThisBlogThis!Share to XShare to Facebook
Posted in Content Mining, Data Mining, Murray-Rust, Text Mining | No comments
Newer Post Older Post Home

0 comments:

Post a Comment

Subscribe to: Post Comments (Atom)

Popular Posts

  • Ann Okerson on the state of Open Access: Where are we, what still needs to be done?
    One of a series exploring the current state of Open Access ( OA ), the Q&A below is with Ann Okerson , Senior Advisor on Electronic Stra...
  • Open Humanities Press to publish OA books
    The Open Humanities Press ( OHP ) announced recently that it is entering the Open Access (OA) book publishing market, launching five new OA ...
  • Open Access: Profile of Eberhard Hilf
    Eberhard (Ebs) Hilf is a true veteran of the Open Access ( OA ) movement. A theoretical physicist based in Oldenburg , Hilf began his advo...
  • Open Access in 2009: The Good, the Bad and the Ugly
    As 2009 draws to a close advocates of Open Access ( OA ) will doubtless be looking back and weighing up the year's events. So what has b...
  • Open Access mandates: Judging success
    As Alma Swan has graphically demonstrated ( here and here ), mandates have begun to propagate nicely. It is worth noting that many of the...
  • Open Access given Papal Blessing?
    In his latest encyclical letter Pope Benedict XVI argues that rich countries are asserting their intellectual property with "excessiv...
  • Open Access: Whom would you back?
    Open Access ( OA ) advocates will tell you that there are two roads to OA. Green OA consists of researchers continuing to publis...
  • Open Access: Rethinking Harvard
    Last week the architect of Harvard’s Open Access ( OA ) policy, Stuart Shieber stated : “the Harvard open-access policy could not be, shoul...
  • Open Access: A publisher's perspective
    In an article I posted on 10th March I discussed the issue of whether the Green and Gold roads to Open Access ( OA ) should be vi...
  • Open Access: Who pays? How much?
    Last month the Scholarly Publishing & Academic Resources Coalition ( SPARC ) launched a new guide called Who pays for Open Access? Th...

Categories

  • ARC
  • Aspesi
  • Australia
  • Big Deal
  • BioOne
  • BMC
  • BOAI
  • Content Mining
  • COPE
  • CUP
  • Data Mining
  • eBooks
  • Elsevier
  • Free Software
  • FRPAA
  • Gold OA
  • Green OA
  • Harnad
  • India
  • InTech
  • ITHAKA
  • Jayakanth
  • John Wilbanks
  • Journal Prices
  • Library of Congress
  • Mandates
  • Michael Eisen
  • Michael Hart
  • MIT Press
  • Murray-Rust
  • Nature
  • NHMRC
  • NIH
  • OA Advantage
  • OASPA
  • OMICS
  • Open Access
  • Open Society Institute
  • Open Source
  • OSTP
  • Peer Review
  • Peter Suber
  • PLoS
  • PLoS ONE
  • Project Gutenberg
  • Repositories
  • Research
  • Research Works Act
  • Robert Kiley
  • Rockefeller University Press
  • RWA
  • Scholarly Publishing
  • Sciyo
  • Select Committee
  • Serials Crisis
  • SPARC
  • Springer
  • Text Mining
  • UC Press
  • UCL
  • Velterop
  • Wellcome Trust
  • Wiley
  • World Bank

Blog Archive

  • ►  2013 (31)
    • ►  November (1)
    • ►  October (4)
    • ►  September (5)
    • ►  August (2)
    • ►  July (9)
    • ►  June (2)
    • ►  May (2)
    • ►  April (1)
    • ►  March (2)
    • ►  February (2)
    • ►  January (1)
  • ▼  2012 (43)
    • ►  December (1)
    • ►  November (1)
    • ►  October (2)
    • ►  September (2)
    • ►  July (6)
    • ▼  June (4)
      • The UK Publishers Association comments on the Finc...
      • The Finch Report in a global Open Access landscape
      • The Finch Report: UCL’s David Price Responds
      • A New Declaration of Rights: Open Content Mining
    • ►  May (2)
    • ►  April (2)
    • ►  March (3)
    • ►  February (7)
    • ►  January (13)
  • ►  2011 (22)
    • ►  December (1)
    • ►  October (2)
    • ►  September (2)
    • ►  August (2)
    • ►  July (1)
    • ►  June (5)
    • ►  May (2)
    • ►  March (4)
    • ►  February (1)
    • ►  January (2)
  • ►  2010 (20)
    • ►  October (3)
    • ►  September (1)
    • ►  August (3)
    • ►  June (3)
    • ►  May (4)
    • ►  March (2)
    • ►  February (3)
    • ►  January (1)
  • ►  2009 (22)
    • ►  December (1)
    • ►  November (3)
    • ►  October (2)
    • ►  September (2)
    • ►  August (1)
    • ►  July (2)
    • ►  June (3)
    • ►  May (1)
    • ►  April (1)
    • ►  March (3)
    • ►  February (2)
    • ►  January (1)
  • ►  2008 (14)
    • ►  December (1)
    • ►  November (4)
    • ►  September (1)
    • ►  June (2)
    • ►  April (2)
    • ►  February (2)
    • ►  January (2)
  • ►  2007 (9)
    • ►  October (3)
    • ►  July (1)
    • ►  May (1)
    • ►  April (1)
    • ►  March (1)
    • ►  February (1)
    • ►  January (1)
  • ►  2006 (27)
    • ►  December (1)
    • ►  November (2)
    • ►  October (1)
    • ►  September (6)
    • ►  June (2)
    • ►  May (3)
    • ►  April (2)
    • ►  March (7)
    • ►  January (3)
  • ►  2005 (31)
    • ►  December (3)
    • ►  November (2)
    • ►  October (3)
    • ►  September (7)
    • ►  August (4)
    • ►  June (2)
    • ►  May (1)
    • ►  April (3)
    • ►  March (6)
  • ►  2004 (2)
    • ►  August (2)
Powered by Blogger.

About Me

Unknown
View my complete profile