Digital Curation Blog: IDCC3

Showing posts with label IDCC3. Show all posts

Saturday, 5 January 2008

More posts on the 3rd Digital Curation Conference

Somewhat belatedly, I found a series of posts blogged during the 3rd International Digital Curation Conference in Washington last month by K-State Libraries Conference Reports (more than one individual). The posts included (with some highly selective personal choice extracts):

Digital Curation Conference: a general comment

"I just spent four days hearing over and over again, however, how thousands of scientists in many countries make use of the very same technologies used by your average teenager--namely, blogs, wikis, Facebooky things, etc.--to do essential work and collaborate with their peers. The head of Web publishing at the Nature Publishing Group, certainly no lightweight or frivolous organization, proudly states that they're creating social networking tools built directly on the Facebook model, and sees in blogging part of the future of scientific communication. Scientist after scientist stood up and showed how wikis and blogs are used for essential communication and collaboration, how some labs have gone nearly entirely over to a social collaborative model."

Digital Curation Conference: Surveying Bloggers' Perspectives
Digital Curation Conference: Sustaining Engineering Informatics

"[Joshua Lubell] then outlined the three access scenarios (the three Rs): reference, reuse, and rationale. The first is simply the ability to view and visualize the engineering data. Reuse is what it sounds like (STEP [ISO 10303] supports this), namely, taking the data and modifying or reengineering it. Rationale is the ability to display information such as construction history, design intent, etc., which go beyond the design itself. He compared this to the chess column in a newspaper, where, in addition to a snapshot of the board at some point in the game, you get a description of the moves required to reach that state. Loosely put, it's the set of 'why' questions that can arise from a design. STEP does not include this rationale piece, which is the point behind his work."

Digital Curation Conference: Moving Archival Practices Upstream
Digital Curation Conference: Day Two Keynote

"Carol Goble, U of Manchester

"[n.b.- This talk was the highlight of DCC for me. Rather than highlighting, again, only the challenges without really offering solutions, she showed concrete examples and tools, in pretty good detail.]"

Digital Curation Conference: Towards Distributed Infrastructures
Digital Curation Conference: Day One Closing Plenary

"Could it be that we are well enough funded to be too comfortable with our traditional roles? Rick Luce asked this question near the end of his first day closing keynote. Otherwise, his talk was a fairly standard review of what's going on and what needs to be done to solve some of the pressing issues, but this question struck me as unique. He's right, I think. Our funding is sufficient to continue operating much as we have for a long time; sure, making incremental changes, but never really taking the great leap forward to stop doing most of what we do now and really take on some major new challenges."

Digital Curation Conference: Sustainable Access to the Records of Science

"... liked to say Fedora, too, as did many people at the CNI meeting. Clearly, it's the flavor of the week, and I sensed a lot of uncertainty among library leaders at CNI who did not yet have a Fedora installation at their library. "We need to move to Fedora now" appears to be the current mantra. That's a bit amusing. Sure, Fedora is great, and can do many wonderful things, but so can a lot of other platforms and solutions, who were either popular way back when (2005, gasp) or have yet to gain traction but are on the horizon. What I've seen and heard in the last three days convinces me that my gut feeling about Fedora is not incorrect: if you have a crew of developers, it might work well for you, but if you lack the commitment to hire and hold such a crew, Fedora is not for you."

Digital Curation Conference: National Perspectives

"In something of an aside, [Rhys Francis] pointed out that computer science began 60 years ago, communications (in the network sense) about 15-20 years ago, and he thinks that something significant and as yet unnamed and vague happened about three years ago. He said we're now living in the data deluge, and in future decades we'll look back and have a name for what is happening now. For me, that's both an exciting and somewhat unsettling notion, since those working in these areas during periods of inception and definition tend to look rather silly to their successors (don't we laugh at the notion of typing catalog cards, after all?), only much later to be recognized for their efforts and innovations.

"In his opinion, there are four facets for collaboration: data, compute, interoperation, access. One must do all four, not one or two or three, which he noted is a key message to an audience largely consisting of data managers of one sort or another.

"One of his closing questions was whether data is actually infrastructure."

David Rosenthal was also at the conference, and was also taken with Carole Goble's day 2 keynote, as he noted in his blog:

"She's a great speaker, with a vivid turn of phrase, and you have to like a talk about science on the web in which a major example is VivaLaDiva.com, a shoe shopping site."

David was interested in Carole's myexperiment implementation, and his blog is worth reading for other insights into that. But I also liked this

"The emerging world of web services is the big challenge facing digital preservation. Her talk was a wonderful illustration both of why this is an important problem, in that much of reader's experience of the Web is already mediated by services, and why the barriers to doing so are almost insurmountable."

(I think "doing so" here means preserving the reader's experience in this environment.)

Friday, 14 December 2007

Murray-Rust on Digital Curation Conference day 2

Peter Murray-Rust has written two blog posts (here and here; I'm not sure if those are permanent URL's...) about day 2 of the International Digital Curation Conference in Washington DC. Thanks, Peter.

In the first post, he began:

"There is a definitely an air of optimism in the conference - we know the tasks are hard and very very diverse but it’s clear that many of them are understood."

He then picked up on Carole Goble's presentation on workflows. Here are a few random extracts from Peter's random jottings (his description):

"The great thing about Carole is she’s honest. Workflows are HARD. They are expensive. There are lots of them. Not of them does exactly what you want. And so on. [PMR: We did a lot of work - by our standards - on Taverna but found it wasn’t cost-effective at that stage. Currently we script things and use Java. Someday we shall return.]"

"myExperiment.org. A collaborative site for workflows. You can go there and find what you want (maybe) and find people to talk to. “- bazaar for workflows, encapsulated objects (EMO) single WFs or collections, chemistry data with blogged log book, encapsulatd experimental objects Open Linked Data linked initiative…"

"Scientists do not collaborate - scientists would rather share a toothbrush [...] than gene names (Mike Ashburner)
who gets the credit? - who is allowed to update?. Changing metadata rather than data. Versioning. Have to get credit and reputation managed. Scientitsts are driven by money, fame, reputation, fear of being left behind"

"Annotations are first class citizens"

His second post covers Jane Hunter and Kwok Cheung's presentation on compound document objects (CDOs):

"Increasing pressure to share and publish data while maintaining competitiveness.
Main problem lack of simple tools for recording, publishing, standards.
What is the incentive to curate and deposit? What granularity? concern for IP and ownership"

"Current problems with traditional systems - little semantic relationship, little provenance, little selectivity, interactivity , flexibility and often fixed rendering and interfaces. No multilevel access. either all open or all restricted
usually hardwired presentation"

"Capture scientific provenance through RDF (and can capture events in physical and digital domain)
Compound Digital Objects - variable semantics, media, etc.
Typed relationships within the CDOs. (this is critical)"

"SCOPE [the tool Jane & Kwok have developed is] a simplified tool for authoring these objects. Can create provenance graphs. Infer types as much as possible. RSS notification. Comes with a graphical provenance explorer."

Thanks again, Peter!

Thursday, 13 December 2007

Murray-Rust on Digital Curation Conference day 1

Peter Murray-Rust has blogged on our conference in Washington DC:

Overall impressions - optimistic spirit with some speakers being very adventurous about what we can and should do.

He paid particular attention to the national perspective from Australia:

"…Rhys Francis (Australia) . One of the most engaging presentations. Why support ICT? It will change the world. Systems can make decisions, electronic supply chains. humans cannot keep pace. We do not need people to process information.

Who owns the data, and the copyright = physics says, who cares? If it’s good, copy it. Else discard it. Storage is free. [PMR: let’s try this in chemistry…]

What to do about data is a harder question than how to build experiements

FOUR components to infrastructure. I’ve seen this from OZ before… data, compute, interoperation, access. Must do all 4. -

And he also highlighted the importance of domain knowledge in preference to institutional repositories (I go along with that).

And I thought this was interesting, relating to Tim Hubbard's presentation:

"particular snippet: Rfam (RNA family) is now in Wikipedia as primary resource with some enhancement by community annotation. So we are seeing Wikipedia as the first choice of deposition for areas of scholarship."

So it looks like things are going well!

Wednesday, 12 December 2007

DCC Forum: Comment on Conference best paper

There is a Conference theme over on the DCC Forum. However, despite its RSS feed, it doesn't appear to be quite in the blogosphere, so I thought I would post some quotes from it here. Bridget Robinson announced the best paper selection (it gets a star spot in the programme):

"The title of best peer-reviewed paper has been awarded to "Digital Data Practises and the Long Term Ecological Research Programme" by Helena Karasti (Department of Information Processing, University of Oulu in Finland), Karen S Baker (Scripps Institution of Oceanography, University of California, USA) and Katharina Schleidt (Department of Data and Diagnoses, Umweltbundesant, Austria)."

Angus Whyte commented:

"Can't make it to Washington, but looking forward to seeing this paper. I'm familiar with the 'Enriching the Notion of Data Curation in e-Science' paper by the same authors (plus Eija Halkola) in the journal Computer Supported Cooperative Work last year [*}. (doi:10.1007/s10606-006-9023-2); and some of Helena Karasti's previous work to integrate ethnographic and participatory design methods in CSCW. Recently colleagues and I at DCC began case studies for the SCARP project, which I hope can draw lessons from their work to document "the inter-related and continuously changing nature of technology, data care, and science conduct" as they put it.

* Volume 15, Number 4 / August, 2006

If there are any coments here I will re-post a summary back on the Forum!

Anticipating day 1 of the International Digital Curation Conference

Starting in 3 hours or so in Washington DC. I'm quite interested in the various different national approaches being discussed in the morning. Two questions I would like to ask:

There is a clash between institutional and subject-oriented approaches (I believe the former have potentially better sustainability, and the latter have potentially better domain scientist data curation skills). The report "Towards the Australian Data Commons: A proposal for an Australian National Data Service" seems to suggest a federated approach, which on the face of it could combine the strengths of both approaches. However, given the clashes of value and approach that so often occur in distributed organisations, how well do the speakers believe such approaches would fare in their culture?
At the moment it is important to begin establishing a more comprehensive data infrastructure. But how do the speakers envisage sufficient community value to be created for the ongoing bills to be payable (ie how will sustainability be assured in the longer term)? Note in this context the decision by AHRC to cease funding AHDS...

These questions could also perhaps be on the table in the penultimate session of the day, a discussion on Global Infrastructure development...

Tuesday, 11 December 2007

3rd International Digital Curation Conference

The 3rd International Digital Curation Conference is getting underway about now with a pre-conference reception at the National Museum of the American Indian in Washington DC. We are very pleased that it is fully booked, and the Programme Committee have put together an excellent programme. I'm hoping that the presentations will be mounted on the web site very quickly[*], and also that some of those present will be blogging the event (if you do, folks, please tag with IDCC3 so that I can find them).

Yes, as you may have guessed, unfortunately I can't be there either this year, which right now makes me hopping mad (madder than a cut snake, as my erstwhile compatriots used to say).
So have a good one, guys!!!

[*Added later: perhaps slides could be added to Slideshare? Again with tag IDCC3...]

Digital Curation Blog