The location of the workshop is the 'Bonatz Saal' of the Hannover Congress Centrum (HCC). Coffee Break are served in the level 1 Corridor, lunch is not provided.
Full papers have 25 minutes for presentation, while short papers and posters have 15 and 10, respectively.
Program
- 9.00 - 10.30, Keynote by Peter Buneman, University of Edinburgh
Abstract: Nearly all the data we rely on -master data, curated data, reference data- is now held in some kind of database, be it a traditional database or some form of ontology. Databases, unlike traditional publications and records, change over time in both content and structure. This leads to some interesting questions for the preservation of this important data. Why is preservation important? If so how much of the history of data should we preserve: should we record every state or every transaction of the database? Also how and in what form should we preserve it: can we rely on the survival of database management systems and other software for this purpose?
These questions lead to some interesting challenges not only for computer scientists but also for the economics and practice of data publication. For example, the traditional distinctions between those who create (or author) data, those who publish data, and those who archive data are fast disappearing. - 11.00 - 12.30, Research Session 1: Data Integration and Visualization
- Classification Algorithms for Web Text Filtering
Christoph Boden (TU Berlin), Thomas Häfele (TU Berlin), Alexander Löser (TU Berlin) - Data Lineage in the MOMIS Data Fusion System
Antonio Sala (University of Modena), Domenico Beneventano (University of Modena and Reggio Emilia), Abdul Rahman Dannaoui (University of Modena and Reggio Emilia) - A Framework for Semi-Automatic Identification, Disambiguation and Storage of Protein-related Abbreviations in Scientific Literature
Daniele Toti (Università Roma Tre), Fabio Polticelli (Università Roma Tre), Paolo Atzeni (Università Roma Tre) - RADAR: Radial Applications Depiction Around Relations For Data-Centric Ecosystems
Panos Vassiliadis (Universtity of Ioannina)
- Classification Algorithms for Web Text Filtering
- 14.00 - 15.30, Keynote by David South, Deutsches Elektronen Synchrotron (DESY) at Hamburg
Abstract: Data from high-energy physics experiments are collected with significant financial and human effort and are mostly unique. At the same time, there is no coherent strategy for data preservation and re-use, and many important and complex data sets are simply lost. While the current focus is on the LHC at CERN, in a period of a few years, several important and unique experimental programs at other facilities will come to an end, including those at HERA, b-factories and the Tevatron. To address this issue, an inter-experimental study group on HEP data preservation and long-term analysis (DPHEP) was convened at the end of 2008.
This talk describes the problems of data preservation in high energy physics and the first conclusions and recommendations of the study group. The physics motivation for data preservation, generic computing and preservation models, technological expectations and governance aspects at local and international levels are examined. The group now aims to publish a full and detailed review of data preservation in high energy physics in 2011. - 16.00 - 17.30, Research Session 2: Schema and Data Evolution
- Using Structured Changes for Elucidating Data Evolution
Yannis Stavrakas (IMIS), George Papastefanatos (IMIS) - On the Minimization of Sequences of XML Document and Schema Update Operations
Federico Cavalieri (DISI - University of Genova), Giovanna Guerrini (Università di Genova), Marco Mesiti (DICo - University of Milano), Barbara Oliboni (University of Verona) - Concluding remarks and discussion
- Using Structured Changes for Elucidating Data Evolution