DM4(B)T: Data Management for (Build)TEDDI(NET) using Semantic Technologies
Find Similar History 34 Claim Ownership Request Data Change Add FavouriteTitle
CoPED ID
Status
Value
Start Date
End Date
Description
CONTEXT
EPSRC funded 22 projects over two calls in 2010 and 2012 to investigate `Transforming Energy Demand through Digital Innovation' (TEDDI) as a means to find how and how people use energy in homes and what can be done reduce energy consumption. As a result a lot of data is being collected at different levels of detail in a variety of housing up and down the UK, but the mode, detail and quantity are largely defined by the needs of each individual project. At the same time, the research councils (RCUK) are defining guidelines for what happens to data generated by projects they fund, for which universities are then defining policies and finally researchers are then taking concrete actions to store, preserve and document data for future reference.
The problem at this current time is that there is relatively little awareness, limited experience and only emerging practice of how to incorporate data management into much of (physical) science research. This is in stark contrast to established procedures for data formats and sharing in the biosciences, stemming from international collaboration on the Human Genome Project, and in the social sciences, where data from national surveys, including census data, have been centrally archived for many years. Consequently, current solutions adopted by (Build)TEDDI projects may be able to meet a minimal interpretation of the requirements, but not effectively deliver the desired data legacy, such as (for example) the means to execute trans-project queries, or being able to cite the results of such queries for the sake of reproducibility.
AIMS AND OBJECTIVES
The challenges described above, which we address in DM4(B)T in the microcosm of the TEDDI projects, are tackled in three ways:
1. Raising awareness with those who are responsible for data management (principal investigators),
2. Developing a framework to guide the process of making the choices for how to go about implementing data management and
3. Demonstrating example tools that will enable researchers to bring together and re-analyse data from different projects more easily,
which together will help researchers (i) to satisfy funding and institutional guidelines for data management, (ii) begin the process of forming a data management culture in science research and (iii) create a substantial case study in science data management which can inform the three primary stakeholders (researchers, institutions and research councils) across a range of issues (see Recommendations below).
Key activities and outputs:
1. Workshops (i) to gather information about current practice, (ii) present data management problems and outline analysis and solutions and (iii) to disseminate knowledge of tools and (new) practices to support effective data management.
2. Tools and techniques: to allow researchers to harness both the variety and volume of data being collected specifically within the (Build)TEDDI projects. The tools will be made available open-source for access by other researchers to expand and adapt.
3. Recommendations: these will take the form of an online report to identify routes to facilitate a sustainable data legacy (management, curation and citation) for projects in the science and engineering domain.
APPLICATIONS AND BENEFITS
1. (Build)TEDDI projects will benefit directly from the above activities and outputs to meet institutional and research council requirements.
2. Other researchers will benefit from being able to access (Build)TEDDI data.
3. The outputs will benefit the wider research community in science and engineering through the provision of an easy-to-adopt (and adapt) data management methodology.
More Information
Potential Impact:
Spreading outwards from the activities taking place in the project, the classes of beneficiaries and consequent impacts are:
1. The investigators and researchers in (Build)TEDDI projects. While some of these are in the investigators' immediate professional circle, the interdisciplinary nature of the portfolio means that many are not. This group are directly exposed to the problems of data management and data legacy creation and can benefit from DM4(B)T's activities to help them analyse those problems and develop appropriate solutions, while also fostering a culture in these projects (and their) researchers that begins to take account of (future) data issues. This will take place during the lifetime of the DM4(B)T project and current (Build)TEDDI projects.
2. The institutions hosting (Build)TEDDI projects. The practices and culture emerging from the (Build)TEDDI projects can inform and contribute to the development of institutional data management guidelines and identification of the resources needed to support them. This can begin to occur during the lifetime of current (Build)TEDDI projects, but will continue afterwards.
3. The EPSRC that is funding the (Build)TEDDI projects. A direct benefit is improved compliance with funder data policies, of benefit to both research councils and researchers within institutions tasked with ensuring compliance. Indirect benefits are enhanced knowledge and awareness of data issues at institutional level enabling capacity for informed discussion between institutions and research councils over guidelines, compliance, resources and implementation. This too can begin during the period of current (Build)TEDDI projects, but will gain momentum subsequently as the data management agenda gathers pace and informed by the recommendations coming out of DM4(B)T's engagement with the (Build)TEDDI project portfolio.
4. Sibling projects in the same institution. Transfer of knowledge and practice to other projects, either through common investigators or local institutional enhancement activities can lead to increased skills amongst RCUK-funded researchers in data management. This would include knowledge of how to curate their data for publication, where and how to publish data, particularly if an institutional data repository is not available, and what ethical issues and licence conditions should be complied with. Expected time frame is during the lifetime of current (Build)TEDDI projects and thereafter.
5. Policy-makers (e.g. DECC/DEFRA) and non-governmental organizations (e.g. Energy Saving Trust, Carbon Trust who often report on field data) or charities (e.g. Centre for Sustainable Energy who worked on the National Household Model). DM4(B)T will initiate the process and prototype the tools to enable increased access to publicly funded data and increased value extraction from existing investment in research by support for and facilitation of publication and re-analysis of data from (Build)TEDDI projects. This in turn can increase knowledge derived from data and hence increased understanding of energy use in homes which may help better inform policy-analysis and policy-making. Impact will start through these stakeholders participating in workshops held with TEDDINET (workshops 1-3) and WHOLESEM (enabled through the CEE, workshop 4), initiating the debate and increasing awareness of what can be achieved through enhanced data management practices. Expected time frame is the duration of the above projects and thereafter.
6. The wider public. Public participation in future studies on domestic energy use will benefit from better understanding of ethical issues by researchers obliged to publish research outputs, thus ensuring privacy of and transparency for participants whilst maximising the potential for data sharing. Time frame is after the end of the current round of projects.
University of Bath | LEAD_ORG |
Julian Padget | PI_PER |
Catherine Pink | COI_PER |
Sukumar Natarajan | COI_PER |
Subjects by relevance
- Projects
- Information management
- Research
- Knowledge management
- Data acquisition
- Project management
- Databases
- Data archives
Extracted key phrases
- DM4(B)T project
- Institutional datum management guideline
- Science datum management
- Datum management practice
- Present datum management problem
- Datum management culture
- Effective datum management
- Build)TEDDI datum
- Build)TEDDI project portfolio
- Funder datum policy
- Datum issue
- Datum legacy creation
- Document datum
- Datum format
- Census datum