An EPSRC National Research Facility to facilitate Data Science in the Physical Sciences: The Physical Sciences Data science Service (PSDS)

Find Similar History 36 Claim Ownership Request Data Change Add Favourite

Title
An EPSRC National Research Facility to facilitate Data Science in the Physical Sciences: The Physical Sciences Data science Service (PSDS)

CoPED ID
07fa84d8-0b05-425b-b063-1fa407c2c3fe

Status
Active

Funders

Value
£5,992,134

Start Date
Jan. 11, 2019

End Date
Jan. 10, 2024

Description

More Like This


The modern physical scientist cannot perform their research without generating significant quantities of data, having recourse to related/prior data, significant data analysis and integrating results with other data. This requires a range of skills and resources that are not available to the majority of physical scientists. There is therefore an urgent need in the physical sciences for providing access to data and integrating them with data science approaches. This requires building a new skills base that enables and empowers working in a data science way.

The Physical Sciences Data-science Service (PSDS) will provide a single place where existing databases, open data sources and data that is still being worked on can be stored and searched in a unified way. This means that it will become trivial to find and combine different types of physical sciences data - from details on structure to measured physical properties of materials. It will also make possible instant comparison of and context for experiment data with that already available.

This is just the start however. There is enormous potential for being able to perform data science across all of these data, that is for example, Machine Learning and Artificial Intelligence approaches, which are becoming a new avenue of research in their own right.

It is vital that data science becomes a routine tool for all physical scientists. For many this will mean learning new skills. The PSDS will therefore develop a training programme around the four main competencies (statistics, programming/tools, computational methods & data visualisation) required to perform data science. Identified links with networks and postgraduate training will enable PSDS users to gain deeper skills in various aspects of data science.

The long-term aim is for the PSDS, and therefore data science, to become a seamless, key part of the research infrastructure for physical scientists.


More Information

Potential Impact:
The purpose of the service is to facilitate new data science approaches to research for a broad cross-section of the physical sciences research community. The impact on the communities and disciplines within physical sciences will be very significant. By providing data resources and analysis processes that would not otherwise be available to the researcher, entirely new results or avenues of research will open up. This will be further enhanced due to the way in which it will be possible to blend and analyse data across different resources. This is not just a matter of breaking down silos, but rather fuelling far-reaching research. The vision is that large aggregations of data can be generated and systematically analysed using machine learning or artificial intelligence approaches. This would effect a paradigm shift in the way in which physical sciences discovery and analysis is performed, so resulting in new science that quite simply could not be achieved by traditional methods.

Early career researchers will be significantly impacted by this service. Our training programme will be particularly aimed at this audience, taking those with a sound, but traditional, physical science background and exposing them to new, emergent methods that promise to revolutionise the subject area. It is the earlier generation of researchers that are best poised to take advantage of this new approach and our training programme is founded in a data science training framework designed specifically to develop researchers in this new direction - we will provide the first rungs on the ladder and link into networks and training schemes that can develop trainees further. We expect a new type of researcher to result from this approach.

The providers of data resources will be particularly impacted. In an open environment where new aggregations of data can be developed on the fly depending on a specific research question, there is enormous opportunity. Data that is currently siloed and therefore only useful to narrow communities will now be available for exploitation in a multitude of new ways that were previously unimagined. The ability for these new collections of data to be mined or systematically analysed provides further opportunity for data providers as well as users.

These new approaches that liberate data and make it available for large scale systematic analysis will ultimately benefit the materials science and chemical manufacturing industries. Time to discovery will be reduced, optimisation and efficiency in manufacture enabled, and new and more elaborate property-driven products engineered. The UK high-value industry needs many more data-science minded researchers and this service will provide a 'low barrier to entry' method for a large number of traditionally educated researchers to engage.

Simon Coles PI_PER
Juan Bicarregui COI_PER
Brian Matthews COI_PER
Jeremy Frey COI_PER

Subjects by relevance
  1. Data mining
  2. Machine learning
  3. Research
  4. Big data
  5. Science
  6. Statistics (data)
  7. Educational methods
  8. Data science
  9. Artificial intelligence
  10. Optimisation

Extracted key phrases
  1. New datum science
  2. Physical Sciences Data science Service
  3. Datum science approach
  4. Physical sciences datum
  5. EPSRC National Research Facility
  6. Significant datum analysis
  7. New science
  8. Physical science background
  9. Open datum source
  10. Datum resource
  11. Datum provider
  12. Prior datum
  13. Datum visualisation
  14. Experiment datum
  15. Data science training framework

Related Pages

UKRI project entry

UK Project Locations