Title
Distributed methods for large scale regression

CoPED ID
40d4cadf-3e40-42b2-921f-d09bcff2ef49

Status
Closed


Value
No funds listed.

Start Date
April 15, 2018

End Date
Dec. 4, 2018

Description

More Like This


Automatically collected data in environmental modelling, energy management and medicine may involve very large data volumes while also requiring richly parameterized models for adequate analysis and prediction. If n is data set size and p the number of model coefficients, this project aims to find O(np) computational methods for estimating penalized regression models, which are susceptible to parallelization in cluster computing environments. The major challenge is to do this in a way that adequately estimates hyper-parameters alongside regression coefficients, and the project will investigate the feasibility of doing this using stochastic log determinant or log trace estimators in the context of marginal likelihood or similar criteria.

Simon Wood SUPER_PER

Subjects by relevance
  1. Energy management
  2. Forecasts
  3. Machine learning
  4. Estimating (statistical methods)

Extracted key phrases
  1. Large datum volume
  2. Large scale regression
  3. Regression model
  4. Regression coefficient
  5. Model coefficient
  6. Computational method
  7. Environmental modelling
  8. Energy management
  9. Stochastic log determinant
  10. Cluster computing environment
  11. Adequate analysis
  12. Project
  13. Medicine
  14. Major challenge

Related Pages

UKRI project entry

UK Project Locations