author:"Ashby, Thomas" | Pollux - Fachinformationsdienst Politikwissenschaft

During a week's stay in Malta in October 1914, Thomas Ashby was able to conduct the excavation of a portion of a megalithic building, on a site called id-debdieba ("the Place of the Echo" in Maltese), pointed out to him by Professor T. Zammit, Curator of the Valletta Museum, who frequently visited the site, and has contributed the report on the objects found whicih forms the second part of this paper. The funids for the work were provided by the Government of Malta, anid the site itself is Government property. ; N/A

Zugriff(Open Access)

BASE

Exportieren

Open Access#62017

SW-SGD: The Sliding Window Stochastic Gradient Descent Algorithm

Chakroun, Imen; Haber, Tom; Ashby, Thomas J

Stochastic Gradient Descent (SGD, or 1-SGD in our notation) is probably the most popular family of optimisation algorithms used in machine learning on large data sets due to its ability to optimise efficiently with respect to the number of complete training set data touches (epochs) used. Various authors have worked on data or model parallelism for SGD, but there is little work on how SGD fits with memory hierarchies ubiquitous in HPC machines. Standard practice suggests randomising the order of training points and streaming the whole set through the learner, which results in extremely low temporal locality of access to the training set and thus, when dealing with large data sets, makes minimal use of the small, fast layers of memory in an HPC memory hierarchy. Mini-batch SGD with batch size n (n-SGD) is often used to control the noise on the gradient and make convergence smoother and more easy to identify, but this can reduce the learning efficiency wrt. epochs when compared to 1-SGD whilst also having the same extremely low temporal locality. In this paper we introduce Sliding Window SGD (SW-SGD) which uses temporal locality of training point access in an attempt to combine the advantages of 1-SGD (epoch efficiency) with n-SGD (smoother convergence and easier identification of convergence) by leveraging HPC memory hierarchies. We give initial results on part of the Pascal dataset that show that memory hierarchies can be used to improve SGD performance. (C) 2017 The Authors. Published by Elsevier B.V. ; European project ExCAPE [2] from the European Union's Horizon 2020 Research and Innovation programme [671555]

Zugriff(Open Access)

BASE

Exportieren

Open Access#72017

Improving Operational Intensity in Data Bound Markov Chain Monte Carlo

Nemeth, Balazs; Haber, Tom; Ashby, Thomas J; Lamotte, Wim

Typically, parallel algorithms are developed to leverage the processing power of multiple processors simultaneously speeding up overall execution. At the same time, discrepancy between \{DRAM\} bandwidth and microprocessor speed hinders reaching peak performance. This paper explores how operational intensity improves by performing useful computation during otherwise stalled cycles. While the proposed methodology is applicable to a wide variety of parallel algorithms, and at different scales, the concepts are demonstrated in the machine learning context. Performance improvements are shown for Bayesian logistic regression with a Markov chain Monte Carlo sampler, either with multiple chains or with multiple proposals, on a dense data set two orders of magnitude larger than the last level cache on contemporary systems. ; Part of the work presented in this paper was funded by Johnson & Johnson. This project has received funding from the European Union's Horizon 2020 Research and Innovation programme under Grant Agreement no. 671555.

Zugriff(Open Access)

BASE

Exportieren

Open Access#82017

Improving Operational Intensity in Data Bound Markov Chain Monte Carlo

Nemeth, Balazs; Haber, Tom; Ashby, Thomas J; Lamotte, Wim

Typically, parallel algorithms are developed to leverage the processing power of multiple processors simultaneously speeding up overall execution. At the same time, discrepancy between \{DRAM\} bandwidth and microprocessor speed hinders reaching peak performance. This paper explores how operational intensity improves by performing useful computation during otherwise stalled cycles. While the proposed methodology is applicable to a wide variety of parallel algorithms, and at different scales, the concepts are demonstrated in the machine learning context. Performance improvements are shown for Bayesian logistic regression with a Markov chain Monte Carlo sampler, either with multiple chains or with multiple proposals, on a dense data set two orders of magnitude larger than the last level cache on contemporary systems. ; Part of the work presented in this paper was funded by Johnson & Johnson. This project has received funding from the European Union's Horizon 2020 Research and Innovation programme under Grant Agreement no. 671555.

Zugriff(Open Access)

BASE

Exportieren

Filter

Format

Medientyp

Sprache

Jahre

69. Excavations at Caerwent in Monmouthshire

136

1. Excavations in Malta in 1914

14. Excavations in Malta in 1914

Excavations in Malta in 1914

SW-SGD: The Sliding Window Stochastic Gradient Descent Algorithm

Improving Operational Intensity in Data Bound Markov Chain Monte Carlo

Improving Operational Intensity in Data Bound Markov Chain Monte Carlo

Suchergebnisse

Filter

Format

Medientyp

Sprache

Jahre

Kontakt

Hilfe