Scientific reproducibility and Open Science may seem like a "new" approach to Science, but these concepts are in fact fundamental principles of the Scientific Method, established a thousand years ago. By definition, scientists want to follow the Scientific Method. Then why have different studies demonstrated that researchers have difficulties reproducing even their own methods? Clues can be found in a system where science is being drowned by numerical ranking, favouring productivity over discovery. Given the current epoch of economic crisis, where researchers are forced into a competitive game of pandering to panelists in quests for funds, it seems to be a good time for deep reflection on the entire scientific system, and on how to support good ideas versus good marketing. The challenge will be harder when facilities like the Square Kilometre Array Observatory start generating data volumes reaching the exa-scale. Not only will computing, networking or storage resources reach their limits, but the biggest challenge will be scientific: Extracting knowledge will become an impossible task without a fundamental change in methodology and policies in the short-term. Computing facilities enabling science in Big Data oriented facilities should be key in supporting astronomers to build reproducible methods. I will discuss how different research communities (such as individual researchers, large collaborations, data centres, journals and their referees) as well as policy makers, view the challenges/barriers and rewards of reproducibility. For scientific facilities, adoption of Open Science is both a need and a duty. Open Science not only enhances scientific collaboration and knowledge interchange in a transparent way. It also brings a wider impact in other areas, by encouraging the democratisation of science, contributing to achieving UN's Sustainable Development Goals. With this talk I aim to provoke extra critical thinking among the different actors involved in our system (applicants for jobs/grants, referees, or ...
Contributions to the XIV.0 Scientific Meeting (virtual) of the Spanish Astronomical Society, held 13-15 July 2020, online at https://www.sea-astronomia.es/reunion-cientifica-2020, id.241. ; With funding from the Spanish government through the 'Severo Ochoa Centre of Excellence' accreditation SEV-2017-0709.
New scientific instruments are starting to generate an unprecedented amount of data. The Low Frequency Array (LOFAR), one of the Square Kilometre Array (SKA) pathfinders, is already producing data on a petabyte scale. The calibration of these data presents a huge challenge for final users: (a) extensive storage and computing resources are required; (b) the installation and maintenance of the software required for the processing is not trivial; and (c) the requirements of calibration pipelines, which are experimental and under development, are quickly evolving. After encountering some limitations in classical infrastructures like dedicated clusters, we investigated the viability of cloud infrastructures as a solution. We found that the installation and operation of LOFAR data calibration pipelines is not only possible, but can also be efficient in cloud infrastructures. The main advantages were: (1) the ease of software installation and maintenance, and the availability of standard APIs and tools, widely used in the industry; this reduces the requirement for significant manual intervention, which can have a highly negative impact in some infrastructures; (2) the flexibility to adapt the infrastructure to the needs of the problem, especially as those demands change over time; (3) the on-demand consumption of (shared) resources. We found that a critical factor (also in other infrastructures) is the availability of scratch storage areas of an appropriate size. We found no significant impediments associated with the speed of data transfer, the use of virtualization, the use of external block storage, or the memory available (provided a minimum threshold is reached). Finally, we considered the cost-effectiveness of a commercial cloud like Amazon Web Services. While a cloud solution is more expensive than the operation of a large, fully-utilized cluster completely dedicated to LOFAR data reduction, we found that its costs are competitive if the number of datasets to be analysed is not high, or if the costs of maintaining a system capable of calibrating LOFAR data become high. Coupled with the advantages discussed above, this suggests that a cloud infrastructure may be favourable for many users. ; We acknowledge the useful comments of the anonymous referee. We would like to acknowledge the work of all the developers and packagers of the LOFAR software that constitute the core of the processing pipelines (including factor, prefactor, LSMTool, LoSoTo, and the Kern Suite), as well as the useful discussions with the participants in the LOFAR blank fields and direction dependent calibration teleconferences over the years. JS and PNB are grateful for financial support from STFC via grant ST/M001229/1. This work has been also supported by the projects 'AMIGA5: gas in and around galaxies. Scientific and technological preparation for the SKA' (AYA2014-52013-C2-1-R) and 'AMIGA6: gas in and around galaxies. Preparation for SKA science and contribution to the design of the SKA data flow' (AYA2015-65973-C3-1-R) both of which were co-funded by MICINN and FEDER funds and the Junta de Andalucía (Spain) grants P08-FQM-4205 and TIC-114. We would like to explicitly acknowledge Dr Jose Ruedas – chief of the computer centre and responsible of the computing and communications infrastructures at IAA-CSIC – and Rafael Parra – system administrator of the IAA computing cluster – for their technical assistance. We acknowledge the joint SKA and AWS Astrocompute proposal call that was used to fund all the tests in the AWS infrastructure with the projects "Calibration of LOFAR ELAIS-N1 data in the Amazon cloud" and "Amazon Cloud Processing of LOFAR Tier-1 surveys: Opening up a new window on the Universe". This work made use of the University of Hertfordshire's high-performance computing facility and the LOFAR-UK computing facility, supported by STFC [grant number ST/P000096/1]. This work benefited from services and resources provided by the fedcloud.egi.eu Virtual Organization, supported by the national resource providers of the EGI Federation. We acknowledge the resources and support provided by the STFC RAL Cloud infrastructure. LOFAR, the Low Frequency Array designed and constructed by ASTRON, has facilities in several countries, that are owned by various parties (each with their own funding sources), and that are collectively operated by the International LOFAR Telescope (ILT) foundation under a joint scientific policy. ; Peer Reviewed ; Award-winning ; Postprint (author's final draft)
New scientific instruments are starting to generate an unprecedented amount of data. The Low Frequency Array (LOFAR), one of the Square Kilometre Array (SKA) pathfinders, is already producing data on a petabyte scale. The calibration of these data presents a huge challenge for final users: (a) extensive storage and computing resources are required; (b) the installation and maintenance of the software required for the processing is not trivial; and (c) the requirements of calibration pipelines, which are experimental and under development, are quickly evolving. After encountering some limitations in classical infrastructures like dedicated clusters, we investigated the viability of cloud infrastructures as a solution. We found that the installation and operation of LOFAR data calibration pipelines is not only possible, but can also be efficient in cloud infrastructures. The main advantages were: (1) the ease of software installation and maintenance, and the availability of standard APIs and tools, widely used in the industry; this reduces the requirement for significant manual intervention, which can have a highly negative impact in some infrastructures; (2) the flexibility to adapt the infrastructure to the needs of the problem, especially as those demands change over time; (3) the on-demand consumption of (shared) resources. We found that a critical factor (also in other infrastructures) is the availability of scratch storage areas of an appropriate size. We found no significant impediments associated with the speed of data transfer, the use of virtualization, the use of external block storage, or the memory available (provided a minimum threshold is reached). Finally, we considered the cost-effectiveness of a commercial cloud like Amazon Web Services. While a cloud solution is more expensive than the operation of a large, fully-utilized cluster completely dedicated to LOFAR data reduction, we found that its costs are competitive if the number of datasets to be analysed is not high, or if the costs of maintaining a system capable of calibrating LOFAR data become high. Coupled with the advantages discussed above, this suggests that a cloud infrastructure may be favourable for many users. ; We acknowledge the useful comments of the anonymous referee. We would like to acknowledge the work of all the developers and packagers of the LOFAR software that constitute the core of the processing pipelines (including factor, prefactor, LSMTool, LoSoTo, and the Kern Suite), as well as the useful discussions with the participants in the LOFAR blank fields and direction dependent calibration teleconferences over the years. JS and PNB are grateful for financial support from STFC via grant ST/M001229/1. This work has been also supported by the projects 'AMIGA5: gas in and around galaxies. Scientific and technological preparation for the SKA' (AYA2014-52013-C2-1-R) and 'AMIGA6: gas in and around galaxies. Preparation for SKA science and contribution to the design of the SKA data flow' (AYA2015-65973-C3-1-R) both of which were co-funded by MICINN and FEDER funds and the Junta de Andalucía (Spain) grants P08-FQM-4205 and TIC-114. We would like to explicitly acknowledge Dr Jose Ruedas – chief of the computer centre and responsible of the computing and communications infrastructures at IAA-CSIC – and Rafael Parra – system administrator of the IAA computing cluster – for their technical assistance. We acknowledge the joint SKA and AWS Astrocompute proposal call that was used to fund all the tests in the AWS infrastructure with the projects "Calibration of LOFAR ELAIS-N1 data in the Amazon cloud" and "Amazon Cloud Processing of LOFAR Tier-1 surveys: Opening up a new window on the Universe". This work made use of the University of Hertfordshire's high-performance computing facility and the LOFAR-UK computing facility, supported by STFC [grant number ST/P000096/1]. This work benefited from services and resources provided by the fedcloud.egi.eu Virtual Organization, supported by the national resource providers of the EGI Federation. We acknowledge the resources and support provided by the STFC RAL Cloud infrastructure. LOFAR, the Low Frequency Array designed and constructed by ASTRON, has facilities in several countries, that are owned by various parties (each with their own funding sources), and that are collectively operated by the International LOFAR Telescope (ILT) foundation under a joint scientific policy. ; Peer Reviewed ; Award-winning ; Postprint (author's final draft)
We thank the anonymous referee for a constructive and detailed report. This project has received funding from the European Union's Horizon 2020 research and innovation programme under the Marie Sklodowska Curie grant agreement No 893673. We acknowledge financial support from the European Union's Horizon 2020 research and innovation programme under Marie Sklodowska-Curie grant agreement No 721463 to the SUNDIAL ITN network, from the State Research Agency (AEI-MCINN) of the Spanish Ministry of Science and Innovation under the grant "The structure and evolution of galaxies and their central regions" with reference PID2019-105602GBI00/10.13039/501100011033, and from IAC project P/300724, financed by the Ministry of Science and Innovation, through the State Budget and by the Canary Islands Department of Economy, Knowledge and Employment, through the Regional Budget of the Autonomous Community. SDG acknowledges support from the Spanish Public Employment Service (SEPE). Furthermore, we acknowledge support by the research project AYA2017-84897-P from the Spanish Ministerio de Economia y Competitividad, from the European Regional Development Funds (FEDER) and the Junta de Andalucia (Spain) grants FQM108. DE acknowledges support from a Beatriz Galindo senior fellowship (BG20/00224) from the Ministry of Science and Innovation. LVM acknowledges financial support from the grants AYA2015-65973-C3-1-R and RTI2018096228-B-C31 (MINECO/FEDER, UE), as well as from the State Agency for Research of the Spanish MCIU through the Center of Excellence Severo Ochoa award to the Instituto de Astrofisica de Andalucia (SEV-2017-0709). This research makes use of python (http://www.python.org), Matplotlib (Hunter 2007), and Astropy (Astropy Collaboration 2013, 2018). We acknowledge the usage of the HyperLeda database (http://leda.univ-lyon1.fr).We thank Alexandre Bouquin for providing us with the GALEX FUV and NUV images used in this work. We thank Stephane Courteau, Estrella Florido, Raul Infante-Sainz, Tom Jarrett, Johan H. Knapen, Heikki Salo, and Miguel Querejeta for useful discussions. We thank Sebastien Comeron and Facundo D. Moyano for valuable comments on the manuscript. Facilities: GALEX, WISE, Spitzer (IRAC). ; Context. While some galactic bars show recent massive star formation (SF) along them, some others do not. Whether bars with low level of SF are a consequence of low star formation efficiency, low gas inflow rate, or dynamical effects remains a matter of debate. Aims. In order to study the physical conditions that enable or prevent SF, we perform a multi-wavelength analysis of 12 strongly barred galaxies with total stellar masses log(10)(M-*/M-circle dot)is an element of[10.2,11], chosen to host different degrees of SF along the bar major axis without any prior condition on gas content. We observe the CO(1-0) and CO(2-1) emission within bars with the IRAM-30 m telescope (beam sizes of 1.7-3.9 kpc and 0.9-2.0 kpc, respectively; 7-8 pointings per galaxy on average). Methods. We estimated molecular gas masses (M-mol) from the CO(1-0) and CO(2-1) emissions. SF rates (SFRs) were calculated from GALEX near-ultraviolet (UV) and WISE 12 mu m images within the beam-pointings, covering the full bar extent (SFRs were also derived from far-UV and 22 mu m). Results. We detect molecular gas along the bars of all probed galaxies. Molecular gas and SFR surface densities span the ranges log(10)(sigma(mol)/[M-circle dot pc(-2)]) is an element of [0.4,2.4] and log(10)(sigma(SFR)/[M-circle dot pc(-1) kpc(-2)]]) is an element of [-3.25, -0.75], respectively. The star formation efficiency (SFE; i.e., SFR/M-mol) in bars varies between galaxies by up to an order of magnitude (SFE is an element of[0.1,1.8] Gyr(-1)). On average, SFEs are roughly constant along bars. SFEs are not significantly different from the mean value in spiral galaxies reported in the literature (similar to 0.43 Gyr(-1)), regardless of whether we estimate M-mol from CO(1-0) or CO(2-1). Interestingly, the higher the total stellar mass of the host galaxy, the lower the SFE within their bars. In particular, the two galaxies in our sample with the lowest SFE and sigma(SFR) (NGC 4548 and NGC 5850, SFE less than or similar to 0.25 Gyr(-1), sigma(SFR)less than or similar to 10(-2.25)M(circle dot) yr(-1) kpc(-2), M greater than or similar to 10(10.7)M(circle dot)) are also those hosting massive bulges and signs of past interactions with nearby companions. Conclusions. We present a statistical analysis of the SFE in bars for a sample of 12 galaxies. The SFE in strong bars is not systematically inhibited (either in the central, middle, or end parts of the bar). Both environmental and internal quenching are likely responsible for the lowest SFEs reported in this work. ; European Union's Horizon 2020 research and innovation programme under the Marie Sklodowska Curie 893673 ; European Commission 721463 ; State Research Agency (AEI-MCINN) of the Spanish Ministry of Science and Innovation under the grant "The structure and evolution of galaxies and their central regions" - Ministry of Science and Innovation P/300724 PID2019-105602GBI00 ; Spanish Public Employment Service (SEPE) ; Spanish Ministerio de Economia y Competitividad, from the European Regional Development Funds (FEDER) AYA2017-84897-P ; Junta de Andalucia ; European Commission FQM108 ; Spanish Government BG20/00224 ; State Agency for Research of the Spanish MCIU through the Center of Excellence Severo Ochoa award SEV-2017-0709 ; Canary Islands Department of Economy, Knowledge and Employment, through the Regional Budget of the Autonomous Community ; MINECO/FEDER, UE AYA2015-65973-C3-1-R RTI2018096228-B-C31
Contributions to the XIV.0 Scientific Meeting (virtual) of the Spanish Astronomical Society, held 13-15 July 2020, online at https://www.sea-astronomia.es/reunion-cientifica-2020, id.52 ; With funding from the Spanish government through the Severo Ochoa Centre of Excellence'accreditation SEV-2017-0709