The study of extra-solar planets, or simply, exoplanets, planets outside our own Solar System, is fundamentally a grand quest to understand our place in the Universe. Discoveries in the last two decades have re-defined what we know about planets, and helped us comprehend the uniqueness of our very own Earth. In recent years, however, the focus has shifted from planet detection to planet characterisation, where key planetary properties are inferred from telescope observations using Monte Carlo-based methods. However, the efficiency of sampling-based methodologies is put under strain by the high-resolution observational data from next generation telescopes, such as the James Webb Space Telescope and the Ariel Space Mission. We propose to host a regular competition with the goal of identifying a reliable and scalable method to perform planetary characterisation. Depending on the chosen track, participants will provide either quartile estimates or the approximate distribution of key planetary properties. They will have access to synthetic spectroscopic data generated from the official simulators for the ESA Ariel Space Mission. The aims of the competition are three-fold. 1) To offer a challenging application for comparing and advancing conditional density estimation methods. 2) To provide a valuable contribution towards reliable and efficient analysis of spectroscopic data, enabling astronomers to build a better picture of planetary demographics, and 3) To promote the interaction between ML and exoplanetary science.
Wed 5:05 a.m. - 5:10 a.m.
|
Welcome
(
Introduction
)
|
🔗 |
Wed 5:10 a.m. - 5:25 a.m.
|
Introduction to Exoplanet Characterisation
(
Introduction
)
A brief introduction by Dr Ingo Waldmann, the co-PI of the Ariel data challenge, on the current demographics of exoplanets, recent findings and the use of AI in the field. |
🔗 |
Wed 5:25 a.m. - 5:40 a.m.
|
Introduction to the Ariel Mission
(
Talk
)
An introduction to the Ariel Space Mission by the PI Prof Giovanna Tinetti. She will discuss about the science of Ariel, its aim, capability and how the Ariel mission can push the field forward in the coming decades. |
🔗 |
Wed 5:40 a.m. - 5:50 a.m.
|
The Ariel Data Challenge 2022 - Introduction
(
Introduction
)
A brief introduction to this year's data challenge and a short summary of the results obtained from each track. |
🔗 |
Wed 5:50 a.m. - 5:55 a.m.
|
5 minutes Break
|
🔗 |
Wed 5:55 a.m. - 6:10 a.m.
|
The “Gators” solution to the Ariel Data Challenge
(
Talk
)
In this talk we present the solution to the Ariel Data Challenge of the “Gators” team from the Physics Department at the University of Florida. We train a model of interconnected neural networks to estimate a posterior distribution over possible exoplanetary atmospheric chemical compositions and surface temperatures from their transit spectrum and system auxiliary priors. A significant improvement of the model’s prediction was made by preprocessing the data using physically motivated feature engineering. The constructed model consists of several fully connected neural networks which use concatenations or products of the outputs of previous modules as inputs. To minimize the Wasserstein-2 distance while reducing the complexity of our model, we trained on a parameterization of the estimate of the posterior distribution. In cases when a concentration is too small to be detected, a functional term is added to reproduce the observed effect of the prior where the posterior distribution ends up being uniform across all compatible concentrations. |
🔗 |
Wed 6:10 a.m. - 6:25 a.m.
|
Stefan_Stefanov's Solution
(
Talk
)
This talk will provide an overview of the second place solution for the Regular Track. The presentation will describe what method is used for modeling the distribution of planetary atmospheric properties, model architecture, how multi-task and semi-supervised learning are applied for the challenge solution. |
🔗 |
Wed 6:25 a.m. - 6:40 a.m.
|
Deep Ensemble predicts Exoplanets’ Atmospheres Composition
(
Talk
)
Standard methods for inferring planetary characteristics from exoplanets’ atmospheric spectra are slow. We present a fast machine learning method: a deep ensemble of convolutional neural networks (CNNs) that outputs mixtures of normal distributions for planetary characteristics. The architecture of our CNN was inspired by VGG networks. We train each CNN with Kullback–Leibler divergence as its loss function on simulated exoplanet’ atmospheric spectra and their auxiliary data from Ariel Data Challenge. We expect that the performance of our deep ensemble would be worse on real data because machine learning methods assume that (in this case) both simulated and real are independent and identically distributed. However, it is highly probable that probability distributions of simulated and real data differ. We suggest using an active domain adaptation (ADA) method to mitigate the difference and thus improve performance on real data. In ADA, the deep ensemble (trained with simulated data) would query real spectra (that would improve its performance most) in rounds. In each round, queried real spectra might be annotated by a slow standard method or human annotator, and the deep ensemble would use them to improve its performance. |
🔗 |
Wed 6:40 a.m. - 6:45 a.m.
|
5 minutes Break
|
🔗 |
Wed 6:45 a.m. - 6:50 a.m.
|
Short Solution Pitch from LeoPulga
(
Pitch
)
The challenge of predicting confidence intervals and distributions can be faced with several statistical techniques. The chosen method for my approach was a direct inference of a weighted combination of multivariate normal distributions along the 6 dimensions of the target variables, to provide information about mutual correlations, using a custom version of the Mixture Density Network, on top of a series of convolution layers for the spectra, and a dense layer for some engineered features from the auxiliary data, against an adapted version of the dataset. |
🔗 |
Wed 6:50 a.m. - 6:55 a.m.
|
MonsieurSolver - Feature extraction from spectra using a multi-instance learning approach
(
Pitch
)
I would like to introduce an overview of the idea which might be useful for someone. |
🔗 |
Wed 6:55 a.m. - 7:00 a.m.
|
Discussion and Concluding Remarks
(
Conclusion
)
|
🔗 |