Artificial Intelligence helps in the identification of astronomical objects

SHEEP is a new artificial intelligence software, developed by researchers at Instituto de Astrofísica e Ciências do Espaço, in Portugal, to help classify astronomical sources. 

Classifying celestial objects is a long-standing problem. With sources at nearly unimaginable distances, sometimes it’s difficult for researchers to distinguish, for example, between stars, galaxies, quasars, or supernovae. Tridimensional Universe map, made by the eBOSS collaboration at SDSS. (Credit: EPFL)

Instituto de Astrofísica e Ciências do Espaço’s (IA) researchers Pedro Cunha and Andrew Humphrey tried to solve this classical problem by creating SHEEP, a machine learning algorithm, which determines the nature of astronomical sources. 

The first author of the article now published in the journal Astronomy & Astrophysics, Pedro Cunha, a Ph.D. student at IA and in the Dep. of Physics and Astronomy of the Science Faculty of the University of Porto, thinks: “This work was born as a side project from my MSc thesis. It combined the lessons learned during that time into a unique project.”. Andrew Humphrey, Pedro Cunha’s MSc advisor and now Ph.D. co-advisor pointed out that: “It was very cool to get such an interesting result, especially from a master’s thesis!”

SHEEP is a supervised machine learning pipeline that estimates photometric redshifts and uses this information when subsequently classifying the sources as a galaxy, quasar, or star. “The photometric information is the easiest to obtain and thus is very important to provide a first analysis about the nature of the observed sources.”, says Pedro Cunha.

A novel step in our pipeline is that before performing the classification, SHEEP first estimates photometric redshifts, which are then placed into the data set as an additional feature for classification model training.

The team found that including the redshift and the coordinates of the objects allows the AI to understand them within a 3D map of the Universe, and use that together with color information, to make better estimations of source properties. For example, the AI learned that there is a higher chance of finding stars closer to the Milky Way plane than at the Galactic Poles. Andrew Humphrey (IA & University of Porto) added: “When we allowed the AI to have a 3D view of the Universe, this really improved its ability to make accurate decisions about what each celestial object was.”

Wide-area surveys, both ground- and space-based, like the Sloan Digital Sky Survey (SDSS), have yielded high volumes of data, revolutionizing the field of astronomy. Future surveys, carried out by the likes of the Vera C. Rubin Observatory, the Dark Energy Spectroscopic Instrument (DESI), the Euclid (ESA) space mission or the James Webb Space Telescope (NASA/ESA) will continue to give us more detailed imaging. However, analyzing all the data using traditional analysis methods can be time-consuming due to the extremely high volume of data. AI or machine learning will be crucial for analyzing and making the best scientific use of this new data.

This work is part of the team’s effort toward exploiting the expected deluge of data to come from those surveys, by developing artificial intelligence systems that efficiently classify and characterize billions of sources.

Imaging and spectroscopic surveys are one of the main resources for understanding the visible content of the Universe. The data from these surveys enable statistical studies of stars, quasars, and galaxies, and the discovery of more peculiar objects.

For the Principal Investigator of the research group “The assembly history of galaxies resolved in space and time” at IA, Polychronis Papaderos: “The development of advanced Machine Learning algorithms, such as SHEEP, is an integral component of IA’s coherent strategy toward scientific exploitation of unprecedentedly large sets of photometric data for billions of galaxies with ESA’s Euclid space mission, scheduled for launch in 2023.”

Euclid will provide detailed cartography of the Universe and shed light on the nature of the enigmatic dark matter and dark energy. The IA coordinates the Portuguese participation in Euclid and leads or co-leads, within the Euclid consortium, several work packages in the field of Cosmology and Extragalactic Astronomy.