Open Access
Issue
A&A
Volume 670, February 2023
Article Number A107
Number of page(s) 22
Section Galactic structure, stellar clusters and populations
DOI https://doi.org/10.1051/0004-6361/202244813
Published online 14 February 2023

© The Authors 2023

Licence Creative CommonsOpen Access article, published by EDP Sciences, under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

This article is published in open access under the Subscribe-to-Open model. Subscribe to A&A to support open access publication.

1. Introduction

Starting in the 2010s several high-resolution stellar spectroscopic surveys started to map the chemical composition of our Galaxy in order to better understand its formation, structure, and evolution. Initially, the lower resolution surveys of the Large sky Area Multi-Object fiber Spectroscopic Telescope (LAMOST, Cui et al. 2012) and Radial Velocity Experiment (RAVE, Steinmetz et al. 2020) focused mostly on the radial velocity of stars, while the later high-resolution multi-object programs such as the Gaia-ESO Survey (GES, Gilmore et al. 2012; Randich et al. 2022), Apache Point Observatory Galactic Evolution Experiment (APOGEE-1 and APOGEE-2, Majewski et al. 2017), and Galactic Archaeology with HERMES (GALAH, De Silva et al. 2015) measured the chemical composition of hundreds of thousands of stars with higher precision and accuracy than the lower resolution surveys. Rigorous constraints of evolutionary models of the Milky Way generally require a combination of these abundances with the extremely precise astrometric and photometric measurements of the Gaia space mission (Gaia Collaboration 2016, 2018, 2021). The Milky Way Mapper project (MWM, Kollmeier et al. 2017) in the fifth iteration of the Sloan Digital Sky Survey (SDSS-V), William Herschel Telescope Enhanced Area Velocity Explorer (WEAVE, Dalton et al. 2018), Multi-Object Spectrograph Telescope (4MOST, de Jong et al. 2019), and Subaru Prime Focus Spectrograph (PFS, Takada et al. 2014) are additional upcoming surveys continuing to contribute to our understanding and theories of the Galactic chemo-dynamical evolution and history. As the successor of APOGEE, MWM will target more than five million stars across the Milky Way, collecting both near-infrared and optical spectra, with a significant portion of the total number of stars planned to be observed in both wavelength regimes. The stellar parameters and elemental abundances provided by this survey will enable a unique global Galactic map of the fossil records and chemo-dynamical structure. This will enable the quantitative testing of galaxy formation physics (Kollmeier et al. 2017). Here we use the APOGEE DR17 results as the reference frame for future surveys, particularly the MWM currently underway.

The APOGEE-2 survey (Majewski et al. 2017) is part of SDSS-IV (Blanton et al. 2017); its latest data release (DR17, Abdurro’uf et al. 2022) has published the main atmospheric parameters and abundances of 20 species for 733,901 stars across the entire sky (Beaton et al. 2021; Santana et al. 2021). The APOGEE team has published both raw and calibrated effective temperatures and surface gravities, and has applied small offsets to individual abundance ratios based on solar neighborhood stars in DR17. The original APOGEE spectrograph (300 fibers) was placed on the 2.5-m Sloan Foundation Telescope (Gunn et al. 2006) at Apache Point Observatory (APO), and a clone APOGEE spectrograph was subsequently made and positioned (in early 2017) on the 2.5-m Irénée du Pont Telescope (Bowen & Vaughan 1973) of Las Campanas Observatory (LCO) in Atacama de Chile (Wilson et al. 2019). APOGEE is unique among the large spectroscopic surveys because it observes stars in the near-infrared, specifically from 15 140 Å to 16 940 Å with the resolving power of R ∼ 22 500, which allows it to reach the dust-obscured regions of the Milky Way in the disk, center, and bulge.

The GALAH observations are conducted with the multi-object 400-fiber HERMES spectrograph (Sheinis et al. 2015) at the 3.9-m Anglo-Australian Telescope in the Southern Hemisphere. The content of its latest data release (DR3) is described by Buder et al. (2021). GALAH has published abundances for 30 species of elements in DR3. The GALAH survey obtains spectra in the following four optical windows: 4713−4903 Å, 5648−5873 Å, 6478−6737 Å, and 7585−7887 Å (De Silva et al. 2015). The resolving power is ∼28 000 in all four optical bands. GALAH DR3 contains detailed atmospheric parameters and abundances for 588 571 stars.

The Gaia-ESO Survey (Gilmore et al. 2012; Randich et al. 2022) has made use of the capabilities of the FLAMES spectrographs (Pasquini et al. 2002) mounted on the 8m ESO Very Large Telescope (VLT). This survey is unique in that it uses multiple pipelines (by various working groups) throughout the analysis of the same set of measurements, which is then followed by data homogenization. GES has multiple narrow optical bands from 4033 Å to 9001 Å, and the resolution varies between ∼16 000 and 26 000 for GIRAFFE and is around ∼47 000 for the UVES spectrograph (Sacco et al. 2014). The latest GES catalog is DR5 (Randich et al. 2022); it contains stellar parameters and chemical data for 114,324 targets in the southern sky. To date, GES has made public the abundances of 31 elements.

Given the nature of these three different data sets, different methodologies were employed, leading to different systematic uncertainties associated with each of the analyses. Comparing the results in common between these data sets helps us to understand systematics and correlations, and potentially correct them for future programs. Based on metallicity measurements from open clusters, Spina et al. (2022) performed the homogenisation of the APOGEE, GALAH, and GES high-quality data sets which was analyzed as a whole, for example to trace the radial metallicity distributions in the Galaxy.

In this paper we therefore aim to determine the differences and underlying systematic effects between the latest data releases of three independent high-resolution spectroscopic surveys: APOGEE DR17, GALAH DR3, and GES DR5. As it is widely understood, abundances on the absolute scale not only depend on the selected line lists and continuum placement, but also strongly on the theoretical assumptions made in the construction of the model atmosphere and synthesis calculations (e.g., Jofré et al. 2019). By combining tens of thousands of stars all observed by these surveys and with the application of several quality criteria, we can investigate the accuracy of the stellar atmospheric parameters and abundances of several elements on the absolute scale in a large effective temperature, surface gravity, and metallicity range covering the majority of the Hertzsprung–Russell Diagram (HRD). Understanding how the abundances differ between the various analyses feeds back into fine-tuning Galactic formation and evolutionary models.

This paper is structured as follows. In Sect. 2 we provide an overview of the target and data selection, covering the determination of our parameter space, and the three resulting common quality data sets. Section 3 focuses on the methods and the steps we implemented in our analysis. Radial velocity and stellar parameter differences, along with the application of all global filters, are explained in Sects. 4 and 5, respectively. Individual elemental and α-abundance ratios from the APOGEE−GALAH overlapping catalog are thoroughly discussed in Sect. 6. We briefly examine how these systematic differences translate to the existing chemical maps of our Galaxy in Sect. 7. We go into detail about the effect of the performed APOGEE calibrations on the discovered variants in Sect. 8. Our conclusions are presented in Sect. 9.

2. Target and data selection

2.1. Target selection

In this section we describe how the SDSS-IV APOGEE-2 DR17 (Abdurro’uf et al. 2022), GALAH DR3 (Buder et al. 2021), and Gaia-ESO DR5 (iDR6, Randich et al. 2022) data sets were combined and how the overlapping stellar catalogs were created. We also briefly discuss the features of these common data sets, as well as the atmospheric parameters derived by these sky surveys.

The main stellar parameters derived by all three surveys investigated in this paper are radial velocity (vrad), effective temperature (Teff), surface gravity (log g), overall metallicity ([M/H]=[Fe/H]), and microturbulent velocity (vmicro). In addition to these common parameters, APOGEE published [α/Fe] and v sin i (vmac for giants), GALAH derived [α/Fe] and vbroad, and GES calculated v sin i. The full list of individual parameters and those present in the overlapping data sets are listed in Tables 1 and 2, respectively. As listed in the relevant column of Table 1, 20 individual elemental abundances (including C and C I, Ti I and Ti II) were released as part of DR17 by APOGEE. From the latest data release of GALAH (DR3), the 30 species of abundances are listed in the second column of Table 1. In its fifth data release, GES has made public abundances of 31 elements, which are listed in the third column of Table 1. The description of this data release can be found in an ESO document1 and in Randich et al. (2022).

Table 1.

Details of the latest data releases of the APOGEE, GALAH, and Gaia−ESO sky survey programs.

Table 2.

Comparison of the common catalogs of the surveys.

In order to create the APOGEE−GALAH, APOGEE−GES, and GALAH−GES stellar catalogs, we combined the published data from APOGEE DR17, GALAH DR3, and GES DR5 using TOPCAT version 4.8-2 (Tool for OPerations on Catalogues And Tables; Taylor 2005). Before matching stars, we removed all stars with STAR_BAD flags from the APOGEE DR17 sample. We performed a 1″ cross-match in TOPCAT and then removed GALAH and GES stars with flags indicating one or more problematic parameters. More details about the flagging systems and our additional filtering aspects are given in Sect. 2.2. Entries with APOGEE STAR_BAD flags were also removed (see Sect. 2.2 for discussion).

The combined APOGEE−GALAH data set contains 37 770 stars, the APOGEE−GES catalog 2,502 stars, and the GALAH−GES common sample 1510 stars before applying other quality cuts to the data. We define these high-quality data sets as golden samples. The positions in Galactic coordinates (l, b) of all APOGEE stars and the positions from the three common data sets are shown in Fig. 1. While APOGEE often publishes multiple values of the parameters, if the particular star was part of multiple science programs, none of the stars in our common sample had multiple entries in the final DR17 catalog.

thumbnail Fig. 1.

Mollweide projection of the Milky Way with the distribution of all targets from APOGEE DR17 (olive green) and from the APOGEE−GALAH (orange), APOGEE−GES (blue), and GALAH−GES (red) common catalogs shown in Galactic coordinates.

The stars observed in APOGEE are from both hemispheres, and these targets are located in the highly reddened Galactic disk and bulge, and in the Galactic halo (Zasowski et al. 2017; Beaton et al. 2021; Santana et al. 2021). Interstellar extinction is less significant in the near-infrared H-band than in the optical (A(H)/A(V) ∼ 1/6, Cardelli et al. 1989), which allowed APOGEE to probe the Galactic thin and thick disks in more detail than GALAH and GES. In addition to observing the Milky Way, the APOGEE program also included the Magellanic Clouds (MCs), eight dwarf spheroidal satellites, and observations of both M33 and M31 (Beaton et al. 2021; Santana et al. 2021). GALAH observes stars from the southern hemisphere, mostly along the ecliptic or Galactic latitudes lower than ±30° and the MCs, but not in the thin disk, avoiding high-extinction areas. Stars in GES are from the southern hemisphere along the ecliptic, and field stars in the Galactic halo, the thick and thin disks, and the Galactic bulge.

While the three surveys computed chemical abundances for many elements, not all elements can be studied by each survey. The common parameters and species between APOGEE, GALAH, and GES can be found in Table 2. Between APOGEE and GALAH the common derived elemental abundances are C, O, Na, Mg, Al, Si, K, Ca, Ti, Ti II, V, Cr, Mn, Co, Ni, and Cu. Between APOGEE and GES we are able to compare the species C, O, Na, Mg, Al, Si, S, Ca, Ti, Ti II, V, Cr, Mn, Co, Ni, and Cu. From the GALAH-GES sample we can examine differences in the abundances of Li, C, O, Na, Mg, Al, Si, Ca, Sc, Sc II, Ti, Ti II, V, Cr, Cr II, Mn, Co, Ni, Cu, Zn, Sr, Zr, Mo, Ba II, La II, Ce II, Nd II, Sm II, and Eu II. Figure 2 shows a diagram of all elements appearing in at least one survey’s parameter list.

thumbnail Fig. 2.

Venn diagram of the elements derived by each survey. The overlap of elements in all three surveys is in brown and those from only two surveys in dark green. Light green indicates no overlap between surveys. Species of any intersection are discussed in detail in Sect. 6.

2.2. Selecting the parameter space

In this section we describe the quality cuts applied to each survey data set to ensure that unreliable measurements are not included in the comparisons, so that we can reliably explore the underlying systematic differences.

We applied several global and survey specific local cuts for each survey, which are defined here. Global cuts are applied on the samples before any calculation, and affect all parameters. Moreover, we use local cuts and flags where individual abundances are considered, which makes the analysis of discrepancies of each elemental abundance independent.

In the first step we filtered out APOGEE stars that have the quality control flag STAR_BAD set, but retained stars with the STAR_WARN flag. This corresponded to the bit 23 part of APOGEE_ASPCAPFLAG bitmask. The STAR_BAD flag is set for a star if any of TEFF_BAD, LOGG_BAD, CHI2_BAD, COLORTE_BAD, ROTATION_BAD, SN_BAD or GRIDEDGE_BAD occur2 (Abdurro’uf et al. 2022). We also selected APOGEE stars within the following global criteria: signal-to-noise ratio S/N > 100 per half-resolution element per pixel to minimize uncertainties caused by random noise; VSCATTER < 1 km s−1 for the scatter in the radial velocity (RV) to eliminate most binaries and variable stars; VERR < 1 km s−1 for the error of RV to discard potentially bad measurements; and VSINI < 15 km s−1 for the rotational velocity to avoid significant rotational broadening.

From the GALAH survey we selected stars that satisfy the following criteria: S/N > 30 (snr_c3_iraf > 30, as recommended in GALAH DR3 documentation3); VBROAD < 15 km s−1 to discard significant stellar rotation. In addition, only stars with quality flags flag_sp = 0 (global), flag_fe_h = 0 (global), and flag_X_fe = 0 (local) were selected to avoid identified problems with stellar parameter determination, metallicity, and abundance determination of element [X/Fe], respectively (as advised in GALAH DR3 documentation, and discussed in Buder et al. 2021).

The GES data set was filtered by utilizing the survey’s flagging system and recommendations defined in ESO Phase 3 Data Release 5 Description, Tables 36. We list the criteria, and we report the values of the eliminated flags of SLFAGS column in brackets. We removed stars with a relative error in effective temperature larger than 5%, with S/N < 50 (and SFLAG==SNR), with RV error of E_VRAD > 1 km s−1 and with significant rotation: VSINI > 15 km s−1 (and SFLAG==SRO). We also removed stars with unreliable RV results that are presumable binary or variable stars (SRV, BIN), stars with spectral reduction issues (SRP), stars that are out of grid boundaries (PSC), and spectra containing any emission lines (e.g., Hα; EML).

Table 3.

Comparison of the elemental abundance values from the surveys.

Table 4.

Overall average of the reported individual parameter errors from the golden samples.

Table 5.

Median absolute deviation of the reported parameters and abundances from the golden samples.

Table 6.

Median absolute deviation of the reported abundances from the golden samples.

In addition to the considerations listed above, stars with missing parameter values in their respective databases were removed. When analyzing specific individual abundances stars are excluded if the particular abundance value of that element is not reported or flagged.

2.3. Common data description

Before discussing the discrepancies in atmospheric parameters and abundance ratios, we briefly describe the characteristics of the selected common data sets obtained after cuts.

Quality cuts left 15,537 stars, 804 stars, and 441 stars in the APOGEE−GALAH, APOGEE−GES, and GALAH−GES common catalogs, respectively. Figure 3 shows Teff versus log g (Kiel) diagrams. The top row presents the full set of stars from each individual survey separately. The bottom row contains the diagrams of common stars from the APOGEE−GALAH, APOGEE−GES, and GALAH−GES catalogs color-coded by metallicity values after all cuts have been applied. GALAH stars with Teff < 4100 K were filtered out by the flagging system implemented by GALAH. Using Kiel diagrams it is relatively easy to separate the main sequence (MS) and red-giant branch (RGB) stars. We defined APOGEE−GALAH stars to be giants using the following equations: Teff < 5250 K and log g < 3.80 dex, or Teff > 5250 K and log g + (Teff − 4950 K)/1500 K < 4 dex; in the other two cases giants are those with log g < 3.75 dex irrespective of Teff, as indicated by the dashed lines in Fig. 3 plots in the bottom row of plots. The reason behind having different definitions for the three overlapping data sets is that they are different data sets and have their own small offsets, so a cut that is suitable for one data set may not work for another in practice. After applying global cuts the APOGEE−GALAH common stellar catalog contains 7763 dwarfs and 7774 giants; there are 112 dwarfs and 692 giants in APOGEE−GES; and there are 170 dwarfs and 271 giants in GALAH−GES. The common data description outlined in this section generally covers all the analyses of main atmospheric parameters (see Sect. 5).

thumbnail Fig. 3.

Teff − log g diagrams of quality stars observed in APOGEE DR17, GALAH DR3, GES DR5 surveys (top row) and of stars that are common between these surveys (bottom row) color-coded by metallicity. In the APOGEE−GALAH and APOGEE−GES panels in the bottom row the APOGEE DR17 Teff (K), log g (cgs, dex), and [M/H] (dex) are used on the axes and in the color-coding; in the GALAH−GES panel the GALAH DR3 Teff, log g, and [Fe/H] are used. The separations of dwarf and giant stars are indicated by dashed lines in the common data sets.

It should be noted that the APOGEE−GALAH sample covers a large range of 4000 K−7000 K and 0−4.75 dex in Teff and log g, respectively. As a result of the smaller samples they contain, the APOGEE−GES and GALAH−GES common samples seem to have a narrower distribution in the parameter space. Moreover, in the last two cases the number of MS stars is relatively low.

3. Methods of surveys

Throughout its data releases, the APOGEE survey provided both “raw” parameters, derived from the APOGEE Stellar Parameters and Chemical Abundance Pipeline (ASPCAP; García Pérez et al. 2016), which relies on the FERRE4 multidimensional χ2 optimization code (Allende Prieto et al. 2006), and from the APOGEE line list (Smith et al. 2021), and also parameters calibrated to values that were independently derived. Because we are interested in how well the various methods used to derive parameters and abundances behave, we decided to use the APOGEE raw parameters and abundance values for the discussions presented in this paper. A brief description of the calibrations is provided in Sect. 8 where the effect of the APOGEE Teff, log g, and [M/H] calibrations on the discrepancies between the surveys is studied specifically.

The raw values of Teff, log g, [M/H], and vmicro used here were taken from the FPARAM array, while the uncalibrated individual abundances are listed under the FELEM label (Abdurro’uf et al. 2022). We note that elemental abundances are determined relative either to hydrogen ([X/H]) or iron ([X/M]), and the conversion between them is given by the following relation:

(1)

We note that APOGEE calculated both overall metallicity denoted by [M/H] and iron abundance [Fe/H] (Abdurro’uf et al. 2022), while the other two surveys use the iron abundance as a proxy for metallicity. The two metallicity indicators generally provide the same values within the expected uncertainties. Here, a mean difference between iron and all metal abundances of 0.002 ± 0.011 dex is obtained for the APOGEE−GALAH catalog. We discuss further details about the differences between the various definitions in Sect. 5.3.

In terms of the output of GALAH DR3, the elemental abundances are derived relative to Fe, while in GES DR5 the provided values are absolute abundances. Generally in our calculations, we also apply the

(2)

relation, where [X/H] is the relative abundance and A(X) is the absolute abundance, for which the related solar reference value is A(X).

The absolute abundance is a definition based on how the solar reference table is constructed, and these values vary from survey to survey, as indicated in the last row in Table 1. Our main motivation is to explore the abundance differences between the results for the three surveys on an absolute scale. APOGEE and GES reported the use of Grevesse et al. (2007) as solar reference for the abundances. In addition, GALAH derived their own solar reference abundances; these zero points are published in Buder et al. (2021), and we note that there are several elements with multiple reference values reported for each absorption line used in the fitting instead of a combined one. We therefore use the mean of these values in those cases.

4. Comparison of radial velocities

The radial velocities (RV) in the three surveys are calculated first with the help of preliminary estimated effective temperatures and surface gravities before deriving the final parameters for each star. In order to assess the magnitude of the differences and scatter between surveys, we compare them to the average uncertainties reported by each survey.

In APOGEE DR17 the data reduction algorithm (Nidever et al. 2015) performs a least-squares fit to a set of visit spectra, solving simultaneously for basic stellar atmospheric parameters and the radial velocity for each visit. The measurements are reported under the VHELIO_AVG label and corrected for the barycentric motion (Abdurro’uf et al. 2022). The overall average of RV uncertainty, σvrad for APOGEE DR17 quality stars is reported in Table 4 (both for dwarfs and giants shown in the top left panel of Fig. 3).

For each GALAH DR3 spectrum, RV is fitted as part of the spectrum synthesis comparison and has the barycentric correction applied. For analyses we use preferably rv_nogr_obst first, and if there is no entry for that parameter we use rv_sme_v2, which are not corrected for gravitational redshifts (Buder et al. 2021). The average uncertainties of individual RVs in our quality GALAH sample is provided in Table 4.

In GES DR5 RVs are derived by cross-correlating each spectrum with a grid of synthetic template spectra. In the quality GES DR5 data set the average uncertainty of RVs is listed in Table 4.

It should be noted that the RV difference, Δvrad, is calculated as vrad(APOGEE) − vrad(GALAH), vrad(APOGEE) − vrad(GES), and vrad(GALAH) − vrad(GES) for the APOGEE−GALAH, APOGEE−GES, and GALAH−GES catalogs, respectively. Radial velocity differences as functions of Teff, metallicity, log g, vmicro, and vrad are shown in Fig. 4. We do not distinguish MS stars from red giants in Fig. 4, but Table 2 also contains calculations of the average and scatter of RV discrepancies for dwarfs and giants separately. For all stars in common after quality cuts between APOGEE and GALAH, a vrad discrepancy of −0.02 ± 3.51 km s−1 is found (see Fig. 4 bottom row), and 95.0% of these stars are located in its ±1σ confidence range. We find no clear significant correlations in RV measurements as a function of atmospheric parameters, except for a weak trend for metal-poor red giants (bottom row, second panel from the right for [M/H] < −1.0, Fig. 4). When separating MS dwarfs from RGB giants, the offset is less significant for dwarfs (−0.01 ± 4.10 km s−1) than for giants (−0.04 ± 2.81 km s−1). These offsets do not indicate systematic errors as they are significantly smaller than the estimated dispersion of both APOGEE and GALAH. Although the discrepancy distributions are not exactly Gaussian, the σvrad uncertainties are combined by using the square root of the sum of squares. Therefore, for the APOGEE−GALAH golden sample the estimated RV internal uncertainty is 0.12 km s−1.

thumbnail Fig. 4.

Distributions of the radial velocity differences with respect to the main stellar parameters for the GALAH−GES (top row), APOGEE−GES (middle row), and APOGEE−GALAH (bottom row) common data sets, including dwarfs and giants. Bins containing only one star are in gray in the background; the color-coding (according to the number of stars per bin) starts from two stars. The mean and standard deviation of the differences are also shown in each case. Horizontal axes indicate Teff (K), metallicity, log g (dex), and vmicro and vrad (km s−1) from APOGEE DR17 (middle and bottom rows) and from GALAH DR3 (top row).

Although the global cuts have already been applied, we note that the number of stars with an absolute RV difference larger than 1 km s−1 is 1608. The most likely candidates would be variable stars (pulsators, binaries, eclipsing systems) that remained in the filtered quality sample. In order to clarify this idea, we matched the APOGEE−GALAH table with the APOGEE DR17 Value Added Catalog (VAC) of double-lined spectroscopic binaries (Kounkel et al. 2021), and with the General Catalog of Variable Stars (GCVS, Samus’ et al. 2017). After cross-matching our sample with these catalogs, only six matches were found in the two cases together. Global cuts included the removal of stars with a GALAH flag_sp that indicates stars with likely unreliable astrometry in Gaia, defined as ruwe_dr2 > 1.4. Below this value of the RUWE parameter we find no significant correlation between Δvrad and ruwe_dr2.

Furthermore, we made use of the Variable Summary of Gaia DR3 (Eyer et al. 2023). A source_id-based cross-match between the APOGEE−GALAH quality data set and this catalog resulted in 437 common stars. The combined RV uncertainty of APOGEE and GALAH is 0.12 km s−1 (calculated with the values from Table 4), and we conclude that only 2.24% of the matched stars lie outside of this uncertainty range. Therefore, the reason for this variation is unclear, while we can conclude that our global cuts that remove known variables from the samples have been effective. The presence of as-yet-unknown measurement discrepancies or unidentified variables might explain this issue. In addition, Anguiano et al. (2018) have found similarly large differences in RVs from comparing APOGEE DR14 and LAMOST DR3.

The average difference in RV is 0.15 ± 2.54 km s−1 for the APOGEE−GES catalog, and 94.3% of stars are within 1σ uncertainty (see Fig. 4 middle row). The average and scatter of RV discrepancies are 0.49 ± 1.26 km s−1 and 0.09 ± 2.69 km s−1 for dwarfs and giants, respectively. We note that the offset is more significant for MS stars, and is larger than the combined uncertainty of the APOGEE−GES catalog, which is 0.28 km s−1. This offset is reduced to 0.42 km s−1 when we filter out the dwarfs with a RV difference larger than ±5 km s−1. Such large discrepancies may happen when the initial atmospheric parameters used during the cross-correlation have a large offset from the actual parameters of the stars.

In the GALAH−GES common sample (Fig. 4, top row) we found an offset and scatter of 0.11 km s−1 and 4.18 km s−1 in RV differences, and 92.9% of the considered stars are included in this interval. For dwarfs and giants the discrepancy is 0.69 ± 4.37 km s−1 and −0.21 ± 4.09 km s−1, respectively. Similarly to the case of APOGEE−GES catalog, the offset is more significant for MS stars, which exceeds the GALAH−GES combined estimated precision of 0.30 km s−1. This relatively high offset in dwarfs is partially the result of 12 MS stars with Δvrad > 5 km s−1 in common between GALAH and GES data set. After filtering these stars out the mean is reduced to 0.39 km s−1.

Using Gaia as a reference, Tsantaki et al. (2022) present a comprehensive catalog of RVs, built by homogeneously merging the RV determinations of the largest ground-based spectroscopic surveys to date, namely APOGEE, GALAH, Gaia-ESO, RAVE, and LAMOST. In their work, the uncertainties are normalized using repeated measurements, or the three-cornered hat method, and a cross-calibration of the RVs as a function of the main parameters is included. They estimate the accuracy of the RV zero-point to be about −0.01 km s−1, 0.04 km s−1, and 0.10 km s−1 and the RV precision to be in the range 3.89 km s−1, 3.97 km s−1, and 4.72 km s−1 compared to APOGEE DR16, GALAH DR2, and GES DR3, respectively. While these numbers are in the range of what we find here, an “apples to apples” comparison with our values is not possible as we use more recent data releases for all surveys and other factors.

Overall we find that APOGEE, GALAH, and GES generally measure the same RVs within uncertainties, but a higher than expected scatter around the mean differences is observed. Currently, we do not have an explanation of such large radial velocity differences.

In the bottom panel of the APOGEE−GALAH row in Fig. 4, one can see a set of stars at particularly high RVs: ∼235 km s−1. We conclude that most of these stars are part of the massive globular cluster ω Cen (NGC 5139), and RV differences show a clear 0.27 km s−1 offset in the APOGEE−GALAH overlapping data set. For these 130 cluster members, the average RVs are 234.95 ± 0.55 km s−1 and 234.68 ± 0.56 km s−1 from APOGEE DR17 and GALAH DR3, respectively. The latter result is closer to the value reported by Baumgardt et al. (2019) as they used Gaia DR2 to study the motion and proper kinematics of globular clusters and published a mean RV of 234.28 ± 0.24 km s−1 for ω Cen.

5. Discussion of differences: Main stellar parameters

In this section we discuss in detail the discrepancies of the main stellar parameters (Teff, log g, metallicity, and vmicro) derived by the three spectroscopic surveys. We note that the relevant atmospheric parameter (AP) difference, Δ(AP) = {ΔTeff,Δlog g,Δ[M/H] or Δ[Fe/H],Δvmicro}, is calculated as AP(APOGEE) − AP(GALAH), AP(APOGEE) − AP(GES), and AP(GALAH) − AP(GES) for the APOGEE−GALAH, APOGEE−GES, and GALAH−GES catalogs, respectively. The three surveys followed different approaches to determine these parameters, which we briefly outline here before discussing the differences.

As described in Sect. 3 above, stellar parameters and abundances for APOGEE DR17 were determined using ASPCAP, which finds the best fit between combined observed visit spectra and synthetic model spectra in APOGEE DR17 (Abdurro’uf et al. 2022). As in previous data releases, the reported individual uncertainties are estimated by analyzing the scatter for stars having repeated observations (Abdurro’uf et al. 2022). Here and in Sect. 6 we make use of uncalibrated Teff, log g, log(vmicro), [α/M], and [M/H] values reported under the ASPCAP output array FPARAM with their associated errors. Similarly to previous data releases, systematic differences between spectra taken from APO and LCO were corrected by applying the median difference for the stars observed at the two observatories (Abdurro’uf et al. 2022).

In GALAH DR3 all stellar parameters and elemental abundances were estimated via the spectrum synthesis code Spectroscopy Made Easy (SME, Valenti & Piskunov 1996; Piskunov & Valenti 2017) with 1D MARCS stellar atmosphere models (Gustafsson et al. 2008). Tests were performed to validate the obtained parameters in terms of their accuracy and precision (Buder et al. 2021). Accuracy assessment involves commonly used comparison samples: the Sun, Gaia FGK benchmark stars (GBS, Heiter et al. 2015; Jofré et al. 2018), photometric Teff from the infrared flux method (IRFM, Casagrande et al. 2010), stars with asteroseismic data, and cluster members, while internal uncertainty estimations and repeated observations are used for the precision analysis (Buder et al. 2021).

Based on a common suitable line list, set of model atmospheres, grid of synthetic spectra, and approach to data formats and standards for the analyses, the parameters returned by the various working groups (WGs) contributing to the GES program at the end of the parameter determination are processed by the homogenization WG to produce a set of recommended parameters for the survey. The same kind of quality assessment and calibration as used by the GALAH team is applied by the GES team (Randich et al. 2022). Parameters from the quality GES DR5 catalog are displayed in the third panel of Fig. 3, top row.

In order to properly discuss the differences, we calculate the average Teff, log g, metallicity uncertainties of those APOGEE DR17, GALAH DR3, and GES DR5 stars that are contained by the relevant filtered catalogs (golden samples in Fig. 3, first row) and use these average uncertainties in the evaluation of the observed discrepancies. The mean values of these errors, σTeff, σlog g, σ[Fe/H] are presented in Table 4. If the discrepancy distribution is approximately a symmetric Gaussian and there are no systematic effects, we can use the sum of squares of corresponding uncertainty values to compare the observed differences. This is the case if the uncertainties originating from each survey are combined. This also applies to the [X/H] errors calculated from the reported [X/Fe] and [Fe/H] uncertainties. However, it is worth noting that this is an oversimplification in estimating the combined errors as systematic effects always exist between parameters, though their extent is often not known precisely enough to correctly take them into account. We also calculate and evaluate the dispersion of the various distributions measured through their median absolute deviation (MAD). For the main parameters, the MAD values are presented in Table 5, and for the abundances in Table 6. We conclude that the MAD values are typically ∼40 to 80% lower than the standard deviation of the differences.

Figure 5 displays the correlations found between parameter pairs in all three overlapping catalogs. Each cell corresponds to the analysis of a specific parameter plane, and for each parameter pair a linear least-squares fit was made for the set of overlapping stars between the surveys. The degree to which a correlation exists or not is represented by the color-coding. Based on the slope of the fits, denoted by a, we deduce the estimated uncertainty of the parameter discrepancies. Cell by cell, this analysis is based on an approximation that the parameter difference is completely caused by the uncertainty of its pair parameter only. Therefore, we define three categories: (i) no significant correlation if a < 0.8σ, (ii) potential weak correlation if 0.8σ < a < 1.5σ, (iii) substantial correlation if 1.5σ < a. No substantial correlations (case iii) are found between any of the surveys. For the majority of cells we found no statistically relevant correlations (case i), implying that the uncertainty is dominated simply by the individual errors determined for the individual derived parameters from each survey. However, as Fig. 5 indicates, several weak correlations (case ii) exist among the parameters. For example, specifically for the APOGEE−GALAH dwarf subsample, there is a weak correlation between ΔTeff (first row) and Δ[M/H] (sixth column) implying a Teff discrepancy 0.8 − 1.5 times higher than from metallicity uncertainties alone. In the following subsections we go into the details of the differences in parameters.

thumbnail Fig. 5.

Correlations between the main parameter pairs in the three overlapping catalogs. The strength of the correlations between each pair is color-coded: light blue cells represent no significant correlations (< 0.8σ); dark blue cells represent weak correlations (0.8σ − 1.5σ). Black cells are not relevant due to the absence of reported uncertainties of vmicro or because they link the same quantities (e.g., ΔTeff with ΔTeff). The estimated strength of the correlations is based on the comparison of the correlation slopes with the average of the reported individual uncertainties of stellar parameters and RVs. The stars have been separated into dwarfs (left column) and giants (right column).

5.1. Effective temperature

Effective temperature is arguably the most important main stellar parameter that has the largest effect on the structure of stellar atmosphere and the formation of absorption lines. Similar to the RV analysis performed earlier, we compare the mean differences and scatters to the average of the reported individual errors combined for the relevant survey pairs.

Discrepancies in Teff are shown in the top row of Fig. 6 for the APOGEE−GALAH overlapping globally filtered sample. The overall statistics, including the mean offset and standard deviation of Teff separated for MS and RGB stars, are listed in Table 2. That table also contains the calculated differences for the APOGEE−GES and GALAH−GES common stars.

thumbnail Fig. 6.

Distributions of effective temperature (top row), surface gravity (second row), metallicity (third row), and microturbulence velocity differences (bottom row) as functions of the main stellar parameters from the APOGEE−GALAH overlapping stellar catalog, for dwarfs and giants combined. The number density of stars (starting with two stars) over the parameter space spanned is provided by the color-coding. The binned star densities over the parameter spaces, and the (overall) mean and standard deviation of parameter differences are shown in each panel as well. Bins representing one star are shown in gray. The horizontal axes indicate Teff (K), metallicity, log g (dex), and vmicro (km s−1) from APOGEE DR17, and the parameter differences are calculated as Δ(AP) = AP(APOGEE) − AP(GALAH).

In the APOGEE−GALAH quality data set, we find a discrepancy of −5.4 ± 126.6 K for Teff, as shown in Fig. 6. While 78.2% of these stars are located within the ±1σ range, there is also a linear trend in the Teff − ΔTeff plane. The offset is significantly different for dwarfs (43.6 ± 137.3 K) and giants (−54.3 ± 91.7 K), as seen in Table 2, but the collective offset spans the interval of predicted uncertainties of σTeff = 91.5 K propagated from the APOGEE and GALAH golden samples. Separately, the offsets still remain lower than the preliminary calculated APOGEE−GALAH sample uncertainty of 93.3 K and 94.6 K for MS and RGB stars, respectively. The GALAH team reported a trend toward increasingly underestimated Teff for hotter dwarfs, similar to previous GALAH analyses (Buder et al. 2021). This is in a good agreement with the positive offset of MS presented here.

We note that the issue of altering the sign of the offsets is visible in the ΔTeff plot as a function of Teff and log g in the relevant panels of Fig. 6. We do not find a symmetric distribution or a linear trend in the [M/H] versus ΔTeff plot, but a systematic deviation for metal-poor stars is apparent. Lastly, we find no notable correlation between microturbulence and ΔTeff. In addition to the overall analysis of the stellar distributions in Fig. 6, the bottom row of Fig. 5 depicts the following considerations: for dwarfs the correlations of ΔTeff with Δ[M/H] and Δlog g remain weak, within the determined uncertainties, and in the other cases and for the giants, Teff remains within the combined uncertainty ranges determined for the APOGEE−GALAH catalog.

In APOGEE−GES catalog we observe an offset in Teff of about −57 K together with a scatter of 118 K; 70.8% of these stars cover the variation range of ±1σ. When looking at the difference between the dwarfs and giants, we find that the mean Teff discrepancy for both MS and RGB stars are negative, namely −15.0 ± 114.7 K and −64.3 ± 117.2 K, respectively (see Table 2); the GES temperatures are generally hotter than those determined by APOGEE by the average amounts noted. These offsets are not significantly higher than the expected combined σTeff = 53.1 K precision that is calculated for the APOGEE and GES quality stars.

We also present the Teff differences between GALAH and GES with respect to the main stellar parameters. The overall offset and scatter is −12.0 ± 125.5 K for our filtered data set of common targets, and 75.3% of the sample falls in this range, while the separate results presented in Table 2 reveal that the effects of different measurements and/or derivations play a role in dwarfs and giants. Although neither of the offsets is higher than the average uncertainty of the propagated errors (σTeff = 100.3 K), it is interesting that the discrepancy seems more notable for dwarfs (−60.4 K) than for giants (21.2 K). However, there are relatively few objects in the MS parameter space (at high log g and Teff) to perform meaningful statistics (see Fig. 3).

We conclude that the three high-resolution spectroscopic programs measure similar Tteff values for the quality samples within the determined uncertainty of each survey. The previously found systematic difference between APOGEE spectroscopic and photometric Teff can be seen in the APOGEE−GALAH comparison as well.

5.2. Surface gravity

In this section we discuss the differences in surface gravity as a function of the stellar parameters, as above for Teff (see Sect. 2.3).

The second row in Fig. 6 shows the log g discrepancies with respect to the measured stellar parameters for the APOGEE−GALAH common stars. The mean offset and scatter calculated for MS and RGB stars are listed in Table 2. Comparing the APOGEE log g with GALAH is interesting because APOGEE measures it from fitting the spectra, while GALAH updated the spectroscopic surface gravities with isochrones (Buder et al. 2021).

In the APOGEE−GALAH catalog we see an overall offset of 0.05 dex with a scatter of 0.19 dex, and 77.5% of these stars belong to this uncertainty range. In terms of dwarfs, the discrepancy is 0.01 ± 0.13 dex, and we find an interesting curving downturn in the log g − Δlog g parameter space around dwarfs, although a slight offset and a weak linear rising trend at the cool end exist with respect to Teff. We can see a larger mean difference and standard deviation for giants, including the red clump, which is 0.09 ± 0.22 dex. APOGEE seems to have derived systematically higher log g for giants than GALAH, and this is also stated in Holtzman et al. (2018): APOGEE ASPCAP (raw) spectroscopic log g values for giants are systematically higher than those derived from asteroseismology. Instead, De Silva et al. (2015) found good agreement between the GALAH’s and asteroseismic log g values. Therefore, calibration was necessary in APOGEE (see Sect. 8). Comparing calibrated results from APOGEE DR14 and uncalibrated values from LAMOST DR3, Anguiano et al. (2018) also found a positive deviation of 0.14 dex with a scatter of 0.25 dex in log g. We note that the metal-poor giant members of ω Cen account for the significant Δlog g < −0.4 dex values shown in the second panel from the left in the second row of Fig. 6.

The calculated uncertainty of σlog g = 0.21 dex propagated from APOGEE and GALAH is always smaller than the discrepancy seen for the MS and the RGB stars. Moreover, Fig. 5 (third row) indicates no significant correlation between the main parameter errors and the differences between the APOGEE−GALAH parameters. The log g discrepancy between APOGEE and GALAH is large at very low metallicities (third panel from the left in the second row); however, the spectroscopic surface gravities for cooler dwarfs tend to be underestimated (Holtzman et al. 2018), which is also found within our analysis. Nandakumar et al. (2020) also found systematically higher log g values for APOGEE DR16 than for GALAH SME, in agreement with our results. The pattern of discrepancy drawn by red clump stars (log g ∼ 2.5 dex) and MS stars is retained as well.

In APOGEE−GES a mean discrepancy of −0.03 dex and a scatter of 0.31 dex is calculated. The ±1σ interval covers 75.6% of the stars in common. Separating the MS and RGB stars, we find a −0.03 ± 0.17 dex average log g difference for dwarfs and −0.03 ± 0.33 dex for giants (see Table 2). The APOGEE−GES combined uncertainty is estimated as σlog g = 0.12 dex, which is larger than the offset observed. No clear trends can be observed in the discrepancy distributions.

The log g discrepancies as functions of stellar parameters of stars included in the GALAH−GES quality catalog follow. We find that the mean deviation of the measured log g values is −0.09 dex with a scatter of 0.20 dex, and 75.3% of the sample is located in the ±1σ range. As listed in Table 2, the offset is −0.02 ± 0.16 dex for dwarfs and a more dominant offset of −0.14 ± 0.22 dex for giants can be seen. Similarly to the APOGEE−GES data set, the majority of the stars is located on the RGB. While GALAH−GES giants have the largest average log g difference among all the discussed data sets, this discrepancy stays within the combined internal precision of σlog g = 0.24 dex. Finally, no systematic trends in Δlog g can be found with respect to the main parameters.

As a result, no significant surface gravity offset and no substantial correlation beyond the average (and combination) of individual parameter uncertainties are found. APOGEE, GALAH, and GES determine their log g values within their estimated uncertainties.

5.3. Metallicity

Here we discuss the differences between the derived metallicity values, which is an important tracer of stellar populations. Similarly to the previous main parameters, the discrepancy is compared to the combined estimated uncertainties.

In APOGEE, the overall metallicity is derived by ASPCAP fitting all spectral lines globally, not only by lines of iron (García Pérez et al. 2016). However, it is still a valid decision to consider that [M/H] and [Fe/H] are equivalent, as the number of iron lines in the spectra of FGK-type stars is by far the largest among all elements, so overall metallicity is essentially the iron abundance within the margin of precision of ±0.1 dex (Jofré et al. 2019). Figure 7 depicts the differences between uncalibrated [M/H] and [Fe/H] for stars included in the APOGEE−GALAH quality catalog. While we can find a higher scatter at low metallicities (and at the highest Teff values), the average discrepancy and scatter is 0.002±0.011 dex, which lies within 0.1 dex (denoted by dashed lines in Fig. 7). When discussing the precision of abundances we use this 0.1 dex in Sects. 5.3 and 6 as a guide of an indicative precision that we expect the high-resolution surveys can reach for elements with well-defined absorption lines.

thumbnail Fig. 7.

Overall metallicity vs. the difference between overall metallicity and iron abundance color-coded by Teff published in APOGEE DR17. The indicated stars are from the APOGEE−GALAH common stellar catalog and global cuts are applied as well. The dashed lines indicate a difference of ±0.1 dex (expected uncertainty from abundance analysis).

Furthermore, García Pérez et al. (2016) reported that uncertainties tend to be more significant toward low metallicities and temperatures, which we also experience here. Therefore, we consider overall metallicity and iron abundance equivalent in APOGEE DR17, and we use [M/H] in the comparisons.

For all stars included in the APOGEE−GALAH overlapping quality catalog, the published metallicity discrepancies with respect to main parameters are represented in the third row of Fig. 6. We find that the mean metallicity difference is 0.023 dex with a scatter of 0.105 dex between the stars collectively, and 78.9% of the samples ranges in the ±1σ interval. In the [M/H]−Δ[M/H] and vmicro − Δ[M/H] parameter spaces, it is clearly recognized that APOGEE systematically measures higher metallicity values.

Specifically for the MS stars, we find an offset of 0.006 dex along with a 0.082 dex scatter. As shown in the relevant (third row) Teff − Δ[M/H] and log g − Δ[M/H] planes in Fig. 6, there is a larger mean and standard deviation (0.039 ± 0.121 dex) in Δ[M/H] for RGB stars (see Table 2), and the offset is strongly dependent on the log g differences between APOGEE and GALAH (Fig. 5). This may suggest that systematic offsets in surface gravity may translate to discrepancies in metallicity. The APOGEE−GALAH combined internal uncertainty is σ[M/H] = 0.077 dex, we conclude that the derived mean discrepancies stay within this range. Therefore, an overall agreement is found in the measured metallicities between APOGEE DR17 and GALAH DR3, but users should be aware that the Δ[M/H] is larger at low Teff values.

Soubiran et al. (2022) performed the assessment of [Fe/H] determinations of FGK stars in spectroscopic surveys. In addition to conducting several tests, they looked for trends in the distribution of residuals versus GaiaG magnitude and Teff, log g, and [M/H]. In general, they found a consistency between APOGEE DR16 (calibrated values), GALAH DR3, and GES DR3. The offsets and scatters are on the same order as in our results, and their plotted distribution patterns also correspond to those in Fig. 6.

Anguiano et al. (2018) also observed a small offset (between APOGEE and LAMOST) in calibrated [Fe/H] of about 0.06 dex together with a scatter of 0.13 dex, and also discovered a more significant discrepancy for cooler stars. Furthermore, when comparing GALAH DR3 with GALAH DR2, Buder et al. (2021) also found a trend of underestimated [Fe/H] for the metal-rich giants and red clump stars, which suggests that their synthetic spectra have a slight inaccuracy (Buder et al. 2021). The difference for all stars with unflagged abundances between APOGEE DR16 and GALAH shows a slightly lower [Fe/H] for GALAH (−0.05 ± 0.14 dex; Buder et al. 2021). According to our study, this underestimation is translated into the positive offset between APOGEE and GALAH.

The main parameter dependences of Δ[Fe/H] for the APOGEE−GES quality stellar data set is discussed in the next step. The average difference is −0.015 dex with a scatter of 0.099 dex, and 76.4% of the stars is located within this uncertainty range. As the vast majority of these stars are cooler than 5500 K and have log g < 3.75 dex, stars contributing to the statistics are mostly giants. The discrepancy is −0.020 ± 0.077 dex and −0.014 ± 0.102 dex for MS and RGB stars, respectively. The average reported error is σ[M/H] = 0.060 dex in the combined APOGEE and GES quality samples. In conclusion, the offset is small without any remarkable systematic trends.

In terms of the GALAH−GES overlapping stars, an overall combined uncertainty of σ[Fe/H] = 0.097 dex is calculated, which suggests that the present offset of −0.107 ± 0.165 dex for giants is significant. Since the number of the comprised GALAH−GES stars is relatively small, we may conclude that the large offset may be the result of the small sample size.

In summary, for the metallicity discrepancies we find no significant offsets and correlations of main parameters beyond the expected uncertainties, except for the mean discrepancy being larger than the expected uncertainty within GALAH−GES and a weak linear relation between Δlog g and Δ[M/H] in the case of APOGEE−GALAH. We note that stars with Teff < 4500 K have larger metallicity differences than the rest of the stars in the APOGEE−GALAH common sample.

5.4. Microturbulent velocity

Here we present a discussion of the empirical (line broadening) atmosphere parameter vmicro. Microturbulent velocity strongly influences the derivation of abundances, so it is important to assess its accuracy among the sky surveys.

The vmicro discrepancy distribution for the overlapping quality catalog APOGEE−GALAH is presented in the panels of Fig. 6 in the bottom row. The discrepancy ranges from ∼ − 1.1 km s−1 to +1.5 km s−1 and we observe an average difference of −0.18 km s−1 with a standard deviation of 0.43 km s−1, and specifically −0.34 ± 0.42 km s−1 and −0.03 ± 0.36 km s−1 for MS and RGB stars (see Table 2). In the log g versus Δvmicro plane it is clear that we have a significant negative offset at high log g values, suggesting that the two surveys use vastly different vmicro for the dwarf stars. In addition to the strong correlation with respect to vmicro (last panel), further interesting patterns appear in the Teff − Δvmicro and [M/H]−Δvmicro planes, for example a strong negative Δvmicro gradient as a function of Teff < 5000 K, and we find detached groups of stellar densities near solar-like metallicities, which are the MS stars. The substantial correlation with respect to vmicro is also represented in Fig. 5. Here we fit Δvmicro as a linear function of vmicro, and find a slope of ∼0.9 which means that the entire difference is comparable to the combined uncertainties in the APOGEE−GALAH sample.

The two surveys used different approaches in determining the value of vmicro. In APOGEE DR17 microturbulence is derived during the global fit of the spectrum (Abdurro’uf et al. 2022). The microturbulence values published in GALAH DR3 are determined via the estimated empirical relations

if Teff≤5500 K and log g≥4.2 dex or

otherwise (Buder et al. 2021). According to Buder et al. (2021) the implementation of these empirical relations of vmicro provides more precise atomic abundance results. In addition, several temperature-dependent trends (in the coolest and hottest regions) have been discovered by the GALAH team during the validation of element abundances. This issue can also be partially caused by over- or underestimated vmicro (Buder et al. 2021).

Furthermore, we examine the derived vmicro discrepancies between APOGEE and GES. Since cuts beyond the global filtering are applied to opt out GES stars without any reported microturbulence data, we have 405 (mostly RGB) stars used in determining the mean difference of −0.16 km s−1 with a scatter of 0.38 km s−1. As in the case of APOGEE−GALAH, we recognize a growing linear trend in the vmicro − Δvmicro plane (Fig. 5). We note that the difference vanishes around 1.5 km s−1 and is systematically proportional to the vmicro data.

As for the GALAH−GES catalog, we only take GES stars with available microturbulence data into account. Therefore, the statistics is made on 228 stars here. An overall mean offset of −0.14 km s−1 is observed with a scatter of 0.22 km s−1, which means a −0.10 ± 0.23 km s−1 and a −0.18 ± 0.19 km s−1 scatter deviation for dwarfs and giants, respectively (see Table 2). Due to the small number of GALAH−GES stars considered here, we find no relevant trends in Δvmicro.

As for the analysis of microturbulent velocity, we find that APOGEE uses systematically lower vmicro values than GALAH and GES. We observe moderate discrepancy trends with respect to Teff, [M/H], and log g, and a substantial correlation as a function of vmicro.

6. Discussion of differences: Chemical abundances

In this section we describe the comparison of chemical abundance ratios, especially the combined α abundance and the individual species of elements that are present in the APOGEE−GALAH catalog (see the relevant intersections in Fig. 2). As discussed in Sect. 2.3, the APOGEE−GALAH sample covers a wide range of effective temperature and surface gravity. Therefore, it is reasonable to provide a detailed further investigation on abundance discrepancy trends and correlations of the APOGEE−GALAH common data as it covers the largest parameter space and has the largest number of samples.

6.1. Abundances from APOGEE, GALAH, and GES

The APOGEE DR17 pipeline (ASPCAP, for details see Sect. 3 and García Pérez et al. 2016, Jönsson et al. 2020) determines the chemical abundances based on non-local thermodynamic equilibrium (NLTE) populations for Na, Mg, K, and Ca (Osorio et al. 2020), and LTE for the other elements. DR17 used the spectral synthesis code Synspec (Hubeny & Lanz 2017; Hubeny et al. 2021). Moreover, certain modifications to the linelists (Smith et al. 2021) have been added (Abdurro’uf et al. 2022).) We note that the estimated uncertainties of abundances are calculated by the APOGEE team using repeated target observations and result in a typical precision of 0.05 dex (Abdurro’uf et al. 2022).

As for GALAH, the abundance determination is also the second step of the parameter analysis of SME; the main parameters are kept fixed while iteratively fitting the individual abundances (Buder et al. 2021). We note that NLTE models are adopted for several elements common between APOGEE DR17 and GALAH DR3: C, O, Na, and Mg. In the final release, GALAH reports relative abundances5 [X/Fe] (Buder et al. 2021). Considering only the abundances from APOGEE−GALAH catalog, both line-by-line (α, C, K, Ti, V, Co, Ni, Cu) and combined (O, Na, Mg, Al, Si, Ca, Cr, Mn) analyses were introduced in the GALAH pipeline. The combination of some elemental lines at different wavelengths is dependent on the detection of these lines and these estimates are a combination of 1 to 9 measurements. As described by Buder et al. (2021), the Sun and solar twins were used in the zero-point validation of the abundance accuracy (further validations involved comparisons to Arcturus, GBS and cluster stars). Precision of individual abundances is assessed with a method based on the S/N scaled uncertainties of repeat observations and internal fitting covariance uncertainties from SME (Buder et al. 2021). For our calculations of absolute abundances, we make use of the A(X) zero-points of Table A2 in Buder et al. (2021).

6.2. Comparison of surveys

We limit our discussion of comparisons with GES just to the absolute abundances due to the lack of enough common stars for a detailed analysis. Individual elemental abundance analysis uses local cuts (introduced in Sect. 2.2) along with opting out stars with abundance values [X/H] < −1 or [X/H] > 1. The mean and scatter of individual discrepancies for stars of the three common catalogs are shown in Table 3.

The number of species of elements reported by both APOGEE and GES is 17, for all of which the mean discrepancy ranges from −0.257 ± 0.319 dex (in [Ti II/H], for dwarfs) to 0.086 ± 0.097 dex (in [Ca/H], for dwarfs). The combined uncertainties exceed dex for the following elements: Na, S, Sc, Ti II, V, Co (dwarfs); and Si, S, Co (giants). This estimated precision is exceeded notably by the offsets from Ti II (Δ[Ti II/H] = −0.257) for the dwarfs and O (Δ[O/H] = −0.202) for the giants. We note that the sample size is reduced by the local cuts to an extent that the final considered number of data points is usually too low for a more detailed analysis.

There are 29 elements measured in common between GALAH DR3 and GES DR5. Table 3 shows the average discrepancies with scatters. Ce II (for giants) and Co (for dwarfs) have the most significant offsets among the listed elements: −0.196 ± 0.224 dex and 0.384 ± 0.467 dex. A precision of 0.1 dex is estimated for both surveys for all elements, which results in a combined uncertainty of σ[X/H] ≈ 0.14 dex, which do not cover the mean discrepancy range of the following species: Co, La II, Nd II, and Ce II for dwarfs and giants (see Table 3). As before, there are not enough stars in common to do a deeper analysis.

Below we give a detailed discussion about the differences and correlations for the APOGEE−GALAH common chemical abundances. Figure 8 shows the result of the analysis that is described here, and is analogous to Fig. 5. Each cell connects two quantities in a way that we fit a linear function on the abundance discrepancy, indicated by the row, with respect to the parameter indicated by the column. Considering the slope of the linear fit and knowing the expected precisions of the two relevant parameters and abundances, we deduce the significance of the correlations of uncertainties. In Figs. 9 and 10, we show the elemental distributions in the Δ(AP) − Δ[X/H] planes, and possible interpretations are pointed out in this section. We note that samples represented in these plots have already met the previously described global and local filtering criteria. Moreover, Fig. 11 shows the average and scatter of the elemental abundances contained in the three common catalogs relative to hydrogen (left column) and on an absolute scale where the zero-point offsets have been applied (right column). We skip the analysis of [Cu/H] because Abdurro’uf et al. (2022) consider the derivation of copper abundance in APOGEE DR17 to be unsuccessful, and this issue had also been recognized by us. We also leave out elements that have no available data after all the cuts.

thumbnail Fig. 8.

Strength of correlations in the APOGEE−GALAH parameter and abundance pairs. The color-coding is as follows: Black cells are indeterminable, light blue cells represent no significant correlation (< 0.8σ), dark blue cells indicate a weak correlation (0.8σ − 1.5σ), and the magenta cells mean strong correlation (> 1.5σ). Our estimations are based on the average correlation of reported individual uncertainties of stellar parameters and abundances. Dwarf (left panel) and giant stars (right panel) are separated.

Alpha-process elements, [α/H]. The [α/H] abundance is one of the independent parameters that ASPCAP uses in its simultaneous multidimensional fit for determining stellar parameters, and contains the following elements: O, Ne, Mg, Si, Ca, and Ti (Jönsson et al. 2020). For GALAH DR3, the reported alpha-enhancement [α/Fe] is computed as the error-weighted combination of abundances of Mg (Mg5711), Si (combined), Ca (combined), and Ti (Ti4758, Ti4759, Ti4782, Ti4802, Ti4820, Ti5739) obtained by 1 to 9 measurements, depending on the detection of the lines; the relevant selected lines are written in brackets. As discussed in Buder et al. (2021), the most dominant α-element is Si, followed by Ti.

The mean Δ[α/H] is −0.005 ± 0.096 dex and there is good agreement between APOGEE DR17 and GALAH DR3. However, linear trends are present as functions of the parameter differences. According to Jönsson et al. (2020), determinations by APOGEE in the near-infrared generally provide lower values of α-abundance than measurements conducted in the optical ranges, which might be reflected in the mean overall discrepancy being below zero.

We find a mean difference of −0.014 ± 0.082 dex in [α/H] for the MS stars. We note that no systematic trend is found with respect to Δvmicro and the distribution is roughly symmetric with its low scatter. Figure 8 indicates a weak correlation with respect to Δ[M/H] and Δlog g. In contrast, the offset is 0.005 ± 0.107 dex when considering RGB stars, with a decreasing trend in the Δvmicro − Δ[α/H] distribution. Moreover, ΔTeff and Δ[M/H] show weak correlations, while a strong correlation is found for Δlog g.

The main distinction for the α-elements is that O is included in APOGEE, while O is excluded in the GALAH α-elements. However, since similar trends in the O comparison in Fig. 8 are not seen, the exclusion of O within the GALAH procedure does not account for the full discrepancy seen (see the paragraph of O later).

Carbon [C/H]. Similarly to the case of α-abundance, the [C/H] abundance determined by APOGEE is based on the CO molecular lines whose abundance is fitted simultaneously along with the stellar parameters and subsequently held fixed. We find good agreement between dwarfs in the APOGEE and GALAH quality samples, with a mean difference of −0.031 ± 0.124 dex in the carbon abundance, and no correlations are found for Δ[C/H] for the dwarf sample. Due to the lack of GALAH quality giants with responsible [C/H] data, results for giants are not found (see Table 3). According to Fig. 11, the mean differences remain around the same value when considering absolute carbon abundances.

Oxygen, [O/H]. The oxygen abundance is determined with OH lines by the APOGEE pipeline. As molecular lines become weak with increasing Teff, the O abundance tag is populated only for stars with Teff < 5000 K. Oxygen shows a high precision and accuracy in the APOGEE data (Jönsson et al. 2020). No significant trend is seen with respect to the parameters (see Figs. 810), but note that the average difference is 0.027 dex with a scatter of 0.198 dex and −0.147 dex with a scatter of 0.210 dex for dwarfs and giants, respectively (see Table 3), meaning that GALAH systematically delivers higher [O/H] values for giants. The mean offsets become a little smaller by placing the O abundances on an absolute scale (see Fig. 11).

Sodium, [Na/H]. APOGEE spectra contain two weak lines of sodium, which may coincide with telluric lines as well; therefore, the reported precision is low (Jönsson et al. 2020). The final calibrated NA_FE tag contains reliable data for giants, but deviant values for dwarfs (online documentation: abundances6). As for the dwarfs, we find an offset of −0.094 ± 0.278 dex in Δ[Na/H], with no recognizable trends. We note that numerous stars with Δ[Na/H] <  − 1 are removed by local cuts, and a significant scatter is still observed in Fig. 9. For the giants there is a smaller offset and scatter: −0.074 and 0.225 dex (Table 3). Giant stars display a growing trend in Δ[Na/H] as a function of metallicity discrepancy, but a decreasing trend is observed with respect to Δvmicro. Moreover, we also find good agreement between APOGEE and GALAH on an absolute scale (ΔA(Na)). Weak correlations are only found between Δ[Na/Fe] and [Na/M] for both dwarfs and giants and no other sets of parameters (see Fig. 8).

thumbnail Fig. 9.

Distributions of Δvmicro (km s−1), Δlog g (dex), Δ[M/H], and Δ[X/H] abundance discrepancies with respect to ΔTeff for MS stars common between the APOGEE and GALAH quality data sets. The color-coding indicates the number of stars in the parameter planes, and bins representing one star are in gray. Black squares represent the median value within 50 K ranges. Red lines, with a slope of a, are fitted on the individual data points. The mean of the parameter differences (μ) and the standard deviations (σ) are displayed in the top left corner of each panel.

Magnesium, [Mg/H]. According to Jönsson et al. (2020), Mg is one of the most precisely derived chemical elements in APOGEE, together with high accuracy, and is derived in DR17 assuming NLTE. The analysis of magnesium abundance indicates a linear trend for dwarfs, similar to those shown for metallicity and α-abundance in Fig. 10 (as Mg contributes to both) with respect to ΔTeff, Δlog g, and Δ[M/H]. The mean difference is −0.064 ± 0.117 dex. As Fig. 8 indicates, there is a weak correlation with Δ[M/H] and Δlog g. For the giants, no correlations are found (see Fig. 8), though a growing linear trend is present with respect to Δ[M/H] (see Fig. 10), together with a mean difference of −0.016 ± 0.143 dex (Table 3). Figure 11 indicates good agreement both in [Mg/H] and A(Mg) between APOGEE and GALAH.

thumbnail Fig. 10.

Distributions of Δvmicro (km s−1), Δlog g (dex), Δ[M/H], and Δ[X/H] abundance discrepancies with respect to ΔTeff for RGB stars common between the APOGEE and GALAH quality data sets. The color-coding indicates the number of stars in the parameter planes, and bins representing one star are shown in gray. Black squares represent the median value within 50 K ranges. Red lines, with a slope of a, are fitted on the individual data points. The mean of the parameter differences (μ) and the standard deviations (σ) are displayed in the top left corner of each panel.

Aluminum, [Al/H]. Three relatively strong Al lines are detected within the APOGEE spectral range, and good precision is reported for the derived aluminum abundances both for giants and dwarfs. No correlations are found for Δ[Al/Fe] (see Fig. 8). For Δ[Al/H], a weak correlation is found between Δ[M/H] and a strong correlation with Δlog g for the dwarf sample, but none for the giants (as in Fig. 8). We find an offset of 0.082 dex with a scatter of 0.123 dex, and an increasing linear trend with respect to ΔTeff for dwarfs as seen in Fig. 9. For giants there is slight trend with ΔTeff yielding a mean discrepancy of 0.052 dex with a low scatter of 0.098 dex (see Table 3 and Figs. 9, 10), though the slope of the fitted trend remains lower than 0.8σ, so there is no formal correlation. We conclude that there is agreement of the aluminum abundances, both relative to hydrogen and in an absolute way (see Fig. 11). Aluminum was derived by assuming LTE by APOGEE and NLTE effects may introduce a systematic offset as a function of ΔTeff and Δlog g over certain parameter space.

thumbnail Fig. 11.

Individual elemental abundance differences for APOGEE−GALAH (bottom), APOGEE−GES (middle), and GALAH−GES (top) common samples relative to hydrogen (left column) and on an absolute scale (right column). The shaded interval accounts for a ±0.1 dex range. The calculated results for GB and MS stars are shown as filled red and empty black circles, respectively.

Silicon, [Si/H]. Silicon is reported to be one of the most precisely measured elements in APOGEE, and no significant trend was discovered in previous comparisons to optical measurements (Jönsson et al. 2020). Here we also find a negligible offset and standard deviation both for MS and RGB stars (Table 3, Figs. 911). In addition to the close overall agreement between APOGEE and GALAH, we note that a slight systematic difference is indicated in Δlog g (also shown in Fig. 8) for dwarfs.

Potassium, [K/H]. Jönsson et al. (2020) report a medium estimated precision of measured K abundance, which is based on two absorption lines. In DR17, NLTE calculations are introduced for K. In GALAH DR3, K is estimated from the 7699 Å resonance line, which is also present in interstellar matter. Therefore, [K/H] should be used with caution in highly extinct regions (Buder et al. 2021). As can be seen in Fig. 8 potassium has no correlations for either dwarfs and giants, and [K/H] abundances suffer from a discrepancy of −0.114 ± 0.174 dex and 0.025 ± 0.252 dex between APOGEE and GALAH quality dwarf and giant stellar samples, respectively. With respect to metallicity, a growing systematic difference is displayed for both dwarfs and giants, suggesting that higher metallicity values convey higher measured abundances. We note that all these trends remain lower than 0.8σ. Again NLTE effects may explain some of these discrepancies.

Calcium, [Ca/H]. In APOGEE, the precision of Ca abundance is reported to be high, but a small offset may be required when compared to the optically determined Ca abundance. Except for microturbulence, there appears to be a weak positive linear trend in the Ca abundance with increasing stellar parameters, but overall show fair agreement among the surveys. MS stars produce an average difference of 0.019 dex with a 0.126 dex scatter, while an offset of −0.009 dex and a scatter of 0.150 dex is found in RGB stars. We note, however, that by placing the abundance on the absolute scale, the mean offsets become larger than 0.1 dex. Some weak correlations in [Ca/H] are shown in Fig. 8, with Δ[M/H] in both dwarfs and giants, and for Δlog g in dwarfs.

Titanium I/II, [Ti/H]/[TiII/H]. As stated in Jönsson et al. (2020), Ti I in APOGEE shows low scatter if compared to optical data, but issues with some of the lines may occur, or the strong uncalibrated Teff dependence of Ti I lines propagate to individual abundance values. As there is only one Ti II line in the APOGEE spectral window, its abundance is reported to have low precision and a significant scatter when compared to optical measurements (Jönsson et al. 2020). Since spectroscopic surface gravities are used in deriving the abundances, elements (such as Ti II) that are sensitive to log g could have systematic offsets (Jönsson et al. 2020).

In the APOGEE−GALAH quality data set, Ti I abundance values show a discrepancy of −0.155 ± 0.264 dex and −0.007 ± 0.194 dex for MS and RGB stars, respectively. We note that for giants the Ti I abundance is derived more precisely than for dwarfs; however, the discovered trend is substantial especially in the ΔTeff − Δ[Ti/H], Δ log g − Δ [Ti/H], and Δ[M/H] − Δ[Ti/H] planes that indicate strong correlations (see also Fig. 8). The case of ionized titanium is similar: there is an average difference of −0.223 ± 0.326 dex and no clear trends among dwarfs, although the agreement is better (0.058 ± 0.235 dex) for giants than for dwarfs, as also found by Jönsson et al. (2020). We also note that the effective temperature dependence is weaker for Δ[Ti II/H] than for Δ[Ti/H] (see Table 3 and Figs. 9, 10).

Vanadium, [V/H]. For APOGEE, the derived vanadium abundance is reported to have rather low precision, accuracy, and reliability by the online documentation (abundances), and it is advised to use the V abundance with caution. Similarly, Buder et al. (2021) also gives a warning about the use of the V abundance, as blended lines result in possible incorrect systematics. A weak correlation is found between [V/M] and Δ[V/Fe] in our comparison for both dwarfs and giants. For the dwarfs, there is a mean [V/H] difference of −0.053 dex and a 0.255 dex scatter. However, the giants show a −0.268 ± 0.311 dex discrepancy of [V/H] on average. For the absolute abundance A(V), we observe almost the same offsets (see Fig. 11).

Chromium, [Cr/H]. Comparison with optical abundances showed that in APOGEE, chromium is measured with moderate precision and accuracy (online documentation: abundances). For the MS stars, a discrepancy of 0.056 ± 0.229 dex is present in Δ[Cr/H], together with no weak systematic correlations with the parameters. The giants show an offset of 0.032 dex and a scatter of 0.183 dex in Δ[Cr/H] and A(Cr) as well, and we observe various slight systematic trends with respect to all atmospheric parameters, which all have slopes lower than 0.8σ.

Manganese, [Mn/H]. Zero-point Mn abundance shifts are included in the final APOGEE release, although reporting a high precision and reliability. We establish weak correlations with Δ[M/H] for both Ms and RGB stars, and with Δlog g for the giants only a seen in Fig. 8. We find a linear ΔTeff trend with an offset of −0.126 dex and a low scatter of 0.111 dex for the dwarfs (Fig. 9), and an offset of −0.083 dex with a scatter 0.198 dex for the giants (Fig. 10). In addition to the offset, the main parameter correlations mostly originate from the metallicity trends (see also Fig. 8). We note that by calculating the offsets on an absolute scale, the mean differences decrease to < 0.1 dex.

Cobalt, [Co/H]. APOGEE spectral windows involve one single cobalt line from which to derive its abundance, and thus a significant scatter is expected, especially among dwarfs. For the dwarfs we find a substantial offset and scatter of −0.217 ± 0.331 dex in Co, with no identified systematic pattern. However, Δ[Co/H] shows a slight correlation, but lower than 0.8σ, similar to Δ[M/H] with respect to the effective temperature differences for giants (see Fig. 10), but the mean discrepancy is lower: 0.004 ± 0.195 dex. As indicated in Fig. 8, there is a weak correlation in Δ[Co/H] with Δlog g for giants.

Nickel, [Ni/H]. Nickel is derived with a high precision in APOGEE (online documentation: abundances), although some systematic deviations compared to optical studies have been revealed. Nickel is claimed to have the most accurate and precise elemental abundance among the iron-peak elements in APOGEE (Jönsson et al. 2020). In Δ[Ni/H], a clear linear trend is observed with respect to effective temperature, and the offset is 0.046 ± 0.127 dex for MS stars (Fig. 9), while the giants have an offset of −0.030 ± 0.133 dex (Fig. 10). For the dwarfs a systematic trend in Δ[M/H] is seen in Fig. 8. When considering absolute abundances the mean offsets lie within 0.1 dex, indicating a general agreement.

7. Effect on the Milky Way chemical maps

Comparison analyses like those presented in this work not only provide a reference basis for future stellar spectroscopy, but also reveal the effect of discrepancies on existing Milky Way chemical maps. Using data from APOGEE DR14 and GALAH DR2, Griffith et al. (2019) discovered that the GALAH and APOGEE median trends for Si and Ca are similar, but O has a much stronger metallicity dependence in GALAH, while the APOGEE trends are nearly flat.

Figure 12 represents the [O/Mg], [Si/Mg], and [Ca/Mg] median trends versus [Mg/H] for the low-Ia (high-α) and high-Ia (low-α) Type Ia supernovae stellar populations of the APOGEE−GALAH filtered data set (additional criteria applied: 4500 K < Teff < 6200 K and the absence of relevant elemental flags), using the same definitions as Weinberg et al. (2019) for the “two-process model”. In the updated version of Fig. 3 in Griffith et al. (2019) and in the current paper we reveal that similar negatively sloped median trends are still present, as in APOGEE DR14, and also a stronger correlation in GALAH DR2. The top panel of Fig. 12 indicates close agreement of the low-Ia and high-Ia sequences. As is theoretically known, O and Mg yields are both dominated by core collapse supernovae. The disagreements in the slopes of the [M/H] or [Mg/H] dependences could arise from the difficulty of deriving oxygen abundances from both optical (GALAH, atomic lines) and near-IR (APOGEE, molecular lines) spectra (Weinberg et al. 2019). The flat trend drawn by APOGEE data agrees better with theoretical expectations, as the O and Mg enrichment show a slight dependence on metallicity. Following Weinberg et al. (2019) and Griffith et al. (2019), Weinberg et al. (2022) used Mg as a reference and also fit the median trends of low-Ia and high-Ia populations and approximate stellar abundance patterns as the sum of a CCSN contribution and an SNIa contribution. Sharper separations between the low and high sequences are observed in the median trends of Si and Ca in the middle and bottom panels, respectively, suggesting that both elements have a non-negligible contribution from SNIa.

thumbnail Fig. 12.

Reproduction of Fig. 3 from Griffith et al. (2019) based on the most current versions of the databases. GALAH DR3 α-element median abundances of the high-Ia supernova (empty blue circles) and low-Ia supernova (empty red circles) populations. Data are binned by 0.1 dex in [Mg/H]. APOGEE DR17 median abundances are also included (squares).

In general, it would be crucial to understand and correct for the trends arising in O, as investigating the chemical evolution of the Galaxy requires reference elements produced and scattered dominantly by CCSN explosions (e.g., Mg and O).

8. Effect of APOGEE DR17 calibration

From APOGEE DR17 only raw parameter and chemical abundance values are used throughout our analysis, and this section includes the investigation of the effect of the APOGEE DR17 surface gravity and effective temperature calibrations on the previously seen discrepancy distributions. For the APOGEE−GALAH quality stars, Fig. 13 shows the discrepancies in Teff and log g before and after the APOGEE DR17 calibrations. Calibrated values come from the PARAM array, and the horizontal axes represent the raw main parameter values derived by APOGEE.

thumbnail Fig. 13.

Main parameters vs. Teff and log g discrepancies based on raw (first and third rows) and calibrated (second and fourth rows) values from the APOGEE−GALAH common data set. Horizontal axes indicate APOGEE DR17 data in all panels. Every data point is marked in the parameter planes, while the color-coding of stellar density starts from number two. The mean and standard deviation of the differences are also shown in each case.

Calibrated Teff is set by a comparison to photometric effective temperatures (Jönsson et al. 2020). Unlike in DR16 (Jönsson et al. 2020), the log g calibration in DR17 was applied using a neural network, and this new approach removes small discontinuities that were apparent (online documentation: parameters7).

By comparing the raw and corrected data in Fig. 13, the conclusion is clear. The implementation of the calibration processes results in a smaller offset with a scatter, and each substantial trend is eliminated with respect to the main parameters; therefore, the stellar density distributions show nearly symmetrical patterns. However, it is arguable whether the solar sample of calibrators used in the photometric effective temperature calibration represents the truth, and thus justifies those calibrations.

9. Conclusion

Comparing abundances between high-resolution surveys will become more and more important as new surveys such as MWM, 4MOST, and WEAVE will observe millions of stars and will be able to map the chemical makeup of our Galaxy in more detail than ever before. Revealing variants in the results between the surveys yields insight into the underlying physics and methodologies employed and informs future surveys. The work presented here contributes to the preparation for the abundances of the next generation of surveys, particularly the MWM. Similarly to APOGEE, MWM also uses SDSS spectrographs, and therefore adds another reason why APOGEE is the main reference throughout the paper in addition to the more extensive detailed coverage of the HRD.

APOGEE, GALAH, and Gaia-ESO are the first three major high-resolution spectroscopic sky survey programs operating over the last decade and providing publicly available data. In this work we created the APOGEE−GALAH, APOGEE−GES, and GALAH−GES stellar catalogs containing golden samples using their latest data releases.

Quality stars are collected according to a consistent quality criteria: sufficient S/N, low v sin i, low scatter, and error of RV, and for technical issues the parameters and chemical abundances also utilize the flagging systems of the three surveys. The filtered common data set covers 4000 K < Teff < 7000 K and 0 dex < log g < 4.75 dex ranges in the HRD, and the metallicity spans −1.5 dex to 0.5 dex.

We investigated the discrepancies and potential correlations of the RV, primary stellar parameters and common species of elemental abundances (both on relative and absolute scales) between the surveys. The analysis is based on comparing average differences and the slope of correlations with the reported individual uncertainties from APOGEE, GALAH, and GES. In most cases the median absolute deviation (MAD) of the differences is lower than the standard deviation of the differences both in the parameters and in the abundances, which indicates that in addition to the offset and scatter there are no unexpected dispersion issues in the parameter planes.

Radial velocity analysis reveals that the mean of the differences is below the combined uncertainty intervals for APOGEE−GALAH, APOGEE−GES, and GALAH−GES quality stars. By separating the MS and RGB we find similarly good agreements between the surveys; however, significant scatters are still observed in RVs. The origin of this variation in Δvrad that range from 1.26 km s−1 (APOGEE−GES dwarfs) to 4.37 km s−1 (GALAH−GES dwarfs) is still unclear, even after eliminating variable stars through cross-matching with the GCVS, with the APOGEE DR17 VAC of spectroscopic binaries and with the Gaia Variable Summary table. The presence of as-yet-unknown measurement discrepancies or undiscovered variables might explain this issue.

Discrepancies and possible correlations of effective temperature, surface gravity, metallicity, and microturbulence are discussed. According to the mean differences, all the main stellar parameters are generally measured consistently by the surveys, although we find possible weak correlations in the parameter-planes ΔTeff versus Δ[M/H] (for the APOGEE−GALAH dwarfs), ΔTeff versus Δlog g (for the APOGEE−GALAH dwarfs and APOGEE−GES stars), and in the Δ[M/H] versus Δlog g (for APOGEE−GALAH stars). Moreover, there is a clear growing linear trend in Δvmicro as a function of vmicro, probably caused by the fact that APOGEE determines it as part of the global spectral fit, while GALAH calculates it from an empirical relation of Teff.

We discuss the mean and standard deviation of the common individual abundances for all three overlapping quality stellar catalogs; in addition, the strength of the correlations between species from APOGEE−GALAH and the main parameters is investigated. Generally, we draw the conclusion that the vast majority of the common chemical abundance ratios are consistent in the relative scale. The strength of correlations are represented in Fig. 8, and, for example, [Al/H] differences do have a strong linear dependence on Δlog g. The comparison of the relative and absolute abundance values (see Fig. 11) suggests that implementing the solar reference values slightly improves the overall agreement between the elements, but there are still large variations within the reference values used by the surveys. Although most of the differences are within the uncertainties, evaluating and understanding the sources of the error remains crucial to put Milky Way stellar data on the same scale as the new data and methods increasingly improve the precision.


1

ESO Phase 3 Data Release 5 Description: https://www.eso.org/rm/api/v1/public/releaseDescriptions/191

2

https://www.sdss.org/dr17/irspec/apogee-bitmasks/

The web documentation contains the details of the APOGEE Bitmasks. This documentation also includes detailed flag descriptions.

5

The customary scale for logarithmic abundances is defined as A(X) = log(NX/NH)+A(H), where A(H)≡12 and NX is the number of atoms of element X per unit volume in the atmosphere.

Acknowledgments

VH and SzM acknowledge the support of Hungarian Academy of Sciences through MTA-ELTE Lendület “Momentum” Milky Way Research Group. Funding for the Sloan Digital Sky Survey IV has been provided by the Alfred P. Sloan Foundation, the U.S. Department of Energy Office of Science, and the Participating Institutions. SDSS-IV acknowledges support and resources from the Center for High Performance Computing at the University of Utah. The SDSS website is www.sdss.org. SDSS-IV is managed by the Astrophysical Research Consortium for the Participating Institutions of the SDSS Collaboration including the Brazilian Participation Group, the Carnegie Institution for Science, Carnegie Mellon University, Center for Astrophysics – Harvard & Smithsonian, the Chilean Participation Group, the French Participation Group, Instituto de Astrofísica de Canarias, The Johns Hopkins University, Kavli Institute for the Physics and Mathematics of the Universe (IPMU)/University of Tokyo, the Korean Participation Group, Lawrence Berkeley National Laboratory, Leibniz Institut für Astrophysik Potsdam (AIP), Max-Planck-Institut für Astronomie (MPIA Heidelberg), Max-Planck-Institut für Astrophysik (MPA Garching), Max-Planck-Institut für Extraterrestrische Physik (MPE), National Astronomical Observatories of China, New Mexico State University, New York University, University of Notre Dame, Observatário Nacional/MCTI, The Ohio State University, Pennsylvania State University, Shanghai Astronomical Observatory, United Kingdom Participation Group, Universidad Nacional Autónoma de México, University of Arizona, University of Colorado Boulder, University of Oxford, University of Portsmouth, University of Utah, University of Virginia, University of Washington, University of Wisconsin, Vanderbilt University, and Yale University. PJ acknowledges financial support of FONDECYT Regular Grant Number 1200703 and Millenium Nucleus ERIS Number NCN2021_017.

References

  1. Abdurro’uf, Accetta, K., Aerts, C., et al. 2022, ApJS, 259, 35 [CrossRef] [Google Scholar]
  2. Allende Prieto, C., Beers, T. C., Wilhelm, R., et al. 2006, ApJ, 636, 804 [NASA ADS] [CrossRef] [Google Scholar]
  3. Anguiano, B., Majewski, S. R., Allende Prieto, C., et al. 2018, A&A, 620, A76 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  4. Baumgardt, H., Hilker, M., Sollima, A., & Bellini, A. 2019, MNRAS, 482, 5138 [Google Scholar]
  5. Beaton, R. L., Oelkers, R. J., Hayes, C. R., et al. 2021, AJ, 162, 302 [NASA ADS] [CrossRef] [Google Scholar]
  6. Blanton, M. R., Bershady, M. A., Abolfathi, B., et al. 2017, AJ, 154, 28 [Google Scholar]
  7. Bowen, I. S., & Vaughan, A. H. 1973, Appl. Opt., 12, 1430 [Google Scholar]
  8. Buder, S., Sharma, S., Kos, J., et al. 2021, MNRAS, 506, 150 [NASA ADS] [CrossRef] [Google Scholar]
  9. Cardelli, J. A., Clayton, G. C., & Mathis, J. S. 1989, ApJ, 345, 245 [Google Scholar]
  10. Casagrande, L., Ramírez, I., Meléndez, J., Bessell, M., & Asplund, M. 2010, A&A, 512, A54 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  11. Cui, X.-Q., Zhao, Y.-H., Chu, Y.-Q., et al. 2012, Res. Astron. Astrophys., 12, 1197 [Google Scholar]
  12. Dalton, G., Trager, S., Abrams, D. C., et al. 2018, SPIE Conf. Ser., 10702, 107021B [Google Scholar]
  13. de Jong, R. S., Agertz, O., Berbel, A. A., et al. 2019, The Messenger, 175, 3 [NASA ADS] [Google Scholar]
  14. De Silva, G. M., Freeman, K. C., Bland-Hawthorn, J., et al. 2015, MNRAS, 449, 2604 [NASA ADS] [CrossRef] [Google Scholar]
  15. Eyer, L., Audard, M., Holl, B., et al. 2023, A&A, submitted [arXiv:2206.06416] [Google Scholar]
  16. Gaia Collaboration (Brown, A. G. A., et al.) 2016, A&A, 595, A2 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  17. Gaia Collaboration (Brown, A. G. A., et al.) 2018, A&A, 616, A1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  18. Gaia Collaboration (Brown, A. G. A., et al.) 2021, A&A, 649, A1 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  19. García Pérez, A. E., Allende Prieto, C., Holtzman, J. A., et al. 2016, AJ, 151, 144 [Google Scholar]
  20. Gilmore, G., Randich, S., Asplund, M., et al. 2012, The Messenger, 147, 25 [NASA ADS] [Google Scholar]
  21. Grevesse, N., Asplund, M., & Sauval, A. J. 2007, Space Sci. Rev., 130, 105 [Google Scholar]
  22. Griffith, E., Johnson, J. A., & Weinberg, D. H. 2019, ApJ, 886, 84 [Google Scholar]
  23. Gunn, J. E., Siegmund, W. A., Mannery, E. J., et al. 2006, AJ, 131, 2332 [NASA ADS] [CrossRef] [Google Scholar]
  24. Gustafsson, B., Edvardsson, B., Eriksson, K., et al. 2008, A&A, 486, 951 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  25. Heiter, U., Jofré, P., Gustafsson, B., et al. 2015, A&A, 582, A49 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  26. Holtzman, J. A., Hasselquist, S., Shetrone, M., et al. 2018, AJ, 156, 125 [Google Scholar]
  27. Hubeny, I., & Lanz, T. M. 2017, ArXiv eprints [arXiv:1706.01859] [Google Scholar]
  28. Hubeny, I., Allende Prieto, C., Osorio, Y., & Lanz, T. 2021, ArXiv eprints [arXiv:2104.02829] [Google Scholar]
  29. Jofré, P., Heiter, U., Maia, M. T., et al. 2018, Res. Notes AAS, 2, 152 [CrossRef] [Google Scholar]
  30. Jofré, P., Heiter, U., & Soubiran, C. 2019, ARA&A, 57, 571 [Google Scholar]
  31. Jönsson, H., Holtzman, J. A., Allende Prieto, C., et al. 2020, AJ, 160, 120 [Google Scholar]
  32. Kollmeier, J. A., Zasowski, G., Rix, H.-W., et al. 2017, ArXiv eprints [arXiv:1711.03234] [Google Scholar]
  33. Kounkel, M., Covey, K. R., Stassun, K. G., et al. 2021, AJ, 162, 184 [NASA ADS] [CrossRef] [Google Scholar]
  34. Majewski, S. R., Schiavon, R. P., Frinchaboy, P. M., et al. 2017, AJ, 154, 94 [Google Scholar]
  35. Nandakumar, G., Hayden, M. R., Sharma, S., et al. 2020, The GALAH survey: Milky Way Disc Metallicity and Alpha-abundance Trends in Combined APOGEE-GALAH Catalogues [Google Scholar]
  36. Nidever, D. L., Holtzman, J. A., Allende Prieto, C., et al. 2015, AJ, 150, 173 [NASA ADS] [CrossRef] [Google Scholar]
  37. Osorio, Y., Allende Prieto, C., Hubeny, I., Mészáros, S., & Shetrone, M. 2020, A&A, 637, A80 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  38. Pasquini, L., Avila, G., Blecha, A., et al. 2002, The Messenger, 110, 1 [Google Scholar]
  39. Piskunov, N., & Valenti, J. A. 2017, A&A, 597, A16 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  40. Randich, S., Gilmore, G., Magrini, L., et al. 2022, A&A, 666, A121 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  41. Sacco, G. G., Morbidelli, L., Franciosini, E., et al. 2014, A&A, 565, A113 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  42. Samus’, N. N., Kazarovets, E. V., Durlevich, O. V., Kireeva, N. N., & Pastukhova, E. N. 2017, Astron. Rep., 61, 80 [Google Scholar]
  43. Santana, F. A., Beaton, R. L., Covey, K. R., et al. 2021, AJ, 162, 303 [NASA ADS] [CrossRef] [Google Scholar]
  44. Sheinis, A., Anguiano, B., Asplund, M., et al. 2015, J. Astron. Telescopes Instrum. Syst., 1, 035002 [NASA ADS] [CrossRef] [Google Scholar]
  45. Smith, V. V., Bizyaev, D., Cunha, K., et al. 2021, AJ, 161, 254 [NASA ADS] [CrossRef] [Google Scholar]
  46. Soubiran, C., Brouillet, N., & Casamiquela, L. 2022, A&A, 663, A4 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  47. Spina, L., Magrini, L., & Cunha, K. 2022, Universe, 8, 87 [NASA ADS] [CrossRef] [Google Scholar]
  48. Steinmetz, M., Matijevič, G., Enke, H., et al. 2020, AJ, 160, 82 [NASA ADS] [CrossRef] [Google Scholar]
  49. Takada, M., Ellis, R. S., Chiba, M., et al. 2014, PASJ, 66, R1 [Google Scholar]
  50. Taylor, M. B. 2005, ASP Conf. Ser., 347, 29 [Google Scholar]
  51. Tsantaki, M., Pancino, E., Marrese, P., et al. 2022, A&A, 659, A95 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  52. Valenti, J. A., & Piskunov, N. 1996, A&AS, 118, 595 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
  53. Weinberg, D. H., Holtzman, J. A., Hasselquist, S., et al. 2019, ApJ, 874, 102 [NASA ADS] [CrossRef] [Google Scholar]
  54. Weinberg, D. H., Holtzman, J. A., Johnson, J. A., et al. 2022, ApJS, 260, 32 [NASA ADS] [CrossRef] [Google Scholar]
  55. Wilson, J. C., Hearty, F. R., Skrutskie, M. F., et al. 2019, PASP, 131, 055001 [NASA ADS] [CrossRef] [Google Scholar]
  56. Zasowski, G., Cohen, R. E., Chojnowski, S. D., et al. 2017, AJ, 154, 198 [Google Scholar]

All Tables

Table 1.

Details of the latest data releases of the APOGEE, GALAH, and Gaia−ESO sky survey programs.

Table 2.

Comparison of the common catalogs of the surveys.

Table 3.

Comparison of the elemental abundance values from the surveys.

Table 4.

Overall average of the reported individual parameter errors from the golden samples.

Table 5.

Median absolute deviation of the reported parameters and abundances from the golden samples.

Table 6.

Median absolute deviation of the reported abundances from the golden samples.

All Figures

thumbnail Fig. 1.

Mollweide projection of the Milky Way with the distribution of all targets from APOGEE DR17 (olive green) and from the APOGEE−GALAH (orange), APOGEE−GES (blue), and GALAH−GES (red) common catalogs shown in Galactic coordinates.

In the text
thumbnail Fig. 2.

Venn diagram of the elements derived by each survey. The overlap of elements in all three surveys is in brown and those from only two surveys in dark green. Light green indicates no overlap between surveys. Species of any intersection are discussed in detail in Sect. 6.

In the text
thumbnail Fig. 3.

Teff − log g diagrams of quality stars observed in APOGEE DR17, GALAH DR3, GES DR5 surveys (top row) and of stars that are common between these surveys (bottom row) color-coded by metallicity. In the APOGEE−GALAH and APOGEE−GES panels in the bottom row the APOGEE DR17 Teff (K), log g (cgs, dex), and [M/H] (dex) are used on the axes and in the color-coding; in the GALAH−GES panel the GALAH DR3 Teff, log g, and [Fe/H] are used. The separations of dwarf and giant stars are indicated by dashed lines in the common data sets.

In the text
thumbnail Fig. 4.

Distributions of the radial velocity differences with respect to the main stellar parameters for the GALAH−GES (top row), APOGEE−GES (middle row), and APOGEE−GALAH (bottom row) common data sets, including dwarfs and giants. Bins containing only one star are in gray in the background; the color-coding (according to the number of stars per bin) starts from two stars. The mean and standard deviation of the differences are also shown in each case. Horizontal axes indicate Teff (K), metallicity, log g (dex), and vmicro and vrad (km s−1) from APOGEE DR17 (middle and bottom rows) and from GALAH DR3 (top row).

In the text
thumbnail Fig. 5.

Correlations between the main parameter pairs in the three overlapping catalogs. The strength of the correlations between each pair is color-coded: light blue cells represent no significant correlations (< 0.8σ); dark blue cells represent weak correlations (0.8σ − 1.5σ). Black cells are not relevant due to the absence of reported uncertainties of vmicro or because they link the same quantities (e.g., ΔTeff with ΔTeff). The estimated strength of the correlations is based on the comparison of the correlation slopes with the average of the reported individual uncertainties of stellar parameters and RVs. The stars have been separated into dwarfs (left column) and giants (right column).

In the text
thumbnail Fig. 6.

Distributions of effective temperature (top row), surface gravity (second row), metallicity (third row), and microturbulence velocity differences (bottom row) as functions of the main stellar parameters from the APOGEE−GALAH overlapping stellar catalog, for dwarfs and giants combined. The number density of stars (starting with two stars) over the parameter space spanned is provided by the color-coding. The binned star densities over the parameter spaces, and the (overall) mean and standard deviation of parameter differences are shown in each panel as well. Bins representing one star are shown in gray. The horizontal axes indicate Teff (K), metallicity, log g (dex), and vmicro (km s−1) from APOGEE DR17, and the parameter differences are calculated as Δ(AP) = AP(APOGEE) − AP(GALAH).

In the text
thumbnail Fig. 7.

Overall metallicity vs. the difference between overall metallicity and iron abundance color-coded by Teff published in APOGEE DR17. The indicated stars are from the APOGEE−GALAH common stellar catalog and global cuts are applied as well. The dashed lines indicate a difference of ±0.1 dex (expected uncertainty from abundance analysis).

In the text
thumbnail Fig. 8.

Strength of correlations in the APOGEE−GALAH parameter and abundance pairs. The color-coding is as follows: Black cells are indeterminable, light blue cells represent no significant correlation (< 0.8σ), dark blue cells indicate a weak correlation (0.8σ − 1.5σ), and the magenta cells mean strong correlation (> 1.5σ). Our estimations are based on the average correlation of reported individual uncertainties of stellar parameters and abundances. Dwarf (left panel) and giant stars (right panel) are separated.

In the text
thumbnail Fig. 9.

Distributions of Δvmicro (km s−1), Δlog g (dex), Δ[M/H], and Δ[X/H] abundance discrepancies with respect to ΔTeff for MS stars common between the APOGEE and GALAH quality data sets. The color-coding indicates the number of stars in the parameter planes, and bins representing one star are in gray. Black squares represent the median value within 50 K ranges. Red lines, with a slope of a, are fitted on the individual data points. The mean of the parameter differences (μ) and the standard deviations (σ) are displayed in the top left corner of each panel.

In the text
thumbnail Fig. 10.

Distributions of Δvmicro (km s−1), Δlog g (dex), Δ[M/H], and Δ[X/H] abundance discrepancies with respect to ΔTeff for RGB stars common between the APOGEE and GALAH quality data sets. The color-coding indicates the number of stars in the parameter planes, and bins representing one star are shown in gray. Black squares represent the median value within 50 K ranges. Red lines, with a slope of a, are fitted on the individual data points. The mean of the parameter differences (μ) and the standard deviations (σ) are displayed in the top left corner of each panel.

In the text
thumbnail Fig. 11.

Individual elemental abundance differences for APOGEE−GALAH (bottom), APOGEE−GES (middle), and GALAH−GES (top) common samples relative to hydrogen (left column) and on an absolute scale (right column). The shaded interval accounts for a ±0.1 dex range. The calculated results for GB and MS stars are shown as filled red and empty black circles, respectively.

In the text
thumbnail Fig. 12.

Reproduction of Fig. 3 from Griffith et al. (2019) based on the most current versions of the databases. GALAH DR3 α-element median abundances of the high-Ia supernova (empty blue circles) and low-Ia supernova (empty red circles) populations. Data are binned by 0.1 dex in [Mg/H]. APOGEE DR17 median abundances are also included (squares).

In the text
thumbnail Fig. 13.

Main parameters vs. Teff and log g discrepancies based on raw (first and third rows) and calibrated (second and fourth rows) values from the APOGEE−GALAH common data set. Horizontal axes indicate APOGEE DR17 data in all panels. Every data point is marked in the parameter planes, while the color-coding of stellar density starts from number two. The mean and standard deviation of the differences are also shown in each case.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.