Skip to content
The following article is Open access

Inferring Host-galaxy Properties of LIGO–Virgo–KAGRA’s Black Holes

, , , and

Published 2024 September 4 © 2024. The Author(s). Published by the American Astronomical Society.
, , Citation Aditya Vijaykumar et al 2024 ApJ 972 157DOI 10.3847/1538-4357/ad6140

Download Article PDF
DownloadArticle ePub

You need an eReader or compatible software to experience the benefits of the ePub3 file format.

0004-637X/972/2/157

Abstract

Observations of gravitational waves from binary black hole (BBH) mergers have measured the redshift evolution of the BBH merger rate. The number density of galaxies in the Universe evolves differently with redshift based on their physical properties, such as their stellar masses and star formation rates. In this work we show that the measured population-level redshift distribution of BBHs sheds light on the properties of their probable host galaxies. We first assume that the hosts of BBHs can be described by a mixture model of galaxies weighted by stellar mass or star formation rate, and find that we can place upper limits on the fraction of mergers coming from a stellar-mass-weighted sample of galaxies. We then constrain the parameters of a physically motivated power-law delay-time distribution using GWTC-3 data, and self-consistently track galaxies in the UniverseMachine simulations with this delay-time model to infer the probable host galaxies of BBHs over a range of redshifts. We find that the inferred host galaxy distribution at redshift z = 0.21 has a median star formation rate ∼ 0.9 Myr−1 and a median stellar mass of ∼1.9 × 1010M. We also provide distributions for the mean stellar age, halo mass, halo radius, peculiar velocity, and large-scale bias associated with the host galaxies, as well as their absolute magnitudes in the B and Ks bands. Our results can be used to design optimal electromagnetic follow-up strategies for BBHs, and also to aid the measurement of cosmological parameters using the statistical dark-siren method.

Export citation and abstractBibTeXRIS

Original content from this work may be used under the terms of the Creative Commons Attribution 4.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

1. Introduction

The host galaxies of astrophysical electromagnetic (EM) transients have offered interesting insights into the astrophysics of their progenitors. For instance, the host-galaxy population of Type Ia supernovae and core-collapse supernovae are different (e.g., Foley & Mandel 2013); this is expected given that both these classes have different progenitors. Similarly, long and short gamma-ray bursts (GRBs) are known to trace different host-galaxy properties (Fong et al. 2010, 2022; Levesque 2014; Lyman et al. 2017; Nugent et al. 2022). The inferred star formation histories (SFHs) of short GRB host galaxies have also been used to constrain their delay-time distribution, finding a preference for low delay times but also evidence for a high delay-time tail (Zevin et al. 2022). Interesting conclusions about fast radio burst (FRB) progenitors have also been drawn using their host galaxies (Bhandari et al. 2022). The follow-up and cataloging of host galaxies of supernovae, short GRBs, and FRBs continues to be an important research direction.

In a similar vein, information about the host galaxies of gravitational-wave (GW) transients could shed light on many interesting aspects of their formation and evolution. For instance, Safarzadeh & Berger (2019) propose to use the population of host galaxies of ${ \mathcal O }(1000)$ binary neutron stars (BNSs) to measure their underlying delay-time distribution, whereas Adhikari et al. (2020) find that ${ \mathcal O }(10)$ detections of localized BNSs would be sufficient to provide interesting constraints on BNS progenitor models, including the relation to star formation and delay-time distributions. Any information about probable host galaxies could also be used to reduce the number of candidate galaxies searched to localize a GW transient, and in designing optimal weighting schemes in the statistical method to measure cosmological parameters (Schutz 1986; Del Pozzo 2012; Chen et al. 2018; Fishbach et al. 2019; Abbott et al. 2021a, 2023a).

A number of works have made assumptions on the formation and evolution of both galaxies and compact binaries to forward model the distribution of GW transient host galaxies (O’Shaughnessy et al. 2010; Lamberts et al. 2016; Artale et al. 2019; Mapelli et al. 2019; Toffano et al. 2019; Santoliquido et al. 2022; Rauf et al. 2023; Srinivasan et al. 2023). For example, Santoliquido et al. (2022) found that, contingent on their modeling assumptions, binary black holes (BBHs) prefer merging in high-mass galaxies. However, unlike their EM cousins, the host galaxies of GW transients cannot usually be identified, owing to poor pointing accuracy of GW detectors. The notable exception to this is the BNS merger GW170817 (Abbott et al. 2017a), whose EM emission helped localize the source to the galaxy NGC 4993 (Abbott et al. 2017b). It comes as no surprise that although the LIGO–Virgo–KAGRA (LVK; Acernese et al. 2015; LIGO Scientific Collaboration et al. 2015; Akutsu et al. 2021) network of detectors has reported the detection of ∼90 BBHs (The LIGO Scientific Collaboration et al. 2021), none of them have been confidently associated with a host galaxy. 9 Even with upgrades to the current detector network with the inclusion of LIGO-India (Saleem et al. 2022), or with more sensitive next-generation detectors such as Cosmic Explorer (Evans et al. 2021) and the Einstein Telescope (Punturo et al. 2010), only a small fraction of BBHs are expected to be sufficiently localized such that the host galaxy can be confidently identified (Chen & Holz 2016; Nishizawa 2017; Abbott et al. 2020a; Borhanian et al. 2020; Branchesi et al. 2023; Gupta et al. 2023).

Thinking about the detected BBHs as a population instead of individual events has allowed measurements of the underlying mass and spin distribution of black holes in binaries, as well the merger rate of binaries as a function of redshift, z (Abbott et al. 2019, 2021b, 2023b). Specifically, the redshift evolution of the merger rate (Fishbach et al. 2018, 2021; Abbott et al. 2021b, 2023b) has yielded constraints on the delay-time distribution and formation metallicity of BBHs (Fishbach & Kalogera 2021; Fishbach & van Son 2023; Turbang et al. 2023), and the properties of globular clusters (GCs; Fishbach & Fragione 2023). In this work, we show that the redshift evolution of the merger rate inferred from the latest GW transient catalog GWTC-3 (The LIGO Scientific Collaboration et al. 2021) can already begin to constrain the host-galaxy parameter space of detected BBHs. This results directly from the observation that galaxies weighted by different physical properties have different evolutions for their (weighted) number density (see, e.g., Faber et al. 2007). For instance, there are more star-forming galaxies per unit volume at z = 1 as compared to z ≈ 0, while there is a lesser number of massive galaxies at higher redshifts.

Our work is structured as follows. In Section 2 we assume that the measured GWTC-3 merger rate is described by a simple mixture model of galaxies weighted by their star formation rate (SFR) and stellar mass, and infer the mixing fraction between them. We derive an upper limit for the possible contribution of BBHs from galaxies weighted by stellar mass. In Section 3 we constrain the delay-time distribution of BBHs using a physically motivated model. We also seed the SFHs of galaxies taken from UniverseMachine (Behroozi et al. 2019) with the inferred delay-time distribution to study the resulting host-galaxy properties, and present results derived using this procedure. We summarize our results and dwell on directions for future work in Section 4. Throughout we assume cosmological parameters consistent with the Planck 2018 data release (Planck Collaboration et al. 2020) as implemented in the Planck18 class in astropy 10 (Astropy Collaboration et al. 2013, 2018), and that galaxy formation histories are well emulated by UniverseMachine.

2. Do Binary Black Holes Follow Stellar Mass or Star Formation Rate?

We first demonstrate that it is possible to differentiate between the host-galaxy populations of BBHs using the redshift evolution of the merger rate. If BBHs were hosted by a sample of host galaxies purely weighted by their SFR, the rate of BBHs would increase as a function of redshift, roughly as (1 + z)2.5, up to z ∼ 1.5, following the evolution of the cosmic star formation rate density (cSFRD; 11 Madau & Dickinson 2014; Madau & Fragos 2017; Behroozi et al. 2019), 12 which peaks at that redshift. 13 On the other hand, if BBHs were hosted by a sample of galaxies purely weighted by their stellar mass, M*, the BBH number density would decrease as a function of redshift; this is because galaxies build up their mass over cosmic time, and the total mass in stars per unit volume in the local Universe is greater than that at higher redshift. At every redshift where UniverseMachine 14 has support, we weight every galaxy in direct proportion to its corresponding stellar mass, finding that the weighted number density of the galaxies would scale as (1 + z)−0.64.

Following Adhikari et al. (2020), we assume for simplicity that BBHs merge in a sample of galaxies either weighted by their stellar mass or SFR, with their relative abundances specified by a mixing fraction αSM. Such a weighting in stellar mass and SFR physically corresponds to long and short delay times, respectively (see Section 3.2 for a discussion). The effective model for the rate of BBH mergers, R(z), can be written as,

Equation (1)

We plot the inferred R(z) along with the stellar-mass- and SFR-weighted expectations in Figure 1. The black traces show the redshift evolution of the merger rate inferred from GWTC-3. By comparing the redshift evolution predicted by Equation (1) for different values of αSM with the inference from GWTC-3, we ascertain αSM < 43% [90% confidence limit (CL); uniform prior on αSM], i.e., a maximum of 43% of the total mergers follow a sample of galaxies weighted by stellar mass, with the rest weighted by SFR. The hypothesis that BBHs are hosted purely by a set of galaxies weighted by their stellar mass (i.e., αSM = 100%) is ruled out by the redshift evolution of the merger rate at >99.9%. As a corollary, at least 57% host galaxies should belong to a galaxy sample weighted by SFR. As mentioned earlier, this model is rather simple, but it sheds light on the fact that the redshift distribution is a powerful discriminator for the set of host galaxies within which GW events merge. For instance, in the statistical dark-siren analysis to measure the Hubble parameter, individual candidate host galaxies are typically weighted by luminosities in passbands that trace stellar mass or their SFR (see, e.g., Fishbach et al. 2019); our results show that a stellar-mass weighting is inconsistent with the redshift evolution of the BBH merger rate, and applying such a weight during the analysis should be avoided.

Figure 1. Refer to the following caption and surrounding text.

Figure 1. Redshift evolution of the merger rate inferred from GWTC-3 (black traces), plotted along with the redshift evolution of the weighted number density of a sample of galaxies weighted by SFR (blue dotted line) and stellar mass, M* (orange dotted line). Galaxies at higher redshifts are more star forming than those at lower redshifts, and hence their weighted number density follows a power law with a positive exponent: (1 + z)2.5. On the other hand, the average galaxy stellar mass per unit volume decreases as we go to higher redshift, and hence the M*-weighted number density of galaxies follows a power law in (1 + z) with a decreasing exponent: (1 + z)−0.64. The green line shows the redshift evolution with the maximum permissible value of the mixture coefficient: ${\alpha }_{\mathrm{SM}}^{\max }=43 \% $ at 90% CL.

Standard image High-resolution image

3. Inferring Host-galaxy Properties Using a Physically Motivated Model

We now turn to exploring a more physically motivated model for the evolution of binaries.

3.1. Redshift Evolution of the Merger Rate, and Constraints on the Delay-time Distribution

For the purposes of this work we model the redshift evolution of the merger rate as depending on the formation rate, Rf , of the progenitor binaries, and the distribution of delay times, p(tD ), between the formation of the binary progenitors and subsequent merger of the binaries. Assuming these, the merger rate, R, as a function of lookback time, t, can be written as,

Equation (2)

Since redshift, z, and lookback time, t, are related for a given cosmology, R(t) can also be equivalently written as R(z). We assume that Rf (t) follows the cSFRD as inferred by UniverseMachine (Behroozi et al. 2019). 15 For the delay-time distribution model, we employ a power-law prescription parameterized by a power-law index α with a minimum allowed delay time of ${t}_{D}^{\min }$ and maximum allowed delay of 13.5 Gyr. That is,

Equation (3)

The latest results from the LVK collaboration (Abbott et al. 2023b) derive constraints on the redshift evolution of the merger rate assuming that it follows R(z) ∼ (1 + z)κ , and determine the posterior probability distribution on κ: p(κ∣data). The LVK GWTC-3 analysis assumed a flat prior on κ, and so the likelihood satisfied ${ \mathcal L }(\mathrm{data}| \kappa )=p(\kappa | \mathrm{data})$. In order to compare the model expectation from Equation (2) to redshift evolution inference, we need to map the delay-time distribution parameters onto a prediction on κ. Let us denote this model prediction by $\tilde{\kappa }(\alpha ,{t}_{D}^{\min })$. Due to the functional form chosen for R(z), $\tilde{\kappa }$ can be approximated as,

Equation (4)

Since current observations measure the merger rate only out to z = 1, this is a good approximation; however, with more sensitive detectors, the parameters of the delay-time distribution would have to be constrained directly by jointly sampling over them along with hyperparameters of the underlying population model related to mass, spin, etc. 16 Using the approximation in Equation (4), we can write a likelihood function for α and ${t}_{D}^{\min }$,

Equation (5)

where the right-hand side is given by the LVK GWTC-3 population analysis.

We construct the joint posterior on α and ${t}_{D}^{\min }$ using the likelihood function above. For this, we assume flat priors on α between [−4, 1], and on ${t}_{D}^{\min }$ between [0, 6] Gyr. We use the Markov Chain Monte Carlo sampler emcee (Foreman-Mackey et al. 2013) to sample from the posterior distribution. Constraints on α and ${t}_{D}^{\min }$ from GWTC-3 are shown in Figure 2. As expected, the constraints on the parameters are stronger than those derived from GWTC-2 (Fishbach & Kalogera 2021) due to the 50% additional events included in GWTC-3. We constrain α < −1.55 and ${t}_{D}^{\min }\lt 2.23\ \mathrm{Gyr}$ at 90% CL. On the whole, the posteriors for both parameters rail against the lower limit of the prior, illustrating a preference for short delay times in the population. This is consistent with expectations that BBHs follow the cSFRD, and also with other recent work (Fishbach & Kalogera 2021; Fishbach & van Son 2023; Karathanasis et al. 2023; Turbang et al. 2023).

Figure 2. Refer to the following caption and surrounding text.

Figure 2. Constraints on the model parameters of the delay-time distribution, power-law slope α, and minimum delay time ${t}_{D}^{\min }$ inferred from GWTC-3 data. GWTC-3 provides stronger constraints as compared to GWTC-2, with α < −1.55 and ${t}_{D}^{\min }\lt 2.23\ \mathrm{Gyr}$ at 90% CL.

Standard image High-resolution image

3.2. Modeling Galaxy Formation and Evolution, and Constraining Host-galaxy Distributions

To obtain observed galaxy properties across cosmic time, we use the UniverseMachine (Behroozi et al. 2019) set of semianalytical galaxy formation simulations. Instead of directly simulating the gas and star formation during galaxy formation as is done in hydrodynamical simulations such as IllustrisTNG (Pillepich et al. 2018; Nelson et al. 2019) or EAGLE (Crain et al. 2015), UniverseMachine populates galaxies within dark-matter halos in a pure cold dark-matter simulation with a semiempirical model. This model relates the SFR to the halo mass accretion rate, and constrains this with a Monte Carlo scheme using a wide variety of observations over a range of redshifts out to z = 10. More details about UniverseMachine can be found in Behroozi et al. (2019). As a result, one can track the evolution of galaxy properties (e.g., stellar mass, host halo mass, and SFH) across redshift. We use publicly available UniverseMachine mock catalogs 17 created using the Bolshoi–Planck dark-matter-only simulations (Klypin et al. 2011) with a box size of 250 h−1 Mpc and 20483 particles. These catalogs allow us to extract galaxy properties in 170 logarithmically spaced redshift bins between z = 0 and z = 13. While we use UniverseMachine for results in this work, the procedure can be repeated with any galaxy formation simulation that tracks galaxy properties across redshift.

For each galaxy at redshift z0, we calculate the merger rate as,

Equation (6)

where ${R}_{i}^{\mathrm{merg}}(z)$ is merger rate of the ith galaxy at redshift z, while ${R}_{i}^{\mathrm{SFH}}(t(z))$ is the SFH of the ith galaxy numerically interpolated across the redshift bins. The equation above is the same as Equation (2), except that the quantities pertain to a specific galaxy rather than the total merger rate across all galaxies. When the delay times are short, the merger rate of BBHs will have a redshift evolution very close to that of the SFR density, since in our model the rate of production of binaries is directly proportional to the rate of production of stars. On the other hand when the delay times are long, the galaxies would have evolved significantly, and would hence no longer necessarily be star forming at the merger epoch. These quiescent galaxies would have a very different evolution for their weighted number density as a function of redshift, which would be more similar to a sample of host galaxies weighted by their stellar mass rather than their SFR.

We illustrate this phenomenon in Figure 3, where we plot a two-dimensional BBH-rate-weighted histogram of the stellar mass and SFR of the z = 0.21 host galaxies assuming p(tD t0) = δ(tD t0). In this M*–SFR plane, the distribution of galaxies has two broadly distinct branches: an upper branch corresponding to star-forming galaxies and a lower branch corresponding to quiescent galaxies. Within these two branches M* and SFR are, on an average, correlated positively. We choose two representative values, t0 = 0 Gyr and 10 Gyr, corresponding to short and long delay times, respectively. When the delay times are short, the star-forming branch is highly populated, while the quiescent branch is relatively empty—this is because binaries merge at roughly the same redshift that they formed, and their host galaxies do not have much time to quench their star formation (hence stay star forming). Naturally, the redshift evolution of the weighted number density in such a scenario follows the SFR. This scenario changes when the delay times are large—the quiescent branch is now more populated since galaxies have had a long time to evolve and quench their star formation. The level to which the quiescent branch is populated increases with an increase in the delay time.

Figure 3. Refer to the following caption and surrounding text.

Figure 3. Two-dimensional BBH-rate-weighted histogram of host galaxies in the M*–SFR plane at z = 0.21, plotted for representative short (tD = 0 Gyr; left) and long (tD = 10 Gyr; right) delay times. The one-dimensional histograms show the marginalized distributions of logSFR and $\mathrm{log}{M}_{* }$. A general trend in both panels is that galaxies occupy one of two broadly distinct branches in the M*−SFR plane (visually more apparent in the right panel). The upper branch contains star-forming galaxies, while the lower branch contains galaxies that have quenched their star formation. For short delay times, binaries merge quickly after forming; their host galaxies do not evolve significantly in this duration. Hence, if binaries have short delay times, they will always track star-forming galaxies. On the other hand for long delay times between formation and merger, galaxies can evolve significantly and quench their star formation, thereby transferring some support from the star-forming branch to the quiescent branch.

Standard image High-resolution image

3.3. Results

In order to extract host-galaxy properties from the observed redshift evolution of the merger rate, we convolve the constraints on the delay-time distribution model obtained in Section 3.1 with galaxy SFHs taken from UniverseMachine. For each sample drawn from the joint posterior on α and ${t}_{D}^{\min }$, we track every galaxy in the catalog and assign to each one a merger rate of binaries according to Equation (2) at every epoch. We obtain a set of mock candidate BBH host galaxies, with complete information about their current stellar mass and SFR, mass assembly and SFH, as well as merger-rate history. Since UniverseMachine models the connection between galaxies and their host halos, we also obtain information about their host halo mass, radius, and peculiar velocity. Corresponding to every posterior sample, one can plot a histogram of galaxy properties weighted appropriately by the merger rate of BBHs in that galaxy; this BBH-rate-weighted histogram will indicate the population of galaxies hosting BBH events.

3.3.1. Stellar Mass, Star Formation Rate, and Mean Stellar Age

Figure 4 shows the median distribution of BBH host galaxies inferred in the M*−SFR plane using our prescription at redshifts z = 0.21, 0.4, and 0.81. One noticeable broad feature is that the star-forming branch is shifted to higher values of SFR at high redshifts. This is because, consistent with expectations, galaxies at higher redshift on an average form more stars. Notably, there is significant support for BBH host galaxies both in the star-forming branch as well as the quiescent branch, although there is more support in the star-forming branch. This is a direct result of the small but nonzero delay times that are allowed by the constraints in Section 3.1—these delay times are large enough to induce some support for galaxies in the quiescent branch, but not so large as to entirely populate this branch. The median stellar mass of the hosts at z = 0.21 is 1.9 × 1010 M, while the median SFR is 0.9 M yr−1. The median stellar mass at z = 0.4 and z = 0.81 is 2.1 × 1010 M and 2.1 × 1010 M respectively, while the median SFR is 1.6 M yr−1 and 4.5 M yr−1 respectively.

Figure 4. Refer to the following caption and surrounding text.

Figure 4. Median inferred BBH-rate-weighted host-galaxy distribution in the $\mathrm{log}{M}_{* }$–logSFR plane for three different redshifts z = 0.21, 0.4, and 0.81. These specific values of redshift are chosen for illustrative purposes; our model can predict the host-galaxy distribution at any redshift at which a simulation snapshot exists. At all three redshifts, the distribution has support both in the star-forming branch as well as the quiescent branch, although it has more support in the former. This is consistent with the fact that the delay-time distribution has support for nonzero delay times.

Standard image High-resolution image

Figure 5 shows one-dimensional BBH-rate-weighted histograms of M*, SFR, and mean stellar age, tage, inferred from our prescription at z = 0.21. Here, tage is defined on a per galaxy level as,

Equation (7)

We also plot, for comparison, histograms for each of the parameters weighted equally by M* and by SFR. The equal weights case should be thought of as the set of all galaxies in the Universe at a given redshift, while the M*-weighted and the SFR-weighted galaxies track massive and star-forming galaxies, respectively. The inferred histograms are plotted in green, and the width of the green region should be thought of as coming from the uncertainties in measuring the parameters of the delay-time distribution.

Figure 5. Refer to the following caption and surrounding text.

Figure 5. Inferred rate-weighted histograms (green) of M*, SFR, and mean stellar age tage of BBH host galaxies at z = 0.21, plotted along with histograms for samples weighted equally (gray) by SFR (blue) and by M* (orange). For all the quantities, the inferred curves are close to the SFR-weighted curves, with the scatter being due to the nonzero delay times.

Standard image High-resolution image

We see that the distribution of M* is broadly consistent with the SFR-weighted sample of galaxies, and is inconsistent with an M*-weighted sample of galaxies following expectations from Figure 1. Most of the inferred histograms cluster around the SFR-weighted histograms, consistent with the railing of α and ${t}_{D}^{\min }$ toward the lower bound on the prior in Figure 2. A similar trend can be seen in the distribution of SFRs as well, which is consistent with an SFR-weighted sample of galaxies, but also has support for galaxies with slightly lower SFRs. The inferred mean stellar age of host galaxies is 4.4 Gyr, consistent with that of star-forming galaxies.

The general picture at z = 0.21 also holds at the higher redshifts which we considered, albeit with larger errors, since the merger rate at higher redshifts is less well constrained. We discuss results for z = 0.4, 0.81 briefly in the Appendix.

3.3.2. Magnitudes

The light that we observe from galaxies encodes signatures of their star formation and mass assembly history. Given information about the initial mass function and SFH of galaxies, one can forward model galaxy spectra using stellar population synthesis models (see, e.g., Bruzual & Charlot 2003; Conroy et al. 2009). These spectra can then be bandpassed appropriately to derive luminosities and magnitudes. For our inferred host galaxies, we derive (absolute) magnitudes in the Johnson–Morgan B band (Johnson & Morgan 1953) and the Two Micron All-Sky Survey Ks band (Skrutskie et al. 2006), 18 thought to be proxies for the SFR and stellar mass, respectively (Bell et al. 2003; Singer et al. 2016). To simulate mock galaxy spectra, we use the stellar population synthesis model from Bruzual & Charlot (2003) implemented in the bagpipes software package (Carnall et al. 2018). Since we already have simulated SFHs from UniverseMachine for our mock galaxies, we use the “custom” SFH implemented in bagpipes instead of relying on semianalytical prescriptions for the SFH. Also, since UniverseMachine does not model metallicity evolution, we fix the metallicity Z = 0.5 Z at all redshifts, where Z = 0.02 is solar metallicity. 19

As before, we plot the BBH-rate-weighted histogram of the inferred magnitudes in the B band and Ks band, along with histograms weighted equally by M* and by SFR, in Figure 6. In both bands, the equally weighted set of galaxies has a preference for high magnitudes (i.e., low luminosities), since there are significantly more low-luminosity galaxies in the Universe (see, e.g., Schechter 1976). In contrast, the histogram for the inferred weights peaks at lower magnitudes (i.e., higher luminosities). The median inferred magnitude in the B band is −19.6, while that in the Ks band is −21.5. These inferred histograms of the magnitudes can be directly used to calculate luminosity weights in the dark-siren method for measuring the cosmic expansion history with GW events. In addition, they may enable the design of more optimized follow-up strategies to search for possible EM counterparts to BBH mergers.

Figure 6. Refer to the following caption and surrounding text.

Figure 6. Rate-weighted histogram of inferred absolute magnitudes of BBH host galaxies in the B band and the Ks band (commonly used as proxies for the SFR and stellar mass, respectively) at z = 0.21.

Standard image High-resolution image

3.3.3. Halo Mass, Halo Radius, and Peculiar Velocity

The formation and evolution of galaxies is intrinsically connected to their host dark-matter halos (see Wechsler & Tinker 2018 for a review); thus, the properties of a galaxy are correlated with its parent dark-matter halo. 20 Specifically, larger halos host larger and more massive galaxies. This correlation also naturally causes massive galaxies to have a greater peculiar velocity, due to the denser environment (see, e.g., Sheth & Diaferio 2001).

Since UniverseMachine also tracks host halo properties along with the galaxy properties, we also have host halo information for the host galaxies. Figure 7 shows the rate-weighted histograms of inferred host dark-matter halo mass, Mhalo, halo radius, rhalo, and galaxy peculiar velocity, vpec. We note that the M*-weighted galaxies have higher Mhalo, vpec, and rhalo, as compared to those weighted by SFR; this is because the former galaxies are larger and hence lie in larger dark-matter halos. Similar to the case of galaxies presented in the previous subsections, we find that the inferred histograms for host dark-matter halos lie closest to the halos associated with SFR-weighted galaxies. The median Mhalo and rhalo are 5.4 × 1011 h−1 M and 180 pc, respectively. The median vpec is 150 km s−1, but has a tail extending out to vpec ≈ 800 km s−1. This means that the peculiar velocities can shift the redshifts of BBHs at most by vpec/c ≈ 0.003, and thus are unlikely to affect the uncertain estimates of distances derived from BBHs even in next-generation detectors.

Figure 7. Refer to the following caption and surrounding text.

Figure 7. Inferred distributions of the host dark-matter halo mass Mhalo, host peculiar velocity vpec, and halo radius rhalo at z = 0.21

Standard image High-resolution image

3.3.4. Large-scale Bias

Galaxies are biased tracers of the underlying matter density in the Universe—they prefer forming at rare, highly dense peaks of the underlying matter density field. Kaiser (1984) showed that the clustering of galaxies, quantified by the two-point correlation ξgal(r), can be modeled as a constant enhancement over the clustering of the underlying matter distribution on large scales: ${\xi }_{\mathrm{gal}}(r)={b}_{\mathrm{gal}}^{2}{\xi }_{{\rm{m}}}(r)$. The value of the bias, bgal, is directly related to the masses of halos that host galaxies (see, e.g., Cooray & Sheth 2002).

Since BBHs also form and evolve in galaxies, they are a biased tracer of the underlying matter density. We denote bBBH as the large-scale bias of BBHs, akin to the same quantity for galaxies. Indeed, multiple works have proposed using the bias to probe the astrophysical formation channels of GW transients (Namikawa et al. 2016; Raccanelli et al. 2016; Scelfo et al. 2018; Mukherjee et al. 2021; Vijaykumar et al. 2023). Using the halo mass–bias relation of Tinker et al. (2010), implemented in the colossus software package (Diemer 2018), we are able to predict values for the large-scale bias of BBHs from the inferred host-galaxy distributions.

We plot the 90% credible region of the inferred BBH-rate-weighted bias as a function of redshift in Figure 8. Overall, the bias has an increasing trend toward high redshift—this is a natural consequence of hierarchical structure formation in the Universe. We note that the bias at z = 0 is predicted to be in the range 0.86–0.88 at z = 0.1 at 90% CL, while that at z = 0.9 is predicted to be 1.16–1.40 (90% CL). The inferred bias is consistent with that of the SFR-weighted galaxies, but is inconsistent with the M*-weighted galaxies; this is because the bias is purely a function of Mhalo, and the inferred Mhalo distribution is inconsistent with the M*-weighted galaxies (see Figure 7).

Figure 8. Refer to the following caption and surrounding text.

Figure 8. Large-scale bias of BBHs, bBBH, as a function of redshift, z, as predicted by the host properties inferred in this work.

Standard image High-resolution image

4. Summary and Outlook

We have shown that the measured redshift evolution of the BBH merger rate from GWTC-3 constrains the set of host galaxies with which BBHs are associated. We started with a simple mixture model of galaxies weighted either by SFR or stellar mass. We demonstrated that the hosts of merging BBHs do not exclusively track stellar mass at the >99.9% level—at most 43% of the host galaxies come from a stellar-mass-weighted sample. We then used UniverseMachine simulations, along with data-driven constraints on the BBH delay-time distribution, to infer the host-galaxy population. Our results indicate that most BBHs are hosted by actively star-forming galaxies, but there is support for a small fraction of them being hosted by quiescent galaxies. We provide inferred distributions for the halo masses, radii, peculiar velocities, and stellar ages. We also make predictions for the observed Ks- and B-band magnitudes of BBH host galaxies, as well as the large-scale bias associated with the clustering of BBHs. Our results assume that R(z) ∼ (1 + z)κ is a good description for the redshift evolution of the merger rate. There are some hints of structure in R(z) beyond this power-law prescription (Edelman et al. 2023; Callister & Farr 2024); if these results hold up with newer data, it might necessitate invoking the presence of two (or more) populations of BBHs with distinct delay-time distributions. However, the inferred delay-time distribution would still prefer short delay times, and hence we do not expect our results to change significantly.

Unlike previous works which provide distributions of host galaxies based on prescriptions from population synthesis, our framework uses constraints from observed GW data to solve the inverse problem: we infer a set of host galaxies without assumptions for any specific prescription for binary formation and evolution. While we have concentrated on BBHs in this work, our framework will extend trivially to BNSs and NS–BHs, once we are able to measure the redshift evolution of their merger rates. This framework can also be extended to infer the host-galaxy population of astronomical objects for which source localization is poor (e.g., poorly localized GRBs and FRBs). Our inferred distributions of various parameters can be used to design optimal weighting schemes for the dark-siren method to constrain the expansion history of the Universe. In addition, our predictions for host-galaxy properties may aid in the design of optimal follow-up strategies for BBH events. This will be of particular interest once the BNS and NS–BH extensions to our methods have been implemented, since those searches are expected to have transient EM counterparts.

For the main results in this work, we have assumed that binaries followed a physically motivated merger-rate evolution, given by the UniverseMachine cSFRD convolved with a delay-time distribution. This prescription is expected to work well if all binaries form and merge in isolated galactic field environments, and our results should be interpreted in this context. For instance, binaries could also form and merge in dynamical environments such as GCs or AGN, and these populations will have different progenitor formation rates and delay-time distributions, thus requiring changes to our modeling procedure. Our framework can be readily extended to include these channels given some assumption of the progenitor formation rate, although adding new parameters is likely to increase the statistical error on the results. Since compact binary formation is expected to be a strong function of metallicity (Chruślińska 2024, and references therein), another natural extension of our framework would be to include the metallicity-specific SFH from hydrodynamical simulations of galaxy and structure formation while assigning merger rates to individual galaxies. This would also necessitate using a metallicity-specific cSFRD as the progenitor formation rate in constraining delay-time distributions. It is also possible to relax our assumption of specific progenitor formation rates and delay-time distributions, and instead generalize our approach to compare redshift evolution of galaxies and BBHs in a nonparametric fashion, although this would still be subject to any modeling differences inherent in different galaxy formation models.

We have demonstrated the power of the observed properties of the existing GW population to constrain properties of the associated host galaxies. The relation between GW mergers and their parent galaxies offers a rich vein to mine for astrophysical insights. A more precise future measurement of the rate evolution of binaries will help us further constrain the host environments, possibly helping to disentangle the various formation channels that contribute to the total population of merging binaries observable with ground-based GW detectors.

Acknowledgments

We are grateful to Jacqueline Antwi-Danso for patiently answering all our questions about galaxy spectral energy distributions, and Peter Behroozi for help with interpreting the UniverseMachine catalogs. We also thank Surhud More for useful discussions, and Alexandra Hanselman and Aditya Kumar Sharma for a reading of our draft. All computations for this work were performed on the Alice cluster at ICTS-TIFR.

A.V.’s work is supported by the Department of Atomic Energy, Government of India, under Project No. RTI4001. A.V. also acknowledges support by a Fulbright Program grant under the Fulbright-Nehru Doctoral Research Fellowship, sponsored by the Bureau of Educational and Cultural Affairs of the United States Department of State and administered by the Institute of International Education and the United States-India Educational Foundation. A.V. and M.F. are supported by the Natural Sciences and Engineering Research Council of Canada (NSERC; funding reference number 568580). D.E.H is supported by NSF grants AST-2006645 and PHY-2110507, as well as by the Kavli Institute for Cosmological Physics (KICP) through an endowment from the Kavli Foundation and its founder Fred Kavli.

We thank KICP for support through “The quest for precision gravitational wave cosmology” workshop where this work was initiated, and ICTS for support through the Largest Cosmological Surveys and Big Data Science (code: ICTS/BigDataCosmo2023/5) workshop. This research has made use of the Spanish Virtual Observatory (https://svo.cab.inta-csic.es) project funded by MCIN/AEI/10.13039/501100011033/ through grant PID2020-112949GB-I00.

This material is based upon work supported by NSF’s LIGO Laboratory which is a major facility fully funded by the National Science Foundation.

Software: numpy (Harris et al. 2020), scipy (Virtanen et al. 2020), matplotlib (Hunter 2007), astropy (Astropy Collaboration et al. 2013, 2018), jupyter (Kluyver et al. 2016), pandas (McKinney 2010), seaborn (Waskom 2021), bagpipes (Carnall et al. 2018), and colossus (Diemer 2018).

Appendix: Inferred Host-galaxy Properties at Redshifts of 0.4 and 0.81

In this section we provide inferred rate-weighted histograms of M*, SFR, tage, Mhalo, rhalo, and vpec, at redshifts of 0.40 (Figure 9) and 0.81 (Figure 10). We also show inferred histograms of B-band and Ks-band absolute magnitudes in Figure 11. We reiterate that our prescription is able to infer galaxy properties at any redshift where a UniverseMachine simulation snapshot exists.

Figure 9. Refer to the following caption and surrounding text.

Figure 9. Host-galaxy properties at z = 0.40.

Standard image High-resolution image
Figure 10. Refer to the following caption and surrounding text.

Figure 10. Host-galaxy properties at z = 0.81.

Standard image High-resolution image
Figure 11. Refer to the following caption and surrounding text.

Figure 11.  B-band and Ks-band absolute magnitudes inferred at z = 0.40 (left) and z = 0.81 (right).

Standard image High-resolution image

Footnotes

  • 9  

    There is a claimed EM counterpart ZTF19abanrhr (Graham et al. 2020) to the massive BBH GW190521 (Abbott et al. 2020b), believed to be produced due to the interaction of the BBH with the accretion disk of an active galactic nucleus (AGN); however, there is insufficient evidence to confidently associate GW190521 with ZTF19abanrhr (Ashton et al. 2021; Palmese et al. 2021).

  • 10  

    We note that the UniverseMachine simulation snapshots use a slightly different set of cosmological parameter values (h = 0.678 and ΩM = 0.307) compared to the Planck18 implementation in astropy (h = 0.6766 and ΩM = 0.30966). Both of these estimates are consistent with Planck Collaboration et al. (2020); we do not expect this small discrepancy to impact our results significantly.

  • 11  

    Throughout this paper, SFR refers to that of an individual galaxy, whereas cSFRD refers to the SFR integrated over all galaxies per unit volume as a function of redshift.

  • 12  

    We use the cSFRD inferred by UniverseMachine (Behroozi et al. 2019) instead of other prescriptions (Madau & Dickinson 2014; Madau & Fragos 2017) to be consistent with choices made later in this work.

  • 13  

    This is a good assumption for current GW detectors, whose horizon redshift (z ∼ 1) is well below the peak star-forming epoch.

  • 14  

    See Section 3.2 for a brief description of UniverseMachine.

  • 15  

    We should note that UniverseMachine uses a different data set as well as a different procedure for constraining the cSFRD as compared to other works (Madau & Dickinson 2014; Madau & Fragos 2017). The two results are consistent within their respective error bars.

  • 16  

    Recent work (Edelman et al. 2023; Callister & Farr 2024) has found hints of structure in the redshift evolution of the BBH merger rate beyond the (1 + z)κ model we assume here, and this model mis-specification could impact our results. However, since the data are still consistent with the (1 + z)κ model, we ignore the effects of these deviations.

  • 17  
  • 18  

    The passbands are downloaded from the SVO Filter Profile Service (Rodrigo et al. 2012; Rodrigo & Solano 2020): http://svo2.cab.inta-csic.es/theory/fps/index.php.

  • 19  

    We have checked that varying Z between 10−3 Z and 3 Z only changes the calculated magnitude by at most 0.75 units. The shape of the histogram of magnitudes stays approximately the same, with a constant shift to the left or the right.

  • 20  

    This correlation is referred to as the “galaxy–halo” connection.

Please wait… references are loading.
10.3847/1538-4357/ad6140