====================
A quickstart example
====================

In this page we show how to use |galario| in some typical use cases, e.g. the fit of some interferometric data sets.

|galario| has been designed to simplify and accelerate the task of computing the synthetic visibilities given a model
image, leaving to the user the freedom to choose the most appropriate statistical tool for the parameter space exploration.

For the purpose of these examples we will adopt a Bayesian approach, using Monte Carlo Markov chains to explore the
parameter space and to produce a sampling of the posterior probability function. In particular, we will use the MCMC
implementation in the `emcee <http://dfm.io/emcee/current/>`_ Python package.

In this page we will show how to fit the mock observations of a protoplanetary disk. In particular, in the example we will
analyse mock visibilities of the disk continuum emission at :math:`\lambda=` 1 mm whose synthetized map is shown in this figure:

.. image:: images/disk_example.jpg
    :scale: 90 %
    :alt: Protoplanetary disk continuum map
    :align: center

You can download `here <uvtable.txt>`_ an ASCII version of the uv-table used in this example.

--------------


Fit a single-wavelength data set
--------------------------------
**1) Import the uv table**

    First, let's import the table containing the interferometric observations. Typically, an interferometric data set
    can be exported to a table containing the :math:`(u_j, v_j)` coordinates (:math:`j=1...M`), the Real and Imaginary part of the complex visibilities
    :math:`V_{obs\ j}`, and their theoretical weight :math:`w_{j}`, for example:

    .. code-block:: python

        u  [m]   	v  [m]   	Re  [Jy] 	Im  [Jy] 	w
        -------------------------------------------------------------------------
        -155.90093	234.34887	 0.01810	 0.13799	200.05723
        9.290660	362.97853	-0.05827	 0.02820	216.95405
        95.23531	109.22704	 0.06314	-0.16727	167.18789
        94.01319	251.97293	 0.01974	 0.04358	179.69114
        -60.45751	211.33346	 0.14856	-0.07756	188.09953
        91.59843	68.94947	 0.12741	-0.12871	156.32432
        23.29531	251.71338	 0.01568	-0.12316	189.58017
        -135.83509	-29.77982	-0.02017	-0.00010	207.29354
        59.38624	144.99431	 0.04759	-0.08606	201.32301
        192.43093	171.57617	-0.02176	-0.02661	208.52030
        -243.91416	76.18790	-0.02306	-0.01430	207.16036
        58.72442	276.64959	 0.03325	 0.04560	173.15922
        35.56591	111.28235	 0.03777	-0.11856	194.83899
        ...     	...     	...     	...      	...

    You can download `here <uvtable.txt>`_ an ASCII version of the uv-table used in this example.

    A table like this one can be read with:

    .. code-block:: python

        u, v, Re, Im, w = np.require(np.loadtxt("uvtable.txt", unpack=True), requirements='C')
        wle = 1e-3  # [m]
        u /= wle
        v /= wle

    where the :math:`u_j` and :math:`v_j` coordinates have been converted in units of the observing wavelength, 1 mm in this example. The `np.require` command is necessary to ensure that the arrays are C-contiguous as required by |galario| (see :ref:`FAQ 1.1 <FAQ1.1>`).

**2) Determine the image size**
    Once imported the uv table, we can start using |galario| to compute the optimal image size

    .. code-block:: python

        from galario.double import get_image_size

        nxy, dxy = get_image_size(u, v, verbose=True)

    where the returned values are the number of pixels (`nxy`) and the pixel size (`dxy`) in radians.
    `nxy` and `dxy` are chosen to fulfil criteria that ensure a correct computation of the synthetic visibilities.
    For more details, refer to Sect. 3.2 in `Tazzari, Beaujean and Testi (2018) MNRAS 476 4527 <https://doi.org/10.1093/mnras/sty409>`_.

**3) Define the brightness model**
    Let us define the model: for this example, we will use a very simple Gaussian profile:

    .. code-block:: python

        from galario import arcsec

        def GaussianProfile(f0, sigma, Rmin, dR, nR):
            """ Gaussian brightness profile. """

            # radial grid
            R = np.linspace(Rmin, Rmin + dR*nR, nR, endpoint=False)

            return f0 * np.exp(-0.5*(R/sigma)**2)

    where `f0` (Jy/sr) is a normalization, `sigma` is the width of the Gaussian, `Rmin` is the
    innermost radius of the grid, `dR` is the size of radial grid and `nR` is the number of radial grid cells.
    `sigma`, `Rmin`, `dR` should be passed to `GaussianProfile()` in arcseconds and `f0` in Jy/sr.

**4) Setup the MCMC Ensemble Sampler**
    In our fit we will have 6 free parameters: on top of the model parameters `f0` and `sigma` we want to fit
    the inclination `inc`, the position angle `PA`, and the angular offsets :math:`(\Delta RA, \Delta Dec)`
    with respect to the phase center.
    Following the notation of the `emcee <http://dfm.io/emcee/current/>`_ documentation, we initialise the EnsembleSampler

    .. code-block:: python

        from emcee import EnsembleSampler

        # radial grid parameters
        Rmin = 1e-4  # arcsec
        dR = 0.01    # arcsec
        nR = 2000

        # parameter space domain
        p_ranges = [[1, 20],
                    [0., 8.],
                    [0., 90.],
                    [0., 180.],
                    [-2., 2.],
                    [-2., 2.]]

        ndim = len(p_ranges)        # number of dimensions
        nwalkers = 40               # number of walkers

        nthreads = 4                # CPU threads that emcee should use

        sampler = EnsembleSampler(nwalkers, ndim, lnpostfn,
                                  args=[p_ranges, Rmin, dR, nR, nxy, dxy, u, v, Re, Im, w],
                                  threads=nthreads)

    where:

        - `p_ranges` is a rectangular domain in the parameter space that defines the search region;
        - `lnpostfn` is the posterior probability function;
        - `args` defines an array of fixed parameters that `lnpostfn` takes additionally in input.

**5) Define the posterior and the prior probability functions**
    Let us now implement the posterior function, using |galario| to compute the :math:`\chi^2`. Since in this example
    we are assuming an axisymmetric brightness profile we will use the `chi2Profile` function, but the same design holds
    for the `chi2Image` function that should be used for non-axisymmetric profiles.

    .. code-block:: python

        from galario import deg, arcsec
        from galario.double import chi2Profile

        def lnpostfn(p, p_ranges, Rmin, dR, nR, nxy, dxy, u, v, Re, Im, w):
            """ Log of posterior probability function """

            lnprior = lnpriorfn(p, p_ranges)  # apply prior
            if not np.isfinite(lnprior):
                return -np.inf

            # unpack the parameters
            f0, sigma, inc, PA, dRA, dDec = p

            f0 = 10.**f0        # convert from log to real space

            # convert to radians
            sigma *= arcsec
            Rmin *= arcsec
            dR *= arcsec
            inc *= deg
            PA *= deg
            dRA *= arcsec
            dDec *= arcsec

            # compute the model brightness profile
            f = GaussianProfile(f0, sigma, Rmin, dR, nR)

            chi2 = chi2Profile(f, Rmin, dR, nxy, dxy, u, v, Re, Im, w,
                               inc=inc, PA=PA, dRA=dRA, dDec=dDec)

            return -0.5 * chi2 + lnprior

    where the normalization `f0` is explored in the logarithmic space to achieve a faster convergence and `lnpriorfn`
    is the prior probability function defined as a uniform prior:

    .. code-block:: python

        def lnpriorfn(p, par_ranges):
            """ Uniform prior probability function """

            for i in range(len(p)):
                if p[i] < par_ranges[i][0] or p[i] > par_ranges[i][1]:
                    return -np.inf

            jacob = -p[0]       # jacobian of the log transformation

            return jacob

    which, up to a constant, basically checks that `p` lies inside the rectangular domain defined by the extents in `p_ranges`.

**6) Ready to go: run the MCMC!**
    We are now ready to start the MCMC:

    .. code-block:: python

        nsteps = 3000     # total number of MCMC steps

        # initial guess for the parameters
        p0 = [10, 0.5, 70., 60., 0., 0.] #  3 parameters for the model + 4 (inc, PA, dRA, dDec)

        # initialize the walkers with an ndim-dimensional Gaussian ball
        pos = [p0 + 1e-4*np.random.randn(ndim) for i in range(nwalkers)]

        # execute the MCMC
        pos, prob, state = sampler.run_mcmc(pos, nsteps, rstate0=state, lnprob0=prob)

    It is possible to run the whole fit collecting the code blocks above into a single `quickstart.py` file and running `python quickstart.py`. For reference, using `nthreads=4`, the run takes approximately 5-8 mins on a laptop with an Intel i5 2.9GHz.

**7) Plot the fit results**

    Once the run has completed, we can inspect the fit results. We will produce two informative plots. First, the so called corner plot, which shows the 1D and 2D marginalised posterior distributions of the free parameters (bottom left figure). To produce this plot we use the `corner <https://github.com/dfm/corner.py>`_ package, which can be easily installed with `pip install corner`.

    .. code-block:: python

        # do the corner plot
        import corner
        samples = sampler.chain[:, -1000:, :].reshape((-1, ndim))
        fig = corner.corner(samples, labels=["$f_0$", "$\sigma$", r"$i$", r"PA", r"$\Delta$RA", r"$\Delta$Dec"],
                            show_titles=True, quantiles=[0.16, 0.50, 0.84],
                            label_kwargs={'labelpad':20, 'fontsize':0}, fontsize=8)
        fig.savefig("triangle_example.png")

    Second, the so called uv-plot which shows the comparison between the visibilities of the bestfit model and the observed ones (bottom right figure). To produce the uv-plot we use the `uvplot <https://github.com/mtazzari/uvplot>`_ package, which can be easily installed with `pip install uvplot`.

    .. code-block:: python

        # do the uv-plot
        # select the bestfit model (here, e.g., the model with median parameters)
        bestfit = [np.percentile(samples[:, i], 50) for i in range(ndim)]

        f0, sigma, inc, PA, dRA, dDec = bestfit

        f0 = 10.**f0        # convert from log to real space

        # convert to radians
        sigma *= arcsec
        Rmin *= arcsec
        dR *= arcsec
        inc *= deg
        PA *= deg
        dRA *= arcsec
        dDec *= arcsec

        f = GaussianProfile(f0, sigma, Rmin, dR, nR)

        # compute the visibilities of the bestfit model
        vis_mod = g_double.sampleProfile(f, Rmin, dR, nxy, dxy, u, v,
                                         inc=inc, PA=PA, dRA=dRA, dDec=dDec)

        from uvplot import UVTable

        uvbin_size = 30e3     # uv-distance bin, units: wle

        # observations uv-plot
        uv = UVTable(uvtable=[u*wle, v*wle, Re, Im, w], wle=wle)
        uv.apply_phase(-dRA, -dDec)         # center the source on the phase center
        uv.deproject(inc, PA)
        axes = uv.plot(linestyle='.', color='k', label='Data', uvbin_size=uvbin_size)

        # model uv-plot
        uv_mod = UVTable(uvtable=[u*wle, v*wle, vis_mod.real, vis_mod.imag, w], wle=wle)
        uv_mod.apply_phase(-dRA, -dDec)     # center the source on the phase center
        uv_mod.deproject(inc, PA)
        uv_mod.plot(axes=axes, linestyle='-', color='r', label='Model', yerr=False, uvbin_size=uvbin_size)

        axes[0].figure.savefig("uvplot_example.pdf")


    +-------------------------------------------------------+-----------------------------------------------+
    |.. image:: images/quickstart_triangle_whole_chain.png  |  .. image:: images/uvplot.png                 |
    |  :width: 80%                                          |          :width: 98%                          |
    |  :alt: Chains                                         |          :alt: Chains                         |
    +-------------------------------------------------------+-----------------------------------------------+
    | Corner plot showing the marginalised posteriors       | Uv-plot showing the deprojected visibilities  |
    +-------------------------------------------------------+-----------------------------------------------+

**7) CPU vs GPU execution**
    So far we have run |galario| on the CPU. Running it on a GPU can be done by just changing the import at the beginning:

    .. code-block:: python

        from galario import double_cuda as g_double

    All the rest of the code remains the same!

    For more details on the GPU vs CPU execution, see the :ref:`Cookbook <cookbook>`.