The aim of this project is to analyze globular star clusters in the Milky Way, in order to understand their dynamics.
The conducted study examined the properties that affect the central velocity dispersion, their impact and the correlations between them.
The task was to find an accurate model that could fit the given data (GlobClus_prop dataset)
and be used to efficiently explain the statistical distribution of the contained records.
Note: The distribution in this study has been assumed as approxymately linear (known beforehand).
In order to identify an initial good model containing only the most significant variables for the central velocity dispersion, several approaches have been used,
including p-value comparisons and penalization criteria.
The evaluations inlcuded all the directions, forward, backward and mixed.
More specifically, a combination of all the following methods have been used:
- P-Value
- Likelihoods Comparison
- AIC (Akaike Information Criterion)
- BIC (Bayesian Information Criterion)
This phase consisted in the evaluation of the model and the correlation analysis between the identified features.
It has been carried out by plotting the features' data and analyzing: confidence intervals, correlation matrices, generic linear models, and graphical model representations.
The graphs included:
- Gaussian linear graphs (undirected updated with a BIC forward stepwise procedure)
- Bayesian networks (directed)
A detailed technical report for the whole analysis can be found here.
If you are interested into the compiled analysis script (html generated report) you can find it rendered here instead.
Copyright 2024 Mattia Bennati
Licensed under the GNU GPL V2: https://www.gnu.org/licenses/old-licenses/gpl-2.0.html