A Comparative Benchmark of Fairness Metrics in Machine Learning