Application of Logistic Regression on Passenger Survival Data of the Titanic Liner

  • Sajjida Reza PAF, Karachi Institute of Economics and Technology
  • Bilal Sarwar Balochistan University of Information Technology, Engineering and Management Sciences, Quetta https://orcid.org/0000-0003-3969-1235
  • Raja Rub Nawaz Karachi University Business School, Karachi, Pakistan
  • S. M. Nabeel Ul Haq Balochistan University of Information Technology, Engineering and Management Sciences, Quetta
Keywords: Binary, Dichotomous, Generalized Linear Model (GLM), Logistic Regression

Abstract

Purpose: This empirical research aims to predict the distinguishing variables of passengers who did or did not survive while traveling in the famous Titanic liner, which sunk in 1912.

Design/Methodology/Approach: The binary logistic regression analysis empirically analyzes the secondary dataset available for 1046 passengers. Variables such as passenger’s gender, age, family composition, ticket class, number of parents with/without children, and number of siblings and/or spouses were opted to examine the differences between the binary dependent variable (Passenger Survived/ Not Survived).

Findings: The study results indicate that all the variables are statistically significant in the model, with passenger's gender being the most significant predictor followed by passenger’s ticket class. The survival chances of passengers decreased for male passengers compared to their counterparts (female passengers) for the sample data [Exp(β)=0.080], for the passengers of age more than 21 years compared to passengers of age less than and equal to 21 years [Exp(β)=0.576], and for passengers with ticket class second and third compared to first-class ticket holders [Exp(β)=0.412]. In contrast, there was a greater chance of survival for families traveling together with parents, siblings, spouses compared to single travelers [Exp(β)=1.823].

Implications/Originality/Value: The study is a classic example of the application of binary logistic regression analysis using EVIEWS software.

Downloads

Download data is not yet available.

Article Analytics Summary

Author Biography

Bilal Sarwar, Balochistan University of Information Technology, Engineering and Management Sciences, Quetta

Assistant Professor, Department of Management Sciences

References

Agresti, A. (2003). Categorical data analysis (Vol. 482): John Wiley & Sons. DOI: https://doi.org/10.1002/0471249688

Berkson, J. (1953). A Statistically Precise and Relatively Simple Method of Estimating the Bio-Assay with Quantal Response, Based on the Logistic Function. Journal of the American Statistical Association, 48(263), 565-599. doi:10.1080/01621459.1953.10483494 DOI: https://doi.org/10.1080/01621459.1953.10483494

Breiman, L., & Friedman, J. H. (1997). Predicting Multivariate Responses in Multiple Linear Regression. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 59(1), 3-54. doi:https://doi.org/10.1111/1467-9868.00054 DOI: https://doi.org/10.1111/1467-9868.00054

Cox, D. R. (1958). The Regression Analysis of Binary Sequences. Journal of the Royal Statistical Society: Series B (Methodological), 20(2), 215-232. doi:10.1111/j.2517-6161.1958.tb00292.x DOI: https://doi.org/10.1111/j.2517-6161.1958.tb00292.x

Cramer, J. S. (2002). The origins of logistic regression. doi:10.2139/ssrn.360300 DOI: https://doi.org/10.2139/ssrn.360300

De La Viña, L., & Ford, J. (2001). Logistic Regression Analysis of Cruise Vacation Market Potential: Demographic and Trip Attribute Perception Factors. Journal of Travel Research, 39(4), 406-410. doi:10.1177/004728750103900407 DOI: https://doi.org/10.1177/004728750103900407

De Noble, A., Galbraith, C. S., Singh, G., & Stiles, C. H. (2007). Market justice, religious orientation, and entrepreneurial attitudes. Journal of Enterprising Communities: People and Places in the Global Economy, 1(2), 121-134. doi:10.1108/17506200710752548 DOI: https://doi.org/10.1108/17506200710752548

Fehrman, E., Muhammad, A. K., Mirkes, E. M., Egan, V., & Gorban, A. N. (2017). The five factor model of personality and evaluation of drug consumption risk Data science (pp. 231-242): Springer. DOI: https://doi.org/10.1007/978-3-319-55723-6_18

Friedman, J., Hastie, T., & Tibshirani, R. (2001). The elements of statistical learning (Vol. 1): Springer series in statistics New York. DOI: https://doi.org/10.1007/978-0-387-21606-5_1

Hosmer Jr, D. W., Lemeshow, S., & Sturdivant, R. X. (2013). Applied logistic regression (Vol. 398): John Wiley & Sons. DOI: https://doi.org/10.1002/9781118548387

Kirk, R. E. (2003). The importance of effect magnitude. Handbook of research methods in experimental psychology, 83-105. DOI: https://doi.org/10.1002/9780470756973.ch5

Meyers, L. S., Gamst, G., & Guarino, A. J. (2016). Applied multivariate research: Design and interpretation: Sage publications.

Miller, M. E., Hui, S. L., & Tierney, W. M. (1991). Validation techniques for logistic regression models. Statistics in Medicine, 10(8), 1213-1226. doi:10.1002/sim.4780100805 DOI: https://doi.org/10.1002/sim.4780100805

Molenberghs, G., & Verbeke, G. (2006). Models for discrete longitudinal data: Springer Science & Business Media.

Pampel, F. C. (2000). Sage Publications i. Logistic Regression: A Primer: SAGE Publications. DOI: https://doi.org/10.4135/9781412984805

Penninx, B. W. J. H., Beekman, A. T. F., Smit, J. H., Zitman, F. G., Nolen, W. A., Spinhoven, P., . . . Van Dyck, R. (2008). The Netherlands Study of Depression and Anxiety (NESDA): rationale, objectives and methods. International Journal of Methods in Psychiatric Research, 17(3), 121-140. doi:10.1002/mpr.256 DOI: https://doi.org/10.1002/mpr.256

Pituch, K. A., & Stevens, J. P. (2015). Applied multivariate statistics for the social sciences: Analyses with SAS and IBM’s SPSS: Routledge. DOI: https://doi.org/10.4324/9781315814919

Spinhoven, P., Elzinga, B. M., Hovens, J. G. F. M., Roelofs, K., Zitman, F. G., van Oppen, P., & Penninx, B. W. J. H. (2010). The specificity of childhood adversities and negative life events across the life span to anxiety and depressive disorders. Journal of Affective Disorders, 126(1), 103-112. doi:https://doi.org/10.1016/j.jad.2010.02.132 DOI: https://doi.org/10.1016/j.jad.2010.02.132

Studenmund, A. H. (2014). Using econometrics a practical guide: Pearson.

Thompson, B. (1999). Statistical Significance Tests, Effect Size Reporting and the Vain Pursuit of Pseudo-Objectivity. Theory & Psychology, 9(2), 191-196. doi:10.1177/095935439992007 DOI: https://doi.org/10.1177/095935439992007

Vugteveen, J., De Bildt, A., Hartman, C. A., & Timmerman, M. E. (2018). Using the Dutch multi-informant Strengths and Difficulties Questionnaire (SDQ) to predict adolescent psychiatric diagnoses. European Child & Adolescent Psychiatry, 27(10), 1347-1359. doi:10.1007/s00787-018-1127-y DOI: https://doi.org/10.1007/s00787-018-1127-y

Warner, R. M. (2012). Applied statistics: From bivariate through multivariate techniques: Sage Publications.

Zewude, B. T., & Ashine, K. M. (2016). Binary Logistic Regression Analysis in Assessment and Identifying Factors That Influence Students' Academic Achievement: The Case of College of Natural and Computational Science, Wolaita Sodo University, Ethiopia. Journal of Education and Practice, 7(25), 3-7.

Published
2022-01-18
How to Cite
Reza, S., Sarwar, B., Nawaz, R. R., & Ul Haq, S. M. N. (2022). Application of Logistic Regression on Passenger Survival Data of the Titanic Liner. Journal of Accounting and Finance in Emerging Economies, 7(4), 861-867. https://doi.org/10.26710/jafee.v7i4.1994