Application of Logistic Regression on Passenger Survival Data of the Titanic Liner
Abstract
Purpose: This empirical research aims to predict the distinguishing variables of passengers who did or did not survive while traveling in the famous Titanic liner, which sunk in 1912.
Design/Methodology/Approach: The binary logistic regression analysis empirically analyzes the secondary dataset available for 1046 passengers. Variables such as passenger’s gender, age, family composition, ticket class, number of parents with/without children, and number of siblings and/or spouses were opted to examine the differences between the binary dependent variable (Passenger Survived/ Not Survived).
Findings: The study results indicate that all the variables are statistically significant in the model, with passenger's gender being the most significant predictor followed by passenger’s ticket class. The survival chances of passengers decreased for male passengers compared to their counterparts (female passengers) for the sample data [Exp(β)=0.080], for the passengers of age more than 21 years compared to passengers of age less than and equal to 21 years [Exp(β)=0.576], and for passengers with ticket class second and third compared to first-class ticket holders [Exp(β)=0.412]. In contrast, there was a greater chance of survival for families traveling together with parents, siblings, spouses compared to single travelers [Exp(β)=1.823].
Implications/Originality/Value: The study is a classic example of the application of binary logistic regression analysis using EVIEWS software.
Downloads
Article Analytics Summary
References
Agresti, A. (2003). Categorical data analysis (Vol. 482): John Wiley & Sons. DOI: https://doi.org/10.1002/0471249688
Berkson, J. (1953). A Statistically Precise and Relatively Simple Method of Estimating the Bio-Assay with Quantal Response, Based on the Logistic Function. Journal of the American Statistical Association, 48(263), 565-599. doi:10.1080/01621459.1953.10483494 DOI: https://doi.org/10.1080/01621459.1953.10483494
Breiman, L., & Friedman, J. H. (1997). Predicting Multivariate Responses in Multiple Linear Regression. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 59(1), 3-54. doi:https://doi.org/10.1111/1467-9868.00054 DOI: https://doi.org/10.1111/1467-9868.00054
Cox, D. R. (1958). The Regression Analysis of Binary Sequences. Journal of the Royal Statistical Society: Series B (Methodological), 20(2), 215-232. doi:10.1111/j.2517-6161.1958.tb00292.x DOI: https://doi.org/10.1111/j.2517-6161.1958.tb00292.x
Cramer, J. S. (2002). The origins of logistic regression. doi:10.2139/ssrn.360300 DOI: https://doi.org/10.2139/ssrn.360300
De La Viña, L., & Ford, J. (2001). Logistic Regression Analysis of Cruise Vacation Market Potential: Demographic and Trip Attribute Perception Factors. Journal of Travel Research, 39(4), 406-410. doi:10.1177/004728750103900407 DOI: https://doi.org/10.1177/004728750103900407
De Noble, A., Galbraith, C. S., Singh, G., & Stiles, C. H. (2007). Market justice, religious orientation, and entrepreneurial attitudes. Journal of Enterprising Communities: People and Places in the Global Economy, 1(2), 121-134. doi:10.1108/17506200710752548 DOI: https://doi.org/10.1108/17506200710752548
Fehrman, E., Muhammad, A. K., Mirkes, E. M., Egan, V., & Gorban, A. N. (2017). The five factor model of personality and evaluation of drug consumption risk Data science (pp. 231-242): Springer. DOI: https://doi.org/10.1007/978-3-319-55723-6_18
Friedman, J., Hastie, T., & Tibshirani, R. (2001). The elements of statistical learning (Vol. 1): Springer series in statistics New York. DOI: https://doi.org/10.1007/978-0-387-21606-5_1
Hosmer Jr, D. W., Lemeshow, S., & Sturdivant, R. X. (2013). Applied logistic regression (Vol. 398): John Wiley & Sons. DOI: https://doi.org/10.1002/9781118548387
Kirk, R. E. (2003). The importance of effect magnitude. Handbook of research methods in experimental psychology, 83-105. DOI: https://doi.org/10.1002/9780470756973.ch5
Meyers, L. S., Gamst, G., & Guarino, A. J. (2016). Applied multivariate research: Design and interpretation: Sage publications.
Miller, M. E., Hui, S. L., & Tierney, W. M. (1991). Validation techniques for logistic regression models. Statistics in Medicine, 10(8), 1213-1226. doi:10.1002/sim.4780100805 DOI: https://doi.org/10.1002/sim.4780100805
Molenberghs, G., & Verbeke, G. (2006). Models for discrete longitudinal data: Springer Science & Business Media.
Pampel, F. C. (2000). Sage Publications i. Logistic Regression: A Primer: SAGE Publications. DOI: https://doi.org/10.4135/9781412984805
Penninx, B. W. J. H., Beekman, A. T. F., Smit, J. H., Zitman, F. G., Nolen, W. A., Spinhoven, P., . . . Van Dyck, R. (2008). The Netherlands Study of Depression and Anxiety (NESDA): rationale, objectives and methods. International Journal of Methods in Psychiatric Research, 17(3), 121-140. doi:10.1002/mpr.256 DOI: https://doi.org/10.1002/mpr.256
Pituch, K. A., & Stevens, J. P. (2015). Applied multivariate statistics for the social sciences: Analyses with SAS and IBM’s SPSS: Routledge. DOI: https://doi.org/10.4324/9781315814919
Spinhoven, P., Elzinga, B. M., Hovens, J. G. F. M., Roelofs, K., Zitman, F. G., van Oppen, P., & Penninx, B. W. J. H. (2010). The specificity of childhood adversities and negative life events across the life span to anxiety and depressive disorders. Journal of Affective Disorders, 126(1), 103-112. doi:https://doi.org/10.1016/j.jad.2010.02.132 DOI: https://doi.org/10.1016/j.jad.2010.02.132
Studenmund, A. H. (2014). Using econometrics a practical guide: Pearson.
Thompson, B. (1999). Statistical Significance Tests, Effect Size Reporting and the Vain Pursuit of Pseudo-Objectivity. Theory & Psychology, 9(2), 191-196. doi:10.1177/095935439992007 DOI: https://doi.org/10.1177/095935439992007
Vugteveen, J., De Bildt, A., Hartman, C. A., & Timmerman, M. E. (2018). Using the Dutch multi-informant Strengths and Difficulties Questionnaire (SDQ) to predict adolescent psychiatric diagnoses. European Child & Adolescent Psychiatry, 27(10), 1347-1359. doi:10.1007/s00787-018-1127-y DOI: https://doi.org/10.1007/s00787-018-1127-y
Warner, R. M. (2012). Applied statistics: From bivariate through multivariate techniques: Sage Publications.
Zewude, B. T., & Ashine, K. M. (2016). Binary Logistic Regression Analysis in Assessment and Identifying Factors That Influence Students' Academic Achievement: The Case of College of Natural and Computational Science, Wolaita Sodo University, Ethiopia. Journal of Education and Practice, 7(25), 3-7.
Copyright (c) 2021 Sajjida Reza, Bilal Sarwar, Raja Rub Nawaz, S. M. Nabeel Ul Haq

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
CSRC Publishing and JAFEE adhere to Creative Commons Attribution-Non Commercial 4.0 International License. The authors, submitting and publishing in the Journal of Accounting and Finance in Emerging Economies published by CSRC Publishing, retain the copyright of their work and give the journal right to publish their work agreeing to the licensing policy under Creative Common Attribution-Non Commercial (NC-BY-NC 4.0) International. Under this license, the published authors let others remix, tweak, and build upon their work non-commercially. Yet all the other authors using the content of CSRC Publishing are required to cite author(s), journal name and publisher in their work. CSRC Publishing and JAFEE follow an Open Access Policy for copyright and licensing.