Using the Logistic Regression in Analysis of Results from Statistical Observations

Authors

DOI:

https://doi.org/10.31767/su.3(94)2021.03.01

Keywords:

logit model, binary choice, multiple choice, chance logarithm, maximum likelihood method, self-assessment of health status

Abstract

The article is focused on investigating the problems arising in the statistical analysis with use of logistic models of the ordered multiple choice, which are constructed by the results of statistical observations involving the existence of a categorical dependent variable. This group of models should be used when a discrete dependent variable takes several alternative values. The examples include assessment of student performance (perfect, good, satisfactory, unsatisfactory). These models’ parameters are estimated using the algorithms based on elements of the probability theory. The purpose of constructing the multiple choice model is to determine the factors with impact on the probability of the occurrence of a particular event and the choice of an alternative, as well as the strength of this impact. A detailed description of the algorithms for estimating logit models of binary and multiple choice is given, with demonstrating the model application in solving a particular problem (statistical analysis of the results of self-assessment of health status by household members) by use of SPSS package. It should be noted that statistical packages like SPSS, STATISTICA or STATA contain the modules for constructing logit and probit models.

The assessment of population’s health status includes the objective assessment of their health status by the official statistics data on the prevalence of deceases and the cumulative subjective assessment of the individual health status by the results of sociological studies  It is important to know to what extent the objective assessment of the population’s health status complies with the subjective perception of the health status by individuals. Because the primary files of the sample survey of households are confidential, the multiple choice model was constructed by the author using the proxy data with characteristics close to actual values. Variables such as residence place, gender, age, assessment of health status, sports practicing, smoking and income were reported in the process of the sample survey. In constructing the model, the variable “health” was used as a dependent variable; “gender” and “education” were used as categorical variables; “age” and “income” were used as covariates. Once the model was constructed and its identification capacity (i. e. the correctness of the predicted dependent variable) estimated, its specification was saved in a special file for the subsequent rebuilding.        

Downloads

Download data is not yet available.

References

Anatolyev, S. (2009). Neparametricheskaya regressiya [Nonparametric regression]. Kvantil – Quantile, 7, 37–52. Retrieved from http://www.quantile.ru/07/07-SA.pdf [in Russian].

Ayvazyan, S. A., & Mkhitaryan, V. S. (1998). Prikladnaya statistika i osnovy ekonometriki [Applied statistics and foundations of econometrics]. Moscow: Yuniti [in Russian].

Buyul, A. & Zöfel P. (2002). SPSS: iskusstvo obrabotki informatsii. Analiz statisticheskikh dannykh i vosstanovlenie skrytykh zakonomernostey [SPSS: The Art of Information Processing. Analysis of statistical data and restoration of hidden patterns]. V. E. Momot (Ed.). St. Petersburg: Dia Soft UP.

Dougherty, K. (2009). Vvedenie v ekonometriku [Introduction to Econometrics]. (3d ed.). Moscow: INFRA-M.

Eliseeva, I. I., Kurysheva, S. V., Kosteeva, T. V. (2007). Ekonometrika [Econometrics]. Moscow: Financy i statistika.

Green, W. G. (2016). Ekonometricheskiy analiz [Econometric Analysis]. S. S. Sinelnikov, M. Yu. Turuntseva (Eds.). Book 1. Moscow: ID “Delo” RANKhiGS.

Magnus, Ya. R., Katyshev, P. K., & Peresetskiy, A. A. (2004). Ekonometrika. Nachalnyy kurs [Econometrics. Elementary course]. (6th ed.). Moscow: ID “Delo”.

Pautova, N. I., & Pautov, I. S. (2015). Gendernye osobennosti samootsenki zdorovya i ego vospriyatiya kak sotsiokulturnoy tsennosti (po dannym 21-y volny RLMS-HSE) [Gender characteristics of health self-assessment and perception as a socio-cultural value (Based on the data of the 21st round of RLMS-HSE)]. Zhenschina v rossiyskom obschestve – Woman in Russian Society, 2 (75), 60–75. Retrieved from https://womaninrussiansociety.ru/wp-content/uploads/2015/06/Pautova-Pautov_64_80.pdf

Peresetskiy, A. A. (2021). Ekonometricheskie metody v distantsionnom analize deyatelnosti rossiyskikh bankov [Econometric methods in remote analysis of the activities of Russian banks]. Moscow: NIU “Vysshaya shkola ekonomiki”.

Perova, M. B. (2016). Obektivnaya i subektivnaya otsenka sostoyaniya zdorovya naseleniya Rossii [Objective and subjective assessment of population health status in Russia]. Sistemnoe upravlenie – System management, 1 (30). Retrieved from http://sisupr.mrsu.ru/2016-1/PDF/Perova_2016-1.pdf

Sedova, E. N. (2008). Modeli mnozhestvennogo vybora v zadachakh otsenki i upravleniya ekologo-ekonomicheskimi riskami [Multiple choice models in the tasks of assessing and managing environmental and economic risks]. Vestnik OGU – Bulletin of OSU, 10, 96–102. Retrieved from https://cyberleninka.ru/article/n/modeli-mnozhestvennogo-vybora-v-zadachah-otsenki-i-upravleniya-ekologo-ekonomicheskimi-riskami/viewer

Tsypina, N. Yu. (2010). Statisticheskiy analiz faktorov, okazyvayuschikh vliyanie na kachestvo predostavlyaemykh uslug naseleniyu Orenburgskoy oblasti [Statistical analysis of factors influencing the quality of services provided to the population of the Orenburg region]. Ekonomicheskie nauki – Economic Sciences, 9 (70), 227–231. Retrieved from https://ecsn.ru/files/pdf/201009/201009_227.pdf

Verbik, M. (2008). Putevoditel po sovremennoy ekonometrike [A guide to modern econometrics]. S. A. Ayvazyan (Ed.). Moscow: Nauchnaya kniga.

Voischeva, O. S. (2006). Ekonometricheskie modeli kachestvennykh peremennykh v prognoznykh zadachakh marketinga [Econometric models of qualitative variables in predictive marketing problems]. Vestnik VGU, Seriya: Ekonomika i upravleniye – Proceedings of VSU, Series: Economics and Management, 2, 261–268. Retrieved from http://www.vestnik.vsu.ru/pdf/econ/2006/02/2006-02-42.pdf

Zsolt S. (2009). Multinomialnye modeli diskretnogo vybora [Multinomial discrete choice models]. Kvantil – Quantile, 7, 9–19. Retrieved from http://quantile.ru/07/07-ZS.pdf

Published

2021-09-15

How to Cite

Soshnikova, L. A. . (2021). Using the Logistic Regression in Analysis of Results from Statistical Observations . Statistics of Ukraine, 94(3), 4–11. https://doi.org/10.31767/su.3(94)2021.03.01