Regression analysis is a commonly used method in data processing, which can identify unknown relationships between data variables, obtain mathematical expressions that are more in line with variable relationships, and help users complete data analysis.
Next, we will introduce the multiple regression analysis method in regression analysis. IBM SPSS Statistics provides users with a mature multiple logistic regression analysis algorithm.
I. Overview
1. Data
Figure 1: Data Sample
This is a survey data on breakfast choices among different populations. Through multiple regression analysis using SPSS, population characteristic variables can be analyzed for breakfast types to find their relationships.
2. Functional location
Figure 2: Functional Location
Under the "Analysis" menu, we can find the "Multivariate Logistic" analysis in "Regression" and enter the window for multivariate regression analysis.
2、 Analysis method
1. Dependent variable setting
Figure 3: Dependent variable setting
The dependent variable is the quantity that follows the change of the independent variable, in this case referring to the variable of "preferred breakfast".
Click on 'Reference Category' and set the reference category for the dependent variable. This is the reference sample for analysis, and we set all categories to be compared with the last category in ascending order.
2. Factors and covariates
Figure 4: Factors and covariates
Factors can be simply understood as independent variables. Here, we will treat age segmentation, marital status, and lifestyle as dependent variables.
Covariance is a control variable that needs to be controlled during the analysis process and has a certain impact on the dependent variable. Here, it is set as gender.
3. Analyze the model
Figure 5: Analysis Model
There are three types of models available for multiple regression analysis in SPSS. The main effect refers to the analysis of the relationship between pre-set factors and covariates and the dependent variable; The full factor model includes both the main effect and the interaction analysis between factors and covariates; Customized step-by-step analysis allows users to define their own analysis types.
We can choose the main effect for analysis here.
4. Statistical settings
Figure 6: Statistical Settings
The settings in this window are for the statistical data analysis that needs to be performed, including multiple types of statistical data that can be selected. We have checked the pseudo R-square, cell likelihood, step summary, classification table, model fit information and fit, estimation under parameters (with a confidence interval set to 95%), and likelihood ratio test under the model.
Define subpopulation selection as a covariate pattern defined by factors and covariates.
5. Convergence conditions
Figure 7: Convergence Settings
In the condition dialog box, make convergence settings.
The maximum iteration number is the number of iterations that can be performed during regression analysis of data. This value must be an integer greater than or less than 100. The maximum step split is set as an equal fraction during iteration, and the system defaults to 5.
Logarithmic likelihood convergence can set a convergence value, and the logarithmic likelihood ratio function is greater than the set value during the regression process; The numerical settings for parameter convergence are similar.
In this example, the dialog box should remain default.
6. Option settings
Figure 8: Option Settings
Set the discrete metric to 'none' in the options dialog box.
The probability of data entering is 0.05 and the probability of data exiting is 0.1. Among these two parameters, the larger the former, the more data will enter the model; The smaller the latter, the more data is excluded, and both the entry and exit methods choose likelihood.
Leave the rest as default.
7. Save Settings
Set the variables to be saved in this dialog box, and if you need to output model information to an XML file, you can also do so in the following settings.
8. Complete the analysis
Figure 10: Result output
After completing the above settings, you can view the analysis results in the log output window








