Original link:tecdat.cn/?p=4124

Original source:Tuo End number according to the tribe public number

 

 

Partial least squares regression:

I will do some consulting around structural equation modeling (SEM) techniques to solve unique business problems. We try to identify customers’ preferences for various products, and traditional regression is not enough because of the high components of the data set and the multicollinearity of the variables. PLS is a powerful and efficient way to deal with these problematic data sets.

Principal component regression is one option we will explore, but when doing background research, I found that PLS might be the better choice. We will look at PLS regression and PLS path analysis. I don’t believe a return to tradition is valuable at this point because we don’t have a good sense or theory to make assumptions about the underlying structure. In addition, due to the large number of variables in the data set, we are extending SEM technology to its limits. An interesting discussion of this limitation can be found in Haenlein, M&Kaplan, A., 2004, “Preliminary Guideline Partial least squares Analysis”, Understanding Statistics, 3 (4), 283-297.

[1] "diesel" and "turbo" and "two doors" "hatchback" "wheel. Base" [6] "length" width "height" "" "skills he. Weight "" eng. Size" [16] "Highway. MPG" [16] "Highway. MPG"Copy the code

The data consisted of 16 variables and 30 observations.

One of the interesting things about PLS regression is that you can have multiple response variables and PLsDepot can accommodate this type of analysis. In this case, I want to analyze only one Y variable, and that is price.

The package places the response variable column at the end of the data frame.

Pls1 $x.lads X-loadings $y.lads X-loadings (U-components) $y.lads Y-loadings $ Cor.xyt score correlation $raw.wGS Original weight $mod.WGS Modified weight $STd.coEFS Standard coefficient $reg.CoEFS Conventional coefficient $R2 R squared $R2Xy explain the variance of Xy T $y.piny - Predict $resid Residual $T2 T2 economic coefficientCopy the code

Word-wrap: break-word! Important; “> Notice what is highly correlated with price

#plot Predictions and actual observations for each observation

We will have to continue to look at different numbers of components to determine the best model and see if the underlying variables make sense from a practical point of view.


Most welcome insight

1.R language multiple Logistic Logistic regression application case

2. Panel smooth transfer regression (PSTR) analysis case implementation

3. Partial least squares regression (PLSR) and principal component regression (PCR) in MATLAB

4.R language Poisson regression model analysis cases

5. Hosmer-lemeshow goodness of fit test in R language regression

6. Implementation of LASSO regression, Ridge regression and Elastic Net model in R language

7. Realize Logistic Logistic regression in R language

8. Python predicts stock prices using linear regression

9. How to calculate IDI and NRI indices for R language in survival analysis and Cox regression