我無法回圈遍歷回歸模型,每次都會丟棄一個觀察值來估計有影響的觀察值的效果。
我想多次運行模型,每次都放棄第 i 個觀察值并提取相關系數估計并將其存盤在向量中。我認為這可以很容易地通過一個相當直接的回圈來完成,但是,我被困在細節上。
我想留下一個向量,其中包含來自同一模型的 n 次迭代的 n 系數估計值。任何幫助都是有益的!
下面我提供一些虛擬資料和示例代碼。
#Dummy data:
set.seed(489)
patientn <- rep(1:400)
gender <- rbinom(400, 1, 0.5)
productid <- rep(c("Product A","Product B"), times=200)
country <- rep(c("USA","UK","Canada","Mexico"), each=50)
baselarea <- rnorm(400,400,60) #baseline area
baselarea2 <- rnorm(400,400,65) #baseline area2
sfactor <- c(
rep(c(0.3,0.9), times = 25),
rep(c(0.4,0.5), times = 25),
rep(c(0.2,0.4), times = 25),
rep(c(0.3,0.7), times = 25)
)
rashdummy2a <- data.frame(patientn,gender,productid,country,baselarea,baselarea2,sfactor)
Data <- rashdummy2a %>% mutate(rashleft = baselarea2*sfactor/baselarea*100) ```
## Example of how this can be done manually:
# model
m1<-lm(rashleft ~ gender baselarea sfactor, data = data)
# extracting relevant coefficient estimates, each time dropping a different "patient" ("patientn")
betas <- c(lm(rashleft ~ gender baselarea sfactor, data = rashdummy2b, patientn !=1)$coefficients[2],
lm(rashleft ~ gender baselarea sfactor, data = rashdummy2b, patientn !=2)$coefficients[2],
lm(rashleft ~ gender baselarea sfactor, data = rashdummy2b, patientn !=3)$coefficients[2])
# the betas vector now stores the relevant coefficient estimates (coefficient nr 2, for gender) for three different variations of the model.
uj5u.com熱心網友回復:
我們可以使用 for 回圈。在您的問題中,您使用了一個rashdummy2b未定義的物件。現在我使用 data了,但您可以用選擇的物件替換它。
#create list to bind results to
result <- list()
#loop through patients and extract betas
for(i in unique(data$patientn)){
#construct linear model
lm.model <- lm(rashleft ~ gender baselarea sfactor, data = subset(data, data$patientn != i))
#create data.frame containing patient left out and coefficient
result.dt <- data.frame(beta = lm.model$coefficients[[2]],
patient_left_out = i)
#bind to list
result[[i]] <- result.dt
}
#bind to data.frame
result <- do.call(rbind, result)
結果
head(result)
beta patient_left_out
1 1.381248 1
2 1.345188 2
3 1.427784 3
4 1.361674 4
5 1.420417 5
6 1.454196 6
uj5u.com熱心網友回復:
您可以使用負索引洗掉特定的行(或列)。在您的情況下,您按以下方式進行:
betas <- numeric(nrow(rashdummy2b)) # memory preallocation
for (i in 1:nrow(rashdummy2b)) {
betas[i] <- lm(rashleft ~ gender baselarea sfactor, data=rashdummy2b[-i,])$coefficients[2]
}
轉載請註明出處,本文鏈接:https://www.uj5u.com/qianduan/486726.html
