使用具有密度核的蒙特卡羅方法計算均值和方差-有解無憂

假設密度核等于：使用具有密度核的蒙特卡羅方法計算均值和方差，我可以使用哪些蒙特卡羅方法來估計分布的均值和方差？

uj5u.com熱心網友回復：

我們可以在這里使用數值方法。首先，我們創建了一個函式來表示您的概率密度函式（盡管它還沒有進行縮放，因此其積分為 1）：

pdf <- function(x) x^2 * exp(-x^2/4)

plot(pdf, xlim = c(0, 10))

使用具有密度核的蒙特卡羅方法計算均值和方差

我們可以看到曲線下幾乎所有的面積都發生在 x < 10 處，所以如果我們在 x = 100 處找到積分，我們應該有一個非常準確的比例因子來生成真正的 pdf：

integrate(pdf, 0, 100)$value
#> [1] 3.544908

所以現在我們可以生成一個真正的pdf：

pdf <- function(x)  x^2 * exp(-x^2/4) / 3.544908

plot(pdf, xlim = c(0, 10))

使用具有密度核的蒙特卡羅方法計算均值和方差

現在我們有了一個 pdf，我們可以創建一個帶有數值積分的 cdf：

cdf <- function(x) sapply(x, \(i) integrate(pdf, 0, i)$value)

plot(cdf, xlim = c(0, 10))

使用具有密度核的蒙特卡羅方法計算均值和方差

cdf 的倒數是我們需要能夠將從 0 和 1 之間的均勻分布中抽取的樣本轉換為從我們的新分布中抽取的樣本。我們可以創建一個反函式，uniroot用于查找 cdf 的輸出與 0 和 1 之間的任意數字匹配的位置：

inverse_cdf <- function(p) 
{
  sapply(p, function(i) {
              uniroot(function(a) {cdf(a) - i}, c(0, 100))$root
  })
}

逆 cdf 如下所示：

plot(inverse_cdf, xlim = c(0, 0.99))

使用具有密度核的蒙特卡羅方法計算均值和方差

我們現在準備從我們的分布中抽取樣本：

set.seed(1) # Makes this draw reproducible

x_sample <- inverse_cdf(runif(1000))

現在我們可以繪制樣本的直方圖并確保它與 pdf 匹配：

hist(x_sample, freq = FALSE)

plot(function(x) pdf(x), add = TRUE, xlim = c(0, 6))

使用具有密度核的蒙特卡羅方法計算均值和方差

Now we are confident that we have a sample drawn from x, we can use the sample mean and standard deviation as estimates for the distribution's mean and standard deviation:

mean(x_sample)
#> [1] 2.264438

sd(x_sample)
#> [1] 0.9625839

We can increase the accuracy of these estimates by taking larger and larger samples in our call to inverse_cdf(runif(1000)), by increasing the 1000 to a larger number.

^{Created on 2021-11-06 by the reprex package (v2.0.0)}

轉載請註明出處，本文鏈接：https://www.uj5u.com/qita/350424.html

標籤：r 统计数据意思凯德方差

上一篇：R-如何按名稱從串列的子串列中選擇元素

下一篇：ggplot在特定情況下生成鋸齒狀/斷線