客觀的:
我想創建一個PaperlessBilling分類特征(電信客戶流失資料集)的堆疊直方圖,將 Y 軸顯示為百分比,并將流失分布顯示為色調。但是,百分比不是來自累計計算。
如果使用 R,這是我的預期:
ggplot(Churn, aes(SeniorCitizen, fill = Churn))
geom_bar(position = "fill")
xlab("Senior Citizen status")
ylab("Percent")
scale_y_continuous(labels = scales::percent)
scale_x_discrete(labels = c("Non-Senior Citizens", "Senior Citizens"))
scale_fill_manual(name = "Churn Status", values = c("green2", "red1"), labels = c("No", "Yes"))
ggtitle("The Ratio of Churns by Senior Citizen status")
theme_classic()
theme(legend.position = "bottom",
plot.title = element_text(hjust = 0.5, size = 15))
這是上面代碼的輸出(看到兩個類別的總和為 100%):

這是我所做的:
fig, axs = plt.subplots(figsize=(5, 5))
sns.histplot(
df,
x = "PaperlessBilling",
hue = "Churn",
multiple = "stack",
stat = "percent"
)
這是上面代碼的輸出:

uj5u.com熱心網友回復:
隨著
轉載請註明出處,本文鏈接:https://www.uj5u.com/net/372989.html
上一篇:如何輕松地從R中的串列中創建具有累積總和的多個新變數
下一篇:帶有動態URL的R函式
