df <- read.csv("https://query.data.world/s/gzjmftivszsy44ukfak2e7ksig35jm", header=TRUE, stringsAsFactors=FALSE);
library(ggplot2)
library(qqplotr)
library(stats)
library(dplyr)
coverage_by_Geography = data.frame(avgcancerdiag= df$avgAnnCount, county = df$Geography, PubCoverage = df$PctPublicCoverage, privcoverage = df$PctPrivateCoverage, deathrt = df$avgDeathsPerYear)
ggplot(data = coverage_by_Geography, aes(x = privcoverage, y = deathrt)) geom_col()
ggplot(data = coverage_by_Geography, aes(x = PubCoverage, y = deathrt)) geom_col()
我試圖在一列中取一堆縣,將它們濃縮成州,并將他們的資料平均到州數字而不是縣。我很難過如何做到這一點。
uj5u.com熱心網友回復:
一個通用的tidyverse解決方案如下:
library(tidyverse)
df <- read_csv("https://query.data.world/s/gzjmftivszsy44ukfak2e7ksig35jm")
df %>%
separate(Geography, c("county", "state"), ", ") %>%
select(state, county, everything()) %>%
group_by(state) %>%
summarize(across(-c(county), mean))
該代碼將縣和州分為兩列。按狀態分組允許您匯總資料。在這里,我詢問了所有列的平均值,但這可能對所有不同的資料型別都沒有意義。希望這能讓你更接近你正在尋找的東西。
轉載請註明出處,本文鏈接:https://www.uj5u.com/gongcheng/466738.html
