所以這是我正在使用的示例資料集的一部分：

   `D1`    `D2`        'D3'     `D4`     `D5`     `D6`     `D7`
   <chr>   <chr>      <dbl>    <dbl>    <dbl>    <dbl>    <dbl>
 1 921     917          935      457      462      451      465
 2 898     E9           914      446      452      440      455
 3 817     806          814      407      412      398      411
 4 644     632          624      321      327      314      324
 5 E9      399          385      207      213      200      206
 6 136     127          127       69       72       66       66
 7 223     233          209      117      106      117      118
 8 475     E9           443      239      234      238      246
 9 684     685          665      340      341      337      348
10 816     814          828      406      409      400      412
...

這是在我使用它之后，您可以看到前兩列中有幾個“E9”實體，這是我希望通過運行它來計算的：

df2 <- df %>%  select(-c(Time))
devices$Exclusions <- str_count(df2, "E9")

這是我的最終結果：

  Device        ID             Exclusions
  <chr>         <int>          <int>
1 D4            145287         14
2 D5            145286         16
3 D6            145285          0
4 D7            145284          0
5 D1            145280          0
6 D2            145277          0
7 D3            145278          0

所以這導致了我的問題。這些設備不一定以相同的順序排列，當它計算“E9”的實體時，它只是按照這些設備所在的順序將它們附加到另一個資料幀，而不是將它們與它們的名稱匹配。為了將 D1 列中的 str_count 添加到另一個資料框中的 D1 行，而不僅僅是頂行，我可以添加什么？

uj5u.com熱心網友回復：

這是tidyverse.

解決方案

library(tidyverse)


# ...
# Code to generate 'df'.
# ...


df_counts <- df %>%
  # Homogenize columns as text.
  mutate(across(everything(), as.character)) %>%
  # Pivot columns into a 'Device | Code' format.
  pivot_longer(everything(), names_to = "Device", values_to = "Code") %>%
  # For each device...
  group_by(Device) %>%
  # ...count how many times "E9" appears among its codes.
  summarize(Exclusions = sum(Code == "E9"))

推測您的devices資料集的結構，我可以用ID您的示例輸出中的那些s來豐富結果：

# ...
# Code to generate 'devices'.
# ...


devices <- devices %>%
  full_join(df_counts, by = "Device", keep = FALSE)

結果

給定一個像你的例子這樣的df資料集

df <- structure(
  list(
    D1 = c("921", "898", "817", "644", "E9", "136", "223", "475", "684", "816"),
    D2 = c("917", "E9", "806", "632", "399", "127", "233", "E9", "685", "814"),
    D3 = c(935, 914, 814, 624, 385, 127, 209, 443, 665, 828),
    D4 = c(457, 446, 407, 321, 207, 69, 117, 239, 340, 406),
    D5 = c(462, 452, 412, 327, 213, 72, 106, 234, 341, 409),
    D6 = c(451, 440, 398, 314, 200, 66, 117, 238, 337, 400),
    D7 = c(465, 455, 411, 324, 206, 66, 118, 246, 348, 412)
  ),
  class = c("tbl_df", "tbl", "data.frame"),
  row.names = c(NA, -10L)
)

此作業流程應產生如下結果df_counts：

# A tibble: 7 x 2
  Device Exclusions
  <chr>       <int>
1 D1              1
2 D2              2
3 D3              0
4 D4              0
5 D5              0
6 D6              0
7 D7              0

此外，給定一個像您的示例這樣的devices資料集

devices <- structure(
  list(
    Device = c("D4", "D5", "D6", "D7", "D1", "D2", "D3"),
    ID = c(145287L, 145286L, 145285L, 145284L, 145280L, 145277L, 145278L)
  ),
  class = c("tbl_df", "tbl", "data.frame"),
  row.names = c(NA, -7L)
)

此解決方案應生成如下devices資料集：

# A tibble: 7 x 3
  Device     ID Exclusions
  <chr>   <int>      <int>
1 D4     145287          0
2 D5     145286          0
3 D6     145285          0
4 D7     145284          0
5 D1     145280          1
6 D2     145277          2
7 D3     145278          0

轉載請註明出處，本文鏈接：https://www.uj5u.com/net/409478.html

標籤：

上一篇：使用np.where()比較4列后得到最后一列

下一篇：創建計算滾動平均值同時排除值的資料框列

計算一個資料幀中字串的實體，然后將結果附加到另一個資料幀中具有匹配設備名稱的行？

解決方案

結果