下面的示例dput():
structure(list(group = c(34676739L, 45938970L, 22731473L, 40083768L,
22527333L, 51629537L, 26299463L, 27420157L, 24898717L, 43569190L,
34573189L, 44503577L, 25471327L, 44630117L, 19048782L, 39710425L,
33535680L, 54358561L, 27363448L, 39386432L, 44150096L, 24614702L,
36219027L, 39609036L, 10803983L, 54770896L, 27574728L, 40912817L,
24679610L, 40261463L), partners = c("US-GB", "US-JP", "US-JP",
"US-GB", "GB-US", "US-GB", "GB-US", "US-GB", "US-GB", "US-JP",
"US-GB", "US-GB", "US-GB", "GB-US", "JP-US", "US-JP", "JP-US",
"JP-US", "US-GB", "US-GB", "US-JP", "US-GB", "GB-US", "GB-US",
"US-GB", "US-GB", "US-JP", "JP-US", "US-GB", "US-GB")), row.names = c(NA,
-30L), class = c("data.table", "data.frame"), .internal.selfref = <pointer: 0x000001fc21f23b00>)
我想要做的:我想創建一個新變數,比如說partners_final,規范化partners. 你會看到partners有條目US-GB, GB-US, US-JP, JP-US。這些只是代表業務合作伙伴之間的關系,因此在技術上US-JP == JP-US和US-GB == GB-US.
然而,這些條目(顯然)在 R 中不等價,這使得在做經驗時變得很困難。所以我想做的是創建一個新變數partners_final,它給出一個統一的業務伙伴對,而不管兩個合作伙伴的順序如何。
請注意,在我的實際資料集中,有很多很多合作伙伴。我需要做一些適用于整個資料集的事情,例如,partners_final必須反映AB-CD === CD-AB所有對AB, CD。有什么辦法可以在 R 中做到這一點(最好避免pivot,因為有些國家對沒有出現在以后可能需要考慮的資料中)?
uj5u.com熱心網友回復:
你需要一些邏輯來訂購我假設。在這種情況下,我將假設標準化對按字母順序排列。您可以拆分、排序和重新加入。
library(stringr)
d <- structure(list(
group = c(
34676739L, 45938970L, 22731473L, 40083768L,
22527333L, 51629537L, 26299463L, 27420157L, 24898717L, 43569190L,
34573189L, 44503577L, 25471327L, 44630117L, 19048782L, 39710425L,
33535680L, 54358561L, 27363448L, 39386432L, 44150096L, 24614702L,
36219027L, 39609036L, 10803983L, 54770896L, 27574728L, 40912817L,
24679610L, 40261463L
),
partners = c(
"US-GB", "US-JP", "US-JP",
"US-GB", "GB-US", "US-GB", "GB-US", "US-GB", "US-GB", "US-JP",
"US-GB", "US-GB", "US-GB", "GB-US", "JP-US", "US-JP", "JP-US",
"JP-US", "US-GB", "US-GB", "US-JP", "US-GB", "GB-US", "GB-US",
"US-GB", "US-GB", "US-JP", "JP-US", "US-GB", "US-GB"
)
),
row.names = c(NA, -30L), class = c("data.table", "data.frame")
)
d$cleaned <- lapply(str_split(d$partners, "-"),
function(x) paste0(sort(x), collapse = "-" ))
轉載請註明出處,本文鏈接:https://www.uj5u.com/gongcheng/513412.html
標籤:r细绳数据框弦乐克兰
