我想使用正則運算式替換資料框模式
例如,我有下表。我想用 替換帳號數字N,例如,如果帳戶是 5 個數字,則應將其替換為 5 個NNNNNN。
Source
Account_Num,Facility Name,Address,City
10605,SAGE MEMORIAL HOSPITAL,STATE ROUTE 264 SOUTH 191,GANADO
2425,WOODRIDGE BEHAVIORAL CENTER,600 NORTH 7TH STREET,XDSDSD
目標
Account_Num,Facility Name,Address,City
NNNNN,AAAA AAAAAAAA AAAAAAA,STATE ROUTE 264 SOUTH 191,GANADO
NNNN,WOODRIDGE BEHAVIORAL CENTER,600 NORTH 7TH STREET,XDSDSD
我正在嘗試使用以下代碼:
print(df.replace(to_replace=(\[re.search(r'\\d ',str(df_str))\]),value='NNNNN', regex=True))
uj5u.com熱心網友回復:
您可以使用.replace多個正則運算式條件:
df = df.astype(str).replace([r'[a-zA-Z]', '\d'], ['A', 'N'], regex=True)
輸出:
>>> df
Account_Num Facility Name Address City
0 NNNNN AAAA AAAAAAAA AAAAAAAA AAAAA AAAAA NNN AAAAA NNN AAAAAA
1 NNNN AAAAAAAAA AAAAAAAAAA AAAAAA NNN AAAAA NAA AAAAAA AAAAAA
轉載請註明出處,本文鏈接:https://www.uj5u.com/qita/448669.html
