我有一個資料框如下:
import pandas as pd
df = pd.DataFrame({'text':['Lary Page is visiting today',' His boss, Maria Jackson is here.']})
我已經提取了下面串列中的名稱。并使用 faker 庫創建與 person_name 串列的 len 相等的假名,并從串列中創建了一個字典。
from faker import Faker
fake = Faker()
person_name = ['Lary Page', 'Maria Jackson']
fake_name= [fake.name() for n in range(len(person_name))]
name_dict = dict(zip(person_name, fake_name ))
現在我想使用字典在資料框中替換它們,但它回傳一個錯誤。
df.text.str.replace(name_dict)
我想要的輸出:(例如)
print(df)
Angela Mindeston is visiting today
His boss, Emanuel Smith is here.
uj5u.com熱心網友回復:
將回呼與 lambda 一起用于Series.str.replaceor Series.replace:
regex = '|'.join(r"\b{}\b".format(x) for x in name_dict.keys())
df['text1'] = df.text.str.replace(regex, lambda x: name_dict[x.group()], regex=True)
df['text2'] = df.text.replace(name_dict, regex=True)
print (df)
text text1 \
0 Lary Page is visiting today Gary Cox is visiting today
1 His boss, Maria Jackson is here. His boss, Mr. George Jones is here.
text2
0 Gary Cox is visiting today
1 His boss, Mr. George Jones is here.
轉載請註明出處,本文鏈接:https://www.uj5u.com/yidong/377177.html
上一篇:將父元素添加到子元素以具有一個元組(將XML轉換為字典集合時)
下一篇:檢查嵌套json鍵的值
