我有興趣遍歷列以轉換為已處理的系列。
以下是兩行四列資料框的示例:
import pandas as pd
from rapidfuzz import process as process_rapid
from rapidfuzz import utils as rapid_utils
data = [['r/o ac. nephritis. /. nephrotic syndrome', ' ac. nephritis. /. nephrotic syndrome',1,'ac nephritis nephrotic syndrome'], [ 'sternocleidomastoid contracture','sternocleidomastoid contracture',0,"NA"]]
# Create the pandas DataFrame
df_diagnosis = pd.DataFrame(data, columns = ['diagnosis_name', 'diagnosis_name_edited','is_spell_corrected','spell_corrected_value'])
spell_corrected_value如果is_spell_corrected 列大于 1,我想使用列。否則,使用diagnosis_name_edited
目前,我有以下代碼可以直接使用diagnosis_name_edited列。如何進入 if-else/lambda 檢查is_spell_corrected列?
unmapped_diag_series = (rapid_utils.default_process(d) for d in df_diagnosis['diagnosis_name_edited'].astype(str)) # characters (generator)
unmapped_processed_diagnosis = pd.Series(unmapped_diag_series) #
謝謝你。
uj5u.com熱心網友回復:
如果我猜對了,請使用 numpy.where 嘗試這個快速解決方案:
df_diagnosis['new_column'] = np.where(df_diagnosis['is_spell_corrected'] > 1, df_diagnosis['spell_corrected_value'], df_diagnosis['diagnosis_name_edited'])
轉載請註明出處,本文鏈接:https://www.uj5u.com/qiye/457688.html
上一篇:表格轉換Google表格
下一篇:IF陳述句未按預期引發錯誤
