讀取和附加 excel 檔案以創建 DataFrame:
import pandas as pd
import os
folder = r'C:\mypathtodocuments'
files = os.listdir(folder)
df = pd.DataFrame()
for file in files:
if file.endswith('.xlsx'):
df = df.append(pd.read_excel(os.path.join(folder,file)))
#Drop extra columns from wrong data
df1 = df[['FIRST_NM', 'LAST_NM', 'CITY_AD']]
CITY_AD專欄預覽:
>>> df1["CITY_AD"]
0 EL PASO
1 HOUSTON
2 HOUSTON
3 CONROE
4 MCKINNEY
5 MCKINNEY
6 KATY
7 TOMBALL
8 TOMBALL
9 SPRING
10 SPRING
使用.isin()函式過濾 DataFrame以僅包含城市HOUSTON和CONROE:
df1[df1["CITY_AD"].isin(["HOUSTON","CONROE"])]
這將回傳一個空集......我怎樣才能讓它正確過濾?
uj5u.com熱心網友回復:
嘗試這個:
df1["CITY_AD"] = df1["CITY_AD"].str.strip()
df1[df1["CITY_AD"].isin(["HOUSTON","CONROE"])]
轉載請註明出處,本文鏈接:https://www.uj5u.com/ruanti/357502.html
下一篇:pandas用函式回傳的值替換0
