我有3個csv檔案(june.csv,july.csv, august.csv)
July.csv是這樣的:
company_name | number_of_employers | money_spend
comp1 | 20 | 1200 !
comp2 | 30 | 1000 !
August.csv是這樣的:
company_name | number_of_employers | money_spend
comp1 | 25 | 1800 !
comp3 | 50 | 2000 !
我想把這些csvs合并成一個結構,就像這樣:
| august |july
公司名稱 | 雇工人數 | 支出金額 | 雇員人數
comp1 | 25 | 1800 |20 |1200 !
comp2 | 0 | 0 |30 |1000 !
comp3 | 50 | 2000 |0 |0 !
我如何才能做到這一點?
uj5u.com熱心網友回復:
嘗試pd.read_csv與.set_index()。然后用pd.concat連接資料幀:
# adjust `sep=` parameter accordingly:
df_1 = pd.read_csv("july.csv", sep=r",", engine="python").set_index(
"company_name"。
)
df_2 = pd.read_csv("august.csv", sep=r",", engine="python") .set_index(
"company_name"。
)
df_out = pd.concat({"july": df_1, "august": df_2}, axis=1).fillna(0)
print(df_out)
列印:
july august
number_of_employers money_spend number_of_employers money_spend
公司名稱
comp1 20.0 1200.0 25.0 1800.0
Comp2 30.0 1000.0 0.0 0.0 0.0
comp3 0.0 0.0 50.0 2000.0
uj5u.com熱心網友回復:
對于marge,你可以做如下處理
July_df= pd.read_csv("july.csv"/span>)
August_df = pd.read_csv("august.csv")
merge(July_df, August_df, on='company_name'/span>, suffixes=['_july'/span>, '_august'/span>] )
不確定如何自動創建多索引
轉載請註明出處,本文鏈接:https://www.uj5u.com/ruanti/327680.html
標籤:
