我有以下格式的 csv 檔案:
| 一種 | b | C | d | e |
|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 |
| 9 | 8 | 7 | 6 | 5 |
我想將此 csv 檔案轉換為嵌套 JSON 格式,如下所示:
[{"a": 1,
"Purchase" : {
"b": 2,
"c": 3
"d": 4},
"Sales": {
"d": 4,
"e": 5}},
{"a": 9,
"Purchase" : {
"b": 8,
"c": 7},
"Sales": {
"d": 6,
"e": 5}}]
我怎樣才能進行這種轉變?我似乎無法弄清楚如何在 Python 中進行這種轉換。請記住,這只是示例表,我的真實表具有多列和數千行,因此手動操作并不經濟。
直到現在我已經嘗試過這段代碼:
with open("new_data.csv") as f:
reader = csv.DictReader(f)
for r in reader:
r["purchase"] = {"b": r['b'],
"c": r['c'],
}
在這里,我嘗試添加所需字典的另一個鍵值對,但未成功。我也會做同樣的事情,Sales但這只是示例。
uj5u.com熱心網友回復:
一種簡單的方法是添加更多列;然后to_json在熊貓中使用方法:
import pandas as pd
df = pd.read_csv('your_file.csv')
df['Purchase'] = df[['b','c','d']].to_dict('records')
df['Sales'] = df[['d','e']].to_dict('records')
out = df[['a', 'Purchase', 'Sales']].to_json(orient='records', indent=4)
輸出:
[
{
"a":1,
"Purchase":{
"b":2,
"c":3,
"d":4
},
"Sales":{
"d":4,
"e":5
}
},
{
"a":9,
"Purchase":{
"b":8,
"c":7,
"d":6
},
"Sales":{
"d":6,
"e":5
}
}
]
uj5u.com熱心網友回復:
您不需要任何庫,只需指定正確的方言,例如制表符分隔:
import csv
import json
with open("tmp4.csv", "r") as f:
result = [
{
"a": row["a"],
"Purchase": {
"b": row["b"],
"c": row["c"],
},
"Sales": {
"d": row["d"],
"e": row["e"],
},
}
for row in csv.DictReader(f, dialect='excel-tab')
]
assert (
json.dumps(result)
== '[{"a": "1", "Purchase": {"b": "2", "c": "3"}, "Sales": {"d": "4", "e": "5"}}, {"a": "9", "Purchase": {"b": "8", "c": "7"}, "Sales": {"d": "6", "e": "5"}}]'
)
uj5u.com熱心網友回復:
當您這樣做時r["purchase"] = {"b": ...},您將字典分配回每行物件r,該物件在回圈結束時被丟棄。相反,為每條記錄創建一個新字典并將其附加到串列中。喜歡:
result = []
with open("new_data.csv") as f:
reader = csv.DictReader(f)
for r in reader:
result.append({
"a": r["a"],
"Purchase" : {
"b": r["b"],
"c": r["c"],
"d": r["d"],
},
"Sales": {
"d": r["d"],
"e": r["e"],
},
})
并使用串列理解來創建result:
with open("new_data.csv") as f:
reader = csv.DictReader(f)
result = [{
"a": r["a"],
"Purchase" : {
"b": r["b"],
"c": r["c"],
"d": r["d"],
},
"Sales": {
"d": r["d"],
"e": r["e"],
},
} for r in reader]
轉載請註明出處,本文鏈接:https://www.uj5u.com/qukuanlian/448560.html
標籤:Python json python-3.x CSV
