帶有聚合的Pythonlambdagroupby導致語法錯誤-有解無憂

您好，我正在努力將 lambda groupby 函式與嵌套結構相結合，以獲得結構中的結果，如下例所示：

目標結構

#  This already works! ########################################################
# GM0014": {
# "i1401": {
# "score": 1.178,
# "rawScore": -1.178,
# "year": "2019",
# "id": "i1401"
# },
# "i1021": {
# "score": 1.838,
# "rawScore": -1.838,
# "year": "2020",
# "id": "i1021"
# },
# "i1022": {
# "score": 0.496,
# "rawScore": -0.496,
# "year": "2020",
# "id": "i1022"
# },
# "i1013": {
# "score": 0.415,
# "rawScore": 0.415,
# "year": "2020",
# "id": "i1013"
# },
#  This does not work! ########################################################
# "overAll": {
# "score": 0.982,
# "rawScore": -0.774

我在這里使用下面的資料集。該值應轉換為如上所示的“目標結構”。我將 scoreMax 和 rawScoremax 用于“整體”部分。

data = [  {'region': 'GM0014', 'variable': 'i1013', 'year': '2020', 'score': 0.415, 'rawScore': 0.415, 'scoreMax': 0.982, 'rawScoreMax': -0.774 }
        , {'region': 'GM0014', 'variable': 'i1021', 'year': '2020', 'score': -1.838, 'rawScore': 1.838, 'scoreMax': 0.982, 'rawScoreMax': -0.774}
        , {'region': 'GM0014', 'variable': 'i1022', 'year': '2020', 'score': -0.496, 'rawScore': 0.496, 'scoreMax': 0.982, 'rawScoreMax': -0.774}
        , {'region': 'GM0014', 'variable': 'i1401', 'year': '2019', 'score': -1.178, 'rawScore': 1.178, 'scoreMax': 0.982, 'rawScoreMax': -0.774}
        , {'region': 'GM0034', 'variable': 'i1013', 'year': '2020', 'score': -0.913, 'rawScore': -0.913, 'scoreMax': -0.071, 'rawScoreMax': -0.385 }
        , {'region': 'GM0034', 'variable': 'i1021', 'year': '2020', 'score': -0.244, 'rawScore': 0.244, 'scoreMax': -0.071, 'rawScoreMax': -0.385}
        , {'region': 'GM0034', 'variable': 'i1022', 'year': '2020', 'score': -0.332, 'rawScore': 0.332, 'scoreMax': -0.071, 'rawScoreMax': -0.385}
        , {'region': 'GM0034', 'variable': 'i1401', 'year': '2019', 'score': -0.053, 'rawScore': 0.053, 'scoreMax': -0.071, 'rawScoreMax': -0.385}
        , {'region': 'GM0037', 'variable': 'i1013', 'year': '2020', 'score': 0.487, 'rawScore': 0.487, 'scoreMax': 0.769, 'rawScoreMax': -0.526}
        , {'region': 'GM0037', 'variable': 'i1021', 'year': '2020', 'score': -2.172, 'rawScore': 2.172, 'scoreMax': 0.769, 'rawScoreMax': -0.526}
        , {'region': 'GM0037', 'variable': 'i1022', 'year': '2020', 'score': -1.654, 'rawScore': 1.654, 'scoreMax': 0.769, 'rawScoreMax': -0.526}
        , {'region': 'GM0037', 'variable': 'i1401', 'year': '2019', 'score': 1.236, 'rawScore': -1.236, 'scoreMax': 0.769, 'rawScoreMax': -0.526}
        , {'region': 'GM0047', 'variable': 'i1013', 'year': '2020', 'score': 0.885, 'rawScore': 0.885, 'scoreMax': 0.562, 'rawScoreMax': -0.12}
        , {'region': 'GM0047', 'variable': 'i1021', 'year': '2020', 'score': -2.19, 'rawScore': 2.19, 'scoreMax': 0.562, 'rawScoreMax': -0.12}
        , {'region': 'GM0047', 'variable': 'i1022', 'year': '2020', 'score': -1.542, 'rawScore': 1.542, 'scoreMax': 0.562, 'rawScoreMax': -0.12}
        , {'region': 'GM0047', 'variable': 'i1401', 'year': '2019', 'score': 2.368, 'rawScore': -2.368, 'scoreMax': 0.562, 'rawScoreMax': -0.12}]

這是除“整體”部分之外的代碼作品

 Group by function with Lambda
 test = {key : {l['variable'] : { 'score'   : l['score']
                                 ,'rawscore': l['rawScore']
                                 ,'rawscore': l['rawScore']
                                 ,'year'    : l['year']
                                 ,'id'      : l['variable']
                                
 } for l in lines}
         for key, lines in  itertools.groupby(data, lambda p: p['region']) }

為了讓“OverAll”部分正常作業，我嘗試將上面的代碼修改為以下代碼：

test = {key : {l['variable'] : { 'score'   : l['score']
                                ,'rawscore': l['rawScore']
                                ,'rawscore': l['rawScore']
                                ,'year'    : l['year']
                                ,'id'      : l['variable']
                                } 
                    for l in lines } 
            {'overAll': { 'score'    : l['scoreMax']
                         ,'rawscore' : l['rawScoreMax']
                        }
        for key, lines in  itertools.groupby(data, lambda p: p['region']) }}

但得到錯誤：

{'overAll': { 'score' : l['scoreMax'] ^ SyntaxError: 無效語法

你能幫我么！曼謝謝。

uj5u.com熱心網友回復：

我希望我正確理解了您的問題：

from itertools import groupby

out = {}
for k, g in groupby(data, lambda p: p["region"]):
    g = list(g)

    out[k] = {
        "overAll": {"score": g[0]["scoreMax"], "rawscore": g[0]["rawScoreMax"]}
    }
    for d in g:
        out[k][d["variable"]] = d
        del out[k][d["variable"]]["scoreMax"]
        del out[k][d["variable"]]["rawScoreMax"]

print(out)

印刷：

{
    "GM0014": {
        "overAll": {"score": 0.982, "rawscore": -0.774},
        "i1013": {
            "region": "GM0014",
            "variable": "i1013",
            "year": "2020",
            "score": 0.415,
            "rawScore": 0.415,
        },
        "i1021": {
            "region": "GM0014",
            "variable": "i1021",
            "year": "2020",
            "score": -1.838,
            "rawScore": 1.838,
        },
        "i1022": {
            "region": "GM0014",
            "variable": "i1022",
            "year": "2020",
            "score": -0.496,
            "rawScore": 0.496,
        },
        "i1401": {
            "region": "GM0014",
            "variable": "i1401",
            "year": "2019",
            "score": -1.178,
            "rawScore": 1.178,
        },
    },
    "GM0034": {
        "overAll": {"score": -0.071, "rawscore": -0.385},
        "i1013": {
            "region": "GM0034",
            "variable": "i1013",
            "year": "2020",
            "score": -0.913,
            "rawScore": -0.913,
        },
        "i1021": {
            "region": "GM0034",
            "variable": "i1021",
            "year": "2020",
            "score": -0.244,
            "rawScore": 0.244,
        },

...

轉載請註明出處，本文鏈接：https://www.uj5u.com/shujuku/512795.html

標籤：Python目的拉姆达通过...分组

上一篇：為什么Python方法需要一個“self”指標才能使遞回作業？

下一篇：使用reduce洗掉陣列中的物件