我目前正在嘗試創建一種演算法,幫助我從具有以下引數的網站中提取資料http://website/example/example/YYYY-MM-DD HH/1。例如,如果我想提取 2022 年 2 月 3 日下午 3 點的資料,鏈接應該是http://website/example/example/2022-03-02 15/1. 我想提取一整天的資料。(我假設為此需要某種嵌套回圈)。誰能給我一些指導。謝謝
2022 年 2 月 3 日的 json 示例:
[ {
"@id" : "12233071",
"definition" : {
"parkingRecord" : {
"@xmlns" : "http://datex2.eu/schema/2/2_0",
"@xmlns:xsi" : "http://www.w3.org/2001/XMLSchema-instance",
"parkingRecord" : {
"@xsi:type" : "UrbanParkingSite",
"@id" : "12233071",
"@version" : "2.5",
"parkingName" : {
"values" : {
"value" : {
"@lang" : "en",
"#text" : "210: Wilkinson Street Park and Ride"
}
}
},
"parkingDescription" : {
"values" : {
"value" : {
"@lang" : "en",
"#text" : "Wilkinson Street Park and Ride"
}
}
},
"parkingRecordVersionTime" : "2022-03-02T15:00:09.089Z",
"parkingNumberOfSpaces" : "600",
"parkingLocation" : {
"@xsi:type" : "Point",
"locationForDisplay" : {
"latitude" : "52.97",
"longitude" : "-1.18"
}
},
"parkingThresholds" : {
"almostFullDecreasing" : "80",
"almostFullIncreasing" : "85",
"fullDecreasing" : "90",
"fullIncreasing" : "95"
},
"urbanParkingSiteType" : "offStreetParking"
}
}
},
"status" : {
"parkingRecord" : {
"@xmlns" : "http://datex2.eu/schema/2/2_0",
"@xmlns:xsi" : "http://www.w3.org/2001/XMLSchema-instance",
"parkingRecordStatus" : {
"@xsi:type" : "ParkingSiteStatus",
"parkingRecordReference" : {
"@targetClass" : "ParkingRecord",
"@id" : "12233071",
"@version" : "2.5"
},
"parkingStatusOriginTime" : "2020-03-09T12:17:42.000Z",
"parkingOccupancy" : {
"parkingNumberOfOccupiedSpaces" : "227",
"parkingOccupancy" : "37.83",
"vehicleCountAndRate" : {
"measurementTimeDefault" : "2022-03-02T14:58:57.000Z",
"vehicleRate" : {
"measurementOrCalculationTime" : "2022-03-02T14:58:57.000Z",
"fillRate" : {
"vehicleFlowRate" : "0"
},
"exitRate" : {
"vehicleFlowRate" : "1"
}
}
}
},
"parkingSiteStatus" : "other",
"parkingSiteOpeningStatus" : "closed"
}
}
}
}]
uj5u.com熱心網友回復:
這應該可以回答您有關構建所需 URL 的問題:
import datetime
prefix= "http://website/example/example/"
start = datetime.datetime(2022, 1, 1)
delta = datetime.timedelta(7)
end = start delta
hourdelta = datetime.timedelta(hours=1)
urls = []
while start < end:
urls.append(prefix start.strftime("%Y-%m-%d% %H/1"))
start = hourdelta
[print(urls[i]) for i in range(4)]
print()
[print(urls[len(urls) - 4 i]) for i in range(4)]
示例輸出(前 4 個 URL 和后 4 個 URL):
http://website/example/example/2022-01-01%2000/1
http://website/example/example/2022-01-01%2001/1
http://website/example/example/2022-01-01%2002/1
http://website/example/example/2022-01-01%2003/1
http://website/example/example/2022-01-07%2020/1
http://website/example/example/2022-01-07%2021/1
http://website/example/example/2022-01-07%2022/1
http://website/example/example/2022-01-07%2023/1
要遍歷這些 URL,以 json 格式提取資料并將其存盤在 csv 檔案中,我建議查看此.
轉載請註明出處,本文鏈接:https://www.uj5u.com/ruanti/441877.html
