請教各位大神,我在做csv檔案讀取時選擇使用chunk來分塊讀取資料,但是我發現,chunk并不能獲取chunk塊的第一條資料,似乎第一條資料被默認為chunk的_info_axis。
想請教各位大神這種情況有什么辦法讓我的chunk塊獲取第一條資料么
代碼如下:
with open('OD2011_ForCluster.csv','w') as csv_file:
writer = csv.writer(csv_file)
reader = pd.read_csv(r'OD2011_ALL.csv', iterator=True, encoding='GBK',skiprows=None)
loop = True
chunkSize = 8000000 # 4000000
while loop:
try:
CalculateNum = ['',
0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0]
chunk = reader.get_chunk(chunkSize)
for i in range(0, len(chunk) - 2):
if(chunk.iat[i,1]==chunk.iat[i+1,1]):
KeyHour = int(chunk.iat[i,6] - 19)*24+(chunk.iat[i,8]/3600)
CalculateNum[KeyHour]=CalculateNum[KeyHour]+1
else:
CalculateNum[0]=chunk.iat[i,1]
writer.writerow(CalculateNum)
CalculateNum.clear()
CalculateNum = ['',
0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0]
except StopIteration:
loop = False
print("Iteration is stopped.")
print('!!!!!!')
除錯資訊中:
chunk: 1 020089614 購物公園站 9.0 車公廟站 11.0 22 502 75997
0 2 20089614 車公廟站 11.0 購物公園站 9.0 22 601 78103
1 3 20089627 通新嶺站 62.0 國貿站 2.0 21 621 55352
2 4 20089659 湖貝站
很明顯chunk把第一條資料當作列名了,求教能解決這個問題的辦法
uj5u.com熱心網友回復:
不是有個引數 header=None 嗎?uj5u.com熱心網友回復:
試試設定header=None轉載請註明出處,本文鏈接:https://www.uj5u.com/qita/105025.html
上一篇:pycharm 有逐行除錯么?
下一篇:小白求教,關于時間戳問題
