我已經列印了從檔案目錄復制的檔案。我想要的是同時列印重復檔案和從中復制它的相應真實檔案。下面是我的代碼。
path = "Z:/PMT_Training/SoftCo/d_i"
def duplicatecheck():
DATA_DIR = Path(path)
files = sorted(DATA_DIR.glob('*.xml'))
invoice_number = {}
duplicateFiles = []
for i in range(0,len(files)):
tree = ET.parse(files[i])
root = tree.getroot()
record = root.findall('record')
for item in record:
invoice = item.find('invoice_number').text
if invoice in invoice_number:
duplicateFiles.append(files[i])
print("Duplicate file found: ", files[i])
break
else:
invoice_number[invoice] = files[i]
duplicatecheck()
以下是我的輸出:
Duplicate file found: file (1).xml
Duplicate file found: file (2).xml
Duplicate file found: file (3).xml
我要列印的是重復檔案和從中發現它是重復檔案的相應檔案
如下所示:
Duplicate file found: file (1).xml, file (a).xml
Duplicate file found: file (2).xml, file (a).xml
Duplicate file found: file (3).xml, file (a).xml
我的意思是,如果發現一個檔案重復,我想列印這兩個檔案
uj5u.com熱心網友回復:
if invoice in invoice_number確保您的字典存盤了該專案,因此在內部它看起來像這樣:
{
'my_invoice_number': 'file.xml',
'my_other_invoice_number': 'file2.xml',
}
所以你需要做的就是列印它:
print("Duplicate file found: ", files[i], invoice_number[invoice])
轉載請註明出處,本文鏈接:https://www.uj5u.com/caozuo/536335.html
標籤:Python文件重复
上一篇:如何按字數對檔案中的行進行排序?
