我在 URL 的文本檔案中有一個串列,其中包含一些不需要的文本:
檔案內容 URL 串列:
http://www.example.com/52 (Status: 403) [Size: 919]
http://www.example.com/details (Status: 403) [Size: 919]
http://www.example.com/h (Status: 403) [Size: 919]
http://www.example.com/affiliate (Status: 403) [Size: 919]
http://www.example.com/56 (Status: 403) [Size: 919]
我使用的正則運算式是: "^[://.a-zA-Z0-9-_]*"
輸出如下:
['http://www.example.com/52']
['http://www.example.com/details']
['http://www.example.com/h']
['http://www.example.com/affiliate']
['http://www.example.com/56']
我需要輸出如下所示:
http://www.example.com/52
http://www.example.com/details
http://www.example.com/h
http://www.example.com/affiliate
http://www.example.com/56
用于此程式的代碼如下:
import re
with open("test.txt","r") as test:
for i in test:
x = re.findall("^[://.a-zA-Z0-9-_]*",i)
print(x)
uj5u.com熱心網友回復:
findall生成一個字串串列,您可以列印出結果中的第一個元素,也可以print(x[0])僅用match于此用例,因為每行有 1 個 url。
with open("test.txt","r") as test:
for i in test:
x = re.match(r"[://.a-zA-Z0-9-_]*", i)
print(x.group(0))
轉載請註明出處,本文鏈接:https://www.uj5u.com/qiye/364257.html
下一篇:顫動比較檢查兩個網址是否相似
