我試圖從網站上獲取有關其營業時間的資訊,但結果非常令人失望。
import requests
from bs4 import BeautifulSoup
import xlsxwriter
i = "90460"
URL = "https://www.tuodi.it/negozi-dettaglio.cfm?negozio=%s" % i
page = requests.get(URL)
soup = BeautifulSoup(page.content, "html.parser")
results = soup.find(id="orario" , style="width:50%;float:left")
orari = results.find_all("div", class_="tab", style="width:220px;line-height: 25px")
print(orari)
我的輸出如下所示
[<div style="width:220px;line-height: 25px">
8,30
- 20,00
<br/>
8,30
- 20,00
<br/>...
但我寧愿有一個可以匯出到excel形式的結果
Excel結果
提前致謝!
uj5u.com熱心網友回復:
要獲得您的結果,您可以使用.stripped_strings和list comprehension:
[''.join(x.split()) for x in orari[0].stripped_strings]
這將為您提供一個串列,您可以將其寫入檔案:
['8,30-20,00', '8,30-20,00', '8,30-20,00', '8,30-20,00', '8,30-20,00', '8,30-20,00', '8,00-13,00']
例子
import requests
from bs4 import BeautifulSoup
import pandas as pd
i = "90460"
URL = "https://www.tuodi.it/negozi-dettaglio.cfm?negozio=%s" % i
page = requests.get(URL)
soup = BeautifulSoup(page.content, "html.parser")
results = soup.find(id="orario" , style="width:50%;float:left")
orari = results.find_all("div", class_="tab", style="width:220px;line-height: 25px")
data = [''.join(x.split()) for x in orari[0].stripped_strings]
pd.DataFrame([data]).to_excel('test.xslx', index=False)
轉載請註明出處,本文鏈接:https://www.uj5u.com/houduan/347507.html
