我正在嘗試從http://www.comuni-italiani.it/110/index.html獲取來自城市的電子郵件
我有使用 xPath Finder 的特定子方向,即/html/body/span[3]/table[2]/tbody/tr[1]/td[2]/table/tbody/tr[11]/td/b/a. 現在我正在嘗試從此頁面檢索電子郵件,但我對圖書館知之甚少BeatifulSoup(我才剛剛開始)。在閱讀了幾個指南之后,我設法撰寫了以下代碼,但我沒有成功正確地指示子路由
from bs4 import BeautifulSoup
import requests
# sample web page
sample_web_page = 'http://www.comuni-italiani.it/110/index.html'
# call get method to request that page
page = requests.get(sample_web_page)
# with the help of beautifulSoup and html parser create soup
soup = BeautifulSoup(page.content, "html.parser")
child_soup = soup.find('span')
for i in child_soup.children:
print("child : ", i)
我究竟做錯了什么??
uj5u.com熱心網友回復:
請在下面找到我解決您問題的嘗試。它以與您的代碼相同的方式開始,只是有一點魔力可以找到電子郵件并將其列印出來。
from bs4 import BeautifulSoup
import requests
sample_web_page = 'http://www.comuni-italiani.it/110/index.html'
page = requests.get(sample_web_page)
soup = BeautifulSoup(page.content, "html.parser")
email = soup.select_one('b > a[href^="mail"]')['href']
print(email.split(':')[1])
轉載請註明出處,本文鏈接:https://www.uj5u.com/houduan/473584.html
上一篇:如何鏈接內部文本檔案交叉參考?
