我無法使用“get”函式讀取從 Excel 檔案“tennis3.xlsx”中的單元格 A1 獲取的 URL。我嘗試了不同的解決方案,但我不知道如何讓它閱讀并使用它來獲得網頁回應。問題可能從'sheet["A1"].value'開始。
我已經通過 Visual Studio 應用了這個程式,它使用的是 chrome 瀏覽器。單元格 A1 中的 URL 是https://www.betexplorer.com/tennis/atp-singles/paris/evans-daniel-nakashima-brandon/WAqNf5ao/。
編輯:我遇到的實際問題是我忘記包含保存功能。
import requests
from bs4 import BeautifulSoup
from openpyxl import load_workbook
workbook = load_workbook(filename="tennis3.xlsx")
sheet = workbook.active
urlcell = sheet["A1"].value
response = requests.get(urlcell)
webpage = response.content
soup = BeautifulSoup(webpage, "html.parser")
sheet["B1"] = soup.select('h1 a')[0].text.replace(' ','_')
uj5u.com熱心網友回復:
您需要保存所做的更改:
import requests
from bs4 import BeautifulSoup
from openpyxl import load_workbook
filename = r"tennis3.xlsx"
workbook = load_workbook(filename=filename)
sheet= workbook['Sheet1']
urlcell = sheet["A1"].value
print(urlcell)
response = requests.get(urlcell)
webpage = response.content
soup = BeautifulSoup(webpage, "html.parser")
sheet["B1"] = soup.select('h1 a')[0].text.replace(' ','_')
workbook.save(filename=filename)
轉載請註明出處,本文鏈接:https://www.uj5u.com/gongcheng/527419.html
標籤:Python网页抓取
上一篇:/form處的ValueError視圖authentication.views.my_form未回傳HttpResponse物件。它回傳None而不是
