我正在嘗試抓取此 URL:
uj5u.com熱心網友回復:
要提取所需的文本,例如64.1 mm、5x114.3等,因為這些元素是啟用了Google Tag Manager的元素,您需要為visibility_of_element_located()引入WebDriverWait,您可以使用以下定位器策略:
options = Options()
options.add_argument("start-maximized")
options.add_experimental_option("excludeSwitches", ["enable-automation"])
options.add_experimental_option('useAutomationExtension', False)
options.add_argument('--disable-blink-features=AutomationControlled')
s = Service('C:\\BrowserDrivers\\chromedriver.exe')
driver = webdriver.Chrome(service=s, options=options)
driver.get('https://www.wheel-size.com/size/acura/mdx/2001/')
print(WebDriverWait(driver, 20).until(EC.visibility_of_element_located((By.XPATH, "//span[contains(., 'Center Bore')]//following::span[1]"))).text)
print(WebDriverWait(driver, 20).until(EC.visibility_of_element_located((By.XPATH, "//span[contains(., 'PCD')]//following::span[1]"))).text)
控制臺輸出:
64.1 mm
5x114.3
注意:您必須添加以下匯入:
from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.common.by import By
from selenium.webdriver.support import expected_conditions as EC
您可以在如何使用 Selenium - Python 檢索 WebElement 的文本中找到相關討論
轉載請註明出處,本文鏈接:https://www.uj5u.com/gongcheng/450926.html
標籤:javascript Python 阿贾克斯 硒 网页抓取
