在此頁面每一行的末尾,都有一個包含 URL 的“查看海報”鏈接。
我在我的代碼中拉入的第一個,拉得很好,作為“你”
我不知道如何拉查看海報 url。
rom selenium import webdriver
import time
import pandas as pd
driver = webdriver.Chrome()
import requests
from bs4 import BeautifulSoup
val=[]
absinfo=[]
sesinfo=[]
url = 'https://meetings.asco.org/meetings/2022-gastrointestinal-cancers-symposium/286/program-guide/search?q=&filters={"sessionType":[{"key":"Poster Session"}]}'
res=requests.get(url)
soup=BeautifulSoup(res.content,'html.parser')
driver.get(url)
time.sleep(4)
productlist =driver.find_elements_by_xpath(".//div[@class='session-card']")
#times = soup.select('.time')
for b in productlist:
ur=b.find_element_by_css_selector('a').get_attribute('href')
uj5u.com熱心網友回復:
如果您想使用,請selenium嘗試使用以下方法xpath來識別href產品串列下的兩個鏈接。
driver.get("https://meetings.asco.org/meetings/2022-gastrointestinal-cancers-symposium/286/program-guide/search?q=&filters={"sessionType":[{"key":"Poster Session"}]}")
productlist =driver.find_elements_by_xpath(".//div[@class='session-card']")
for item in productlist:
print("Url 1 :" item.find_element_by_xpath(".//span[@data-cy='sessionTitle']//a").get_attribute('href'))
print("View Poster :" item.find_element_by_xpath(".//a[.//span[text()='View Posters']]").get_attribute('href'))
輸出:
Url 1 :https://meetings.asco.org/2022-asco-gastrointestinal-cancers-symposium/14170
View Poster :https://meetings.asco.org/session/14170
Url 1 :https://meetings.asco.org/2022-asco-gastrointestinal-cancers-symposium/14145
View Poster :https://meetings.asco.org/session/14145
Url 1 :https://meetings.asco.org/2022-asco-gastrointestinal-cancers-symposium/14169?presentation=205955
View Poster :https://meetings.asco.org/session/14169
Url 1 :https://meetings.asco.org/2022-asco-gastrointestinal-cancers-symposium/14168
View Poster :https://meetings.asco.org/session/14168
Url 1 :https://meetings.asco.org/2022-asco-gastrointestinal-cancers-symposium/14450
View Poster :https://meetings.asco.org/session/14450
Url 1 :https://meetings.asco.org/2022-asco-gastrointestinal-cancers-symposium/14163
View Poster :https://meetings.asco.org/session/14163
Url 1 :https://meetings.asco.org/2022-asco-gastrointestinal-cancers-symposium/14449
View Poster :https://meetings.asco.org/session/14449
Url 1 :https://meetings.asco.org/2022-asco-gastrointestinal-cancers-symposium/14451
View Poster :https://meetings.asco.org/session/14451
Url 1 :https://meetings.asco.org/2022-asco-gastrointestinal-cancers-symposium/14166
View Poster :https://meetings.asco.org/session/14166
轉載請註明出處,本文鏈接:https://www.uj5u.com/caozuo/382744.html
上一篇:從容器中抓取資料
