求助大神,為啥只有一條資訊


uj5u.com熱心網友回復:
print放回圈外面了?uj5u.com熱心網友回復:
另外,給你一個簡單的方法from simplified_scrapy.simplified_doc import SimplifiedDoc
from simplified_scrapy.request import req
html = req.get('https://news.sogou.com/news?query=%BA%AB%B9%FA%D7%D4%C9%B1%C2%CA%BE%D3%C7%B0%C1%D0&mode=1&w=01025001&oq=&ri=0&sourceid=sugg&sut=0&sst0=1574739190312&lkt=0%2C0%2C0&p=40240100')
doc = SimplifiedDoc()
lst = doc.getElementsByClass('vrwrap',html)
for item in lst:
a = doc.getElementByTag('a',item.innerHtml)
print (a.href,a.text)
info = doc.getElementByClass('news-from',item.innerHtml)
if(info):
info = info.text.split()
print (info)
uj5u.com熱心網友回復:
網頁少用正則,一點點變化就讓你重新寫正則。uj5u.com熱心網友回復:
謝謝
uj5u.com熱心網友回復:
不太懂啊,跟著書上學的uj5u.com熱心網友回復:
用正則還是不錯的,比xpath直觀。如果用的好,可以寫出比xpath更通用的方法的uj5u.com熱心網友回復:
print前面加個縮進,放到回圈內部去轉載請註明出處,本文鏈接:https://www.uj5u.com/qita/122774.html
