
uj5u.com熱心網友回復:
xpath好像就能操作到這里了,后續用串列的字符的一些操作提取就好了吧uj5u.com熱心網友回復:
麻煩網址貼出來uj5u.com熱心網友回復:
爬的這個網址https://jn.lianjia.com/zufang/
uj5u.com熱心網友回復:
給你一個非xpath的例子
from simplified_scrapy.simplified_doc import SimplifiedDoc
from simplified_scrapy.request import req
html = req.get('https://jn.lianjia.com/zufang/')
doc = SimplifiedDoc()
divs = doc.getElementsByClass('content__list--item--main',html)
lst = []
for div in divs:
item = {}
ps = doc.getChildren(div.innerHtml)
item['title']=ps[0].text
item['mianji']=ps[1].text.split('/')[1]
item['brand']=ps[2].text
item['time']=ps[3].text
item['other']=ps[4].text
if len(ps)>5:
item['price']=ps[5].text
lst.append(item)
print (item)
轉載請註明出處,本文鏈接:https://www.uj5u.com/qita/122801.html
