前言

本文的文字及圖片來源于網路,僅供學習、交流使用,不具有任何商業用途,如有問題請及時聯系我們以作處理，

Python爬蟲、資料分析、網站開發等案例教程視頻免費在線觀看

https://space.bilibili.com/523606542

前文內容

Python爬蟲新手入門教學（一）：爬取豆瓣電影排行資訊

Python爬蟲新手入門教學（二）：爬取小說

Python爬蟲新手入門教學（三）：爬取鏈家二手房資料

Python爬蟲新手入門教學（四）：爬取前程無憂招聘資訊

Python爬蟲新手入門教學（五）：爬取B站視頻彈幕

Python爬蟲新手入門教學（六）：制作詞云圖

Python爬蟲新手入門教學（七）：爬取騰訊視頻彈幕

Python爬蟲新手入門教學（八）：爬取論壇文章保存成PDF

Python爬蟲新手入門教學（九）：多執行緒爬蟲案例講解

Python爬蟲新手入門教學（十）：爬取彼岸4K超清壁紙

Python爬蟲新手入門教學（十一）：最近王者榮耀皮膚爬取

Python爬蟲新手入門教學（十二）：英雄聯盟最新皮膚爬取

基本開發環境

Python 3.6
Pycharm

一、明確需求

如圖所示爬取里面的高清壁紙

二、網頁資料分析

點擊下載原圖，會自動給你下載壁紙圖片，

所以只需要獲取這個鏈接就可以了爬取壁紙圖片了，

回傳串列的可以發現，網頁是瀑布流加載方式，當你往下滑才會有資料出現，所以可以在下滑網頁的前，先打開開發者工具，當下滑網頁的時候新加載出來的資料會出現，

通過對比可以知道，這個資料包中包含了，壁紙圖片下載的地址，

需要注意的就是這個資料鏈接是post請求，并不是get請求

需要提交的data引數，就是對應的頁碼，

三、代碼實作

1、獲取圖片ID

    for page in range(1, 11):
        url = 'https://wallpaper.wispx.cn/cat/%E5%8A%A8%E6%BC%AB'
        headers = {
            'user-agent': 'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/81.0.4044.138 Safari/537.36',
            'x-requested-with': 'XMLHttpRequest',
        }
        data = {
            'page': page
        }
        response = requests.post(url=url, headers=headers)
        result = re.findall('detail(.*?)target=', response.text)
        for index in result:
            image_id = index.replace('\\', '').replace('" ', '')
            page_url = f'https://wallpaper.wispx.cn/detail{image_id}'

2、獲取壁紙url地址，并保存

def main(page_url):
    html_data = get_response(page_url).text
    image_url = re.findall('<a  href="https://www.cnblogs.com/hhh188764/archive/2021/02/03/(.*?)">', html_data)[0]
    image_title = re.findall('<title>(.*?)</title>', html_data)[0].split(' - ')[0]
    image_content = get_response(image_url).content
    path = 'images\\'
    if not os.path.exists(path):
        os.makedirs(path)
    with open(path + image_title + '.jpg', mode='wb') as f:
        f.write(image_content)
        print('正在保存：', image_title)

需要注意的點：

請求頭里面要防盜鏈，不然就下載不了，

def get_response(html_url):
    header = {
        'referer': 'https://wallpaper.wispx.cn/detail/1206',
        'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/81.0.4044.138 Safari/537.36'
    }
    resp = requests.get(url=html_url, headers=header)
    return resp

四、實作效果

轉載請註明出處，本文鏈接：https://www.uj5u.com/houduan/256203.html

標籤：其他

上一篇：C/C++編程筆記：陣列和字串丨多維陣列詳解

下一篇：陌陌面試官：說說Spring AOP 的原理、SpringMVC 的處理程序？

Python爬蟲新手入門教學（十三）：爬取高質量超清壁紙