需要幫助設定從文本檔案進行決議。Python3-有解無憂

這是一個示例代碼。您需要使用 txt 從請求中獲取 url

import requests
from bs4 import BeautifulSoup

headers = {
"User-Agent": "Mozilla/5.0 (Macintosh; Intel Mac OS X 10.15;rv:84.0) Gecko/20100101 Firefox/84.0",}

page = requests.get('https://duckduckgo.com/html/?q=test', headers=headers).text
soup = BeautifulSoup(page, 'html.parser').find_all("a", class_="result__url", href=True)

for link in soup:
print(link['href'])

uj5u.com熱心網友回復：

您可以使用f 字串

search_text = "foo"
page = requests.get(f'https://duckduckgo.com/html/?q={search_text}', headers=headers).text

uj5u.com熱心網友回復：

import requests, argparse, ScrapeSearchEngine, time, threading
from bs4 import BeautifulSoup

headers = {
    "User-Agent": "Mozilla/5.0 (Macintosh; Intel Mac OS X 10.15; rv:84.0) Gecko/20100101 Firefox/84.0",
}
parser = argparse.ArgumentParser()
parser.add_argument("-d", "--dorks", help="Your dorks list", required=True)
args = parser.parse_args()
with open(args.dorks, 'r') as f:
    dorks = [line.strip('\n') for line in f]

scraped = 0
for dork in dorks:
    if os.name == "nt":
        os.system('title SQLI Crawler ^| Dork: ' str(dork) ' ^| Scraped Links: ' str(scraped))
search = (dork)
page = requests.get(f'https://duckduckgo.com/html/?q={search_text}', headers=headers).text
soup = BeautifulSoup(page, 'html.parser').find_all("a", class_="result__url", href=True)

for link in soup:
    print(link['href'])

uj5u.com熱心網友回復：

添加了新的更改。我無法在搜索中添加頁數并保存到 links.txt

import os, requests, argparse, colorama, ScrapeSearchEngine, time, threading
from bs4 import BeautifulSoup

headers = {
    "User-Agent": "Mozilla/5.0 (Macintosh; Intel Mac OS X 10.15; rv:84.0) Gecko/20100101 Firefox/84.0",
}
parser = argparse.ArgumentParser()
parser.add_argument("-d", "--dorks", help="Your dorks list", required=True)
args = parser.parse_args()
with open(args.dorks, 'r') as f:
    dorks = [line.strip('\n') for line in f]

scraped = 0
for dork in dorks:
    if os.name == "nt":
        os.system('title SQLI Crawler ^| Dork: ' str(dork) ' ^| Scraped Links: ' str(scraped))
search = (dork)
page = requests.get(f'https://duckduckgo.com/html/?q={search}', headers=headers).text
soup = BeautifulSoup(page, 'html.parser').find_all("a", class_="result__url", href=True)
for link in ScrapedLinks:
            scraped  = 1
            open('links.txt', 'a ').write(link "\n")

            print(f"[{Fore.CYAN}{time.strftime('%H:%M:%S')}{Fore.RESET}] [{Fore.YELLOW}INFO{Fore.RESET}] " link)

            if args.scan == 'true':
                threading.Thread(target=scanner, args=(link, )).start()

轉載請註明出處，本文鏈接：https://www.uj5u.com/gongcheng/461350.html

標籤：Python python-3.x 解析

上一篇：為什么print()不受運算子優先級的影響？

下一篇：轉換語法以進行預測分析