我撰寫了一個用于決議新聞文章提要的函式。
def save_new_articles(feed, source_id, category_id):
channel_feed_title = feed.channel.title.title()
channel_feed_link = feed.channel.link
channel_feed_desc = feed.channel.description
official_source_id = source_id
post_category_id = category_id
for item in feed.entries:
parsed_summary = item.summary
soup = BeautifulSoup(parsed_summary, 'lxml')
images = soup.findAll('img')
for image in images:
image_url_link = (image['src'])
if image_url_link is not None:
image_link = image_url_link
else:
image_link = "https://www.publicdomainpictures.net/pictures/280000/velka/not-found-image-15383864787lu.jpg"
parsed_title = item.title
formatted = re.sub("<.*?>", "", parsed_title)
post_title = formatted
post_link = item.link
description = item.description
output_summary = re.sub("<.*?>", "", description)
title = item.title
capital = title.title()
tags = capital.split()
date_published = parser.parse(item.published)
if not Posts.objects.filter(guid=item.guid).exists():
post = Posts(
title = post_title,
link = post_link,
summary = output_summary,
image_url = image_link,
tags = tags,
pub_date = date_published,
guid = item.guid,
feed_title = channel_feed_title,
feed_link = channel_feed_link,
feed_description = channel_feed_desc,
source_id = official_source_id,
category_id = post_category_id
)
post.save()
else:
logger.info("Duplicate Post Detected! Skipping...")
但是在運行代碼時,我得到:
image_url = image_link,
UnboundLocalError: local variable 'image_link' referenced before assignment
我不明白錯誤來自哪里,正如我image_link在image for loop statement上面定義的那樣。我已經在 SO 上檢查過類似的答案,但我似乎沒有找到合適的答案。請幫我除錯這個。
uj5u.com熱心網友回復:
當images(您用 初始化soup.findAll('img'))為空時會發生此錯誤。
可能你不僅item.summary沒有img標簽,而且由于之前的一些錯誤,它也可能是空的。
因此要修復您的代碼,您應該image_link在此之前進行初始化:
if not Posts.objects.filter(guid=item.guid).exists():
正如我所假設的,默認值是:
image_link = "https://www.publicdomainpictures.net/pictures/280000/velka/not-found-image-15383864787lu.jpg"
例如,像這樣,而不是:
for image in images:
image_url_link = (image['src'])
if image_url_link is not None:
image_link = image_url_link
else:
image_link = "https://www.publicdomainpictures.net/pictures/280000/velka/not-found-image-15383864787lu.jpg"
你可以這樣:
image_link = "https://www.publicdomainpictures.net/pictures/280000/velka/not-found-image-15383864787lu.jpg"
for image in images:
image_url_link = (image['src'])
if image_url_link is not None:
image_link = image_url_link
轉載請註明出處,本文鏈接:https://www.uj5u.com/shujuku/395944.html
