当前位置：首页 > news >正文

做外链选择那些网站做网站网页的专业

news 2026/4/8 14:03:30

做外链选择那些网站,做网站网页的专业,安徽工程建设发展有限公司,推介网站本文是本人最近学习Python爬虫所做的小练习。如有侵权#xff0c;请联系删除。页面获取url 代码 import requests import os import re# 创建文件夹 path os.getcwd() /images if not os.path.exists(path):os.mkdir(path)# 当前页数 page 1 # 总页数 total_page 2# 自动… 本文是本人最近学习Python爬虫所做的小练习。如有侵权请联系删除。页面获取url 代码 import requests import os import re# 创建文件夹 path os.getcwd() /images if not os.path.exists(path):os.mkdir(path)# 当前页数 page 1 # 总页数 total_page 2# 自动翻页获取全部数据 def get_data():global page, total_pagewhile page total_page:# 地址url fhttps://api.bilibili.com/pgc/season/index/result?st1order3season_version-1spoken_language_type-1area-1is_finish-1copyright-1season_status-1season_month-1year-1style_id-1sort0page{page}season_type1pagesize20type1# 请求头headers {user-agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/109.0.0.0 Safari/537.36 SLBrowser/9.0.3.1311 SLBChan/105,}# 发送请求response requests.get(url, headersheaders)# json数据格式items response.json()# 循环遍历for data in items[data][list]:# 标题title data.get(title)# 封面cover data.get(cover)# 下载图片到images文件夹中文件名titleif title ! and cover ! :download_image(title, cover)total items.get(data)[total]size items.get(data)[size]total_page get_page_count(total, size)page 1# 下载图片 def download_image(title, cover):# 请求头headers {user-agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/109.0.0.0 Safari/537.36 SLBrowser/9.0.3.1311 SLBChan/105,}res requests.get(cover, headersheaders)# 判断标题是否含有\/:*?|文件命名不能含有这些如果有则用下划线_取代new_title validateTitle(title)with open(path / new_title .jpg, modewb) as f:# 图片内容写入文件f.write(res.content)print(f正在下载图片图片名{title}.jpg)# 去除文件中的非法字符(正则表达式) def validateTitle(title):pattern r[\\\/\:\*\?\\\\|]new_title re.sub(pattern, _, title)return new_title# 求页数 def get_page_count(total, page):page_count total // pageif total % page ! 0:page_count 1return page_countif __name__ __main__:get_data() 效果

查看全文

http://www.w-s-a.com/news/710424/