python爬蟲：從頁面下載圖片以及編譯錯誤解決。

阿新 • • 發佈：2018-12-31

#!/usr/bin/python
import re
import urllib

def getHtml(url):
page = urllib.urlopen(url)
html = page.read()
return html
def getImage(html):
reg = r'src="(.*?\.jpg)" title'
image = re.compile(reg)
imglist = re.findall(image,html)
x = 0
for imgurl in imglist:
urllib.urlretrieve(imgurl,'%s.jpg' % x)
x+=1

html = getHtml("http://desk.zol.com.cn/tiyu/1920x1080/")
print(getImage(html))

報錯：

“AttributeError: 'module' object has no attribute 'urlopen'”

原因是Python3裡的urllib模組已經發生改變，此處的urllib都應該改成urllib.request。

#!/usr/bin/python
import re
import urllib.request

def getHtml(url):
page = urllib.request.urlopen(url)
html = page.read()
return html
def getImage(html):
reg = r'src="(.*?\.jpg)" title'
image = re.compile(reg)
html = html.decode('GBK')
imglist = re.findall(image,html)
x = 0
for imgurl in imglist:
urllib.request.urlretrieve(imgurl,'%s.jpg' % x)
x+=1

html = getHtml("http://desk.zol.com.cn/tiyu/1920x1080/")
print(getImage(html))

發現讀取下來後,執行到第12行,出現:

can't use a string pattern on a bytes-like object

查找了一下,是說3.0現在的引數更改了,現在讀取的是bytes-like的,但引數要求是chart-like的,找了一下,加了個編碼:

html= html.decode('GBK')

執行成功，從頁面下載圖片。

python爬蟲：從頁面下載圖片以及編譯錯誤解決。

python爬蟲：從頁面下載圖片以及編譯錯誤解決。

python爬蟲-簡單使用xpath下載圖片

python爬蟲學習多程序下載圖片

Python爬蟲：認識urllib/urllib2以及requests

python程式設計：從入門到實踐 pdf 下載

《Python程式設計：從入門到實踐》高清PDF下載

關於《Python程式設計：從入門到實踐）》pdf版適用於網盤前端美元符號開發下載教程

《Python 程式設計：從入門到實踐》第十六章(下載資料)練習題答案

《Python程式設計：從入門到實踐》PDF 下載

Python爬蟲：Selenium常用操作，下載youtube視訊例項

Python爬蟲：十分鐘實現從資料抓取到資料API提供

python爬蟲實現帶附件+html內容以及圖片的郵件傳送

python爬蟲：抓取頁面上的超連結

Python爬蟲：抓取內涵段子1000張搞笑圖片-上篇（小爬蟲誕生篇）

Python爬蟲：使用requests庫下載大檔案

Python爬蟲：爬取指定網址圖片

從FTP下載圖片返回檔案流在頁面顯示圖片

python 爬蟲抓取頁面圖片

《Python程式設計：從入門到實踐》【PDF】完整版免費下載

Python爬蟲：學爬蟲前得了解的事兒

python爬蟲：從頁面下載圖片以及編譯錯誤解決。

相關推薦