Python爬蟲錯誤記錄

阿新 • • 發佈：2019-01-15

本文注意是用於記錄在用Python寫爬蟲的過程中所經歷的一些問題及其解決方法，便於後續翻查。

語法錯誤

錯誤檔案已存在時無法建立檔案

出錯程式碼

fp = open("filetest.txt","w")
fp.write("Hello World \n")
fp.close()
import os
os.rename("filetest.txt","newfiletest.txt")
fp = open("newfiletest.txt","r")
print ("the new file name is:",fp.name)

詳細錯誤資訊

FileExistsError: [WinError 183] 當檔案已存在時，無法建立該檔案。: 'filetest.txt' -> 'newfiletest.txt'

原因及解決方法

由於歷史原因，在本程式碼出錯之前已有其他錯誤，導致檔案已經被重新命名為filetest.txt,導致無法重複對其進行重新命名，因而出錯。解決方法是將程式碼目錄下的同名檔案刪除，再執行程式碼，即可。

‘ResultSet’ object has no attribute ‘get’

出錯程式碼

def getMMAlbumList(personal_id, file):
    # get the ambum list of a sigle mm
    data = urllib.request.urlopen(personal_id)
    soup = BeautifulSoup(data, 'lxml' 
)
    tag=soup.find_all('div')
    album_url=tag.get('href')

    return data

詳細錯誤資訊

    album_url=tag.get('href')
AttributeError: 'ResultSet' object has no attribute 'get'

原因及解決方法

通過列印tag可知其為一個tag的集合（類似於一個結構體），而程式碼需要從中找出href的屬性值（可以理解為結構體中的某個變數的型別），如此在獲取單個tag的屬性值之前必須要定位到該tag，不能對整個tag集合進行操作。因此在該段程式碼中，若想取得某個tag的屬性，需要遍歷整個tag集合，找出該tag，最後呼叫get函式，獲得相應屬性值。修改後的程式碼如下：

def getMMAlbumList(personal_id, file):
    # get the ambum list of a sigle mm
    data = urllib.request.urlopen(personal_id)
    soup = BeautifulSoup(data, 'lxml')
    for tag in soup.find_all('a')
        print (tag)
    return data

Python爬蟲錯誤記錄

語法錯誤

錯誤檔案已存在時無法建立檔案

出錯程式碼

詳細錯誤資訊

原因及解決方法

‘ResultSet’ object has no attribute ‘get’

出錯程式碼

詳細錯誤資訊

原因及解決方法

python 爬蟲錯誤記錄

Python爬蟲錯誤記錄

Python爬蟲實踐 -- 記錄我的第一只爬蟲

【Python爬蟲錯誤】ConnectionResetError: [WinError 10054] 遠端主機強迫關閉了一個現有的連線

Python爬蟲實踐 -- 記錄我的第二隻爬蟲

python3+selenium自動化測試：除錯python程式錯誤記錄，呼叫類時格式出錯

一次簡單Python爬蟲程式碼記錄

記錄一次python爬蟲批量下載一個校花網站的妹子圖片

python-爬蟲技能升級記錄

python爬蟲執行scrapy crawl demo出現： import win32api ModuleNotFoundError: No module named 'win32api'錯誤

[Python爬蟲]通過分析胸罩銷售記錄發現了驚人的祕密

python爬蟲學習之日誌記錄模組

錯誤記錄： linux 使用yum安裝軟體出錯 basn: /usr/bin/yum: /usr/bin/python: bad interpreter: no such file or

Ubuntu下搭建Appium+python自動化環境記錄及遇到的錯誤記錄

python爬蟲：從頁面下載圖片以及編譯錯誤解決。

python爬蟲解決403禁止訪問錯誤

【Python】學習遇到錯誤記錄

Python爬蟲實戰之爬取鏈家廣州房價_04鏈家的模擬登入(記錄)

華為2016校園招聘上機筆試題：簡單錯誤記錄 [python]

python爬蟲（爬取蜂鳥網高畫素圖片）_空網頁,錯誤處理

Python爬蟲錯誤記錄

語法錯誤

錯誤檔案已存在時無法建立檔案

出錯程式碼

詳細錯誤資訊

原因及解決方法

‘ResultSet’ object has no attribute ‘get’

出錯程式碼

詳細錯誤資訊

原因及解決方法

相關推薦