爬蟲學習日誌3 ajax json 和 post請求

阿新 • • 發佈：2021-08-08

1 題目要求

破解百度翻譯

2 要求分析

首先題目的要求只需要我們獲取翻譯之後的內容，而不需要其他無關的內容。而Ajax請求可以解決這個問題，（Ajax 在瀏覽器與 Web 伺服器之間使用非同步資料傳輸（HTTP 請求），這樣就可使網頁從伺服器請求少量的資訊，而不是整個頁面。 Ajax可使因特網應用程式更小、更快，更友好。 Ajax 是一種獨立於 Web 伺服器軟體的瀏覽器技術。）接下來就來分佈介紹如何獲取所需的資訊。

3 獲取url

url是爬蟲爬取資料的基礎，在本題中獲取url的操作是在瀏覽器中右鍵點選檢查，再點選網路，選擇XHR，在輸入需要翻譯的資料時會出現多個sug資料包，單擊sug

資料包找到kw為檢索詞的，複製其請求url即可。

4 程式碼實現

import requests
import json
post_url = 'https://fanyi.baidu.com/sug'
#指定url
headers = {
    'User-Agent':'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/92.0.4515.107 Safari/537.36 Edg/92.0.902.62'
}
#UA偽裝
keyword = input('input (a) word(s):')
data = {
    'kw':keyword
}
response = requests.post(url=post_url,data=data,headers=headers)
#data引數與get的params類似
#發起請求
page_json = response.json()
#僅在相應資料為json時，才能夠使用response.json
#獲取響應資料
FileName = keyword + '.json'
fp = open(FileName,'w',encoding='utf-8')
json.dump(page_json,fp=fp,ensure_ascii=False)
#因為中文是不支援ascii編碼的，所以需要將第三個引數設定為False
#持久化儲存
print('Accomplished!')

在程式碼實現時，我們還是需要進行UA偽裝

爬蟲學習日誌3 ajax json 和 post請求

1 題目要求

2 要求分析

3 獲取url

4 程式碼實現

爬蟲學習日誌3 ajax json 和 post請求

OpenGL高階版本學習日誌3：網格模型的載入與顯示

【FastAPI 學習七】GET和POST請求引數接收以及驗證

SpringMVC 學習筆記08：解決Get和Post請求的亂碼問題

爬蟲學習（3）：獲取網站cookies

4.27 jQuery AJAX get() 和 post() 方法

jQuery - AJAX get() 和 post() 方法

RTKLIB學習日誌3—精密定位流程

AJAX入門以及get和post請求

淺談http get和post請求

js get和post請求實現程式碼解析

golang使用http client發起get和post請求示例

webapi設定一個Action同時支援get和post請求

PHP如何使用cURL實現Get和Post請求

requests post/get請求params引數和post請求正文的資料型別記錄

前端匯出下載分別傳送get和post請求的寫法

【5】基於Python-基礎知識：環境搭建和模擬Get 和Post請求（1）

用python傳送GET和POST請求

RestTemplate傳送get和post請求,下載檔案的例項

iOS開發網路篇—傳送GET和POST請求（使用NSURLSession） - 轉

爬蟲學習日誌3 ajax json 和 post請求

1 題目要求

2 要求分析

3 獲取url

4 程式碼實現

相關推薦