xpath解析資料（爬取全國城市名稱）

阿新 • • 發佈：2020-12-28

目標網站：https://www.aqistudy.cn/historydata/

# 開發時間：2020/12/27 22:00
# 開發工具：PyCharm
# 開發者：Friday
# 網址 https://www.aqistudy.cn/historydata/
import requests
from lxml import etree

if __name__ == "__main__":
    headers = {
        'Referer': 'http://pic.netbian.com/4kmeinv/index_2.html' 
,
        'user_agent':'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/87.0.4280.88 Safari/537.36'
    }
    url = 'https://www.aqistudy.cn/historydata/'
    response = requests.get(url = url, headers = headers)
    page_text = response.text
    tree = etree.HTML(page_text) 

    #方法一：
    # # 熱門城市
    # host_city_list = tree.xpath('//div[@class="bottom"]/ul/li')
    # host_name_list = []
    # for li in  host_city_list:
    #     host_name = li.xpath('./a/text()')[0]
    #     host_name_list.append(host_name)
    # # print(host_name_list)
    #
    # #1.
    # # all_city_list = [] 

    # # all_city_ul_list = tree.xpath('//div[@class="bottom"]/ul')
    # # for ul in all_city_ul_list:
    # #     get_li_list = ul.xpath('./div/li')
    # #     for li in get_li_list:
    # #         name = li.xpath('./a/text()')[0]
    # #         host_name_list.append(name)
    # #2.
    # # all_city_li = tree.xpath('//div[@class="bottom"]/ul/div[2]/li')
    # # for li in all_city_li:
    # #     name = li.xpath('./a/text()')[0]
    # #     host_name_list.append(name)
    # print(host_name_list)
    # print(len(host_name_list))

    #方法二：
    a_list = tree.xpath('//div[@class="bottom"]/ul/li/a | //div[@class="bottom"]/ul/div[2]/li/a')
    all_city_names = []
    for a in a_list:
        city_name = a.xpath('./text()')[0]
        all_city_names.append(city_name)
    print(all_city_names)
    print(len(all_city_names))

總結：檢視網頁的程式碼結構，比較容易想到的就是進行兩次xpath解析，分別獲取“熱門城市”和“全部城市”的li標籤，但仔細思考，還是可以進一步優化的，由於我們要爬取的城市名稱都在a標籤下，所以我們可以利用xpath同時解析出兩者所對應的a標籤，然後再統一操作。

xpath解析資料（爬取全國城市名稱）

技術標籤：pythonpythonxpathhtml資料分析目標網站：https://www.aqistudy.cn/historydata/ # 開發時間：2020/12/27 22:00

我不就是吃點肉，應該沒事吧——爬取一座城市裡的烤肉店資料（附完整Python爬蟲程式碼）

寫在前面的一點屁話：對於肉食主義者，吃肉簡直幸福感爆棚！特別是烤肉，看著一塊塊肉慢慢變熟，聽著烤盤上“滋滋”的聲響，這種期待感是任何其他食物都無法帶來的。如果說甜點是“乍見之歡”，那肉則是“久處不

56平住房賣2萬，新房價變白菜價？爬取全國315個城市的房價資訊

前言今天來使用爬蟲技術，爬取全國315個城市的房價資訊。大家都知道房價資訊是非常有價值的，敏感的，對於一些人來說是投資賺錢的一種重要渠道。能夠及時的獲取房價漲跌資訊，甚至用大資料分析市場行情，這對於投

基於Python的爬蟲spider（爬取番號站）

前幾天咕咕了幾天，最近又有了新的研究成果，爬取番號站，請忽略內容這只是學習☺️

xpath案例-4K圖片爬取

#!/usr/bin/python #需求：解析下載圖片資料 http://pic.netbian.com/4kmeinv/ import requests from lxml import etree

利用python爬取全國水雨情資訊

分析我們沒有找到介面，所以打算利用selenium來爬取。程式碼 import datetime import pandas as pd

Node-RED中使用html節點爬取HTML網頁資料之爬取Node-RED的最新版本

場景 Node-RED簡介與Windows上安裝、啟動和執行示例： https://blog.csdn.net/BADAO_LIUMANG_QIZHI/article/details/121884766

Big Number HDU - 1212（大數取模除法模擬）

Big Number Problem Description As we know, Big Number is always troublesome. But it\'s really important in our ACM. And today, your task is to write a program to calculate A mod B.To make the problem

mongoose 更新資料時不驗證資料（忽略設定的集合規則）的問題

問題： mongoose 更新資料時不驗證資料（忽略設定的集合規則）的問題參考： http://www.mongoosejs.net/docs/api.html#updateone_updateOne

錄製的視訊，使用python opencv去擷取幀數（只取某一幀）同時可裁剪影象尺寸

coding: utf-8 指定某一幀擷取影象（不包括裁剪） import cv2 as cv import os 1.讀取視訊資料夾

pandas：使用函式批量處理資料（map、apply、applymap）

此文轉載自：https://blog.csdn.net/weixin_43887421/article/details/109776020#commentBox pandas：使用函式批量處理資料（map、apply、applymap）

mysql查詢前百分之幾的資料（例如學生分數前25%）

技術標籤：mysqlsql 資料表sql select @rownum:=@rownum+1,student.* from (select @rownum:=0) t1 , (select * from student order by student.grade desc) student ##排序

.NET ------- aspx 獲取aspx.cs 中資料（方法，repeater控制元件）

一、前臺獲取後臺方法中資料 aspx.cs 頁面：藉助方法獲取從資料庫中查詢到的值

ngrinder groovy 引數化--從資料庫獲取資料（以oracle資料庫為例）

import static net.grinder.script.Grinder.grinder import static org.junit.Assert.* import static org.hamcrest.Matchers.*

C#簡單爬取資料（.NET使用HTML解析器NSoup和正則兩種方式匹配資料）

一、獲取資料想弄一個數據庫，由於需要一些人名，所以就去百度一下，然後發現了360圖書館中有很多人名

爬取csdn的資料與解析儲存（9）

安裝軟體： pip instal pymysq pip install peewee 建立資料模型orm from peewee import * db = MySQLDatabase(\"spider\", host=\"127.0.0.1\", port=3306, user=\"root\", password=\"root\")

xpath案例-全國城市名爬取

#!/usr/bin/python import requests from lxml import etree #專案需求：解析出所有的城市名稱https://www.aqistudy.cn/historydata/

Python使用mongodb儲存爬取豆瓣電影的資料過程解析

建立爬蟲專案douban scrapy startproject douban 設定items.py檔案，儲存要儲存的資料型別和欄位名稱

python+selenium定時爬取丁香園的新型冠狀病毒資料並製作出類似的地圖（部署到雲伺服器）

前言硬要說這篇文章怎麼來的，那得先從那幾個吃野味的人開始說起…… 前天睡醒：假期還有幾天；昨天睡醒：假期還有十幾天；今天睡醒：假期還有一個月…… 每天過著幾乎和每個假期一樣的宅男生活，唯一不同的是玩手機

Python爬蟲爬取、解析資料操作示例

本文例項講述了Python爬蟲爬取、解析資料操作。分享給大家供大家參考，具體如下：

xpath解析資料（爬取全國城市名稱）

相關推薦