筆趣閣小說優化版

阿新 • • 發佈：2018-12-14

#-*-coding:utf-8-*-
# 筆趣閣
import requests
from lxml import etree

def url_processing(url):   # 網址處理函式
    if requests.get(url).status_code > 200 and requests.get(url).status_code < 300:
        print('網址輸入錯誤請重新輸入,返回的狀態碼為%s' % (requests.get(url).status_code))
        return []
    else:
        print('正在開啟',url)
        headers = {'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/62.0.3202.62 Safari/537.36'}
        res = requests.get(url=url, headers=headers)
        html = res.text
        return html

def extract(html):   # 資料提取函式
    tree = etree.HTML(html)         # xpath
    urs = tree.xpath('//dd/a/@href')
    return urs

def urls_cl(urs):
    for i in range(9, len(urs)):
        headers = {
            'User-Agent':'Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/59.0.3071.15 Safari/537.36'}
        res = requests.get(url=urs[i], headers=headers)
        tr = etree.HTML( res.text)
        txt_a = tr.xpath('//div[@class="bookname"]/h1/text()')[0]   # 標題
        txt_b = tr.xpath('//div[@id="content"]/p/text()')[0]      # 內容
        tra = txt_a + '\n' + txt_b
        for i in range(1, len(urs) + 1):
            file = '第' + str(i) + '章.txt'
            print('開始爬取第', str(i), '章' )
            with open(file, 'a', encoding='utf-8') as fp:
                fp.write(tra)
            print('第',str(i), '章爬取完成')
    return '爬取全本完成'

if __name__ == '__main__':
    ur = 'https://www.biquge5200.cc/'
    a = str(input('請輸入書號')) # 例如:0_844
    url = ur + a
    urls_cl(extract(url_processing(url)))

筆趣閣小說優化版

#-*-coding:utf-8-*- # 筆趣閣 import requests from lxml import etree def url_processing(url): # 網址處理函式 if requests.get(url).status_code > 200

爬取筆趣閣小說（一念永恒）

with inf end name style code color lin lena ！：編碼格式。編碼格式。編碼格式 !!：http://xiaorui.cc/2016/02/19/%E4%BB%A3%E7%A0%81%E5%88%86%E6%9E%90python-r

免app下載筆趣閣小說

[] .com site 根據 app下載代碼 earch mozilla 學習　　這個是對最近學習的一次總結吧。前兩天寫的，今天才有時間寫博客。　　偶然點開筆趣閣的網址(https://www.biquge.cc/),突然覺得我應該可以用爬蟲實現小說下載。有這個

Python BeautifulSoup 爬取筆趣閣所有的小說

http bs4 soup decode dom 數據結構 con lock lis 這是一個練習作品。用python腳本爬取筆趣閣上面的免費小說。環境：python3類庫：BeautifulSoup數據源：http://www.biqukan.cc 原理就是偽裝正常

用Scrapy爬取筆趣閣小說

今天早上無聊，去筆趣閣扒了點小說存Mongodb裡存著，想著哪天做一個小說網站有點用，無奈網太差，爬了一個小時就爬了幾百章，爬完全網的小說，不知道要到猴年馬月去了。再說說scrapy這個爬蟲框架，真是不用不知道，一用嚇一跳，這個實在太好用了，比自己用request，Beaut

scrapycrawl 爬取筆趣閣小說

視頻 mage 匯總多臺設置由於 tle 目錄 pla 前言第一次發到博客上..不太會排版見諒最近在看一些爬蟲教學的視頻,有感而發,大學的時候看盜版小說網站覺得很能賺錢,心想自己也要搞個,正好想爬點小說能不能試試做個網站(網站搭建啥的都不會...)

WordCount 優化版測試小程序實現

mil AD 輸出 bat 行處理文件名工作對比 osi Stage1:代碼編寫+單元測試 Github地址：　　https://github.com/245553473/wcPro.git PSP表格: PSP PSP階段預估耗時(分

專業定制汽車網站開發支持電腦版+手機版+微信版+小程序版

系統 ges 維護一條龍服務 ice 郵箱 get 手機小程序網站開發采用：PHP+MySQL+ThinkPHP框架服務器選擇：服務器購買地址：http://www.erduyun.com/services/cloudhost/域名購買地址

scribe優化版

scribe詳細安裝部署過程yum install -y python-devel yum install -y libevent libevent-devel m4 autoconf automake libtool libicu libicu-devel yum install -y gcc gcc-c+

180119 計算器的優化版

lac 再次其中設定 nothing bre pil 第一個 com print("\033[31;1m歡迎使用計算器\033[0m".center(59,"-")) import sys,time #導入模塊 sys模塊，time模塊 f = open("intr

python封裝configparser模塊獲取conf.ini值（優化版）

att 模塊沒有自動化測試 .com for getconf dict import 　　昨天晚上封裝了configparser模塊，是根據keyname獲取的value。python封裝configparser模塊獲取conf.ini值　　我原本是想通過confi

微信小程序版2048

rand clas rtx img listitem 小遊戲 chan .com post 最近流行微信“跳一跳”小遊戲，我也心血來潮寫了一個微信小程序版2048，本篇文章主要分享實現2048的算法以及註意的點，一起來學習吧！（源碼地址見文章末尾）算法 1、生成

Python 爬取筆趣看小說

self obj download pat color windows http float web # -*- coding:utf-8 -*- from bs4 import BeautifulSoup import requests import sys cla

PPTV聚力網絡電視4優化版|無廣告版本（免開通會員）

PPTV無廣告版 PPTV優化版 PPTV全功能版本 PPTV破解版 PPTV聚力網絡電視4優化版|無廣告版本是PPLive旗下媒體，一款全球安裝量最大的網絡電視客戶端，支持對海量高清影視內容的“直播+點播”功能。匯聚最清晰,最流暢的網絡各類最新熱門的電影、電視劇、動漫、綜藝、體育直播、遊戲競

Python基礎練習（二）筆趣看《伏天氏》全文章節爬取

平臺空行 ges 會有好的 clas 追加 ref 版本大家如果覺得有幫助的話，可以關註我的知乎https://www.zhihu.com/people/hdmi-blog/posts，裏面有寫了一些我學習爬蟲的練習~ 今天我們想要爬取的是筆趣看小說網上的網絡小說，並

用while循環寫猜年齡以及優化版

pre color 輸出 lse elif 年齡 break too while 增加：1、循環猜，2、最多猜3次，3、超過3次輸出猜次數過多： age_of_XueKe=18count=0while True: if count==3: break

去重算法，簡單粗暴&優化版

一個 dag 代碼 text 下標數組 repeat 次數 style Remove Repeat 一、去重原理　　1、進行排序　　2、判斷是否滿足 ‘兩個字符串相同‘ 的條件，相同則累加重復次數，並使用continue繼續下一次循環　　3、當條件不滿足時，將該

js代碼小優化

取出 location 註冊 pre mar data eve 新增 eva 今天真坑，老大請了兩天假，來了之後指指點點，不過人家說的倒是很是到位好不容易把嵌套小窗口登陸註冊功能，做完了，直接調之前寫好的登陸註冊功能，也就是頁面跳轉並不是ajax異步登陸說讓改成aja

購物車小優化

... item enum [] user enume 分享圖片商品 lan 加入time模塊，在退出系統前等待2s 1 import time 2 product_list = [ 3 (‘iphone‘,5800), 4 (‘Mac Pro

重裝了服務器，用的是centos/php微信小程序版,centos 命令大全

設置用戶口令掛載版本 sgid 復制 cdr 依次 www. lock centos 命令大全1.關機 (系統的關機、重啟以及登出 ) 的命令 shutdown -h now 關閉系統(1) init 0 關閉系統(2) telinit 0 關閉系統(3) s

筆趣閣小說優化版

相關推薦