牛客網面經 - 爬蟲整理 - 高效背題

阿新 • • 發佈：2021-06-15

效果

原始碼

import time
from bs4 import BeautifulSoup
import requests
from selenium import webdriver

# 原作
# https://blog.csdn.net/qq_40050586/article/details/105729740

urlbase = "https://www.nowcoder.com"
# Android 面經
targetUrl = "https://www.nowcoder.com/discuss/experience?tagId=642"


def getIndexPage(url):
    driver = webdriver.Chrome(executable_path='/Users/jiangjia/Downloads/chromedriver')
    driver.get(targetUrl)
    time.sleep(3)
    js = "return action=document.body.scrollHeight"
    height = driver.execute_script(js)
    driver.execute_script('window.scrollTo(0, document.body.scrollHeight)')
    time.sleep(5)
    t1 = int(time.time())
    status = True
    num = 0
    # 這裡的一堆程式碼就是將滾動條拉到最下面，讓資源載入完畢。
    while status:
        t2 = int(time.time())
        if t2 - t1 < 30:
            new_height = driver.execute_script(js)
            if new_height > height:
                time.sleep(1)
                driver.execute_script('window.scrollTo(0, document.body.scrollHeight)')
                height = new_height
                t1 = int(time.time())
        elif num < 3:
            time.sleep(3)
            num = num + 1
        else:
            print("滾動條已經處於頁面最下方！")
            status = False
            driver.execute_script('window.scrollTo(0, 0)')
            break
    content = driver.page_source
    return content

def getUrl(page):
    soup = BeautifulSoup(page, 'lxml')
    list = []
    for ul in soup.select(".js-nc-wrap-link"):
        list.append(ul.attrs['data-href'])
    return list


def getPageDetail(urll):
    try:
        response = requests.get(urll)
        if response.status_code == 200:
            return response.text
        return None
    except ConnectionError:
        print('Error occurred')
        return None


def parseContentName(page):
    soup = BeautifulSoup(page, 'lxml')
    return soup.select(".post-title")[0].get_text()


def main():
    file = open("面經整理.md", "a+")
    page = getIndexPage("https://www.nowcoder.com/discuss/experience?tagId=639&order=3&companyId=0&phaseId=2")
    list = getUrl(page)
    print("一共有%d篇" % len(list))
    count = 0
    for item in list:
        content = getPageDetail(urlbase + item)
        name = parseContentName(content)
        file.write("- [ ] &emsp; [{0}]({1})\n\n".format(name, urlbase + item))
        count = count + 1
        print("進行到第{0}篇了 >>> {1}".format(count, name))

    file.close()


if __name__ == '__main__':
    main()

牛客網面經 - 爬蟲整理 - 高效背題

效果原始碼 import time from bs4 import BeautifulSoup import requests from selenium import webdriver # 原作

牛客網面試題

目錄一、JAVA 二、計算機網路三、作業系統四、專案五、資料庫第六部分框架

牛客網劍指offer第十七題解答及知識點

技術標籤：牛客網-劍指offer題解問題：輸入兩棵二叉樹A，B，判斷B是不是A的子結構。（ps：我們約定空樹不是任意一個樹的子結構）。解答1：知識點： 1.二叉樹的簡介。二叉樹是n個有限元素的集合，該集合或者為

牛客網題庫爬蟲

完整程式碼 import requests from urllib.parse import urlencode from multiprocessing.pool import Pool

牛客網--位元組跳動面試題--雀魂啟動

牛客網--位元組跳動面試題--雀魂啟動部落格說明文章所涉及的資料來自網際網路整理和個人總結，意在於個人學習和經驗彙總，如有什麼地方侵權，請聯絡本人刪除，謝謝！

牛客網--位元組跳動面試題--萬萬沒想到之聰明的編輯

牛客網--位元組跳動面試題--萬萬沒想到之聰明的編輯部落格說明文章所涉及的資料來自網際網路整理和個人總結，意在於個人學習和經驗彙總，如有什麼地方侵權，請聯絡本人刪除，謝謝！

牛客網--位元組跳動面試題--萬萬沒想到之抓捕孔連順

牛客網--位元組跳動面試題--萬萬沒想到之抓捕孔連順部落格說明文章所涉及的資料來自網際網路整理和個人總結，意在於個人學習和經驗彙總，如有什麼地方侵權，請聯絡本人刪除，謝謝！

牛客網--位元組跳動面試題--特徵提取

牛客網--位元組跳動面試題--特徵提取部落格說明文章所涉及的資料來自網際網路整理和個人總結，意在於個人學習和經驗彙總，如有什麼地方侵權，請聯絡本人刪除，謝謝！

校招筆試整理牛客網 2020小米校招（1）

前端筆試選擇牛客網 2020小米校招(1) 2020小米校招 localStorage和cookie 在現代瀏覽器中, cookie可以在跨域請求中被攜帶在請求頭中

牛客網演算法——名企高頻面試題143題（5）

技術標籤：牛客網題目資料結構與演算法演算法 package 名企高頻面試題143; import org.junit.Test;

牛客網演算法——名企高頻面試題143題（6）

技術標籤：牛客網題目資料結構與演算法演算法題目描述已知兩顆二叉樹，將它們合併成一顆二叉樹。合併規則是：都存在的結點，就將結點值加起來，否則空的位置就由另一個樹的結點來代替。例如：

牛客網演算法——名企高頻面試題143題（8）

技術標籤：牛客網題目資料結構與演算法演算法題目描述請寫出一個高效的在m*n矩陣中判斷目標值是否存在的演算法，矩陣具有如下特徵：

牛客網 | 高頻面試題 | 判斷連結串列中是否有環

技術標籤：刷題# 牛客連結串列leetcode單鏈表文章目錄題目題解快慢指標雜湊表

【牛客網-名企高頻面試題】 NC66 兩個連結串列的第一個公共結點

技術標籤：# 牛客網【牛客網-名企高頻面試題】 NC66 兩個連結串列的第一個公共結點

【牛客網-名企高頻面試題】 NC72 二叉樹的映象

技術標籤：# 牛客網【牛客網-名企高頻面試題】 NC72 二叉樹的映象題目描述：

[Nowcoder]牛客網週週練15

Before the Beginning 轉載請將本段放在文章開頭顯眼處，如有二次創作請標明。原文連結：https://www.codein.icu/nowcoderweekly15/

牛客網Bogo Sort

題目描述： Today Tonnnny the monkey learned a new algorithm called Bogo Sort. The teacher gave Tonnnny the code of Bogo sort:

牛客網Drop Voicing

題目描述： Inaka composes music. Today\'s arrangement includes a chord of nnn notes that are pairwise distinct, represented by a permutation p1…np_{1 \\dots n}p1…n of integers from 111

牛客網Graph

題目描述： You are now in a big factory. The factory could be recognized as a graph withn vertices andm edges. Every edge has its length. You havek missions to do. The i-th mission is going to verte

牛客網K-Bag

題目描述：連結：https://ac.nowcoder.com/acm/contest/5671/K來源：牛客網A sequence is called kkk-bag, if and only if it is put in order by some (maybe one) permutations of111to kkk. For example,1,2,3,2

牛客網面經 - 爬蟲整理 - 高效背題

效果

原始碼

相關推薦