爬取電影網站

阿新 • • 發佈：2020-10-21

code

import time
import sys,os
import requests
import shutil
from selenium import webdriver 
from selenium.webdriver.common.keys import Keys
from selenium.webdriver.common.action_chains import ActionChains
from bs4 import BeautifulSoup


def asleep(driver):
    driver.implicitly_wait(3.5)
    time.sleep( 
2) 

driver = webdriver.Chrome()
asleep(driver)

#719頁
for k in range(1,720):
    url="http://zimiyy.com/mov/0/0/all/{}.html".format(k)

    driver.get(url)

    t=driver.find_element_by_xpath("//div[@class='index-tj mb clearfix']/ul").get_attribute('innerHTML')

    soup1 = BeautifulSoup(t, 'html.parser')

    tmp 
=soup1.findAll('a')

    for i in tmp:
        tmp_movie_url="http://zimiyy.com{}".format(i.get("href"))
        print(tmp_movie_url)
        movie_name=i.get("title")
        print(movie_name)
        pic_url=i.find("img").get("src")
        print(pic_url)

        time.sleep(2)
        #進入詳情頁
        driver. 
get(tmp_movie_url)
        #獲取描述
        tmp_desc=driver.find_element_by_class_name("info").get_attribute('innerHTML')
        detail_html=driver.find_element_by_id("stab_1_71").get_attribute('innerHTML')
        soup2 = BeautifulSoup(detail_html, 'html.parser')
        tmp_play_page_list=soup2.findAll('li')

        print(tmp_desc)
        all_movie_url={}
        for j in tmp_play_page_list:
            movie_url_type=j.find("a").string
            play_page_url=j.find("a").get("href")
            #進入播放頁
            driver.get(play_page_url)
            #獲取視訊連結
            try:
                movie_url=driver.find_element_by_xpath("//span[@class='dplayer-info-panel-item-data']").text
            except Exception as e:
                print(e)
                movie_url=None

            #記錄
            all_movie_url[movie_url_type]=movie_url

        print(all_movie_url)
        print("*"*17)

    time.sleep(3)

爬取電影網站

code import time import sys,os import requests import shutil from selenium import webdriver from selenium.webdriver.common.keys import Keys

web爬蟲系列（一）- 爬取電影天堂迅雷地址

一、爬蟲介紹目前爬蟲框架層出不窮，當然很多公司也會根據自己的業務做二次開發，Java的有WebMagic和WebCollector等，Python的有PySpider和Scrapy等。不能說孰好孰壞，只能說根據自己的業務場景選擇不同框架，Pytho

python實現的爬取電影下載連結功能示例

本文例項講述了python實現的爬取電影下載連結功能。分享給大家供大家參考，具體如下：

Python爬蟲爬取電影票房資料及圖表展示操作示例

本文例項講述了Python爬蟲爬取電影票房資料及圖表展示操作。分享給大家供大家參考，具體如下：

爬取求職網站的相關資訊

程式碼如下： import requests import openpyxl import time from bs4 import BeautifulSoup #用於解析和提取網頁資料的

辦公自動化24-爬取CMB網站理財產品的基本資訊（產品程式碼、產品名稱、收益率、淨值）

#匯入包import re import time import pandas as pd import numpy as np from selenium import webdriver from selenium.webdriver.common.keys import Keys

辦公自動化25-爬取CMB網站理財產品的投資報告並格式化輸出

# -*- coding: utf-8 -*- \"\"\" Created on Aug 5 2020 @author: lizitingxue \"\"\" #基礎包 import numpy as np

python如何爬取動態網站

python有許多庫可以讓我們很方便地編寫網路爬蟲，爬取某些頁面，獲得有價值的資訊！但許多時候，爬蟲取到的頁面僅僅是一個靜態的頁面，即網頁的原始碼，就像在瀏覽器上的“檢視網頁原始碼”一樣。一些動態的東西如j

python爬取12306網站獲取火車票資訊

利用requests傳送請求，prettytable表格輸出，需要安裝requests，prettytable python -m pip install requests

scrapy與selenium結合爬取資料(爬取動態網站)的示例程式碼

scrapy框架只能爬取靜態網站。如需爬取動態網站，需要結合著selenium進行js的渲染，才能獲取到動態載入的資料。

Python爬取素材網站3000多條音訊素材檔案

前言本文的文字及圖片來源於網路,僅供學習、交流使用,不具有任何商業用途,如有問題請及時聯絡我們以作處理。

Request爬取各類網站的資料（例項爬取）

1. 先上程式碼 1 # !/usr/bin/env python 2 # ! _*_ coding:utf-8 _*_ 3 # @TIME: 2020/10/1213:29 4 # @Author : Noob

基於Python爬取素材網站音訊檔案

基本環境配置 python 3.6 pycharm requests parsel 相關模組pip安裝即可目標網頁請求網頁 import requests

Python爬取招聘網站資料並做資料視覺化處理

本文的文字及圖片來源於網路,僅供學習、交流使用,不具有任何商業用途,如有問題請及時聯絡我們以作處理。

python爬蟲實現爬取同一個網站的多頁資料的例項講解

對於一個網站的圖片、文字音視訊等，如果我們一個個的下載，不僅浪費時間，而且很容易出錯。Python爬蟲幫助我們獲取需要的資料，這個資料是可以快速批量的獲取。本文小編帶領大家通過python爬蟲獲取獲取總頁數並更改

python協程爬取某網站的老賴資料

import re import json import aiohttp import asyncio import time import pymysql from asyncio.locks import Semaphore

爬蟲實戰：爬取相親網站，看看當下年輕小姐姐的擇偶觀。

技術標籤：爬蟲爬蟲python 前言到了一定年齡，父母可能會催你找女朋友，結婚。大多數的父母催婚，是父母漸漸老了，想讓你找個人照顧你，有熱飯吃，生病了有人照顧。在外面不被人欺負。當然，也有一部分來自周

代理伺服器ip如何爬取指定網站？

該程式碼是根據上一個爬行代理智慧財產權程式碼寫的。當我們從國內高匿代理智慧財產權網站爬到生存時間相對較長的智慧財產權地址，即智慧財產權地址已存入智慧財產權時，我們可以使用爬行智慧財產權作為代理智慧財

爬取一個網站中的多個表格

一、匯入url 以這個連結為例 https://www.hurun.net/zh-CN/Info/Detail?num=E7190250C866 # 匯入pandas 這個模組

爬蟲學習：爬取一個網站的風景圖片

用Python爬取一個網站圖片先看看主頁的規律這是他的地址：https://pic.netbian.com/4kfengjing/

爬取電影網站

相關推薦