A rookie of python_crawler----1(tf)

阿新 • • 發佈：2018-12-13

記錄一個菜鳥學習爬蟲的過程

下面這個程式碼很簡單，爬取的是TF官網上熱門口紅的資訊

採取的是最基本的BeautifulSoup和requests庫

#A simple code for crawling the information of the popular TF-lipsticks
import requests
import re
from bs4 import BeautifulSoup

url='https://www.tom-ford.cn/'
data={}
headers = {
        'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) '
                      'Chrome/70.0.3538.77 Safari/537.36'
        }

response = requests.get(url, headers=headers)
html_doc = response.content  # TF
#print(response.status_code)   #狀態碼
#print(response.content.decode("utf-8")) #內容

soup = BeautifulSoup(
    html_doc,
    'html.parser',
    from_encoding='utf-8'  # html文件編碼#
)

TF_type = soup.find_all('a', href=re.compile(r"goods-"))

for tf_type in TF_type:
    #print(tf_type.name,tf_type['href'],tf_type.get_text())
    print(tf_type.get_text())

A rookie of python_crawler----1(tf)

記錄一個菜鳥學習爬蟲的過程下面這個程式碼很簡單，爬取的是TF官網上熱門口紅的資訊採取的是最基本的BeautifulSoup和requests庫 #A simple code for crawling the information of the popular TF-lipstick

A rookie of python_crawler----2(dict)

用crwaler做了個字典，可以中翻英，英翻中，還有例句什麼的，做了個簡單的實驗 # A simple dict made by crawler, supporting Chiners->English and English->Chinese import requests imp

A Bite Of React(1)

lec add constant ant app span javascrip imp this react: component and views : produce html abd add them on a page( in the dom) <impor

A rookie for flask----1(login)

一個最基本的登陸註冊方式 login.html <!DOCTYPE html> <html lang="en"> <head> <meta charset="UTF-8"> <title>Title</titl

Cannot switch on a value of type String for source level below 1.7. Only convertible int values or enum variables are permitted

perm eve mit can source string per ted idt 在java中寫switch代碼時，參數用的是string，jdk用的是1.8，但是還是報錯，說不支持1.7版本以下的，然後查找了項目中的一些文件，打開一個文件如下，發現是1.6的版本，好奇

CHAPTER 1 ----- a tour of computer sysytems(2)

reads 地址 cpu mach sin sel error evel over 1.3 It pays to understand how compilation systems work Why programmers need to understand how

Xamarin.Android 使用 SQLite 出現 Index -1 requested, with a size of 10 異常

查詢 else bubuko local 獲取 roi sum 圖片 next() 異常: Android.Database.CursorIndexOutOfBoundsException: Index -1 requested, with a size of 10 此錯

A Tour of Go: Basics 1

unicode x64 連續變量名 and export int asi constant Packages, variables and functions Packages packages中，以大寫字母開頭的name是exported name，當import pa

1 TypeError: Index(...) must be called with a collection of some kind, ' ' was passed columns

今天犯了這個錯誤，查到的解決方法如下 columns : Index or array-like Column labels to use for resulting frame. Will default to np.arange(n) if no column labels

Android問題：報錯Index -1 requested, with a size of 1

使用Cursor使，讀取裡面的資料用到getColumnIndex()時報錯： Index -1 requested, with a size of 1 仔細閱讀過C

Cannot switch on a value of type String for source level below 1.7. Only

switch語句的判斷條件可以接受int,byte,char,short,不能接受其他型別只有JDK版本1.7以上才可以支援String 你可能會說我的jdk是1.7以上啊, 這裡說的版本是java直譯器的版本, eclipse修改辦法: 專案右鍵 > pr

Get a Ninja 2-in-1 blender on sale at Walmart for $49 (and save $40) ahead of Cyber Monday

Sometimes you need your blender to be more than a blender. The Nutri Ninja 2-in-1 Blender is not only a single-serve blender perfect for smoothies and milk

A preview of Go version 1

5 October 2011 We want to be able to provide a stable base for people using Go. People should be able to wr

【Tensorflow】ValueError: The `kernel_size` argument must be a tuple of 1 integers. Received: [3, 3]

使用 tensorflow.contrib.slim 搭建卷積神經網路進行圖片識別，圖片inputs維度為[299,299,3]，使用語句如下： net = slim.conv2d(inputs, 32, [3, 3], stride=2, scope='Conv2d_l

binascii.Error: Invalid base64-encoded string: number of data characters (25) cannot be 1 more than a multiple of 4

nco python multi code bin valid set utf .get Python Django 中set_cookie get_cookie 出現這種錯誤 b‘5Lit5paHdXRmLTjljrvmjoli‘ 打印看多了個b ,bytes

後端程序員之路 52、A Tour of Go-2

run arrays primes var auto 程序 pointer ase tex # flowcontrol - for - for i := 0; i < 10; i++ { - for ; sum < 1000;

CO-PRIME（初探莫比烏斯）NYOJ1066（經典）gcd（a，b）=1

put size 兩個 test hat ott == clas otto CO-PRIME 時間限制：1000 ms | 內存限制：65535 KB 難度：3 描寫敘述 This problem is so easy! Can you solve it

freedom is a kind of responsibility

重寫小學三個月一個經濟創新裏的它的整形張維迎教授在2017年7月1日北大國發院2017屆畢業典禮上的發言《自由是一種責任》張維迎：自由是一種責任本文為張維迎教授在2017年7月1日北大國發院2017屆畢業典禮上的發言同學們好！首先祝賀大

Largest Submatrix of All 1’s

its style each cin mes 輸入輸出流 element nes then Given a m-by-n (0,1)-matrix, of all its submatrices of all 1’s which is the large

03 Complementing a Strand of DNA

osal vco str tga truct nat dual dataset vid Problem In DNA strings, symbols ‘A‘ and ‘T‘ are complements of each other, as are ‘C‘ and ‘G

A rookie of python_crawler----1(tf)

相關推薦