1. 程式人生 > >python xpath text與attrib

python xpath text與attrib

xPath = '/html/body/div/div/form/div/table/tbody/tr/td[1]/a'

我們很容易就能抓到一個這樣的a標籤陣列

然後安排它

selector = etree.HTML(resData)
    res = selector.xpath(xPath)
    data = {}
    allData = []
    for x in res:
        data['value'] = x.text
        try:
            data['href'] = "http://" + x.attrib['href'].replace("..", "www.liuan.mobi");
        except:
            print("遍歷完畢,退出程式")
            exit("遍歷完畢,退出程式");
            # 遇到異常說明遍歷完畢 退出程式
        if (data['value'] == ""):
            exit("遍歷完畢,退出程式");
            break

由上面程式碼可以知道

x.attrib['href']

可以取一些我們想要的標籤