根據使用者ID爬取Twitter資料

阿新 • • 發佈：2019-01-17

我需要爬取的使用者ID存放在一個.csv檔案下，然後從官網註冊到一個APP，並獲得你的key和secret，寫入下邊的程式碼，就可以爬取tweets了。每個ID會輸出相應的tweet並且s會放在一個.csv檔案裡，而這個.csv檔案就在你執行這段程式碼的資料夾下。 #!/usr/bin/env python # encoding: utf-8 import tweepy import csv consumer_key = "" consumer_secret = "" access_key = "" access_secret = "" def get_all_tweets(user_id): auth = tweepy.OAuthHandler(consumer_key, consumer_secret) auth.set_access_token(access_key, access_secret) api = tweepy.API(auth) # 初始化一個數字來儲存所有的tweets alltweets = [] new_tweets = api.user_timeline(user_id=user_id, count=200) # save most recent tweets alltweets.extend(new_tweets) # save the id of the oldest tweet less one oldest = alltweets[-1].id - 1 # keep grabbing tweets until there are no tweets left to grab while len(new_tweets) > 0: print "getting tweets before %s" % (oldest) # all subsiquent requests use the max_id param to prevent duplicates new_tweets = api.user_timeline(user_id=user_id, count=200, max_id=oldest) # save most recent tweets alltweets.extend(new_tweets) # update the id of the oldest tweet less one oldest = alltweets[-1].id - 1 print "...%s tweets downloaded so far" % (len(alltweets)) # transform the tweepy tweets into a 2D array that will populate the csv outtweets = [[tweet.id_str, tweet.created_at, tweet.text.encode("utf-8")] for tweet in alltweets] # write the csv with open('%s_tweets.csv' % user_id, 'wb') as f: writer = csv.writer(f) writer.writerow(["tweet_id", "created_at", "text"]) writer.writerows(outtweets) pass if __name__ == '__main__': with open(這裡寫你的檔案的位置，例如：'e:/file/userID.csv', 'rb') as f: ID = csv.reader(f) for row in ID: # 這裡運用了錯誤查詢機制，遇到使用者ID出現問題時，可以跳過 try: get_all_tweets(row[0]) except tweepy.TweepError, e: print 'Failed to run the command on that user, Skipping...' except IndexError, e: print 'List index out of range, Skipping...' continue

根據使用者ID爬取Twitter資料

根據使用者ID爬取Twitter資料

利用Twitter開放者平臺爬取Twitter資料

根據地理位置和關鍵詞爬取twitter資料並生成詞雲

無搜尋條件根據url獲取網頁資料(java爬取網頁資料)

有搜尋條件根據url抓取網頁資料(java爬取網頁資料)

高德地圖之根據矩形範圍爬取範圍內的分類POI資料

店鋪商品id爬取

python 根據鏈家爬取的信息生成雲詞

爬取xml資料之R

將爬取的資料傳入到pipeline中，需要對settings.py進行修改

用python爬取股票資料的一點小結

將爬取的資料儲存到mysql中

scrapy框架用post 爬取網站資料的兩種方法區別

爬取貓眼資料

另類爬蟲：從PDF檔案中爬取表格資料

爬蟲練習--爬取股票資料

python 將爬取的資料儲存在資料庫裡

利用linux curl爬取網站資料

爬取京東資料

爬取大規模資料（1）

根據使用者ID爬取Twitter資料

相關推薦