愛麗絲夢遊仙境---python雲圖
阿新 • • 發佈:2018-11-12
目錄
WordCloud功能
(1) 文字預處理
(2) 詞頻統計
(3) 將高頻詞以圖片形式進行彩色渲染
文章和底片來源
https://github.com/amueller/word_cloud/tree/master/examples
無底片雲圖
from os import path from PIL import Image import numpy as np import matplotlib.pyplot as plt from wordcloud import WordCloud, STOPWORDS d = path.dirname(__file__) text = open(path.join(d, 'D:\python\CompatingD\Cloud_map\\1.txt')).read() wordcloud = WordCloud().generate(text) plt.imshow(wordcloud, interpolation='bilinear') plt.axis("off") plt.show()
有底片雲圖
from os import path from PIL import Image import numpy as np import matplotlib.pyplot as plt from wordcloud import WordCloud, STOPWORDS d = path.dirname(__file__) text = open(path.join(d, 'D:\python\CompatingD\Cloud_map\\alice.txt')).read() alice_mask = np.array(Image.open(path.join(d, "D:\python\CompatingD\Cloud_map\\alice_mask.png"))) stopwords = set(STOPWORDS) stopwords.add("said") wc = WordCloud(background_color="white", max_words=2000, mask=alice_mask, stopwords=stopwords) wc.generate(text) wc.to_file(path.join(d, "alice.png")) plt.imshow(wc, interpolation='bilinear') plt.axis("off") plt.figure() plt.imshow(alice_mask, cmap=plt.cm.gray, interpolation='bilinear') plt.axis("off") plt.show()
愛麗絲!!!
中文雲圖
simsun.ttf是中文語言包,放到同一目錄下即可
from os import path from PIL import Image import numpy as np import matplotlib.pyplot as plt from wordcloud import WordCloud, STOPWORDS import jieba file = open('D:\python\CompatingD\Cloud_map\Chinese.txt') text = file.read() text = ' '.join(jieba.cut(text)) wordcloud = WordCloud(font_path="simsun.ttf").generate(text) plt.imshow(wordcloud, interpolation='bilinear') plt.axis("off") plt.show()
****中文文章請自行翻譯