1. 程式人生 > >愛麗絲夢遊仙境---python雲圖

愛麗絲夢遊仙境---python雲圖

目錄

WordCloud功能

文章和底片來源

無底片雲圖

 有底片雲圖

 中文雲圖


WordCloud功能

(1) 文字預處理

(2) 詞頻統計

(3) 將高頻詞以圖片形式進行彩色渲染

文章和底片來源

https://github.com/amueller/word_cloud/tree/master/examples

無底片雲圖

from os import path
from PIL import Image
import numpy as np
import matplotlib.pyplot as plt
from wordcloud import WordCloud, STOPWORDS

d = path.dirname(__file__)
text = open(path.join(d, 'D:\python\CompatingD\Cloud_map\\1.txt')).read()
wordcloud = WordCloud().generate(text)
plt.imshow(wordcloud, interpolation='bilinear')
plt.axis("off")
plt.show()

 

 有底片雲圖

from os import path
from PIL import Image
import numpy as np
import matplotlib.pyplot as plt
from wordcloud import WordCloud, STOPWORDS


d = path.dirname(__file__)
text = open(path.join(d, 'D:\python\CompatingD\Cloud_map\\alice.txt')).read()

alice_mask = np.array(Image.open(path.join(d, "D:\python\CompatingD\Cloud_map\\alice_mask.png")))

stopwords = set(STOPWORDS)
stopwords.add("said")

wc = WordCloud(background_color="white", max_words=2000, mask=alice_mask, stopwords=stopwords)

wc.generate(text)

wc.to_file(path.join(d, "alice.png"))

plt.imshow(wc, interpolation='bilinear')
plt.axis("off")
plt.figure()
plt.imshow(alice_mask, cmap=plt.cm.gray, interpolation='bilinear')
plt.axis("off")
plt.show()

愛麗絲!!!

 中文雲圖

 simsun.ttf是中文語言包,放到同一目錄下即可

from os import path
from PIL import Image
import numpy as np
import matplotlib.pyplot as plt
from wordcloud import WordCloud, STOPWORDS
import jieba

file = open('D:\python\CompatingD\Cloud_map\Chinese.txt')
text = file.read()
text = ' '.join(jieba.cut(text))

wordcloud = WordCloud(font_path="simsun.ttf").generate(text)
plt.imshow(wordcloud, interpolation='bilinear')
plt.axis("off")
plt.show()

 ****中文文章請自行翻譯