基於共現提取人民的民義人物關係

阿新 • • 發佈：2019-01-08

“`

-- coding: utf-8 --

import os,sys
import re
import jieba,codecs,math
import jieba.posseg as pseg
import string
from zhon.hanzi import punctuation

names = {} # 姓名字典，字典的鍵為人物名稱，值為該人物在全文中出現的次數
relationships = {} # 關係字典，人物關係的有向邊，該字典的鍵為有向邊的起點，值為一個字典edge，
# edge的鍵是有向邊的終點，值是有向邊的權值，代表兩個人物之間聯絡的緊密程度
lineNames = [] # 每集內人物關係，儲存對每一段分詞得到當前集中出現的人物名稱，lineName[i]是一個列表，列表中儲存第i集中出現過的人物。

jieba.load_userdict(“dict.txt”) # 載入字典
with open(“introduction.txt”,”r”) as f:
for line in f.readlines():
line = line.decode(‘GB2312’)
line = line.encode(‘utf-8’)
line = re.sub(ur”[%s]+” % punctuation, “”, line.decode(“utf-8”)) # 去標點
poss = pseg.cut(line)
lineNames.append([])
for w in poss:
if w.flag != “nr” or len(w.word)<2:
continue # 當分詞長度小於2或該詞詞性不為nr時認為該詞不為人名
lineNames[-1].append(w.word) # 為當前段的環境增加一個人物
if names.get(w.word) is None:
names[w.word] = 0
relationships[w.word] = {}
names[w.word] += 1 # 該人物出現次數加 1
for name, times in names.items():
print name,times

for line in lineNames: # 對於每一段
for name1 in line:
for name2 in line: # 每段中的任意兩個人
if name1 == name2:
continue
if relationships[name1].get(name2) is None: # 若兩人尚未同時出現則新建項
relationships[name1][name2]= 1
else:
relationships[name1][name2] = relationships[name1][name2]+ 1

with codecs.open(“node.txt”, “w”, “gbk”) as f:
f.write(“Id Label Weight\r\n”)
for name, times in names.items():
f.write(name + ” ” + name + ” ” + str(times) + “\r\n”)

with codecs.open(“edge.txt”, “w”, “gbk”) as f:
f.write(“Source Target Weight\r\n”)
for name, edges in relationships.items():
for v, w in edges.items():
if w > 3:
f.write(name + ” ” + v + ” ” + str(w) + “\r\n”)“`

基於共現提取人民的民義人物關係

-- coding: utf-8 --

基於共現提取人民的民義人物關係

基於共現發現人物關系的python實現

python基於共現的《紅樓夢》人物關係圖

Gephi例項教程----手把手實現基於共現關係的劇本分析

aaYn各極與質屬將民義什西此給界電特

[原始碼和文件分享]基於Python的Django框架實現的人物資訊檢索系統

基於顏色特徵提取

ECharts--基於力導向佈局圖功能更完善的人物關係圖外掛擴充套件-增加橫縱滾動條

用Python+Gephi畫《人民的名義》人物關係圖

MapReduce 基礎演算法【單詞共現演算法】

基於matlab邊緣提取的幾種方法的比較

OpenCV Using Python——基於SURF特徵提取和金字塔LK光流法的單目視覺三維重建

使用MapReduce實現pairs演算法實現單詞的共現矩陣

如何用VOSviewer分析CNKI關鍵詞共現？

Python實現《人民的名義》關係視覺化

中文人物關係圖譜構建與應用專案(人物關係抽取,關係抽取評測)

基於cytoscape.js 、 d3.js實現的關係圖譜初級版本

MongoDB一個基於分散式檔案儲存的資料庫（介於關係資料庫和非關係資料庫之間的資料庫）

【D3.js】力導向佈局 + 圓形圖片展示的人物關係

echart 人物關係圖新增照片

基於共現提取人民的民義人物關係

-- coding: utf-8 --

相關推薦