部分程式碼3

阿新 • • 發佈：2018-11-05

#!/usr/bin/env python
#-*- coding:utf-8 -*-
#author: Enoch time:2018/10/30 0030

import re
import time
from collections import Counter
import os
import sys
import cProfile

###################################################################################
#Name:count_words
#Inputs:file name,the first n words, stopfile name
#outputs:None
#Author: Thomas
#Date:2018.10.22
###################################################################################
def CountPhrases(file_name,k):

    totalNum = 0

    t0 = time.clock()
    with open(file_name) as f:
        txt = f.read()
    txt = txt.lower()
    txt = re.sub(r'\s+',' ',txt)
    pword = r'(([a-z]+ )+[a-z]+)'  # extract sentence
    pattern = re.compile(pword)
    sentence = pattern.findall(txt)
    txt = ','.join([sentence[m][0] for m in range(len(sentence))])

    pattern = "[a-z]+[0-9]*"
    for i in range(k-1):
        pattern += "[\s|,][a-z]+[0-9]*"
    wordList = []
    for i in range(k):
        if( i == 0 ):
            tempList = re.findall(pattern, txt)
        else:
            wordpattern = "[a-z]+[0-9]*"
            txt = re.sub(wordpattern, '', txt, 1).strip()
            tempList = re.findall(pattern, txt)
        wordList += tempList
    tempc = Counter(wordList)

    dicNum = {}
    phrases = tempc.keys()
    for phrase in phrases:
        if (',' not in phrase):
            dicNum[phrase] = tempc[phrase]
            totalNum += tempc[phrase]
    dicNum = sorted(dicNum.items(), key=lambda k: k[0])
    dicNum = sorted(dicNum, key=lambda k: k[1], reverse=True)
    t1 = time.clock()

    for letter, fre in dicNum[:2]:
        print("|\t{:15}|{:<11.2%}|".format(letter, fre / totalNum))
    print(t1 - t0)


CountPhrases('../gone_with_the_wind.txt', 2)

部分程式碼3

原文連結 #!/usr/bin/env python #-*- coding:utf-8 -*- #author: Enoch time:2018/10/30 0030 import re import time from collections import Counter impo

求（3+開根5） N次方的整數部分最後3位

\n using include none scan alt tdi typedef fine 求（3+開根5） N次方的整數部分最後3位，請補足前導零。分析：首先（1）=（3+開根5） N次方的展開為 an + bn * 根號5 的形式。同時也有（2）=

GPIO模擬SPI介面程式碼(3線8位)

http://blog.csdn.net/sanchuyayun/article/details/48394381 關於SPI，不同的晶片具體通訊方式可能會不大一樣，所以要具體問題具體分析，下面是最近做LCD時碰到的兩個模擬SPI協議的程式碼，晶片通訊方式不同，程式碼也就不同了

Java基礎部分（3）

Java中的常用類2 　　集合陣列與集合的區別：　　1、陣列長度固定，集合長度可變。　　2、陣列可以儲存基本資料型別，集合只能儲存物件。集合類的結構圖以及相關特點： Collection　　|--List 有序,可重複　　　　|--ArrayList　　　　　　

部分程式碼4

#!/usr/bin/env python #-- coding:utf-8 -- #author: Enoch time:2018/10/30 0030 import re import time from collections import Counter ##########

部分程式碼2

原文連結https://blog.csdn.net/qq_36097393/article/details/83574269 import re import time from collections import Counter t0 = time.clock() #!/usr/

部分程式碼1

原文的一部分原文 #!/usr/bin/env python #-*- coding:utf-8 -*- #author: Enoch time:2018/10/30 0030 import re import time from collections import Counter

專案程式碼3

一.使用者登陸(校驗驗證碼：錯誤的驗證碼) 1 package com.itheima.bos.web.action; 2 3 import org.apache.commons.lang3.StringUtils; 4 import org.apache.struts2.ServletAc

RocksDB Persistent Read Cache部分程式碼分析

【github Wiki搜尋Persistent Read cache詳細描述】第二種優勢:直接拿走不影響使用設計時不考慮針對某種裝置三個主要部分： Block Lookup Index：maps a given LSM block address to a cac

CTF-web 第三部分程式碼審計

http://www.mxcz.net/tools/rot13.aspx rot-13加密解密 http://www.zjslove.com/3.decode/ 凱撒當鋪倒敘維吉尼亞密碼實際上就是閱讀有關的校驗程式碼，人為構造特殊的輸入或者引數才能拿到flag。需要了解一般的變數

pytorch 訓練資料以及測試全部程式碼(3)

if epoch % p['epoch_size'] == p['epoch_size'] - 1: lr_ = utils.lr_poly(base_lr=p['lr'], iter_=epoch, max_iter=nEpochs, power=0.9) print('(poly

《2.uboot和系統移植-第3部分-2.3.零距離初體驗uboot》

《2.uboot和系統移植-第3部分-2.3.零距離初體驗uboot》第一部分、章節目錄 2.3.1_2.X210官方uboot配置編譯實踐 2.3.3.uboot的原始碼目錄分析1 2.3.4.uboot的原始碼目錄分析2 2.3.5.uboot的原始碼目錄分析3 2.3.6.Sou

《5.linux驅動開發-第3部分-5.3.字元裝置驅動高階》

《5.linux驅動開發-第3部分-5.3.字元裝置驅動高階》第一部分、章節目錄 5.3.1.註冊字元裝置驅動新介面1 5.3.2.註冊字元裝置驅動新介面2 5.3.3.註冊字元裝置驅動新介面3 5.3.4.註冊字元裝置驅動新介面4 5.3.5.字元裝置驅動註冊程式碼分析1 5.3.6

web前端學習（三）css學習筆記部分（3）-- css常用操作

lan web pre 常用 meta gin 對齊 span web前端 5. CSS常用操作 5.1 對齊　　使用margin屬性進行水平對齊 <!DOCTYPE html> <html lang="en"> <head>

對AssetBundleBulit部分程式碼的個人理解

[MenuItem("Tools/AssetsBoundle/SelectBundle")] //MenuItem是在unity的工具欄中建立一個新的選單欄Tools->AssetBundle->SelectBundle public stati

提交訂單效能優化系列之009-對比整個方法同步與方法中的部分程式碼同步

概括總結在用到synchronized關鍵字的時候，憑直覺就會加在方法上，比如public static synchronized void test(){}，但是這種直覺不見得是對的，估計大部分時候是出圖方便，想偷懶，才直接加到方法上的。推薦的做法是：僅僅

C#萬能工具類部分程式碼

C#萬能工具類 class Utils { //獲取路徑 public static string GetImagePath() { string personImgPath = Path.GetDirect

易語言獲取TeamViewerID密碼部分程式碼

.版本 2 .區域性變數 hwnd, 整數型 .區域性變數 HwndEx, 整數型 .區域性變數 msg, 文字型 .區域性變數陣列, 文字型, , "0" .區域性變數陣列2, 文字型, , "0" .區域性變數二級視窗控制代碼, 整數型 .區域性變數三級視窗

多語言在企業級應用中的實現思路和部分程式碼

需要多語言的地方標題介面欄位資訊提示資訊下拉框資訊選單資訊查詢資訊需要用到的表詞條表 C_lang 元素對映表 C_ui_lable 語言包 C_use_lang 語言資訊 C_lang_temp 詞條表裡是存的是你的系統的原本語言和“多語言

測試層，判斷部分程式碼是否正確

package com.wh.test; import org.junit.Test; import com.wh.entity.User; import com.wh.servie.LoginService; import com.wh.servie.LoginServiceImpl;

部分程式碼3

相關推薦