處理任意格式的文本文件

阿新 • • 發佈：2018-07-08

end 不存在 readlines 創建去除 get 字典更多數據

# 內存--硬盤內容  序列話
# 手動擋
# f = open(‘文件名或類似文件的東西‘， ‘文件打卡模式‘)
# f是文件對象或指針，用來進行讀寫操作
# f.close()

# 三種模式：
# w. write 寫
# r read 讀
# a append  追加內容

import os 
%pwd

‘C:\\study\\jupyter‘

f = open(‘coop.txt‘, ‘w‘)  # 用w的方式打開文件，不存在則創建
f.write(‘coop‘ * 7)  # 向文件寫入字符串
f.close()

with open(‘coop-1.txt‘, ‘w‘) as f:
    f.write(‘coop‘ * 7)

with open(‘coop.txt‘) as f: # 文件名後面的r是默認模式
    data = f.read()# 讀出所有內容，保存到一個變量
    print(data)

coopcoopcoopcoopcoopcoopcoop

# 在打開文件時要考慮此文件是否存在，使用try except

with open(‘coop.txt‘, ‘a‘) as f:
    f.write(‘coop1\n‘)
    f.write(‘coop2\n‘)
    f.write(‘\n111‘)
    f.write(‘\n222‘)

with open(‘coop.txt‘) as f:  
    print(f.readline())  # 每次讀一行
    print(f.readline())
    print(f.readline())
    print(f.readline())

coopcoopcoopcoopcoopcoopcoopcoop

coop

coop1

coop2

with open(‘coop.txt‘) as f:  # 當文件不是很大時用readlines
    print(f.readlines())   # 如何去掉\n

[‘coopcoopcoopcoopcoopcoopcoopcoop\n‘, ‘coop\n‘, ‘coop1\n‘, ‘coop2\n‘, ‘\n‘, ‘111\n‘, ‘222‘]

with open(‘coop.txt‘) as f:  
    print(f.tell())  # tell()告訴我們光標現在的位置(列的位置)
    print(f.readline())  # 每次讀一行
    print(f.tell())
    print(f.readline())
    print(f.tell())
    print(f.seek(0))  # seek（0）讓光標返回到初始0的位置
    print(f.readline())
    print(f.readline())
    f.seek(5)

    print(f.readline())
    print(f.tell())

0
coopcoopcoopcoopcoopcoopcoopcoop

34
coop

40
0
coopcoopcoopcoopcoopcoopcoopcoop

coop

oopcoopcoopcoopcoopcoopcoop

34

f = open(‘coop.txt‘, ‘a‘)
f.write(‘append\n‘)
# print(f.readlines())

with open(‘coop.txt‘,) as f:
    data = f.read()
    print(data)

coopcoopcoopcoopcoopcoopcoopcoop
coop
coop1
coop2

111
222appendappendappendappendappendappendappend
append
append
append
append
append

##############

# 匹配相應後綴名的文件
import fnmatch
for f in os.listdir(‘.‘):
    if fnmatch.fnmatch(f, ‘*.txt‘):
        print(f)
    elif fnmatch.fnmatch(f, ‘*.pdf)‘):
        print(‘find pdf‘, f)

coop-1.txt
coop.txt

# 匹配相應後綴名的文件
import fnmatch
for f in os.listdir(‘.‘):
    if fnmatch.fnmatch(f, ‘?+.txt‘):  # 正則？，一個字符
        print(f)
    elif fnmatch.fnmatch(f, ‘?.pdf)‘):
        print(‘find pdf‘, f)

#################

import fnmatch
for f in os.listdir(‘.‘):
    if fnmatch.fnmatch(f, ‘\w+.txt‘):  # 正則？，一個字符
        print(f)
    elif fnmatch.fnmatch(f, ‘?.pdf)‘):
        print(‘find pdf‘, f)

# 單純匹配某種命名規則的文件
import glob
for f in glob.glob(‘[0-9].txt‘):
    print(f)

0.txt
1.txt

import glob
for f in glob.glob(‘[0-9]+.txt‘):  # 不可以加+號，已匹配更多字符
    print(f)

############################

# 序列化 picle ，持久化， 存盤
# 後綴名隨意，推薦使用pkl
# 存儲python的數據結構
name_list = [‘coop‘, ‘fang‘, ‘beijing‘]
data = {‘name‘:name_list, ‘age‘:(2,3,4)}

import pickle
with open(‘data.pkl‘, ‘wb‘) as f: # 使用wb，通用二進制存儲
    pickle.dump(data, f)

with open(‘data.pkl‘, ‘rb‘) as f:
    data = pickle.load(f)
    print(data)

{‘name‘: [‘coop‘, ‘fang‘, ‘beijing‘], ‘age‘: (2, 3, 4)}

############################

# 虛擬文件，臨時文件，不需要真的存到磁盤
import io
output = io.StringIO()
output.write(‘the first code\n‘)
print(‘ddd‘, file=output)

# 去除內容
contents = output.getvalue()
print(contents)

#關閉文件，清理緩存
output.close()   # 打印順序為什麽是那個樣子

the first code
ddd

# 用類似字典的方式存儲任意的python對象  pickle存儲的是數據結構
import shelve
with shelve.open(‘coop.she‘) as so:
    so[‘coop‘] = ‘fang‘  # 生成三個文件

with shelve.open(‘coop.she‘) as so:
    print(so[‘coop‘])

fang

處理任意格式的文本文件

end 不存在 readlines 創建去除 get 字典更多數據 # 內存--硬盤內容序列話 # 手動擋 # f = open(‘文件名或類似文件的東西‘， ‘文件打卡模式‘) # f是文件對象或指針，用來進行讀寫操作 # f.close() # 三種模式：

處理文本文件及其排序去重

處理文本文件及其排序去重cat file1.txt file2.txt >file3.txtsort file3.txt | uniq >newfile.txtnewfile即為去重後的文件本文出自 “simeon技術專欄” 博客，請務必保留此出處http://simeon.blog.51cto.

將字典的按字符串格式寫入文本文件，刪除中括號

pre 文本文件 open with open value 刪除 user 去掉逗號 a={‘user2‘: [‘234567‘,‘8000‘,‘10000‘], ‘user1‘: [‘123456‘,‘12000‘,‘15000‘]}with open(‘shuru.t

任意一個英文的純文本文件，統計其中的單詞出現的個數（shell python 兩種語言實現）

統計文本英文單詞個數 python shell sort uniq 現有plain text titled test.txt，統計其中的單詞出現的個數。 test.txt的內容： i have have application someday oneday day demo i have some one c

C#處理文本文件TXT實例詳解

技術分享 otto 文件內容名字空間數組 draw mat strong 上傳本文實例講述了C#處理文本文件TXT的方法。分享給大家供大家參考。具體分析如下： 1. 如何讀取文本文件內容：這裏介紹的程序中，是把讀取的文本文件，用一個richTextBox組件顯

python文本文件處理和用戶輸入

abcd 內存執行模式自動 flush 打印一次變量 #用戶輸入 a = input(‘please input: ‘) #這個輸入什麽即是什麽，比如輸入1，則a變量=1，輸入‘abc‘，則a變量 = ‘abc‘，輸入abc則報錯，因為會把abc當做一個變量，而並

使用FileStream向txt格式的文本文件 "追加" 新內容並讀取

dom files res void 追加 ons director 字節數組讀取txt 原文:使用FileStream向txt格式的文本文件 "追加" 新內容並讀取 1 //得到文件路徑。 2 static string filePath

Python 批量處理特定格式文件

mat walk pytho div app append spa 調用 [1] 1 #批量對文件夾下的‘.mat‘進行處理 2 3 def file_name(file_dir,suff): 4 L=[] 5 for root, dirs, file

記一次800多萬XML文本文件預處理經歷

超過 random while 表達式 range utf-8 test 現在其他一.背景由於某些需求，現需對系統在最近幾個月生成的xml文件進行預處理，提取<text>標簽內的數據進行分析。這些需要預處理的數據大概有280GB左右880多萬，存放在gys

bat批處理以當前時間創建文本文件

class test code 文件表示當前設置變量例如 time :: 表示註釋 :: @表示不顯示當前命令，只在後臺執行 :: @echo off 表示以後執行的命令都不顯示 :: set d=%date:~0,10% 表示設置變量d為當前年月日，默認

《文本文件的處理》單元測驗 1

輸出結果所有 tee命令 class 計數組成文件的翻譯 16進制使用more命令逐屏顯示文本文件時，使得顯示內容上滾一行而不是滾動一屏，應按下哪個鍵？回車 Linux中用來實現計數功能，比如：統計系統有多少個登錄用戶，實現計數功能的命令是：

Python 批處理文本文件、進行查找

描述 eba 位置匹配 exist 損壞冒號 back 獲取去年換了一部手機，老手機終於光榮退休了，但是裏面的便簽裏還存有很多文字記錄，這個手機還不能備份到雲，只能將每個便簽保留為一個個的文本文件，我想要把所有的文本文件歸到一個文本文件中，手動操作太麻煩了，剛好去年學

使用字符流讀取文本文件

緩沖區編碼一行 port sys adl () tostring println 1.字符輸入流Reader類　　Reader類是讀取字符流的抽象類，它提供了常用的方法。　　Reader rd= new FileReader("Test/xy.txt");//　 i

字節輸入流寫文本文件【OutputStream、FileOutputStream】

byte[] 方法名 cell end borde 方法 oid 所有寫入文件字節輸入流寫文本文件 1.OutputStream基類作用：把內存中的數據輸出到文件中。 ※OutputStream類的常用方法方法名稱說明

C語言之文件操作06——寫數據到文本文件遇0停止

語言 text null white ont .net main fopen scan //文件 /* =============================================================== 題目：輸入10個籃球運動員的

每天一個liunx命令3之awk實現文本文件的抓取

logs -h 名稱 name $0 rep ray 表達式指定 =============================================================================grep -h -s -E ‘HUAWEI_9000

mysql的load data,高速將文本文件,插入數據庫中

option 子句取數據跳過 expr 數據導入文件名所在 from 1語法 LOAD DATA [ LOW_PRIORITY | CONCURRENT ] [ LOCAL ] INFILE ‘file_name.txt‘ [ REPLACE | IGNORE

奪命雷公狗---linux NO:11 linux的文本文件查看命令

而已分享 cat .cn blog mage 技術 bsp passwd cat 命令使用方法如下：他就會將裏面的內容都給打印出來了。。。這幾個命令其實並沒有什麽卵用，不過知道有他的存在即可。。。。不過都是針對 etc/passwd 文件來進行查看

20161212xlVBA文本文件多列合並

多列 workbook msgbox time minus 清理 number iter 設置 Sub NextSeven_CodeFrame() ‘應用程序設置 Application.ScreenUpdating = False Application

合並多個文本文件方法

font 使用技術分享 sco 簡單 cte 衍生 strong analyst 原創作品，出自 “深藍的blog” 博客，深藍的blog：http://blog.csdn.net/huangyanlong/article/details/47055589 把多

處理任意格式的文本文件

相關推薦