【Fine學習筆記】python 文件l操做方法整理

python腳本能夠對excel進行建立、讀、寫、保存成指定文件名,保存到指定路徑的操做。整理了如下處理方法:
 
首先區別幾個操做方式:
"r" 以讀方式打開,只能讀文件 , 若是文件不存在,會發生異常
"w" 以寫方式打開,只能寫文件, 若是文件不存在,建立該文件,若是文件已存在,先清空,再打開文件
 
"rb" 以二進制讀方式打開,只能讀文件 , 若是文件不存在,會發生異常
"wb" 以二進制寫方式打開,只能寫文件, 若是文件不存在,建立該文件,若是文件已存在,先清空,再打開文件
 
"rt" 以文本讀方式打開,只能讀文件 , 若是文件不存在,會發生異常
"wt" 以文本寫方式打開,只能寫文件, 若是文件不存在,建立該文件,若是文件已存在,先清空,再打開文件
 
"rb+" 以二進制讀方式打開,能夠讀、寫文件 , 若是文件不存在,會發生異常
"wb+" 以二進制寫方式打開,能夠讀、寫文件, 若是文件不存在,建立該文件,若是文件已存在,先清空,再打開文件
 
***************************************************************************************
 
※對excel文件的操做(excel相關的類庫:xlwt,xlrd)

主要有三種方法:html

1> xlrd, xlwt, xlutils: 這三個庫的好處是不須要其它支持,在任何操做系統上均可以使用。xlrd 能夠讀取 .xls, .xlsx 文件,很是好用;但由於 xlwt 不能直接修改 Excel 文檔,必須得複製一份而後另存爲其它文件,並且聽說寫複雜格式的 Excel 文件會出現問題。python

2> openpyxl: 這個庫也是不須要其它支持的,並且聽說對 Office 2007 格式支持得更好。加載 Excel 文件的效率比 xlrd 慢 3 倍以上,內存使用在 10 倍以上。性能

3> win32com: Python Win32 擴展,這個庫須要運行環境爲 Windows+Office 對應版本。因爲 Python Win32 擴展只是把 COM 接口包裝了一下,能夠視爲與 VBA 徹底相同,不會有讀寫格式上的問題。 win32com 讀取 Excel 文件,效率仍是比 xlrd 慢些。字體

新建一個excel文件

file = xlwt.Workbook() #注意這裏的Workbook首字母是大寫google

新建一個sheet操作系統

table = file.add_sheet('sheet name').net

寫入數據table.write(行,列,value)excel

table.write(0,0,'test')code

注意:若是對一個單元格重複操做,會引起orm

         returns error:# Exception: Attempt to overwrite cell:# sheetname=u'sheet 1' rowx=0 colx=0

解決辦法:因此在打開時加cell_overwrite_ok=True

   table = file.add_sheet('sheet name',cell_overwrite_ok=True )

保存文件

file.save('demo.xls')

 

使用style能夠修改樣式

style = xlwt.XFStyle() # 初始化樣式

font = xlwt.Font() #爲樣式建立字體

font.name = 'Times New Roman'

font.bold = True

style.font = font #爲樣式設置字體

table.write(0, 0, 'some bold Times text', style) # 使用樣式

xlwt 容許單元格或者整行地設置格式。還能夠添加連接以及公式。

dates.py, 展現如何設置不一樣的數據格式

hyperlinks.py, 展現如何建立超連接 

merged.py, 展現如何合併格子

row_styles.py, 展現如何應用Style到整行格子中.

================================================================
對txt文件的操做
 
※讀文件
讀文本文件
input = open('data', 'r')
#第二個參數默認爲r
input = open('data')
 
讀二進制文件
input = open('data', 'rb')
 
讀取全部內容
file_object = open('thefile.txt')
try:
all_the_text = file_object.read( )
finally:
file_object.close( )
 
讀固定字節
file_object = open('abinfile', 'rb')
try:
while True:
chunk = file_object.read(100)
if not chunk:
break
do_something_with(chunk)
finally:
file_object.close( )
 
讀每行
list_of_all_the_lines = file_object.readlines( )
 
若是文件是文本文件,還能夠直接遍歷文件對象獲取每行:
for line in file_object:
process line
 
================================================================
 
 ※寫文件
寫文本文件
output = open('data', 'w')
 
寫二進制文件
output = open('data', 'wb')
 
追加寫文件
output = open('data', 'w+')
 
寫數據並保存
file_object = open('thefile.txt', 'w')
file_object.write(all_the_text)
file_object.close( )
 
寫入多行
file_object.writelines(list_of_text_strings)
 
======================================================================
 
代碼示例:
 
f = open(filename,'w') #打開文件open()是file()的別名
f.write(context) #把字符串寫入文件
f.close() #關閉文件
 
!writelines()和write()都能解決一次性寫入的問題,區別在於性能,writelines()性能高於write()。
 
#經過循環訪問readlines()返回列表中的元素
f = open(filename)
try:
lines = f.readlines()
for line in lines:
print line,
finally:
f.close()
 
從文件中讀取全部內容,將內容賦值到content
f = open(filename)
try:
content = f.read()
finally:
f.close()
print content
 
 
# 最簡單的例子
import xlwt
workbook = xlwt.Workbook(encoding = 'ascii')
worksheet = workbook.add_sheet('My Worksheet')
worksheet.write(0, 0, label = 'Row 0, Column 0 Value')
workbook.save('Excel_Workbook.xls')
 
# 格式化cell的font
font = xlwt.Font() # Create the Font
font.name = 'Times New Roman'
font.bold = True
font.underline = True
font.italic = True
style = xlwt.XFStyle() # Create the Style
style.font = font # Apply the Font to the Style
worksheet.write(0, 0, label = 'Unformatted value')
worksheet.write(1, 0, label = 'Formatted value', style) # Apply the Style to the Cell
 
 
# Font對象的屬性
font.bold = True # May be: True, False
font.italic = True # May be: True, False
font.struck_out = True # May be: True, False
font.underline = xlwt.Font.UNDERLINE_SINGLE # May be: UNDERLINE_NONE, UNDERLINE_SINGLE, UNDERLINE_SINGLE_ACC, UNDERLINE_DOUBLE, UNDERLINE_DOUBLE_ACC
font.escapement = xlwt.Font.ESCAPEMENT_SUPERSCRIPT # May be: ESCAPEMENT_NONE, ESCAPEMENT_SUPERSCRIPT, ESCAPEMENT_SUBSCRIPT
font.family = xlwt.Font.FAMILY_ROMAN # May be: FAMILY_NONE, FAMILY_ROMAN, FAMILY_SWISS, FAMILY_MODERN, FAMILY_SCRIPT, FAMILY_DECORATIVE
font.charset = xlwt.Font.CHARSET_ANSI_LATIN # May be: CHARSET_ANSI_LATIN, CHARSET_SYS_DEFAULT, CHARSET_SYMBOL, CHARSET_APPLE_ROMAN, CHARSET_ANSI_JAP_SHIFT_JIS, CHARSET_ANSI_KOR_HANGUL, CHARSET_ANSI_KOR_JOHAB, CHARSET_ANSI_CHINESE_GBK, CHARSET_ANSI_CHINESE_BIG5, CHARSET_ANSI_GREEK, CHARSET_ANSI_TURKISH, CHARSET_ANSI_VIETNAMESE, CHARSET_ANSI_HEBREW, CHARSET_ANSI_ARABIC, CHARSET_ANSI_BALTIC, CHARSET_ANSI_CYRILLIC, CHARSET_ANSI_THAI, CHARSET_ANSI_LATIN_II, CHARSET_OEM_LATIN_I
font.colour_index = 2 # 0:black, 1: white, 2: red, 3:light green, 4:blue
font.get_biff_record = ?
font.height = 0x00C8 # C8 in Hex (in decimal) = 10 points in height.
font.name = ?
font.outline = ?
font.shadow = ?
 
 
# 設置cell的寬度
worksheet.write(0, 0, 'My Cell Contents')
worksheet.col(0).width = 3333 # 3333 = 1" (one inch).
 
# 向cell添加一個日期
style = xlwt.XFStyle()
style.num_format_str = 'M/D/YY' # Other options: D-MMM-YY, D-MMM, MMM-YY, h:mm, h:mm:ss, h:mm, h:mm:ss, M/D/YY h:mm, mm:ss, [h]:mm:ss, mm:ss.0
worksheet.write(0, 0, datetime.datetime.now(), style)
 
# 向cell添加一個Formula
worksheet.write(0, 0, 5) # Outputs 5
worksheet.write(0, 1, 2) # Outputs 2
worksheet.write(1, 0, xlwt.Formula('A1*B1')) # Should output "10" (A1[5] * A2[2])
worksheet.write(1, 1, xlwt.Formula('SUM(A1,B1)')) # Should output "7" (A1[5] + A2[2])
 
# 向cell添加一個Hyperlink
worksheet.write(0, 0, xlwt.Formula('HYPERLINK("http://www.google.com";"Google")')) # Outputs the text "Google" linking to http://www.google.com
 
# 合併行列
worksheet.write_merge(0, 0, 0, 3, 'First Merge') # Merges row 0's columns 0 through 3.
font = xlwt.Font() # Create Font
font.bold = True # Set font to Bold
style = xlwt.XFStyle() # Create Style
style.font = font # Add Bold Font to Style
worksheet.write_merge(1, 2, 0, 3, 'Second Merge', style) # Merges row 1 through 2's columns 0 through 3.
 
# 設置cell內部定位
alignment = xlwt.Alignment() # Create Alignment
alignment.horz = xlwt.Alignment.HORZ_CENTER # May be: HORZ_GENERAL, HORZ_LEFT, HORZ_CENTER, HORZ_RIGHT, HORZ_FILLED, HORZ_JUSTIFIED, HORZ_CENTER_ACROSS_SEL, HORZ_DISTRIBUTED
alignment.vert = xlwt.Alignment.VERT_CENTER # May be: VERT_TOP, VERT_CENTER, VERT_BOTTOM, VERT_JUSTIFIED, VERT_DISTRIBUTED
style = xlwt.XFStyle() # Create Style
style.alignment = alignment # Add Alignment to Style
worksheet.write(0, 0, 'Cell Contents', style)
 
# 添加cell的邊框
# Please note: While I was able to find these constants within the source code, on my system (using LibreOffice,) I was only presented with a solid line, varying from thin to thick; no dotted or dashed lines.
borders = xlwt.Borders() # Create Borders
borders.left = xlwt.Borders.DASHED # May be: NO_LINE, THIN, MEDIUM, DASHED, DOTTED, THICK, DOUBLE, HAIR, MEDIUM_DASHED, THIN_DASH_DOTTED, MEDIUM_DASH_DOTTED, THIN_DASH_DOT_DOTTED, MEDIUM_DASH_DOT_DOTTED, SLANTED_MEDIUM_DASH_DOTTED, or 0x00 through 0x0D.
borders.right = xlwt.Borders.DASHED
borders.top = xlwt.Borders.DASHED
borders.bottom = xlwt.Borders.DASHED
borders.left_colour = 0x40
borders.right_colour = 0x40
borders.top_colour = 0x40
borders.bottom_colour = 0x40
style = xlwt.XFStyle() # Create Style
style.borders = borders # Add Borders to Style
worksheet.write(0, 0, 'Cell Contents', style)
 
# 設置cell的背景顏色
pattern = xlwt.Pattern() # Create the Pattern
pattern.pattern = xlwt.Pattern.SOLID_PATTERN # May be: NO_PATTERN, SOLID_PATTERN, or 0x00 through 0x12
pattern.pattern_fore_colour = 5 # May be: 8 through 63. 0 = Black, 1 = White, 2 = Red, 3 = Green, 4 = Blue, 5 = Yellow, 6 = Magenta, 7 = Cyan, 16 = Maroon, 17 = Dark Green, 18 = Dark Blue, 19 = Dark Yellow , almost brown), 20 = Dark Magenta, 21 = Teal, 22 = Light Gray, 23 = Dark Gray, the list goes on...
style = xlwt.XFStyle() # Create the Pattern
style.pattern = pattern # Add Pattern to Style
worksheet.write(0, 0, 'Cell Contents', style)
相關文章
相關標籤/搜索