### python導入csv文件的三種方法 ```python #原始的方式 lines = [line.split(',') for line in open('iris.csv')] df = [[float(x) for x in line[:4]] for line in lines[1:]] #使用numpy包 import numpy as np lines = np.loadtxt('iris.csv',delimiter=',',dtype='str') df = lines[1:,:4].astype('float') #使用pandas包 import pandas as pd df = pd.read_csv('iris.csv') df=df.ix[:,:4]
這三種方法中最後一種最簡單,不過花費時間比較長一點,第一種最麻煩,不過用時最短。這個能夠經過ipython中的magic函數%%timeit
來看。python
數據:app
將數據轉爲DataFrame格式,代碼以下:函數
def read_file(path,is_test=False): fp = open(path,encoding='utf-8') dataset = [] for line in fp.readlines(): line = line.strip().split('\t') if is_test: line.append('-1') dataset.append(line) data = pd.DataFrame(dataset) data.columns = ['prefix','query_prediction','title','tag','label'] return data
參考文獻:spa
【1】numpy.loadtxt()code