clob字段是oracle專門用於存儲超長字符串的字段類型,通常varchar2只能存4000個字符串,超過4000個就存不下去了。
若是直接使用 pandas.read_sql(sql, conn)會報錯,那麼如何將clob讀取到python中呢? python
1.read方法
import cx_Oracle conn = cx_Oracle.connect("user/pwd@ip/db") cur = conn.cursor() # col 是clob字段 cur.execute("select col from table") pram=[] for i in cur: text = i[0].read() pram.appen(text) cur.close() conn.close()
2.使用DBMS_LOB.SUBSTR模塊
方法是先提取前2000個字符串,接着提取2000個字符串….,而後拼接起來。sql
import cx_Oracle import pandas as pd conn = cx_Oracle.connect("user/pwd@ip/db") # 這裏只提取前6000個字符串 sql1 = "select DBMS_LOB.SUBSTR(col,2000,1) as col1 as fzss from table" # 1-2000個字符串 sql2 = "select DBMS_LOB.SUBSTR(col,4000,2001) as col2 as fzss from table" # 2001-4000個字符串 sql3 = "select DBMS_LOB.SUBSTR(col,6000,4001) as col3 as fzss from table" # 2001-4000個字符串 # 讀取數據 df1 = pd.read_sql(sql1, conn) df2 = pd.read_sql(sql2, conn) df3 = pd.read_sql(sql3, conn) # 有些記錄不必定是很長的字符串,結果多是None,須要填充空字符串,不然下面的拼接會出錯 df1 = df1.fillna('') df2 = df2.fillna('') df3 = df3.fillna('') # 將它們拼接起來,放在tmp字段上 df1['TMP'] = df1.loc['COL1']+df2.loc['COL2']+df3.loc['COL3']
3.將字符串寫入clob字段
方法和將字符串導入varchar2字段是同樣的,不須要特殊設置。 好比:oracle
id='123' clob='a'*2**20 # 重複2的20次方次 param=[id, colb] sql = "insert into table (id,colb) values(:1, :2)" cursor.execute(sql, param) conn.commit()