Oracle 兩個逗號分割的字符串，獲取交集、差集的sql實現過程解析

時間 2019-11-06

標籤 oracle 兩個逗號分割字符串獲取交集 sql 實現過程解析欄目 Oracle 简体版

原文原文鏈接

Oracle數據庫的兩個字段值爲逗號分割的字符串，例如：字段A值爲「1,2,3,5」，字段B爲「2」。
想獲取兩個字段的交集（相同值）2，獲取兩個字段的差集（差別值）1,3,5。正則表達式

1、最終實現的sql語句sql

一、獲取交集（相同值）：數據庫

select regexp_substr(id, '[^,]+', 1, rownum) id
from (select '1,2,3,5' id from dual)
connect by rownum <= length(regexp_replace(id, '[^,]+')) +1
intersect -- 取交集
select regexp_substr(id, '[^,]+', 1, rownum) id
from (select '2' id from dual)
connect by rownum <= length(regexp_replace(id, '[^,]+')) +1;
/*結果：
2
*/

二、獲取差集（差別值）：express

select regexp_substr(id, '[^,]+', 1, rownum) id
from (select '1,2,3,5' id from dual)
connect by rownum <= length(regexp_replace(id, '[^,]+')) +1
minus --取差集
select regexp_substr(id, '[^,]+', 1, rownum) id
from (select '2' id from dual)
connect by rownum <= length(regexp_replace(id, '[^,]+')) +1;
/*結果：
1
3
5
*/

2、實現過程用到的函數用法說明函數

一、regexp_substr
正則表達式分割字符串，函數格式以下：spa

function regexp_substr(strstr, pattern [,position] [,occurrence] [,modifier] [subexpression])
__srcstr：須要進行正則處理的字符串
__pattern：進行匹配的正則表達式
__position：可選參數，表示起始位置，從第幾個字符開始正則表達式匹配（默認爲1）
__occurrence：可選參數，標識第幾個匹配組，默認爲1
__modifier：可選參數，表示模式（'i'不區分大小寫進行檢索；'c'區分大小寫進行檢索。默認爲'c'。）

使用例子：code

select 
regexp_substr('1,2,3,5','[^,]+') AS t1, 
regexp_substr('1,2,3,5','[^,]+',1,2) AS t2,
regexp_substr('1,2,3,5','[^,]+',1,3) AS t3,
regexp_substr('1,2,3,5','[^,]+',1,4) AS t4,
regexp_substr('1,2,3,5','[^,]+',2) AS t5,
regexp_substr('1,2,3,5','[^,]+',2,1) AS t6,
regexp_substr('1,2,3,5','[^,]+',2,2) AS t7
from dual; 
/*結果：
1    2    3    5    2    2    3
*/

二、regexp_replaceregexp

經過正則表達式來進行匹配替換，函數格式以下：blog

function regexp_substr(srcstr, pattern [,replacestr] [,position] [,occurrence] [,modifier])
__srcstr：須要進行正則處理的字符串
__pattern：進行匹配的正則表達式
__replacestr：可選參數，替換的字符串，默認爲空字符串
__position：可選參數，表示起始位置，從第幾個字符開始正則表達式匹配（默認爲1）
__occurrence：可選參數，標識第幾個匹配組，默認爲1
__modifier：可選參數，表示模式（'i'不區分大小寫進行檢索；'c'區分大小寫進行檢索。默認爲'c'。）

使用例子：字符串

select 
regexp_replace('1,2,3,5','5','4') t1,
regexp_replace('1,2,3,5','2|3',4) t2,
regexp_replace('1,2,3,5','[^,]+') t3,
regexp_replace('1,2,3,5','[^,]+','') t4,
regexp_replace('1,2,3,5','[^,]+','*') t5
from dual; 
/*結果：
1,2,3,4    1,4,4,5    ,,,    ,,,    *,*,*,*
*/

三、connect by
（1）connect by單獨用，返回多行結果

select rownum from dual connect by rownum < 5;
/*結果：
1
2
3
4
*/

（2）通常經過start with . . . connect by . . .子句來實現SQL的層次查詢

select 
id,
name,
sys_connect_by_path(id,'\') idpath,
sys_connect_by_path(name, '\') namepath
from (
select 1 id, '廣東' name, 0 pid from dual
union 
select 2 id, '廣州' name , 1 pid from dual
union 
select 3 id, '深圳' name , 1 pid from dual
) 
start with pid = 0
connect by prior id = pid;

/*結果：
1    廣東    \1    \廣東
2    廣州    \1\2    \廣東\廣州
3    深圳    \1\3    \廣東\深圳
*/

3、總結

由上面函數用法，可知下面語句能夠把字符串「1,2,3,5」轉換爲4行記錄

select regexp_substr(id, '[^,]+', 1, rownum) id
from (select '1,2,3,5' id from dual)
connect by rownum <= length(regexp_replace(id, '[^,]+')) +1

而後在2個結果中使用集合運算符（UNION/UNION ALL 並集，INTERSECT 交集，MINUS 差集）進行最終處理。

相關標籤/搜索

每日一句

每一个你不满意的现在，都有一个你没有努力的曾经。