Oracle Lead/Last函數

時間 2021-02-15

標籤 sql ide 函數 spa 3d orm blog 排序 element 欄目 Oracle 简体版

原文原文鏈接

Oracle Lead/Last函數sql

1. Syntax

Purposeide

FIRST and LAST are very similar functions.Both are aggregate and analytic functions that operate on a set of values froma set of rows that rank as the FIRST or LAST withrespect to a given sorting specification. If only one row ranks as FIRSTor LAST, then the aggregate operates on the set with only one element.函數

If you omit the OVERclause, then the FIRST and LAST functions are treated as aggregate functions. You can use thesefunctions as analytic functions by specifying the OVER clause. Thequery_partition_clause is the only part of the OVER clause valid with thesefunctions. If you include the OVER clause but omit thequery_partition_clause, then the function is treated as an analytic function, but the window defined for analysis is theentire table.spa

中文說明：省略over子句，Fisrt/Last被當作聚合函數使用，見示例1；含over關鍵字但沒query_partition_clause，Fisrt/Last被當作分析函數使用，分析的窗口是整個表，見示例2。3d

These functions take as an argument anynumeric data type or any nonnumeric data type that can be implicitly convertedto a numeric data type. The function returns the same data type as the numericdata type of the argument.orm

When you need a value from the first orlast row of a sorted group, but the needed value is not the sort key, the FIRSTand LAST functions eliminate the need for self-joins or views and enable betterperformance.blog

The aggregate_functionargument is any one of the MIN, MAX, SUM, AVG, COUNT, VARIANCE, or STDDEVfunctions. It operates on values from the rows thatrank either FIRST or LAST. If only one row ranks as FIRST or LAST, then theaggregate operates on a singleton (nonaggregate) set.排序

The KEEP keyword is for semantic clarity.It qualifies aggregate_function, indicating that only the FIRST or LAST valuesof aggregate_function will be returned.ci

DENSE_RANK FIRST or DENSE_RANK LASTindicates that Oracle Database will aggregate over only those rows with theminimum (FIRST) or the maximum (LAST) dense rank (also called olympic rank).element

2. 說明

min(job_id) keep(dense_rank first order bycount(job_id) desc) over(partition by department_id)

語義：按每一個部門查找工種人數最多的工種。

min：例如某個部門，人數佔用最多的工種有兩個，例如某個部門A工種3人，B工種3人，這時用min返回的值就是A，相應的用max返回的值就是B。若你想用AVG這類函數，則會報錯，invalid number。其實做用就是防止返回兩個值，也不是網上說的，徹底沒有意義（max和min結果是不同的）。

keep：關鍵字。

dense_rank：排序操做，換成row_number試了下，直接拋出異常。

over：便是分析函數分析的窗口，省略over及其後面語句，則整個結果聚合（aggregate）

3. 示例

1. 示例1

select max(e.job_id) keep(dense_rank lastorder by count(job_id) desc),
      min(e.job_id) keep(dense_rank last order by count(job_id) desc),
      max(e.job_id) keep(dense_rank first order by count(job_id) desc),
      min(e.job_id) keep(dense_rank first order by count(job_id) desc)
 from employees e
 group by e.department_id, e.job_id;

返回的結果集以下

SA_REP AC_ACCOUNT SA_REP SA_REP

發現：整個表的聚合，也驗證了max和min的結果有時不一致。

2. 示例2

select distinct
      department_id,
      --count(job_id),
      min(job_id) keep(dense_rank first order by count(job_id) desc)over(partition by department_id) job_id
 from employees
 group by department_id, job_id
 order by 1;

分析窗口：以部門分組

返回結果集以下

1 10 AD_ASST

2 20 MK_MAN

3 30 PU_CLERK

4 40 HR_REP

5 50 SH_CLERK

6 60 IT_PROG

7 70 PR_REP

8 80 SA_REP

9 90 AD_VP

10 100 FI_ACCOUNT

11 110 AC_ACCOUNT

12 SA_REP

返回每一個部門的工種人數最多的工種，注意部門ID爲空也返回了，這個是boss。

3. 幾種寫法比較

1. method 1

with t as
 (select department_id, job_id, count(job_id)cnt
   from employees
  group by department_id, job_id)
select department_id, max(job_id)  --再次聚合
 from t
 where (department_id, cnt) in (selectdepartment_id, max(cnt) from t group by department_id)
 group by department_id
 order by 1;

1 10 AD_ASST

2 20 MK_REP

3 30 PU_CLERK

4 40 HR_REP

5 50 ST_CLERK

6 60 IT_PROG

7 70 PR_REP

8 80 SA_REP

9 90 AD_VP

10 100 FI_ACCOUNT

11 110 AC_MGR

總結：1. boss這個部門，即部門爲空，沒有返回；

2.某個部門工種人數最多的，有兩個工種，不得再也不次進行聚合。

3.代碼較爲繁瑣。

2. method 2

select department_id, job_id
 from (select e.department_id,
               e.job_id,
               count(e.job_id),
               row_number() over(partition bydepartment_id order by count(job_id) desc) rk
         from employees e
        group by e.department_id, e.job_id)
 where rk = 1;

1 10 AD_ASST

2 20 MK_MAN

3 30 PU_CLERK

4 40 HR_REP

5 50 ST_CLERK

6 60 IT_PROG

7 70 PR_REP

8 80 SA_REP

9 90 AD_VP

10 100 FI_ACCOUNT

11 110 AC_ACCOUNT

12 SA_REP

總結：1.用row_number排序，而後使用外查詢過濾row_number爲1的；

2.boss這我的包含的結果返回。

3. method 3

select /*distinct*/
      department_id,
      count(job_id),
      min(job_id) keep(dense_rank first order by count(job_id) desc)over(partition by department_id) job_id
 from employees
 group by department_id, job_id
 order by 1;

1 10 1 AD_ASST

2 20 1 MK_MAN

3 20 1 MK_MAN

4 30 5 PU_CLERK

5 30 1 PU_CLERK

6 40 1 HR_REP

7 50 20 SH_CLERK

8 50 20 SH_CLERK

9 50 5 SH_CLERK

10 60 5 IT_PROG

11 70 1 PR_REP

12 80 5 SA_REP

13 80 29 SA_REP

14 90 1 AD_VP

15 90 2 AD_VP

16 100 5 FI_ACCOUNT

17 100 1 FI_ACCOUNT

18 110 1 AC_ACCOUNT

19 110 1 AC_ACCOUNT

20 1 SA_REP

這裏按department_id, job_id分組，咱們只關心department_id, job_id，SQL進行調整下

tuneSQL

select distinct
      department_id,
      --count(job_id),
      min(job_id) keep(dense_rank first order by count(job_id) desc)over(partition by department_id) job_id
 from employees
 group by department_id, job_id
 order by 1;

1 10 AD_ASST

2 20 MK_MAN

3 30 PU_CLERK

4 40 HR_REP

5 50 SH_CLERK

6 60 IT_PROG

7 70 PR_REP

8 80 SA_REP

9 90 AD_VP

10 100 FI_ACCOUNT

11 110 AC_ACCOUNT

12 SA_REP