Oracle中number数据类型的存储机制

2023-01-13 12:52:34 浏览数 (1)

Oracle中number数据类型存储的是整型,碰巧看到这篇文章讲解了通过分析索引了解0和1的存储机制,值得学习一下。

P.S. https://www.modb.pro/db/605566

代码语言:javascript复制
create table t1 as select * from dba_objects;
insert into t1 select * from t1; --执行5次
commit;
create table t2 as select * from t1;
update t1 set object_name=null where object_id<7000;
commit;
update t2 set object_name=null where object_id<7000;
commit;
create index ind_t1_name on t1(object_name,0);
create index ind_t2_name on t2(object_name,1);
exec dbms_stats.gather_table_stats('MYTEST','T1',degree => 4,cascade => true,method_opt=>'for all columns size auto',estimate_percent=>100);  
exec dbms_stats.gather_table_stats('MYTEST','T2',degree => 4,cascade => true,method_opt=>'for all columns size auto',estimate_percent=>100);

通过检查索引大小及占用块数,我们发现,常数0的复合索引占用空间更小,

代码语言:javascript复制
--查看表和索引大小
col segment_name for a20
select segment_name,segment_type,bytes/1024/1024 from user_segments where segment_name in('T1','T2','IND_T1_NAME','IND_T2_NAME');


SEGMENT_NAME       SEGMENT_TYPE  BYTES/1024/1024
-------------------- ------------------ ---------------
IND_T1_NAME       INDEX          120
IND_T2_NAME       INDEX          128
T1         TABLE          352
T2         TABLE          352


col index_name for a20
col table_name for a10
col LEAF_BLOCKS for 99999999
col NUM_ROWS for 99999999
select index_name,TABLE_NAME,LEAF_BLOCKS,NUM_ROWS from user_indexes where index_name in ('IND_T1_NAME','IND_T2_NAME');


INDEX_NAME       TABLE_NAME LEAF_BLOCKS  NUM_ROWS
-------------------- ---------- ----------- ---------
IND_T1_NAME       T1         15004   2310176
IND_T2_NAME       T2         15328   2310176

IND_T1_NAME内容,

代码语言:javascript复制
col object_name for a16
select object_id,object_name,object_type from user_objects where object_name in ('IND_T1_NAME','IND_T2_NAME');


 OBJECT_ID OBJECT_NAME      OBJECT_TYPE
---------- ---------------- -----------------------
    242122 IND_T1_NAME      INDEX
    242124 IND_T2_NAME      INDEX


ALTER SESSION SET EVENTS 'immediate trace name treedump level 242122';


----- begin tree dump
branch: 0x2816183 42033539 (0: nrow: 98, level: 2)
   branch: 0x281aa74 42052212 (-1: nrow: 230, level: 1)
      leaf: 0x2816184 42033540 (-1: row:254.254 avs:820)
      leaf: 0x2816185 42033541 (0: row:199.199 avs:819)
      ......
    leaf: 0x241dd88 37870984 (229: row:512.512 avs:828)
      leaf: 0x241dd89 37870985 (230: row:512.512 avs:828)


select DBMS_UTILITY.data_block_address_file(37870984) RELATIVE_FNO,DBMS_UTILITY.data_block_address_block(37870984) blk_id from dual;


--注意,上述转换是相对文件号,需要转换成绝对文件号file_id.
select RELATIVE_FNO,FILE_ID from dba_data_files where RELATIVE_FNO=9;


alter system dump datafile <file_id> block <block_id>;


--IND_T1_NAME ,我们可以看到,索引有列,复合索引两列,第一列是空值,第二列是80,也就是我们写的0,第三列是rowid。
Block header dump:  0x0241dd88
 Object id on Block? Y
 seg/obj: 0x3b1ca  csc:  0x000000000c43aa16  itc: 2  flg: E  typ: 2 - INDEX
     brn: 0  bdba: 0x241da03 ver: 0x01 opc: 0
     inc: 0  exflg: 0


 Itl           Xid                  Uba         Flag  Lck        Scn/Fsc
0x01   0x0000.000.00000000  0x00000000.0000.00  ----    0  fsc 0x0000.00000000
0x02   0xffff.000.00000000  0x00000000.0000.00  C---    0  scn  0x000000000c43aa16
Leaf block dump
===============
header address 3595444324=0xd64e2064
kdxcolev 0
KDXCOLEV Flags = - - -
kdxcolok 0
kdxcoopc 0x80: opcode=0: iot flags=--- is converted=Y
kdxconco 3
kdxcosdc 0
kdxconro 512
kdxcofbo 1060=0x424
kdxcofeo 1888=0x760
kdxcoavs 828
kdxlespl 0
kdxlende 0
kdxlenxt 37870985=0x241dd89
kdxleprv 37870983=0x241dd87
kdxledsz 0
kdxlebksz 8032
row#0[8020] flag: -------, lock: 0, len=12
col 0; NULL
col 1; len 1; (1):  80
col 2; len 6; (6):  02 81 3e ed 00 0b

IND_T2_NAME内容,

代码语言:javascript复制
--通过上述方式,我们dump 索引IND_T2_NAME
select DBMS_UTILITY.data_block_address_file(42066656) RELATIVE_FNO,
(select file_id from dba_data_files where relative_fno=DBMS_UTILITY.data_block_address_file(42066656)) file_id,
DBMS_UTILITY.data_block_address_block(42066656) blk_id from dual;


RELATIVE_FNO  FILE_ID     BLK_ID
------------ ---------- ----------
    10       15     123616


alter system dump datafile 15 block 123616;


--如下所示,最有一行,可看出,IND_T2_NAME 也有三列,空值列,常数1,rowid。其中常数1占用了两个字节
Block header dump:  0x0281e2e0
 Object id on Block? Y
 seg/obj: 0x3b1cc  csc:  0x000000000c43b071  itc: 2  flg: E  typ: 2 - INDEX
     brn: 0  bdba: 0x281e200 ver: 0x01 opc: 0
     inc: 0  exflg: 0


 Itl           Xid                  Uba         Flag  Lck        Scn/Fsc
0x01   0x0000.000.00000000  0x00000000.0000.00  ----    0  fsc 0x0000.00000000
0x02   0xffff.000.00000000  0x00000000.0000.00  C---    0  scn  0x000000000c43b071
Leaf block dump
===============
header address 139662300258404=0x7f05a9cba064
kdxcolev 0
KDXCOLEV Flags = - - -
kdxcolok 0
kdxcoopc 0x80: opcode=0: iot flags=--- is converted=Y
kdxconco 3
kdxcosdc 0
kdxconro 478
kdxcofbo 992=0x3e0
kdxcofeo 1818=0x71a
kdxcoavs 826
kdxlespl 0
kdxlende 0
kdxlenxt 42066657=0x281e2e1
kdxleprv 42066655=0x281e2df
kdxledsz 0
kdxlebksz 8032
row#0[8019] flag: -------, lock: 0, len=13
col 0; NULL
col 1; len 2; (2):  c1 02
col 2; len 6; (6):  02 81 a3 32 00 27

因此,我们能知道,常数0存储占用一个字节,常数1占用两个字节,这跟Oracle数据库存储number机制有关系。

因为有负数、小数点等,Oracle采用了如下方式表示, Oracle中存储的number类型包含3个部分:HEAD(标记占用了几位),DATA,符号位。对正数来说,符号位省略,对0来说,只有80。

Oracle是以十六进制00-FF来表示所有的number,所以为了编码的对称,首先将number分为正负,所以以00-FF的中间位置80,即十进制的128来表示0,HEAD部分小于80,即为负数,大于80即为正数。

  • 00-3E表示 x <= -1
  • 3F-7F 表示 -1< x <0
  • 81-C0 表示 0< x < 1
  • C1-FF 表示 1<= x

Oracle数据库的优化,需要掌握其本身特性,才能更好的发挥它的优势。

参考资料,

http://www.itpub.net/forum.php?mod=viewthread&tid=308317

https://www.likecs.com/show-306981395.html

0 人点赞