SQL中IN与EXISTS的比较-zjwei1121-ChinaUnix博客

逝者如斯夫&nbsp;不舍昼夜

首页　| 　博文目录　| 　关于我

zjwei1121

博客访问： 4445
博文数量： 4
博客积分： 0
博客等级：民兵
技术积分： 15
用户组：普通用户
注册时间： 2013-05-22 20:46

个人简介

逝者如斯夫不舍昼夜，珍惜时间，珍惜生命！

文章分类

全部博文（4）

数据库（1）
UNIX C（3）
未分配的博文（0）

文章存档

2013年（4）

我的朋友

相关博文

SQL中IN与EXISTS的比较

分类： DB2/Informix

2013-05-22 21:17:51

IN
确定给定的值是否与子查询或列表中的值相匹配。

EXISTS
指定一个子查询，检测行的存在。

SQL中取数据时有时要用到in 和 exists 那么他们有什么区别呢？

1 性能上的比较
比如Select * from T1 where x in ( select y from T2 )
执行的过程相当于:
select *
from t1, ( select distinct y from t2 ) t2
where t1.x = t2.y;

相对的

select * from t1 where exists ( select null from t2 where y = x )
执行的过程相当于:
for x in ( select * from t1 )
   loop
      if ( exists ( select null from t2 where y = x.x )
      then
         OUTPUT THE RECORD
      end if
end loop
表 T1 不可避免的要被完全扫描一遍

分别适用在什么情况?
以子查询 ( select y from T2 )为考虑方向
如果子查询的结果集很大需要消耗很多时间，但是T1比较小执行( select null from t2 where y = x.x )非常快，那么exists就比较适合用在这里
相对应得子查询的结果集比较小的时候就应该使用in.

in和exists

in 是把外表和内表作hash 连接，而exists是对外表作loop循环，每次loop循环再对内表进行查询。
一直以来认为exists比in效率高的说法是不准确的。

如果查询的两个表大小相当，那么用in和exists差别不大。

如果两个表中一个较小，一个是大表，则子查询表大的用exists，子查询表小的用in：

例如：表A（小表），表B（大表）
1：
select * from A where cc in (select cc from B)

效率低，用到了A表上cc列的索引；
select * from A where exists(select cc from B where cc=A.cc)

效率高，用到了B表上cc列的索引。

相反的
2：
select * from B where cc in (select cc from A)

效率高，用到了B表上cc列的索引；
select * from B where exists(select cc from A where cc=B.cc)

效率低，用到了A表上cc列的索引。

not in 和not exists
如果查询语句使用了not in 那么内外表都进行全表扫描，没有用到索引；
而not extsts 的子查询依然能用到表上的索引。
所以无论那个表大，用not exists都比not in要快。

in 与 =的区别

select name from student where name in ('zhang','wang','li','zhao');

与

select name from student where name='zhang' or name='li' or name='wang' or name='zhao'

的结果是相同的。

阅读(162) | 评论(0) | 转发(0) |

上一篇：unix/linux共享内存应用与陷阱

下一篇：c语言全局变量和局部变量问题汇总

给主人留下些什么吧！~~

感谢所有关心和支持过ChinaUnix的朋友们

16024965号-6