转:Oracle的函数vsize和length的区别讨论-huaihe0410-ChinaUnix博客

huaihe0410

首页　| 　博文目录　| 　关于我

huaihe0410

博客访问： 1435211
博文数量： 247
博客积分： 10147
博客等级：上将
技术积分： 2776
用户组：普通用户
注册时间： 2008-01-24 15:18

文章分类

全部博文（247）

svn（1）
AIX（1）
协议（2）
编码（8）
测试（10）
编译（0）
python（22）

socket（1）

中文字符（4）

smtp（1）
resin/java（3）
jsp（2）
其他（3）
mysql（22）

cluster（3）
linux/unix（68）

linux性能指令（5）

磁盘（3）

cvs（4）

shell（6）

指令（19）

网络（8）
oracle（92）

oracle字符集（6）

PL/SQL（1）

Oracle9i初始化参（15）

oracle 并行（2）

oracle1011新特性（6）

oracle函数（5）

oracle索引组织表（4）

oracle分区表（9）

oracle性能优化（21）
未分配的博文（13）

文章存档

2013年（11）

2012年（3）

2011年（20）

2010年（35）

2009年（91）

2008年（87）

我的朋友

jiayanfu

相关博文

转:Oracle的函数vsize和length的区别讨论

分类： Oracle

2008-01-29 12:46:52

The "length" functions return the length of char. LENGTH calculates length using characters as defined by the input character set.

LENGTHB uses bytes instead of characters. LENGTHC uses Unicode complete characters. LENGTH2 uses UCS2 codepoints. LENGTH4 uses UCS4 codepoints

length函数返回字符的长度，它使用定义好的输入的字符集计算长度.
lengthb使用bytes代替字符

VSIZE returns the number of bytes in the internal representation of expr.

vsize 返回内部表示的字节的数目。internal representation of expr谁能解释一下。

看sql示例：

select length('adfad合理') "bytesLengthIs" from dual --7

select lengthb('adfad') "bytesLengthIs" from dual --5

select lengthb('adfad合理') "bytesLengthIs" from dual --11

select vsize('adfad合理') "bytesLengthIs" from dual --11

select lengthc('adfad合理')"bytesLengthIs" from dual --7

结论：在utf-8的字符集下
lengthb=vsize
lengthc=length

疑问：中文字符怎么会占用了3个byte?而不是2个。是utf-8字符集的原因？
谁知道??????

************************************************************************

用String的getBytes方法测试了一下.
结论是utf-8的中文字符占用3个字节,gbk的中文字符占用2个字节,iso-8859-1的中文字符被识别为占用2个字节,iso不支持中文字符的编码,应该是都当成某个拉丁字母了.Oracle没有关系，oracle只是负责存储数据.
可以先用 select * from v$nls_parameters 看看oracle的字符集
下边是测试的类:

import java.io.UnsupportedEncodingException;

public class TextEncoding {

/**
*
* @author:sunflower
* @date: 2007-1-24 上午10:09:40
* @todo: 调用的是String的自己的getBytes(encoding)方法,
* 使用指定的字符集将此 String 解码为字节序列，并将结果存储到一个新的字节数组中.
* @param content
* @param encode
* @return
*/
public static byte[] getBytes(String content,String charsetName)
throws UnsupportedEncodingException{
return content.getBytes(charsetName);
}

/**
*
* @author:sunflower
* @date: 2007-1-24 上午10:19:40
* @todo: 调用的是String的自己的getBytes()方法,
* 使用平台默认的字符集将此 String 解码为字节序列，并将结果存储到一个新的字节数组中。
* @param content
* @return
*/
public static byte[] getBytes(String content){
return content.getBytes();
}

public static void main(String[]args){
String content="1e宝宝";
byte[] len;
try{
len=getBytes(content,"UTF-8");
System.out.println(" the byte array length is "+len.length);
len=getBytes(content,"GBK");
System.out.println(" the byte array length is "+len.length);
len=getBytes(content,"ISO-8859-1");
System.out.println(" the byte array length is "+len.length);
}catch(Exception e){
System.out.println("Can 't recognize");
}

// System.out.println("the content byte[] length is "+);
}

}

输出 :
the byte array length is 8
the byte array length is 6
the byte array length is 4

**************************************************

本人测试如下:

Create Table li_test02
(Id Number, Name Varchar2(64), code Char(64) )

Select * From li_test02

    ID    Name         code
1   1     test01       0001
2   10    test002      00002
3   100   test03大狼   00002
4   1111 test04       00002

Select Id,vsize(Id),Name,vsize(Name),lengthb(name),code,vsize(code) From li_test02

Id vsize(id) Name vsize(Name) lengthb(name) code vsize(code)

1 1     2         test01       6              6           0001         64
2 10   2         test002       7              7           00002        64
3 100   2         test03大狼   10            10           00002        64
4 1111 3         test04       6            6           00002        64

字符型: vsize 反应的是所占用的字节(bytes) ,对于varchar2,显示的是实际占用的字节.对于char,显示的是char定义时的长度.
Number :没发现规律

阅读(1549) | 评论(0) | 转发(0) |

上一篇：转:观察analyze table compute statistics 都对什么对象统计了信息

下一篇：[转]ORACLE函数大全

给主人留下些什么吧！~~

感谢所有关心和支持过ChinaUnix的朋友们

16024965号-6