关于固定元字符的一个问题-CUDev-ChinaUnix博客

CUDevcudev.blog.chinaunix.net

首页　| 　博文目录　| 　关于我

CUDev

博客访问： 5816571
博文数量： 675
博客积分： 20301
博客等级：上将
技术积分： 7671
用户组：普通用户
注册时间： 2005-12-31 16:15

文章分类

全部博文（675）

Web架构（4）
Thinking（1）
SF（2）
Kernel and Drive（70）
perl（2）
QT4学习笔记（9）
网络编程（52）
嵌入式Linux（4）
服务器管理（64）
操作系统研究（11）
Linux深入学习（38）
算法研究（29）
网络安全（34）
python（19）
心情日记（6）
程序设计（127）
Linux应用（134）
Shell（64）
未分配的博文（5）

文章存档

2012年（1）

2011年（20）

2010年（14）

2009年（63）

2008年（118）

2007年（141）

2006年（318）

我的朋友

最近访客

推荐博文

关于固定元字符的一个问题

分类：

2006-02-21 21:20:05

CODE:

wangyao@fisherman ~/test
$ cat fuhao
>hdfgio
>>dfjkhgeriu
...JIdjewai
>>reuhwqi
< hsdfui >
<>

abc
dfh abc huihd hsdf
abd abcd cbabc
.abc .bcd sidfie ..sdfhf
. .
abc. .abc. nsdf .
.. hdsufih .dcbhdj.

wangyao@fisherman ~/test
$ grep '\' fuhao

abc
dfh abc huihd hsdf
.abc .bcd sidfie ..sdfhf
. .
abc. .abc. nsdf .
.. hdsufih .dcbhdj.

\<\>称为固定元字符，但是它并不是一个真正的元字符。它只在grep中使用；在sed，awk和egrep中没有包含此元字符，因此它在sed，awk和egrep是不等价的。

固定元字符表达式和尖括号内的词严格匹配。它的标记是\<和\>。需要严格匹配的词包含在两个元字符对之间。固定元字符正则表达式可以在固定元字符见包含其他的正则表达式元字符。

\<>\表示词的范围，准确匹配一个词。
\\应该匹配abc这个词，但是结果正如上面所示， abc. 也匹配了

下面的一段话摘自《O'Reilly - Mastering regular expressions》。

Quote:

There are three types of escaped items:

1. The pairing of \ and a metacharacter is a metasequence to match the literal character (for example, \* matches a literal asterisk).
2. The pairing of \ and selected non-metacharacters becomes a metasequence with an implementation-defined meaning (for example, \< often means "start of word").
3. The pairing of \ and any other character defaults to simply matching the character (that is, the backslash is ignored).

关键在\< often means "start of word"。often不代表总是。

看来在于如何定义"word"的了。a-z组成是word,而<,.,{,等就不包含在内了。不过只是实验过，没有查正式文档的定义。perl中只把[a-z],[A-Z]和-定位为"word",其他都不是。

word可以这样定义，是两边为空格、制表符、标点或<,.,{,符号的一个字符串。

但是这样就有了一个新的问题。如何在上述文件中精确的匹配 abc ，而不匹配

大家不要怪我，对于问题要执着一些吗？：－)

阅读(1434) | 评论(0) | 转发(0) |

上一篇：一些grep、sed＆awk的练习

下一篇：用awk把日志内容分段提取

给主人留下些什么吧！~~

感谢所有关心和支持过ChinaUnix的朋友们

16024965号-6