awk的match函数总结-binary_XY.Z-ChinaUnix博客

binary_XY.Z的ChinaUnix博客binary.blog.chinaunix.net

首页　| 　博文目录　| 　关于我

binary_XY.Z

博客访问： 1516742
博文数量： 263
博客积分： 10851
博客等级：上将
技术积分： 2627
用户组：普通用户
注册时间： 2008-11-26 22:40

文章分类

全部博文（263）

终端开发（3）

android（2）
问题集（1）
linux桌面应用（2）
linux内核（13）

内核开发（1）

内核管理（12）
龙套箱（1）
虚拟技术（9）

xen（1）

vmware（8）
linux网络安全（22）

网络其他（2）

netfilter/iptabl（4）

网络基础（16）
数据库（20）

oracle（0）

mysql（20）
团队建设（2）
linux大杂烩（10）
linux系统管理（95）

lvs + heartbeat（11）

rpm（5）

包管理工具（0）

工具集（2）

vsftpd（2）

nginx（8）

rsync（5）

cacti（0）

resin（0）

apache（2）

nagios（0）
linux编程开发（27）

java web开发（1）

java（2）

系统编程（8）

php（3）

c/c++（11）
linux脚本（57）

tcl与expect（2）

perl（0）

bash（54）
未分配的博文（1）

文章存档

2013年（4）

2012年（25）

2011年（33）

2010年（50）

2009年（138）

2008年（13）

我的朋友

相关博文

awk的match函数总结

分类： LINUX

2013-01-11 14:50:32

开始正文之前，推荐下这里有个介绍awk数组的精华帖：

grep 1083628889 XXYY.TamServer_updateVipAmount_20121227.log | tr -d '][' | awk 'BEGIN{ FS="|" }{ match($10, / money=[0-9]+*/, a); match($10, / vip=[0-9]+*/, b); print $4,a[0],b[0] }'

match(s, r [, a])
Returns the position in s where the regular expression r occurs, or 0 if r is not present, and sets the values of RSTART and RLENGTH. Note that the argument order is the same as for the ~ operator: str ~ re. If array a is provided, a is cleared and then elements 1 through n are filled with the portions of s that match the corresponding parenthesized subexpression in r. The 0'th element of a contains the portion of s matched by the entire regular expression r.   Subscripts a[n, "start"], and a[n, "length"] provide the starting index in the string and length    respectively, of each matching substring.

两种用法：
1. 普通用法
match(字符串,正则表达式)
内置变量RSTART表示匹配开始的位置，RLENGTH表示匹配的长度
如果匹配到了，返回匹配到的开始位置，否则返回0
$ awk 'BEGIN{start=match("Abc Ef Kig",/ [A-Z][a-z]+ /);print RSTART,RLENGTH}'
4 4

2. 建立数组（If array a is provided, a is cleared and then elements 1 through n are filled with the portions of s that match the corresponding
parenthesized subexpression in r. The 0'th element of a contains the portion of s matched by the entire regular
expression r.   Subscripts a[n, "start"], and a[n, "length"] provide the starting index in the string and length
    respectively, of each matching substring.）

echo "foooobazbarrrrr | gawk '{ match($0, /(fo+).+(bar*)/, arr) #匹配到的部分自动赋值到arr中，下标从1开始
          print arr[1], arr[2]
          print arr[1, "start"], arr[1, "length"] #二维数组arr[index,＂start＂]值=RSTART
          print arr[2, "start"], arr[2, "length"] #二维数组arr[index,＂length＂]值=RLENGTH
    }'

foooo barrrrr
1 5
9 7

阅读(17032) | 评论(0) | 转发(1) |

上一篇：ArrayList和数组间的相互转换

下一篇：MySql的Bolb四种类型

给主人留下些什么吧！~~

感谢所有关心和支持过ChinaUnix的朋友们

16024965号-6