常用的正则表达式-ubuntuer-ChinaUnix博客

人生如逆旅，我亦是行人！江湖人称wsjjeremy.blog.chinaunix.net

首页　| 　博文目录　| 　关于我

ubuntuer

博客访问： 4906862
博文数量： 930
博客积分： 12070
博客等级：上将
技术积分： 11448
用户组：普通用户
注册时间： 2008-08-15 16:57

文章分类

全部博文（930）

html5（0）
python（1）
google_gnu fans（8）
高品位（2）
perl（4）
mobile_dev（2）
openssl（1）
libcurl（2）
windows内核安全（5）
自己的C_LIB（5）
高性能MySQL学习（94）
多线程（4）
ldd学习笔记（3）
netfilter（3）
笔试题（5）
师徒之言传身教（1）
转载（15）
work（146）
introduction to （9）
debug（3）

intern（3）
mobile ip（0）
毕业设计（2）
linux防火墙（10）
c++（16）
database（13）
CentOS（11）
data structure（5）
kernel（50）
DIY（4）
酷软（19）
iptables（9）
linux c（105）

string（19）
APUE学习笔记（7）
facetea（13）
shell（68）
tcp_ip（23）
apache（3）
linux（258）

正则表达式（5）
未分配的博文（1）

文章存档

2011年（60）

2010年（220）

2009年（371）

2008年（279）

我的朋友

相关博文

常用的正则表达式

分类： LINUX

2008-10-24 19:27:54

匹配html的嵌入代码

CODE:

<[^>]*>

匹配[....]的嵌入码

CODE:

\[[^]]\{1,\}\]

删除仅由空字符组成的行

CODE:

sed '/^[[:space:]]*$/d' filename

匹配html标签

CODE:

/$<[^>]*>$/

例如：从html文件中剔除html标签

CODE:

sed 's/$<[^>]*>$//g;/^[[:space:]]*$/d' file.html

例如：要从下列代码中去除"[]"及其中包括的代码

CODE:

[b:4c6c2a6554][color=red:4c6c2a6554]一. 替换[/color:4c6c2a6554][/b:4c6c2a6554]
sed 's/\[[^]]\{1,\}\]//g' filename

匹配日期：

CODE:

Month, Day, Year [A-Z][a-z]\{3,9\}, [0-9]\{1,2\}, [0-9]\{4\}
2003-01-28 或 2003.10.18 或 2003/10/10 或 2003 10 10
$[0-9]\{4\}[ /-.][0-2][0-9][ /-.][0-3][0-9]$

匹配IP地址

CODE:

$[0-9]\{1,3\}\.[0-9]\{1,3\}\.[0-9]\{1,3\}\.[0-9]\{1,3\}$
$\([0-9]\{1,3\}\.$\{3\}[0-9]\{1,3\}\)

匹配数字串

CODE:

[-+]*[0-9]\{1,\} 整数
[-+]*[0-9]\{1,\}\.[0-9]\{1,\} 浮点数

从字串中解析出两个子串(前2各字符和后9个字符)

CODE:

echo "WeLoveChinaUnix"|sed -e 'H;s/$..$.*/\1/;x;s/.*$.\{9\}$$/\1/;x;G;s/\n/ /'
We ChinaUnix

分解日期串

CODE:

echo 20030922|sed 's/$....$$..$$..$/\1 \2 \3/'|read year month day
echo $year $month $day

文件内容倒序输出

CODE:

sed '1!G;h;$!d' oldfile >newfile
当然也可以直接使用tac命令实现倒序输出.

阅读(4260) | 评论(0) | 转发(0) |

上一篇：linux下内存释放问题

下一篇：shell实例学习

给主人留下些什么吧！~~

感谢所有关心和支持过ChinaUnix的朋友们

16024965号-6