Header

  1. View current page

    정상혁의 수첩

Profile_img_60x60_08
195

Java 정규식

http://java.sun.com/javase/6/docs/api/java/util/regex/Pattern.html

Java and Regular Expressions - Tutorial

 

HTML추출

Pattern p = Pattern.compile("\\<(\\/?)(\\w+)*([^<>]*)>");
Matcher m = p.matcher(body);

body = m.replaceAll("");

 

String content = str.replaceAll("<(/)?([a-zA-Z]*)(\\s[a-zA-Z]*=[^>]*)?(\\s)*(/)?>", "");

                       str.replaceAll("(?:<!.*?(?:--.*?--\\s*)*.*?>)|(?:<(?:[^>'\"]*|\".*?\"|'.*?')+>)","");

 

그림파일 추출

String source = "<img src=\"
String pattern = "
http://.(.jpg|.gif)";
Pattern p = Pattern.compile(pattern);
Matcher m = p.matcher(source);
System.out.println();
while(m.find()) System.out.println(m.group());

 

History

Last edited on 04/30/2009 12:10 by benelog

Comments (0)

You must log in to leave a comment. Please sign in.