2016-01-14 15 views
1

即时尝试编写正则表达式来搜索和替换图像路径和保持相同的文件名存在路径到我的远程服务器路径。Coldfusion虽然上传替换图像路径REGEX

用户正在上传HTML文件,我希望在上传替换路径之前保存在数据库中。

我写了一些正则表达式它似乎没有工作即时通讯不好我在互联网上找到的正则表达式。

请从下面的代码片段...

for example this my HTML content. 
 

 
<cfsavecontent variable="htmlcont"> 
 

 
<html> 
 
<head> 
 
</head> 
 
<body> 
 
<a href='http://www.w3schools.com'><img src='http://www.w3schools.com/images/w3schools.png' alt='W3Schools.com' class='img-responsive'></a> 
 
<div class="w3-row"> 
 
<div class="w3-third w3-center"> 
 
<h2>JPG Images</h2> 
 
<img alt="Mountain View" src="pic_mountain.jpg" style="width: 304px; height: auto" class="img-responsive"> 
 
</div> 
 
<div class="w3-third w3-center"> 
 
<h2>GIF Images</h2> 
 
<img alt="" src="html5.gif" style="width: 128px; height: auto" class="img-responsive"> 
 
</div> 
 
<div class="w3-third w3-center"> 
 
<h2>PNG Images</h2> 
 
<img alt="Graph" src="pic_graph.png" style="width: 170px; height: auto" class="img-responsive"> 
 
</div> 
 
</div> 
 

 
<table class="lamp"><tr> 
 
<th style="width:34px"> 
 
<img src="/images/lamp.jpg" alt="Note" style="height:32px;width:32px"></th> 
 
<td>Always specify the width and height of an image. If width and height are not specified, the page will flicker while the image loads. 
 
</td> 
 
</tr></table> 
 
<hr> 
 

 
<table class="lamp"><tr> 
 
<th style="width:34px"> 
 
<img src="/images/lamp.jpg" alt="Note" style="height:32px;width:32px"></th> 
 
<td>Add &quot;border:0;&quot; to prevent IE9 (and earlier) from displaying a border around the image.</td> 
 
</tr></table> 
 
<hr> 
 
</body> 
 
</html> 
 
</cfsavecontent>

我尝试更换

http://www.w3schools.com/images/w3schools.png 

TO

http://myremoteServer.com/234001/images/w3schools.png 

OR

<img alt="Graph" src="pic_graph.png" style="width: 170px; height: auto" class="img-responsive"> 

TO

<img alt="Graph" src="http://myremoteServer.com/234001/images/pic_graph.png" style="width: 170px; height: auto" class="img-responsive"> 

CODE:

<cfset regxv = 'src="\K[^"]*(?=")' /> 
 
<cfset resluthtml = REReplace (htmlcont,regxv, "http://myremoteServer.com/234001/images/") /> 
 

 

 
<cfdump var="#resluthtml#" label="resluthtml" >

+0

请考虑使用jSoup而不是正则表达式。参见:http://www.bennadel.com/blog/2358-parsing-traversing-and-mutating-html-with-coldfusion-and-jsoup.htm – Henry

回答

1

与ColdFusion的内置发动机(雅加达ORO,Perl等)捕获在一个痛苦***。所以,让我们在这里使用一些Java魔法(POSIX):

<cfset regex = createObject("java", "java.util.regex.Pattern").compile('<img [^>]*src=["'']([^"'']*)["'']')> 

<cfset result = createObject("java", "java.lang.StringBuilder").init()> 
<cfset matcher = regex.matcher(htmlcont)> 
<cfset last  = 0> 

<cfloop condition="matcher.find()"> 

    <cfset result.append(
     htmlcont.substring(
      last, 
      matcher.start() 
     ) 
    )> 

    <cfset token = matcher.group(
     javaCast("int", (matcher.groupCount() gte 1 ? 1 : 0)) 
    )> 

    <!--- go with your replace logic here, token is the value of the [src] attribute ---> 
    <cfset token = ("http://myremoteServer.com/234001/images/" & listLast(token, "/"))> 

    <cfset result.append(token)> 

    <cfset last = matcher.end()> 
</cfloop> 

<cfset result.append(
    htmlcont.substring(last) 
)> 
<cfset result = result.toString()> 

<cfdump var="#result#"> 
+0

' Mountain View Graph Note Note '' OUTPUT – IBM

+0
+0

是否有反正我只能得到图像文件名?它会返回令牌“image/image.jpg” – IBM