我有一个文件,index.html
,包含数据是这样的:如何仅从文件中的URL中去除&符号?
<li><a href="/battered-fried-chicken-breast-no-skin.html">battered fried chicken breast, no skin</a></li>
<li><a href="/bbq-short-ribs-with-sauce.html">bbq short ribs with sauce</a></li>
<li><a href="/bbq-spareribs-&-sauce-eat-lean-&-fat.html">bbq spareribs & sauce (eat lean & fat)</a></li>
<li><a href="/bbq-spareribs-&-sauce-eat-lean-only.html">bbq spareribs & sauce (eat lean only)</a></li>
我需要从网址剥离&符号,使得"/bbq-spareribs-&-sauce-eat-lean-&-fat.html"
变得"/bbq-spareribs--sauce-eat-lean--fat.html"
。但是,我不希望从文件的非URL部分(如链接文本bbq spareribs & sauce (eat lean & fat)
)中删除&符号。
我该如何在标准的Linux安装上完成此操作?只要它有效,使用什么特定的工具/语言来实现结果并不重要。