2015-07-13 104 views
0

我使用这些技术来=?utf-8?B?...?=转换成可读的字符串:将电子邮件主题从“?UTF-8?...”转换为字符串?

How convert email subject from “?UTF-8?…?=” to readable string?

string encode/decode

它适用于简单的输入,但我有一些输入已经嵌套=?utf-8?B?...?=,例如:

"=?utf-8?B?2KfbjNmGINuM2qkg2YXYqtmGINiz2KfYr9mHINin2LPYqg==?= =?utf-8?B?2KfbjNmGINuM2qkg2YXYqtmGINiz2KfYr9mHINin2LPYqg==?= =?utf-8?B?2YbYr9is?=" 

我知道=?UTF-8?B??=之间的部分是base64编码的st环,但在这种情况下,我不知道如何提取它们。

+1

拆分输入并单独解析每个组件。 – dreamlax

回答

2

你可以使用正则表达式来提取=?UTF-8?B??=之间的字符串,然后转换其余的字符串。这里有一个例子:

string input = "=?utf-8?B?2KfbjNmGINuM2qkg2YXYqtmGINiz2KfYr9mHINin2LPYqg==?= =?utf-8?B?2KfbjNmGINuM2qkg2YXYqtmGINiz2KfYr9mHINin2LPYqg==?= =?utf-8?B?2YbYr9is?="; 
Regex regex = new Regex(string.Format("{0}(.*?){1}",Regex.Escape("=?utf-8?B?"), Regex.Escape("?="))); 
var matches = regex.Matches(input); 
foreach (Match match in matches) 
{ 

    Console.WriteLine(
       Encoding.UTF8.GetString(Convert.FromBase64String(match.Groups[1].Value)) 
       ); 
} 

这将打印:

اینیکمتنسادهاست
اینیکمتنسادهاست
ندج

不要忘记这些使用包括声明:

using System.Text.RegularExpressions; 
using System.Text; 

可用工作示例here

+0

哇!非常感谢 :) –

2

尝试的东西,如:

string str = "=?utf-8?B?2KfbjNmGINuM2qkg2YXYqtmGINiz2KfYr9mHINin2LPYqg==?= =?utf-8?B?2KfbjNmGINuM2qkg2YXYqtmGINiz2KfYr9mHINin2LPYqg==?= =?utf-8?B?2YbYr9is?="; 

const string utf8b = "=?utf-8?B?"; 

var parts = str.Split(new[] { "?=" }, 0); 

foreach (var part in parts) 
{ 
    string str2 = part.Trim(); 

    if (str2.StartsWith(utf8b, StringComparison.OrdinalIgnoreCase)) 
    { 
     str2 = str2.Substring(utf8b.Length); 
     byte[] bytes = Convert.FromBase64String(str2); 
     string final = Encoding.UTF8.GetString(bytes); 
     Console.WriteLine(final); 
    } 
    else if (str2 == string.Empty) 
    { 
     // Nothing to do here 
    } 
    else 
    { 
     Console.WriteLine("Not recognized {0}", str2); 
    } 
} 

注意技术上rfc 1342是稍微复杂一点...而不是UTF-8,你可以有任何编码,并代替B您能有Q(用于引用可打印)

相关问题