0
我正在使用tessearct wrapper for C# v3。需要的是获得位于图像顶部的12位数字号码。下面做工作,但速度很慢(42秒我的电脑):tesseract包装 - 性能下降
public string GetIdentityNumber(string path)
{
string identityNum = string.Empty;
Regex regex = new Regex(@"[\d]{4}\s+[\d]{4}\s+[\d]{4}");
try
{
using (var engine = new TesseractEngine(@".\tessdata", "eng", EngineMode.Default))
{
using (var img = Pix.LoadFromFile(path))
{
using (var page = engine.Process(img, PageSegMode.SingleBlock))
{
using (var iter = page.GetIterator())
{
string text;
Match match;
iter.Begin();
do
{
text = iter.GetText(PageIteratorLevel.TextLine);
match = regex.Match(text);
if (match.Success)
{
identityNum = match.ToString();
break;
}
}
while (iter.Next(PageIteratorLevel.TextLine));
}
}
}
}
}
catch
{
}
return identityNum;
}
大约需要40秒执行page.GetIterator()方法。有没有人知道任何设置或方法来提高性能?