2017-10-16 43 views
1

在我的Django项目中,我使用了字段(“title”)。在这个领域中,用户需要用俄语书写,但用户可能会误写一些拉丁字母。我想改变它们。我用PYTHON 2.7。下一个代码引发错误。如何解决这个错误?如何将文本中的字母改为其他字母| Python 2.7?

from string import maketrans 

eng = 'ETOPAHKXCBMetopahkxcbm' # English letters 
rus = 'ЕТОРАНКХСВМеторанкхсвм' # Russian letters 

def form_valid(self, form): 
    form.cleaned_data['title'].translate(maketrans(dict(zip(eng, rus)))) 
    form.save() 

错误

Traceback (most recent call last): 
    File "/srv/envs/py27/lib/python2.7/site-packages/django/core/handlers/exception.py", line 41, in inner 
    response = get_response(request) 
    File "/srv/envs/py27/lib/python2.7/site-packages/django/core/handlers/base.py", line 187, in _get_response 
    response = self.process_exception_by_middleware(e, request) 
    File "/srv/envs/py27/lib/python2.7/site-packages/django/core/handlers/base.py", line 185, in _get_response 
    response = wrapped_callback(request, *callback_args, **callback_kwargs) 
    File "/srv/envs/py27/lib/python2.7/site-packages/django/views/generic/base.py", line 68, in view 
    return self.dispatch(request, *args, **kwargs) 
    File "/srv/envs/py27/lib/python2.7/site-packages/django/views/generic/base.py", line 88, in dispatch 
    return handler(request, *args, **kwargs) 
    File "/srv/envs/py27/lib/python2.7/site-packages/django/views/generic/edit.py", line 217, in post 
    return super(BaseCreateView, self).post(request, *args, **kwargs) 
    File "/srv/envs/py27/lib/python2.7/site-packages/django/views/generic/edit.py", line 183, in post 
    return self.form_valid(form) 
    File "/home/nurzhan/dashboard.kase.kz/static_pages/views.py", line 54, in form_valid 
    form.cleaned_data['title'].translate(maketrans(dict(zip(eng, rus)))) 
TypeError: maketrans() takes exactly 2 arguments (1 given) 

当我只使用form.cleaned_data['title'].translate(maketrans(eng, rus))它引发错误:

Internal Server Error: /static_page/create/ 
Traceback (most recent call last): 
    File "/srv/envs/py27/lib/python2.7/site-packages/django/core/handlers/exception.py", line 41, in inner 
    response = get_response(request) 
    File "/srv/envs/py27/lib/python2.7/site-packages/django/core/handlers/base.py", line 187, in _get_response 
    response = self.process_exception_by_middleware(e, request) 
    File "/srv/envs/py27/lib/python2.7/site-packages/django/core/handlers/base.py", line 185, in _get_response 
    response = wrapped_callback(request, *callback_args, **callback_kwargs) 
    File "/srv/envs/py27/lib/python2.7/site-packages/django/views/generic/base.py", line 68, in view 
    return self.dispatch(request, *args, **kwargs) 
    File "/srv/envs/py27/lib/python2.7/site-packages/django/views/generic/base.py", line 88, in dispatch 
    return handler(request, *args, **kwargs) 
    File "/srv/envs/py27/lib/python2.7/site-packages/django/views/generic/edit.py", line 217, in post 
    return super(BaseCreateView, self).post(request, *args, **kwargs) 
    File "/srv/envs/py27/lib/python2.7/site-packages/django/views/generic/edit.py", line 183, in post 
    return self.form_valid(form) 
    File "/home/nurzhan/dashboard.kase.kz/static_pages/views.py", line 54, in form_valid 
    form.cleaned_data['title'].translate(maketrans(eng, rus)) 
UnicodeEncodeError: 'ascii' codec can't encode characters in position 0-21: ordinal not in range(128) 
+0

从你的函数定义中删除自己。 –

+0

线索在'maketrans()需要2个参数(1给出)' - 请参见[string.maketrans(from,to)](https://docs.python.org/2/library/string.html#string .maketrans) –

+0

当我使用form.cleaned_data ['title']。translate(maketrans(eng,rus))'它会引发错误'UnicodeEncodeError:'ascii'编解码器不能编码0-21位置的字符:ordinal不在范围内(128)'。你有什么想法吗? –

回答

1

您可以手动像这样做:

a_string = u"abcdeFghijklmn" # Let say it is cyrillic with one latin letter represented here by 'F' (uppercase) 
# To change it to cyrillic i.e. here to lower latin you would do: 
l2c = {u"F": u"f"} # Map all latin to cyrillic for real 
l2c_get = l2c.get # Faster access but not necessary 
a_string = u"".join(
    l2c_get(x, x) for x in a_string) 

当然,你可以使用unicode.translate应该做同样的()方法。

当你的'a_string'被处理时,只需将它推回到django。所有映射的拉丁到西里尔都将被替换。请注意,映射中的所有内容都是unicode。如果把翻译字符串的Django引发UnicodeDecode /编码错误,这意味着你应该先试:

a_string = a_string.encode("utf-8", "ignore") 

UTF-8二进制应该被接受。

2

这会让事情变得更简单,如果你在你的代码中使用Unicode文本同样,这样你就不必担心str vs unicode类型问题。这与将所有字符串文字写为u'stuff'而不仅仅是'stuff'相同。

下一页 - 的unicodestringtranslate功能需要不同的参数 - 你需要一个dict为Unicode映射Unicode码,以序数或Unicode字符串,maketrans这是只为str翻译。

您可以使用内置的ord从Unicode字符中获取序号。

试试这个:

from __future__ import unicode_literals # Put this at the top of your source 

eng = 'ETOPAHKXCBMetopahkxcbm' # English letters 
rus = 'ЕТОРАНКХСВМеторанкхсвм' # Russian letters 

table = dict(zip(map(ord, eng), map(ord, rus))) 
assert eng.translate(table) == rus # Verifying translation 
+0

你好!我试图测试你的代码,但不幸的是它不为我工作。你的代码不会引发错误,但它不会将拉丁字母改为西里尔文。你还有其他建议吗? –

相关问题