2015-07-04 38 views
2

WordNet中的感应偏移量是一个8位数字,后跟一个POS标签。例如,synset'dog.n.01'的偏移量是'02084071-n'。我曾尝试下面的代码:如何提取WordNet synset的偏移量在Python NLTK中提供synset?

from nltk.corpus import wordnet as wn 

    ss = wn.synset('dog.n.01') 
    offset = str(ss.offset) 
    print (offset) 

不过,我得到这样的输出:

<bound method Synset.offset of Synset('dog.n.01')> 

如何获得实际在这个幅面胶印机: '02084071-N'?

+0

您会得到[这里]你的答案(http://stackoverflow.com/questions/27091571/get-synonyms-from-synset-returns-error-python)和[这里](HTTP:/ /stackoverflow.com/questions/8077641/how-to-get-the-wordnet-synset-given-an-offset-id)和一点点实验 – Vaulstein

回答

4
>>> from nltk.corpus import wordnet as wn 
>>> ss = wn.synset('dog.n.01') 
>>> offset = str(ss.offset()).zfill(8) + '-' + ss.pos() 
>>> offset 
u'02084071-n' 
+0

谢谢!,请检查我的其他问题:http://stackoverflow.com/questions/31234168/how-do-i-calculate-the-shortest-path-geodesic-distance-between-two-adjectives @alvas – modarwish

相关问题