2012-06-26 17 views
9

你能否介绍一下哈希函数/算法Perl用来将字符串映射到索引?任何相关的阅读?Perl使用什么哈希函数/算法?

+0

你到底想干什么?你能提供一些不起作用的代码的例子吗? –

+0

任何会造成冲突的键:) – Jean

+0

没有两个键总是会碰撞。散列在必要时被随机扰动。 – ikegami

回答

15

PERL_HASH_INTERNAL_,在hv.h定义,下面复制:

/* hash a key */ 
/* FYI: This is the "One-at-a-Time" algorithm by Bob Jenkins 
* from requirements by Colin Plumb. 
* (http://burtleburtle.net/bob/hash/doobs.html) */ 
/* The use of a temporary pointer and the casting games 
* is needed to serve the dual purposes of 
* (a) the hashed data being interpreted as "unsigned char" (new since 5.8, 
*  a "char" can be either signed or unsigned, depending on the compiler) 
* (b) catering for old code that uses a "char" 
* 
* The "hash seed" feature was added in Perl 5.8.1 to perturb the results 
* to avoid "algorithmic complexity attacks". 
* 
* If USE_HASH_SEED is defined, hash randomisation is done by default 
* If USE_HASH_SEED_EXPLICIT is defined, hash randomisation is done 
* only if the environment variable PERL_HASH_SEED is set. 
* For maximal control, one can define PERL_HASH_SEED. 
* (see also perl.c:perl_parse()). 
*/ 

#define PERL_HASH_INTERNAL_(hash,str,len,internal) \ 
    STMT_START { \ 
     register const char * const s_PeRlHaSh_tmp = str; \ 
     register const unsigned char *s_PeRlHaSh = (const unsigned char *)s_PeRlHaSh_tmp; \ 
     register I32 i_PeRlHaSh = len; \ 
     register U32 hash_PeRlHaSh = (internal ? PL_rehash_seed : PERL_HASH_SEED); \ 
     while (i_PeRlHaSh--) { \ 
      hash_PeRlHaSh += *s_PeRlHaSh++; \ 
      hash_PeRlHaSh += (hash_PeRlHaSh << 10); \ 
      hash_PeRlHaSh ^= (hash_PeRlHaSh >> 6); \ 
     } \ 
     hash_PeRlHaSh += (hash_PeRlHaSh << 3); \ 
     hash_PeRlHaSh ^= (hash_PeRlHaSh >> 11); \ 
     (hash) = (hash_PeRlHaSh + (hash_PeRlHaSh << 15)); \ 
    } STMT_END 
+0

这是一些通用算法的实现吗? – Jean

+3

@alertjean是的,代码的评论说它是http://burtleburtle.net/bob/hash/doobs.html的一个版本 – Schwern