Channel: Tweetable hash function challenge - Code Golf Stack Exchange

↧

Answer by Paul Chernoch for Tweetable hash function challenge

April 1, 2016, 5:41 pm

≫ Next: Answer by bmm6o for Tweetable hash function challenge

≪ Previous: Answer by jose_castro_arnaud for Tweetable hash function challenge

Ruby, 6473 collisions, 129 bytes

h=->(w){@p=@p||(2..999).select{|i|(2..i**0.5).select{|j|i%j==0}==[]};c=w.chars.reduce(1){|a,s|(a*@p[s.ord%92]+179)%((1<<24)-3)}}

The @p variable is filled with all the primes below 999.

This converts ascii values into primes numbers and takes their product modulo a large prime. The fudge factor of 179 deals with the fact that the original algorithm was for use in finding anagrams, where all words that are rearrangements of the same letters get the same hash. By adding the factor in the loop, it makes anagrams have distinct codes.

I could remove the **0.5 (sqrt test for prime) at the expense of poorer performance to shorten the code. I could even make the prime number finder execute in the loop to remove nine more characters, leaving 115 bytes.

To test, the following tries to find the best value for the fudge factor in the range 1 to 300. It assume that the word file in in the /tmp directory:

h=->(w,y){
  @p=@p||(2..999).
    select{|i|(2..i**0.5). 
    select{|j|i%j==0}==[]};
  c=w.chars.reduce(1){|a,s|(a*@p[s.ord%92]+y)%((1<<24)-3)}
}

american_dictionary = "/usr/share/dict/words"
british_dictionary = "/tmp/british-english-huge.txt"
words = (IO.readlines british_dictionary).map{|word| word.chomp}.uniq
wordcount = words.size

fewest_collisions = 9999
(1..300).each do |y|
  whash = Hash.new(0)
  words.each do |w|
    code=h.call(w,y)
    whash[code] += 1
  end
  hashcount = whash.size
  collisions = whash.values.select{|count| count > 1}.inject(:+)
  if (collisions < fewest_collisions)
    puts "y = #{y}. #{collisions} Collisions. #{wordcount} Unique words. #{hashcount} Unique hash values"
    fewest_collisions = collisions
  end
end

↧

Trending Articles

Practice Sheet of Right form of verbs for HSC Students

September 22, 2019, 11:40 pm

Sarah Samis, Emil Bove III

November 17, 2012, 9:36 pm

David Perell - Write of Passage 2025

January 24, 2025, 12:37 am

Kodad Mandal Sarpanch Wardmumber Mobile Numbers List Part II Nalgonda...

April 19, 2017, 7:20 am

99 God Status for Whatsapp, Facebook

June 5, 2016, 11:46 pm

Firefighters attend car crash in Melton Mowbray

August 31, 2014, 2:35 am

CAMDEN CAMPERS SALE IS ON NOW THIS CRACKING VW AUTOHOMES KOMET HAS BEEN...

September 9, 2014, 1:24 am

Why do I get 'Access is Denied' when using Set-Service with Admin privileges?

October 3, 2014, 2:03 pm

Missing man located Bayview Avenue and Wilket Road area, Alexander Klopot, 31

July 12, 2020, 6:15 pm

Outlook.com issue with window 8

December 25, 2013, 7:54 pm

Praye – Wodin (Throwback Music)

November 13, 2016, 2:05 pm

charmilles roboform E998

May 12, 2015, 7:00 am

Three walk

November 18, 2016, 2:00 am

Shatta Wale – You Shock Me (Prod. by Willis Beatz)

May 11, 2017, 9:28 pm

Mp3 Download: Mdu - Mazola

December 7, 2017, 8:15 am

Ek Bar Baby Selfish Hoke Apne Liye Jiyo Na Lyrics Translation | Race 3

May 25, 2018, 4:00 am

The Who – Who’s Next (1971/2023) [High Fidelity Pure Audio Blu-Ray Disc]

April 13, 2025, 12:21 am

Maryland: State Police report 416 DWI / DUI drivers during December 2014;...

January 7, 2015, 6:37 pm

Java error when using Sky Go app

June 18, 2018, 7:49 am

Final Purple Gang-Related Indictment Ensnared ‘Candy’ Davidson In Drug Bust...

April 14, 2017, 10:24 pm

© 2025 //www.rssing.com