Difference between revisions of "Chained Dictionary Assignment"

From CSE425S Wiki
Jump to navigation Jump to search
Line 171: Line 171:
 
[https://en.wikipedia.org/wiki/Hash_table Hash table]
 
[https://en.wikipedia.org/wiki/Hash_table Hash table]
  
=== signature HASHED_DICTIONARY ===
 
We can see that <code>signature HASHED_DICTIONARY</code> includes <code>DICTIONARY</code> and adds a <code>create</code> function which accepts the number of buckets to create along with a hash function.
 
  
  <nowiki>structure HashedDictionary :> HASHED_DICTIONARY</nowiki>
+
  <nowiki>structure HashedDictionary = DictionaryFn(struct
 +
type ''k hash_function = ''k -> int
  
<nowiki>signature HASHED_DICTIONARY = sig include DICTIONARY
+
    type ''k hash_function = ''k -> int
 
    val create : (int * ''k hash_function) -> (''k,'v) dictionary
 
end</nowiki>
 
 
 
=== structure TypeHolder ===
 
[[File:Hash table 5 0 1 1 1 1 1 LL.svg]]
 
 
 
[https://smlfamily.github.io/Basis/array.html SML Array Structure]
 
 
 
Change the definition of <code>type (''k,'v) dictionary</code> to support HashedDictionary.
 
 
 
<nowiki>structure TypeHolder = struct
 
 
(* TODO: replace unit with the type you decide upon *)
 
(* TODO: replace unit with the type you decide upon *)
 
type (''k,'v) dictionary = unit
 
type (''k,'v) dictionary = unit
end</nowiki>
+
  
=== structure HashedHasEntries ===
+
    type ''k create_parameter_type = (int * (''k hash_function))
<nowiki>structure HashedHasEntries = HasEntriesFn (struct
 
type (''k,'v) dictionary = (''k,'v) TypeHolder.dictionary
 
fun entries(dict : (''k,'v) dictionary) : (''k*'v) list =
 
raise Fail "NotYetImplemented"
 
end)</nowiki>
 
====entries====
 
Define the entries method so we can take advantage of the <code>keys</code> and <code>values</code> functions you implemented on <code>functor HasEntriesFn</code>.
 
  
=== structure HashedHasChaining ===
+
    fun create(bucket_count_request : int, hash : ''k hash_function) : (''k,'v) dictionary =
 
+
        raise Fail "NotYetImplemented"
<nowiki>structure HashedHasChaining = HasChainingFn (struct
 
type (''k,'v) dictionary = (''k,'v) TypeHolder.dictionary
 
  
 
fun positive_remainder(v : int, n : int) : int =  
 
fun positive_remainder(v : int, n : int) : int =  
Line 214: Line 192:
 
end  
 
end  
  
fun getChainOfEntriesForKey(dict : (''k,'v) dictionary, key : ''k) : (''k*'v) list =
+
 +
 
 +
fun getChainForKey(dict : (''k,'v) dictionary, key : ''k) : (''k*'v) list =
 +
raise Fail "NotYetImplemented"
 +
 
 +
fun updateChainForKey(dict : (''k,'v) dictionary, key : ''k, updater_function) : (''k,'v) dictionary * 'v option =
 +
raise Fail "NotYetImplemented"
 +
 
 +
 
 +
    fun get(dict : (''k,'v) dictionary, key : ''k) : 'v option =
 +
        raise Fail "NotYetImplemented"
 +
 
 +
    fun put(dict : (''k,'v) dictionary, key : ''k , value : 'v) : (''k,'v) dictionary * 'v option =
 +
        raise Fail "NotYetImplemented"
 +
 
 +
    fun remove(dict : (''k,'v) dictionary, key : ''k) : (''k,'v) dictionary * 'v option =
 
raise Fail "NotYetImplemented"
 
raise Fail "NotYetImplemented"
  
fun setChainOfEntriesForKey(dict : (''k,'v) dictionary, key : ''k, nextEntries : (''k*'v) list) : unit =
+
fun entries(dict : (''k,'v) dictionary) : (''k*'v) list =
 
raise Fail "NotYetImplemented"
 
raise Fail "NotYetImplemented"
 +
 
end)</nowiki>
 
end)</nowiki>
  
====getChainOfEntriesForKey====
+
[[File:Hash table 5 0 1 1 1 1 1 LL.svg]]
====setChainOfEntriesForKey====
+
 
 +
===type dictionary===
 +
Change the definition of <nowiki>type (''k,'v) dictionary</nowiki> to support HashedDictionary.
 +
 
 +
(* TODO: replace unit with the type you decide upon *)
 +
type (''k,'v) dictionary = unit
 +
 
 +
===type create_parameter_type===
 +
Leave create_parameter_type unchanged.  HashedDictionary's create method accepts two parameters: an int for the requested bucket count and a hash_function.
 +
 
 +
<nowiki>type ''k create_parameter_type = (int * (''k hash_function))</nowiki>
 +
 
 +
=== create ===
 +
<nowiki>fun create() : (''k,'v) dictionary =  
 +
raise Fail "NotYetImplemented"</nowiki>
 +
 
 +
=== getChainForKey ===
 +
<nowiki>fun getChainForKey(dict : (''k,'v) dictionary, key : ''k) : (''k*'v) list =
 +
raise Fail "NotYetImplemented"</nowiki>
 +
 
 +
=== updateChainForKey ===
 +
<nowiki>fun updateChainForKey(dict : (''k,'v) dictionary, key : ''k, updater_function) : (''k,'v) dictionary * 'v option =
 +
raise Fail "NotYetImplemented"</nowiki>
 +
 
 +
=== get ===
 +
<nowiki>fun get(dict : (''k,'v) dictionary, key : ''k) : 'v option =
 +
raise Fail "NotYetImplemented"</nowiki>
 +
 
 +
remember to use the Chain structure.
 +
 
 +
=== put ===
 +
<nowiki>put(dict : (''k,'v) dictionary, key : ''k , value : 'v) : (''k,'v) dictionary * 'v option =
 +
raise Fail "NotYetImplemented"</nowiki>
 +
 
 +
remember to use the Chain structure.
 +
 
 +
=== remove ===
 +
<nowiki>fun entries(dict : (''k,'v) dictionary) : (''k*'v) list =
 +
raise Fail "NotYetImplemented"</nowiki>
 +
 
 +
remember to use the Chain structure.
 +
 
 +
=== entries ===
 +
Define the entries method so we can take advantage of the <code>keys</code> and <code>values</code> functions you implemented on <code>functor DictionaryFn</code>.
  
=== create function ===
+
  <nowiki>fun entries(dict : (''k,'v) dictionary) : (''k*'v) list =
  <nowiki>fun create(bucket_count_request : int, hash : ''k hash_function) : (''k,'v) dictionary =  
 
 
raise Fail "NotYetImplemented"</nowiki>
 
raise Fail "NotYetImplemented"</nowiki>
  

Revision as of 01:13, 12 October 2022

Motivation

In this and the follow up Sorted Dictionary studio, you will build three implementations of a dictionary. Each will be a persistent, mutable data structure, so you can expect to use the ref feature of SML, either directly or via the mutable Array structure.

Background

SML structure Vector

tabulate : int * (int -> 'a) -> 'a vector
length : 'a vector -> int
sub : a vector * int -> 'a
update : 'a vector * int * 'a -> 'a vector
foldl : ('a * 'b -> 'b) -> 'b -> 'a vector -> 'b

SML structure Option

datatype 'a option = NONE | SOME of 'a

HashTable

HashTable on Wikipedia

Code To Investigate

signature Dictionary

Note the include in the DICTIONARY signature.

signature DICTIONARY_FUNCTOR_PARAMETER = sig
    type (''k,'v) dictionary
    type ''k create_parameter_type
    val create : ''k create_parameter_type -> (''k,'v) dictionary
    val get : ((''k,'v) dictionary *''k) -> 'v option
    val put : ((''k,'v) dictionary *''k *'v) -> (''k,'v) dictionary * 'v option
    val remove : ((''k,'v) dictionary *''k) -> (''k,'v) dictionary * 'v option
    val entries : (''k,'v) dictionary -> (''k*'v) list
end

signature DICTIONARY = sig include DICTIONARY_FUNCTOR_PARAMETER
    val keys : (''k,'v) dictionary -> ''k list
    val values : (''k,'v) dictionary -> 'v list
end

create

Specific to each structure. Creates an empty immutable Dictionary.

get

Behaves much like java.util.Map<K,V>'s get(key) method except instead of returning null or the associated value, it returns an option.

put

Behaves much like java.util.Map<K,V>'s put(key,value) method except

  1. instead of mutating the provided dictionary, it returns an immutable updated dictionary.
  2. instead of returning null or the previously associated value, it returns an option of the associate value (along with the updated dictionary in a 2-tuple).

remove

Behaves much like java.util.Map<K,V>'s remove(key) method except instead of returning null or the previously associated value, it returns an option.

  1. instead of mutating the provided dictionary, it returns an immutable updated dictionary.
  2. instead of returning null or the previously associated value, it returns an option of the associate value (along with the updated dictionary in a 2-tuple).

entries

Behaves much like java.util.Map<K,V>'s entrySet() method.

keys

Behaves much like java.util.Map<K,V>'s keySet() method.

values

Behaves much like java.util.Map<K,V>'s values() method.

Code To Implement

functor DictionaryFn

Each implementation of dictionary can reuse the same functions which given a list of entries produce the keys and values. functor DictionaryFn accepts a signature parameter which defines all of the necessary functions besides keys and values. Critically, this includes the entries function which can be employed to support the keys and values functions.

keys

One of List's higher order functions can be useful here. Which one is it?

fun keys(dict : (''k,'v) dictionary) : ''k list = 
	raise Fail "NotYetImplemented"

values

One of List's higher-order functions can be useful here. Which one is it?

fun values(dict : (''k,'v) dictionary) : 'v list = 
	raise Fail "NotYetImplemented"

structure Chain

Both the SingleChainedDictionary and the HashedDictionary will leverage chaining to deal with collisions. A collision is where more than one (key, value) entry wants to be store in the same chain (a.k.a. list of (key, value) entries). For the SingleChainedDictionary, every unique key will "collide" with every other one since all of them will be stored in the single chain of the SingleChainedDictionary. For the HashedDictionary, there should be fewer collisions since there will be a multiple chains to spread the load, but handling collisions should be similar to the SingleChainedDictionary once the appropriate chain is determined.

We implement structure Chain so we can use the code in both SingleChainedDictionary and HashedDictionary.

signature CHAIN = sig
	val get : (''k*'v) list * ''k -> 'v option
	val put : (''k*'v) list * ''k * 'v -> ((''k*'v) list * 'v option)
	val remove : (''k*'v) list * ''k -> ((''k*'v) list * 'v option)
end

get

fun get(chain : (''k*'v) list, key:''k) : 'v option =
    raise Fail "NotYetImplemented"

put

fun put(chain : (''k*'v) list, key:''k, value:'v) : (''k*'v) list * 'v option =
    raise Fail "NotYetImplemented"

remove

fun remove(chain : (''k*'v) list, key : ''k) : (''k*'v) list * 'v option =
    raise Fail "NotYetImplemented"

structure SingleChainedDictionary

One can imagine the utility of having a simple Dictionary without the overhead of a HashTable. Perhaps, the number of anticipated entries is small. Perhaps, its performance is not critical. Maybe the client wants to use a mutable key which would create an invalid HashedDictionary. Whatever the reason, it seems like a reasonable and straightforward implementation of Dictionary to provide.

structure SingleChainedDictionary = DictionaryFn(struct
    
    (* TODO: replace unit with the type you decide upon *)
    type (''k,'v) dictionary = unit

    type ''k create_parameter_type = unit
    
    fun create() : (''k,'v) dictionary = 
        raise Fail "NotYetImplemented"

    fun get(dict : (''k,'v) dictionary, key : ''k) : 'v option = 
        raise Fail "NotYetImplemented"

    fun put(dict : (''k,'v) dictionary, key : ''k , value : 'v) : (''k,'v) dictionary * 'v option =
        raise Fail "NotYetImplemented"
	
    fun remove(dict : (''k,'v) dictionary, key : ''k) : (''k,'v) dictionary * 'v option =
        raise Fail "NotYetImplemented"

    fun entries(dict : (''k,'v) dictionary) : (''k*'v) list =
        raise Fail "NotYetImplemented"

end)

Single chained dictionary.svg

type dictionary

Change the definition of type (''k,'v) dictionary to support SingleChainedDictionary.

(* TODO: replace unit with the type you decide upon *) type (k,'v) dictionary = unit

type create_parameter_type

Leave create_parameter_type unchanged. SingleChainedDictionary's create method accepts no parameters, so unit is the correct type.

type ''k create_parameter_type = unit

create function

fun create() : (''k,'v) dictionary = 
	raise Fail "NotYetImplemented"

get

fun get(dict : (''k,'v) dictionary, key : ''k) : 'v option = 
	raise Fail "NotYetImplemented"

remember to use the Chain structure.

put

put(dict : (''k,'v) dictionary, key : ''k , value : 'v) : (''k,'v) dictionary * 'v option =
	raise Fail "NotYetImplemented"

remember to use the Chain structure.

remove

fun entries(dict : (''k,'v) dictionary) : (''k*'v) list =
	raise Fail "NotYetImplemented"

remember to use the Chain structure.

entries

Define the entries method so we can take advantage of the keys and values functions you implemented on functor DictionaryFn.

fun entries(dict : (''k,'v) dictionary) : (''k*'v) list =
	raise Fail "NotYetImplemented"

structure HashedDictionary

Hash table


structure HashedDictionary = DictionaryFn(struct
	type ''k hash_function = ''k -> int

	
	(* TODO: replace unit with the type you decide upon *)
	type (''k,'v) dictionary = unit
	

    type ''k create_parameter_type = (int * (''k hash_function))

    fun create(bucket_count_request : int, hash : ''k hash_function) : (''k,'v) dictionary = 
        raise Fail "NotYetImplemented"

	fun positive_remainder(v : int, n : int) : int = 
		let
			val result = v mod n 
		in 
			if result >= 0 then result else result+n
		end 

	

	fun getChainForKey(dict : (''k,'v) dictionary, key : ''k) : (''k*'v) list =
		raise Fail "NotYetImplemented"

	fun updateChainForKey(dict : (''k,'v) dictionary, key : ''k, updater_function) : (''k,'v) dictionary * 'v option =
		raise Fail "NotYetImplemented"


    fun get(dict : (''k,'v) dictionary, key : ''k) : 'v option = 
        raise Fail "NotYetImplemented"

    fun put(dict : (''k,'v) dictionary, key : ''k , value : 'v) : (''k,'v) dictionary * 'v option =
        raise Fail "NotYetImplemented"

    fun remove(dict : (''k,'v) dictionary, key : ''k) : (''k,'v) dictionary * 'v option =
		raise Fail "NotYetImplemented"

	fun entries(dict : (''k,'v) dictionary) : (''k*'v) list =
		raise Fail "NotYetImplemented"

end)

Hash table 5 0 1 1 1 1 1 LL.svg

type dictionary

Change the definition of type (''k,'v) dictionary to support HashedDictionary.

(* TODO: replace unit with the type you decide upon *) type (k,'v) dictionary = unit

type create_parameter_type

Leave create_parameter_type unchanged. HashedDictionary's create method accepts two parameters: an int for the requested bucket count and a hash_function.

type ''k create_parameter_type = (int * (''k hash_function))

create

fun create() : (''k,'v) dictionary = 
	raise Fail "NotYetImplemented"

getChainForKey

fun getChainForKey(dict : (''k,'v) dictionary, key : ''k) : (''k*'v) list =
	raise Fail "NotYetImplemented"

updateChainForKey

fun updateChainForKey(dict : (''k,'v) dictionary, key : ''k, updater_function) : (''k,'v) dictionary * 'v option =
	raise Fail "NotYetImplemented"

get

fun get(dict : (''k,'v) dictionary, key : ''k) : 'v option = 
	raise Fail "NotYetImplemented"

remember to use the Chain structure.

put

put(dict : (''k,'v) dictionary, key : ''k , value : 'v) : (''k,'v) dictionary * 'v option =
	raise Fail "NotYetImplemented"

remember to use the Chain structure.

remove

fun entries(dict : (''k,'v) dictionary) : (''k*'v) list =
	raise Fail "NotYetImplemented"

remember to use the Chain structure.

entries

Define the entries method so we can take advantage of the keys and values functions you implemented on functor DictionaryFn.

fun entries(dict : (''k,'v) dictionary) : (''k*'v) list =
	raise Fail "NotYetImplemented"

Testing

source folder: src/test/sml/dictionary/chained
how to run with CM.make verbosity off: sml -Ccm.verbose=false run_chained_testing.sml
how to run with CM.make verbosity on: sml run_chained_testing.sml

note: ensure that you have removed all printing to receive credit for any assignment.

SML Error Messages

Pledge, Acknowledgments, Citations

file: exercise-chained-dictionary-pledge-acknowledgments-citations.txt

More info about the Honor Pledge