Vybrané partie z dátových štruktúr
2-INF-237, LS 2016/17
Perzistentné dátové štruktúry
Z VPDS
Sources
- lectures L01, L02, L03 by Erika Demaina from MIT: http://courses.csail.mit.edu/6.851/spring12/lectures/
Persistent data structures
- keep old versions of the data structure
- partial persistence: update only current version, query any old version, linearly ordered versions
- full persistence: update any version, versions form a tree
- confluent persistence: combine versions together, versions form a DAG
- fully functional: never modify nodes, only create new
- backtracking: query and update only current version, revert to an older version
- retroactive: insert updates to the past, delete past updates, query at any past time relative to current set of updates
Pointer machine
- model of computation, without arrays
- constant-size nodes, holding data (numbers, characters, etc) and pointers to other nodes
- pointers can be assigned: address of a newly created node, copy of another pointer, null
- no pointer arithmetic
- root node with all global variables, all variable names in the program of the form root.current.left.x
General transformation with node copying
- Algorithm see slides
First, let us prove that the algorithm for the update will terminate
- update on one node may trigger other updates etc, in the worst case the each newest node might be replaced by another
- However, no node is replaced by a new node twice within one recursive update
- Assume by contradiction that such anode exists and consider the first node that is rebuilt second time. Between its first and second rebuild it mist have acquired 2p mods, but these mods must have been caused by nodes that point to this node being rebuilt. There are p such nodes and none of them was rebuilt more than once because we consider the first node which is rebuilt twice.
- If we had only p-1 modes in each node, we could cause an infinite computation (in a complete graph with p nodes and all mods full)
Notes for the amortized analysis:
- as real cost, count the number of recursive calls (all other computation proportional to that)
- want to prove that the amortized cost of one update is 2
- if there is space in the node for another mod, real cost is 1 (only one call to modify), change of potential is 1, so amortized cost is 2
- if the node is full, real cost is 1, change of potential is -2p plus amortized cost of all subsequent recursive calls. By induction, each of these calls has amortized cost 2, so overall amortized cost is 1
- what is the induction on? The depth in the tree of recursive calls (leaves are the base cases where node not full). We have already proved that this node is finite.