Try to extend the functions in `TwoKeyPrp` to use `expand_8to16`, and check the performance of ggm tree generation.