Pseudo Random Number Generation: a Reinforcement Learning Approach

Luca Pasqualini, Maurizio Parton

GitHub for a reinforcement learning research project consisting in simulating a novel Random Number Generator (RNG) by Deep Reinforcement Learning.

A RNG is an algorithm generating pseudo-random numbers and in this research project it is approximated by a Deep Neural Network using Reinforcement Learning. The network is trained to "randomly" generate a novel algorithm by Reinforcement Learning, using a deep agent to solve a navigation problem. This navigation task is defined by an N-dimensional environment in which said agent can "move". Starting from a seed state the agent learns how to "move" in the N-dimensional environment in order to reach state with high rewards. The reward is given by the result of the NIST test battery on the sequence at each time step or only at the last time step.

Additional information are given in the related arXiv article.

Link to the published article.

The algorithms used are:

Dueling Double DQN (DDDQN) with Prioritized Experience Replay and Gradient-Clipping by using Huber loss
Vanilla Policy Gradient (VPG) with rewards-to-go and Generalized Advantage Estimation (GAE-Lambda) buffer
Proximal Policy Optimization (PPO) with rewards-to-go and Generalized Advantage Estimation (GAE-Lambda) buffer

License

The same of the article.

Framework used

To run the NIST test battery NistRng (nistrng package) python implementation framework is used.
To execute reinforcement learning the framework USienaRL (usienarl package) is used.

Compatible with Usienarl v0.5.0

Backend

Python 3.6
Tensorflow 1.10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pseudo Random Number Generation: a Reinforcement Learning Approach

Luca Pasqualini, Maurizio Parton

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
experiments		experiments
src		src
README.rst		README.rst
rng_discrete_binary_ppo_e_s25_r127-128_ss.py		rng_discrete_binary_ppo_e_s25_r127-128_ss.py
rng_discrete_binary_ppo_e_s25_r127-128_ss_th.py		rng_discrete_binary_ppo_e_s25_r127-128_ss_th.py
rng_discrete_binary_ppo_e_s25_r127-128_ss_th_hr.py		rng_discrete_binary_ppo_e_s25_r127-128_ss_th_hr.py
rng_discrete_binary_vpg_e_s10_r127-128.py		rng_discrete_binary_vpg_e_s10_r127-128.py
rng_discrete_binary_vpg_e_s10_r127-128_ss.py		rng_discrete_binary_vpg_e_s10_r127-128_ss.py
rng_discrete_binary_vpg_e_s25_r127-128_ss.py		rng_discrete_binary_vpg_e_s25_r127-128_ss.py
rng_discrete_binary_vpg_e_s50_r127-128_ss.py		rng_discrete_binary_vpg_e_s50_r127-128_ss.py
rng_discrete_numeric_dddql_ne_s10_r5-5.py		rng_discrete_numeric_dddql_ne_s10_r5-5.py
rng_discrete_numeric_dddql_ne_s10_r5-5_a035.py		rng_discrete_numeric_dddql_ne_s10_r5-5_a035.py
rng_discrete_numeric_vpg_ne_s10_r5-5.py		rng_discrete_numeric_vpg_ne_s10_r5-5.py
rng_discrete_numeric_vpg_ne_s10_r5-5_a035.py		rng_discrete_numeric_vpg_ne_s10_r5-5_a035.py
rng_discrete_numeric_vpg_ne_s25_r127-128.py		rng_discrete_numeric_vpg_ne_s25_r127-128.py
rng_discrete_numeric_vpg_ne_s25_r25-25.py		rng_discrete_numeric_vpg_ne_s25_r25-25.py
rng_discrete_numeric_vpg_ne_s25_r50-50.py		rng_discrete_numeric_vpg_ne_s25_r50-50.py
rng_discrete_numeric_vpg_ne_s50_r127-128.py		rng_discrete_numeric_vpg_ne_s50_r127-128.py

Folders and files

Latest commit

History

Repository files navigation

Pseudo Random Number Generation: a Reinforcement Learning Approach

Luca Pasqualini, Maurizio Parton

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages