Minimalistic Adaptive Dynamic-Programming Agents for Memory-Driven Exploration