Reinforcement Learning For Continuing Problems Using Average Reward