Function approximation-based reinforcement learning for large-scale problem domains