Credit assignment in multiagent reinforcement learning for large agent population