Reinforcement learning approach to product allocation and storage