Model-free learning with imitation