Satellite Reorientation Using Reinforcement Learning Under Unknown Attitude Failure