Reinforcement learning (RL) represents a paradigm shift in smart building energy management by enabling systems to dynamically adapt to changing environmental conditions and occupant behaviours.