Abstract
This study presents an energy manager (EM) based on reinforcement learning (RL) for low-power wearable IoT devices with solar energy harvesting used in precision livestock farming (PLF). The system utilizes battery level, device orientation, weather forecasts, and time of day to dynamically adjust the device’s energy consumption by modifying the monitoring duty cycle, the number of transmissions, and GPS executions to balance the energy harvested from flexible solar panels with energy consumption. Using the twin delayed deep deterministic policy gradient (TD3) algorithm, the EM learns how to perform these adjustments. Model-based simulations over extended periods, along with real-world deployments, demonstrate the system’s ability to achieve energy neutral operation (ENO), outperforming state-of-the-art algorithm in terms of harvested energy utilization. Additionally, a statistical equivalence test (TOST) is also applied to formally validate that the system maintains energy neutrality. The lightweight neural network design ensures compatibility with resource-constrained devices. This work addresses the scarcity of RL-based EM implementations in real-world wearables and PLF applications and demonstrates their potential for use in such scenarios.
Links
Citation
Iglesias, D., & Muñoz-Poblete, C. (2025). Energy Manager Based on Reinforcement Learning for a Low-Power IoT Collar Worn by Dairy Cows. IEEE Internet of Things Journal, 12(16), 34374–34391. doi:10.1109/JIOT.2025.3577898
@ARTICLE{11028906,
author={Iglesias, Daniel and Muñoz-Poblete, Carlos},
journal={IEEE Internet of Things Journal},
title={Energy Manager Based on Reinforcement Learning for a Low-Power IoT Collar Worn by Dairy Cows},
year={2025},
volume={12},
number={16},
pages={34374-34391},
keywords={Batteries;Internet of Things;Wearable devices;Energy harvesting;Wireless communication;Reinforcement learning;Body area networks;Sensors;Energy consumption;Biomedical monitoring;Energy harvesting (EH);energy manager (EM);Internet of Things (IoT);precision livestock farming (PLF);reinforcement learning (RL);twin delayed deep deterministic policy gradient (TD3);wearable},
doi={10.1109/JIOT.2025.3577898}}