Learning based energy management in multi-cell interference networks