introduce extra register of delay to split combinatorial loops