So basically I\'ve spent the past two weeks trying to figure out why my multithreaded command buffer recording has been so slow, and I\'m completely stumped. The problem is,