I've got an embedded routine that copies a few bytes into a buffer, and it is really slow even with optimizations enabled. What is the best place to ask for guidance on making it faster? (I really don't want to rewrite it in C or assembly) ``` test ```