I've got an embedded routine that copies a few bytes into a buffer, and it is really slow even with optimizations enabled.

What is the best place to ask for guidance on making it faster? (I really don't want to rewrite it in C or assembly)

```
test
```