This is (AFAIK) a specific question within this general topic.
Here\'s the situation:
I have an embedded system (a video game console) based on a 32-bit RISC
GCC has such a routine for many processors, named _divdi3 (usually implemented using a common divmod call). Here's one. Some Unix kernels have an implementation too, e.g. FreeBSD.