profiling: Build profiling libraries we distribute with -fprof-late
The profiling libraries we currently build are pretty useless for profiling because they don't contain any cost centres. We should add cost centres using -fprof-late and distribute those so profile outputs are more useful for anyone trying to profile.