The Evil Mangler (EM) is a Perl script invoked by the Glorious Driver after the C compiler (gcc) has
      translated the GHC-produced C code into assembly.  Consequently, it is
      only of interest if -fvia-C is in effect (either explicitly
      or implicitly).
    
The EM reads the assembly produced by gcc and re-arranges code blocks as well as nukes instructions that it considers non-essential. It derives it evilness from its utterly ad hoc, machine, compiler, and whatnot dependent design and implementation. More precisely, the EM performs the following tasks:
      The EM is located in the Perl script ghc-asm.lprl.
      The script reads the .s file and chops it up into
      chunks (that's how they are actually called in the script) that
      roughly correspond to basic blocks.  Each chunk is annotated with an
      educated guess about what kind of code it contains (e.g., infotable,
      fast entry point, slow entry point, etc.).  The annotations also contain
      the symbol introducing the chunk of assembly and whether that chunk has
      already been processed or not.
    
The parsing of the input into chunks as well as recognising assembly instructions that are to be removed or altered is based on a large number of Perl regular expressions sprinkled over the whole code. These expressions are rather fragile as they heavily rely on the structure of the generated code - in fact, they even rely on the right amount of white space and thus on the formatting of the assembly.
Afterwards, the chunks are reordered, some of them purged, and some stripped of some useless instructions. Moreover, some instructions are manipulated (eg, loads of fast entry points followed by indirect jumps are replaced by direct jumps to the fast entry point).
      The EM knows which part of the code belongs to function prologues and
      epilogues as STG C adds tags of the
      form --- BEGIN --- and --- END --- the
      assembler just before and after the code proper of a function starts.
      It adds these tags using gcc's __asm__ feature.
    
      Update: Gcc 2.96 upwards performs more aggressive basic
      block re-ordering and dead code elimination.  This seems to make the
      whole --- END --- tag business redundant -- in fact, if
      proper code is generated, no --- END --- tags survive gcc
      optimiser. 
    
Last modified: Sun Feb 17 17:55:47 EST 2002