separate_compilation.rst 45.9 KB
Newer Older
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71
.. _separate-compilation:

Filenames and separate compilation
==================================

.. index::
   single: separate compilation
   single: recompilation checker
   single: make and recompilation

This section describes what files GHC expects to find, what files it
creates, where these files are stored, and what options affect this
behaviour.

Pathname conventions vary from system to system. In particular, the
directory separator is "``/``" on Unix systems and "``\``" on
Windows systems. In the sections that follow, we shall consistently use
"``/``" as the directory separator; substitute this for the
appropriate character for your system.

.. _source-files:

Haskell source files
--------------------

.. index::
   single: file names; of source files

Each Haskell source module should be placed in a file on its own.

Usually, the file should be named after the module name, replacing dots
in the module name by directory separators. For example, on a Unix
system, the module ``A.B.C`` should be placed in the file ``A/B/C.hs``,
relative to some base directory. If the module is not going to be
imported by another module (``Main``, for example), then you are free to
use any filename for it.

.. index::
   single: Unicode
   single: UTF-8
   single: ASCII
   single: Latin-1
   single: encodings; of source files

GHC assumes that source files are ASCII or UTF-8 only, other
encoding are not recognised. However, invalid UTF-8 sequences
will be ignored in comments, so it is possible to use other encodings
such as Latin-1, as long as the non-comment source code is ASCII
only.

.. _output-files:

Output files
------------

.. index::
   single: interface files
   single: .hi files
   single: object files
   single: .o files

When asked to compile a source file, GHC normally generates two files:
an object file, and an interface file.

The object file, which normally ends in a ``.o`` suffix, contains the
compiled code for the module.

The interface file, which normally ends in a ``.hi`` suffix, contains
the information that GHC needs in order to compile further modules that
depend on this module. It contains things like the types of exported
functions, definitions of data types, and so on. It is stored in a
72
binary format, so don't try to read one; use the :ghc-flag:`--show-iface` option
73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101
instead (see :ref:`hi-options`).

You should think of the object file and the interface file as a pair,
since the interface file is in a sense a compiler-readable description
of the contents of the object file. If the interface file and object
file get out of sync for any reason, then the compiler may end up making
assumptions about the object file that aren't true; trouble will almost
certainly follow. For this reason, we recommend keeping object files and
interface files in the same place (GHC does this by default, but it is
possible to override the defaults as we'll explain shortly).

Every module has a *module name* defined in its source code
(``module A.B.C where ...``).

The name of the object file generated by GHC is derived according to the
following rules, where ⟨osuf⟩ is the object-file suffix (this can be
changed with the ``-osuf`` option).

-  If there is no ``-odir`` option (the default), then the object
   filename is derived from the source filename (ignoring the module
   name) by replacing the suffix with ⟨osuf⟩.

-  If ``-odir ⟨dir⟩`` has been specified, then the object filename is
   ⟨dir⟩/⟨mod⟩.⟨osuf⟩, where ⟨mod⟩ is the module name with dots replaced
   by slashes. GHC will silently create the necessary directory
   structure underneath ⟨dir⟩, if it does not already exist.

The name of the interface file is derived using the same rules, except
that the suffix is ⟨hisuf⟩ (``.hi`` by default) instead of ⟨osuf⟩, and
102 103
the relevant options are :ghc-flag:`-hidir` and :ghc-flag:`-hisuf` instead of
:ghc-flag:`-odir` and :ghc-flag:`-osuf` respectively.
104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122

For example, if GHC compiles the module ``A.B.C`` in the file
``src/A/B/C.hs``, with no ``-odir`` or ``-hidir`` flags, the interface
file will be put in ``src/A/B/C.hi`` and the object file in
``src/A/B/C.o``.

For any module that is imported, GHC requires that the name of the
module in the import statement exactly matches the name of the module in
the interface file (or source file) found using the strategy specified
in :ref:`search-path`. This means that for most modules, the source file
name should match the module name.

However, note that it is reasonable to have a module ``Main`` in a file
named ``foo.hs``, but this only works because GHC never needs to search
for the interface for module ``Main`` (because it is never imported). It
is therefore possible to have several ``Main`` modules in separate
source files in the same directory, and GHC will not get confused.

In batch compilation mode, the name of the object file can also be
123 124
overridden using the :ghc-flag:`-o` option, and the name of the interface file
can be specified directly using the :ghc-flag:`-ohi` option.
125 126 127 128 129 130 131 132 133 134 135 136

.. _search-path:

The search path
---------------

.. index::
   single: search path
   single: interface files, finding them
   single: finding interface files

In your program, you import a module ``Foo`` by saying ``import Foo``.
137 138
In :ghc-flag:`--make` mode or GHCi, GHC will look for a source file for ``Foo``
and arrange to compile it first. Without :ghc-flag:`--make`, GHC will look for
139 140 141 142 143 144 145 146 147 148
the interface file for ``Foo``, which should have been created by an
earlier compilation of ``Foo``. GHC uses the same strategy in each of
these cases for finding the appropriate file.

This strategy is as follows: GHC keeps a list of directories called the
search path. For each of these directories, it tries appending
``⟨basename⟩.⟨extension⟩`` to the directory, and checks whether the
file exists. The value of ⟨basename⟩ is the module name with dots
replaced by the directory separator ("``/``" or "``\\"``, depending on the
system), and ⟨extension⟩ is a source extension (``hs``, ``lhs``) if we
149
are in :ghc-flag:`--make` mode or GHCi, or ⟨hisuf⟩ otherwise.
150 151

For example, suppose the search path contains directories ``d1``,
152
``d2``, and ``d3``, and we are in :ghc-flag:`--make` mode looking for the source
153 154 155 156 157 158 159
file for a module ``A.B.C``. GHC will look in ``d1/A/B/C.hs``,
``d1/A/B/C.lhs``, ``d2/A/B/C.hs``, and so on.

The search path by default contains a single directory: "``.``" (i.e. the
current directory). The following options can be used to add to or change the
contents of the search path:

160 161
.. ghc-flag:: -i⟨dir⟩[:⟨dir⟩]*

162
    .. index::
163
       single: search path; source code
164 165 166 167

    This flag appends a colon-separated list of ``dirs`` to
    the search path.

168
.. ghc-flag:: -i
169 170 171 172 173 174 175 176 177 178 179 180 181 182 183
    resets the search path back to nothing.

This isn't the whole story: GHC also looks for modules in pre-compiled
libraries, known as packages. See the section on packages
(:ref:`packages`) for details.

.. _options-output:

Redirecting the compilation output(s)
-------------------------------------

.. index::
   single: output-directing options
   single: redirecting compilation output

184
.. ghc-flag:: -o file
185 186 187 188 189 190 191 192 193 194 195 196 197

    GHC's compiled output normally goes into a ``.hc``, ``.o``, etc.,
    file, depending on the last-run compilation phase. The option
    ``-o file`` re-directs the output of that last-run phase to ⟨file⟩.

    .. note::
       This “feature” can be counterintuitive: ``ghc -C -o foo.o foo.hs``
       will put the intermediate C code in the file ``foo.o``, name
       notwithstanding!

    This option is most often used when creating an executable file, to
    set the filename of the executable. For example:

198
    .. code-block:: none
199 200 201 202 203 204 205 206 207 208

        ghc -o prog --make Main

    will compile the program starting with module ``Main`` and put the
    executable in the file ``prog``.

    Note: on Windows, if the result is an executable file, the extension
    "``.exe``" is added if the specified filename does not already have
    an extension. Thus

209
    .. code-block:: none
210 211 212 213 214 215 216 217 218 219 220

        ghc -o foo Main.hs

    will compile and link the module ``Main.hs``, and put the resulting
    executable in ``foo.exe`` (not ``foo``).

    If you use ``ghc --make`` and you don't use the ``-o``, the name GHC
    will choose for the executable will be based on the name of the file
    containing the module ``Main``. Note that with GHC the ``Main``
    module doesn't have to be put in file ``Main.hs``. Thus both

221
    .. code-block:: none
222 223 224 225 226

        ghc --make Prog

    and

227
    .. code-block:: none
228 229 230 231 232

        ghc --make Prog.hs

    will produce ``Prog`` (or ``Prog.exe`` if you are on Windows).

233
.. ghc-flag:: -odir ⟨dir⟩
234 235 236

    Redirects object files to directory ⟨dir⟩. For example:

237
    .. code-block:: none
238 239 240 241 242 243 244 245 246 247 248 249

        $ ghc -c parse/Foo.hs parse/Bar.hs gurgle/Bumble.hs -odir `uname -m`

    The object files, ``Foo.o``, ``Bar.o``, and ``Bumble.o`` would be
    put into a subdirectory named after the architecture of the
    executing machine (``x86``, ``mips``, etc).

    Note that the ``-odir`` option does *not* affect where the interface
    files are put; use the ``-hidir`` option for that. In the above
    example, they would still be put in ``parse/Foo.hi``,
    ``parse/Bar.hi``, and ``gurgle/Bumble.hi``.

250
.. ghc-flag:: -ohi ⟨file⟩
251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266

    The interface output may be directed to another file
    ``bar2/Wurble.iface`` with the option ``-ohi bar2/Wurble.iface``
    (not recommended).

    .. warning::
       If you redirect the interface file somewhere that GHC can't
       find it, then the recompilation checker may get confused (at the
       least, you won't get any recompilation avoidance). We recommend
       using a combination of ``-hidir`` and ``-hisuf`` options instead, if
       possible.

    To avoid generating an interface at all, you could use this option
    to redirect the interface into the bit bucket: ``-ohi /dev/null``,
    for example.

267
.. ghc-flag:: -hidir ⟨dir⟩
268 269 270 271

    Redirects all generated interface files into ⟨dir⟩, instead of the
    default.

272
.. ghc-flag:: -stubdir ⟨dir⟩
273 274 275 276 277 278 279 280

    Redirects all generated FFI stub files into ⟨dir⟩. Stub files are
    generated when the Haskell source contains a ``foreign export`` or
    ``foreign import "&wrapper"`` declaration (see
    :ref:`foreign-export-ghc`). The ``-stubdir`` option behaves in
    exactly the same way as ``-odir`` and ``-hidir`` with respect to
    hierarchical modules.

281
.. ghc-flag:: -dumpdir ⟨dir⟩
282 283 284 285

    Redirects all dump files into ⟨dir⟩. Dump files are generated when
    ``-ddump-to-file`` is used with other ``-ddump-*`` flags.

286
.. ghc-flag:: -outputdir ⟨dir⟩
287 288

    The ``-outputdir`` option is shorthand for the combination of
289
    :ghc-flag:`-odir`, :ghc-flag:`-hidir`, :ghc-flag:`-stubdir` and :ghc-flag:`-dumpdir`.
290

291 292 293
.. ghc-flag:: -osuf ⟨suffix⟩
              -hisuf ⟨suffix⟩
              -hcsuf ⟨suffix⟩
294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309

    The ``-osuf`` ⟨suffix⟩ will change the ``.o`` file suffix for object
    files to whatever you specify. We use this when compiling libraries,
    so that objects for the profiling versions of the libraries don't
    clobber the normal ones.

    Similarly, the ``-hisuf`` suffix will change the ``.hi`` file
    suffix for non-system interface files (see :ref:`hi-options`).

    Finally, the option ``-hcsuf`` suffix will change the ``.hc`` file
    suffix for compiler-generated intermediate C files.

    The ``-hisuf``/``-osuf`` game is particularly useful if you want to
    compile a program both with and without profiling, in the same
    directory. You can say:

310
    .. code-block:: none
311 312 313 314 315

        ghc ...

    to get the ordinary version, and

316
    .. code-block:: none
317

318
        ghc ... -osuf prof.o -hisuf prof.hi -prof -fprof-auto
319 320 321 322 323 324 325 326 327 328 329 330 331 332

    to get the profiled version.

.. _keeping-intermediates:

Keeping Intermediate Files
--------------------------

.. index::
   single: intermediate files, saving
   single: .hc files, saving
   single: .ll files, saving
   single: .s files, saving

333 334 335
The following options are useful for keeping (or not keeping) certain
intermediate files around, when normally GHC would throw these away after
compilation:
336

337
.. ghc-flag:: -keep-hc-file
338
              -keep-hc-files
339 340 341 342 343

    Keep intermediate ``.hc`` files when doing ``.hs``-to-``.o``
    compilations via :ref:`C <c-code-gen>` (Note: ``.hc`` files are only
    generated by :ref:`unregisterised <unreg>` compilers).

344 345 346 347 348 349 350 351
.. ghc-flag:: -keep-hi-files

    .. index::
       single: temporary files; keeping

   Keep intermediate ``.hi`` files. This is the default. You may use
   ``-no-keep-hi-files`` if you are not interested in the ``.hi`` files.

352
.. ghc-flag:: -keep-llvm-file
353 354 355
              -keep-llvm-files

    :implies: :ghc-flag:`-fllvm`
356 357 358 359

    Keep intermediate ``.ll`` files when doing ``.hs``-to-``.o``
    compilations via :ref:`LLVM <llvm-code-gen>` (Note: ``.ll`` files
    aren't generated when using the native code generator, you may need
360
    to use :ghc-flag:`-fllvm` to force them to be produced).
361

362 363 364 365 366 367 368 369
.. ghc-flag:: -keep-o-files

    .. index::
       single: temporary files; keeping

   Keep intermediate ``.o`` files. This is the default. You may use
   ``-no-keep-o-files`` if you are not interested in the ``.o`` files.

370
.. ghc-flag:: -keep-s-file
371
              -keep-s-files
372 373 374

    Keep intermediate ``.s`` files.

375 376
.. ghc-flag:: -keep-tmp-files

377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392
    .. index::
       single: temporary files; keeping

    Instructs the GHC driver not to delete any of its temporary files,
    which it normally keeps in ``/tmp`` (or possibly elsewhere; see
    :ref:`temp-files`). Running GHC with ``-v`` will show you what
    temporary files were generated along the way.

.. _temp-files:

Redirecting temporary files
---------------------------

.. index::
   single: temporary files; redirecting

393
.. ghc-flag:: -tmpdir
394 395 396 397 398 399 400

    If you have trouble because of running out of space in ``/tmp`` (or
    wherever your installation thinks temporary files should go), you
    may use the ``-tmpdir <dir>``-tmpdir <dir> option option to specify an
    alternate directory. For example, ``-tmpdir .`` says to put temporary files
    in the current working directory.

401 402 403 404 405 406
    .. index::
        single: TMPDIR environment variable

    Alternatively, use your :envvar:`TMPDIR` environment variable. Set it to the
    name of the directory where temporary files should be put. GCC and other
    programs will honour the :envvar:`TMPDIR` variable as well.
407 408 409 410 411 412 413 414 415

.. _hi-options:

Other options related to interface files
----------------------------------------

.. index::
   single: interface files, options

416
.. ghc-flag:: -ddump-hi
417 418 419

    Dumps the new interface to standard output.

420
.. ghc-flag:: -ddump-hi-diffs
421 422 423

    The compiler does not overwrite an existing ``.hi`` interface file
    if the new one is the same as the old one; this is friendly to
424 425
    :command:`make`. When an interface does change, it is often enlightening to
    be informed. The :ghc-flag:`-ddump-hi-diffs` option will make GHC report the
426 427
    differences between the old and new ``.hi`` files.

428
.. ghc-flag:: -ddump-minimal-imports
429

430
    Dump to the file :file:`{M}.imports` (where ⟨M⟩ is the name of the module
431 432
    being compiled) a "minimal" set of import declarations. The
    directory where the ``.imports`` files are created can be controlled
433
    via the :ghc-flag:`-dumpdir` option.
434

435
    You can safely replace all the import declarations in :file:`{M}.hs` with
436 437 438 439 440 441
    those found in its respective ``.imports`` file. Why would you want
    to do that? Because the "minimal" imports (a) import everything
    explicitly, by name, and (b) import nothing that is not required. It
    can be quite painful to maintain this property by hand, so this flag
    is intended to reduce the labour.

442
.. ghc-flag:: --show-iface ⟨file⟩
443 444 445 446 447 448 449 450 451 452 453 454

    where ⟨file⟩ is the name of an interface file, dumps the contents of
    that interface in a human-readable format. See :ref:`modes`.

.. _recomp:

The recompilation checker
-------------------------

.. index::
   single: recompilation checker

455
.. ghc-flag:: -fforce-recomp
456 457 458 459 460 461 462 463 464 465 466 467 468 469 470 471 472 473 474 475 476 477 478 479 480 481 482 483 484 485 486 487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504 505

    Turn off recompilation checking (which is on by default).
    Recompilation checking normally stops compilation early, leaving an
    existing ``.o`` file in place, if it can be determined that the
    module does not need to be recompiled.

In the olden days, GHC compared the newly-generated ``.hi`` file with
the previous version; if they were identical, it left the old one alone
and didn't change its modification date. In consequence, importers of a
module with an unchanged output ``.hi`` file were not recompiled.

This doesn't work any more. Suppose module ``C`` imports module ``B``,
and ``B`` imports module ``A``. So changes to module ``A`` might require
module ``C`` to be recompiled, and hence when ``A.hi`` changes we should
check whether ``C`` should be recompiled. However, the dependencies of
``C`` will only list ``B.hi``, not ``A.hi``, and some changes to ``A``
(changing the definition of a function that appears in an inlining of a
function exported by ``B``, say) may conceivably not change ``B.hi`` one
jot. So now…

GHC calculates a fingerprint (in fact an MD5 hash) of each interface
file, and of each declaration within the interface file. It also keeps
in every interface file a list of the fingerprints of everything it used
when it last compiled the file. If the source file's modification date
is earlier than the ``.o`` file's date (i.e. the source hasn't changed
since the file was last compiled), and the recompilation checking is on,
GHC will be clever. It compares the fingerprints on the things it needs
this time with the fingerprints on the things it needed last time
(gleaned from the interface file of the module being compiled); if they
are all the same it stops compiling early in the process saying
Compilation IS NOT required. What a beautiful sight!

You can read about `how all this
works <http://ghc.haskell.org/trac/ghc/wiki/Commentary/Compiler/RecompilationAvoidance>`__
in the GHC commentary.

.. _mutual-recursion:

How to compile mutually recursive modules
-----------------------------------------

.. index::
   single: module system, recursion
   single: recursion, between modules

GHC supports the compilation of mutually recursive modules. This section
explains how.

Every cycle in the module import graph must be broken by a ``hs-boot``
file. Suppose that modules ``A.hs`` and ``B.hs`` are Haskell source
506
files, thus: ::
507 508 509 510 511 512 513 514 515 516 517 518 519 520 521 522 523 524 525 526 527 528 529 530 531

    module A where
        import B( TB(..) )

        newtype TA = MkTA Int

        f :: TB -> TA
        f (MkTB x) = MkTA x

    module B where
        import {-# SOURCE #-} A( TA(..) )

        data TB = MkTB !Int

        g :: TA -> TB
        g (MkTA x) = MkTB x

``hs-boot`` files importing, ``hi-boot`` files Here ``A`` imports ``B``,
but ``B`` imports ``A`` with a ``{-# SOURCE #-}`` pragma, which breaks
the circular dependency. Every loop in the module import graph must be
broken by a ``{-# SOURCE #-}`` import; or, equivalently, the module
import graph must be acyclic if ``{-# SOURCE #-}`` imports are ignored.

For every module ``A.hs`` that is ``{-# SOURCE #-}``-imported in this
way there must exist a source file ``A.hs-boot``. This file contains an
532
abbreviated version of ``A.hs``, thus: ::
533 534 535 536 537 538

    module A where
        newtype TA = MkTA Int

To compile these three files, issue the following commands:

539
.. code-block:: none
540 541 542 543 544 545 546 547 548 549 550 551 552 553 554

      ghc -c A.hs-boot    -- Produces A.hi-boot, A.o-boot
      ghc -c B.hs         -- Consumes A.hi-boot, produces B.hi, B.o
      ghc -c A.hs         -- Consumes B.hi, produces A.hi, A.o
      ghc -o foo A.o B.o  -- Linking the program

There are several points to note here:

-  The file ``A.hs-boot`` is a programmer-written source file. It must
   live in the same directory as its parent source file ``A.hs``.
   Currently, if you use a literate source file ``A.lhs`` you must also
   use a literate boot file, ``A.lhs-boot``; and vice versa.

-  A ``hs-boot`` file is compiled by GHC, just like a ``hs`` file:

555
   .. code-block:: none
556 557 558 559 560 561 562 563 564 565 566 567 568 569 570 571 572 573 574 575 576 577 578 579 580 581 582 583 584 585 586 587 588 589 590 591 592 593 594 595 596 597 598 599 600 601

         ghc -c A.hs-boot

   When a hs-boot file ``A.hs-boot`` is compiled, it is checked for
   scope and type errors. When its parent module ``A.hs`` is compiled,
   the two are compared, and an error is reported if the two are
   inconsistent.

-  Just as compiling ``A.hs`` produces an interface file ``A.hi``, and
   an object file ``A.o``, so compiling ``A.hs-boot`` produces an
   interface file ``A.hi-boot``, and an pseudo-object file ``A.o-boot``:

   -  The pseudo-object file ``A.o-boot`` is empty (don't link it!), but
      it is very useful when using a Makefile, to record when the
      ``A.hi-boot`` was last brought up to date (see :ref:`using-make`).

   -  The ``hi-boot`` generated by compiling a ``hs-boot`` file is in
      the same machine-generated binary format as any other
      GHC-generated interface file (e.g. ``B.hi``). You can display its
      contents with ``ghc --show-iface``. If you specify a directory for
      interface files, the ``-ohidir`` flag, then that affects ``hi-boot`` files
      too.

-  If hs-boot files are considered distinct from their parent source
   files, and if a ``{-# SOURCE #-}`` import is considered to refer to
   the hs-boot file, then the module import graph must have no cycles.
   The command ``ghc -M`` will report an error if a cycle is found.

-  A module ``M`` that is ``{-# SOURCE #-}``\-imported in a program will
   usually also be ordinarily imported elsewhere. If not, ``ghc --make``
   automatically adds ``M`` to the set of modules it tries to compile
   and link, to ensure that ``M``\'s implementation is included in the
   final program.

A hs-boot file need only contain the bare minimum of information needed
to get the bootstrapping process started. For example, it doesn't need
to contain declarations for *everything* that module ``A`` exports, only
the things required by the module(s) that import ``A`` recursively.

A hs-boot file is written in a subset of Haskell:

-  The module header (including the export list), and import statements,
   are exactly as in Haskell, and so are the scoping rules. Hence, to
   mention a non-Prelude type or class, you must import it.

-  There must be no value declarations, but there can be type signatures
602
   for values. For example: ::
603 604 605 606 607 608 609 610 611 612

        double :: Int -> Int

-  Fixity declarations are exactly as in Haskell.

-  Vanilla type synonym declarations are exactly as in Haskell.

-  Open type and data family declarations are exactly as in Haskell.

-  A closed type family may optionally omit its equations, as in the
613
   following example: ::
614 615 616 617 618 619 620 621 622 623 624

        type family ClosedFam a where ..

   The ``..`` is meant literally -- you should write two dots in your
   file. Note that the ``where`` clause is still necessary to
   distinguish closed families from open ones. If you give any equations
   of a closed family, you must give all of them, in the same order as
   they appear in the accompanying Haskell file.

-  A data type declaration can either be given in full, exactly as in
   Haskell, or it can be given abstractly, by omitting the '=' sign and
625
   everything that follows. For example: ::
626 627 628 629 630 631 632 633 634 635 636 637 638 639

        data T a b

   In a *source* program this would declare TA to have no constructors
   (a GHC extension: see :ref:`nullary-types`), but in an hi-boot file
   it means "I don't know or care what the constructors are". This is
   the most common form of data type declaration, because it's easy to
   get right. You *can* also write out the constructors but, if you do
   so, you must write it out precisely as in its real definition.

   If you do not write out the constructors, you may need to give a kind
   annotation (:ref:`kinding`), to tell GHC the kind of the type
   variable, if it is not "\*". (In source files, this is worked out
   from the way the type variable is used in the constructors.) For
640
   example: ::
641 642 643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661 662 663 664

        data R (x :: * -> *) y

   You cannot use ``deriving`` on a data type declaration; write an
   ``instance`` declaration instead.

-  Class declarations is exactly as in Haskell, except that you may not
   put default method declarations. You can also omit all the
   superclasses and class methods entirely; but you must either omit
   them all or put them all in.

-  You can include instance declarations just as in Haskell; but omit
   the "where" part.

-  The default role for abstract datatype parameters is now
   representational. (An abstract datatype is one with no constructors
   listed.) To get another role, use a role annotation. (See
   :ref:`roles`.)

.. _module-signatures:

Module signatures
-----------------

665 666 667 668 669 670
GHC 8.2 supports module signatures (``hsig`` files), which allow you to
write a signature in place of a module implementation, deferring the
choice of implementation until a later point in time.  This feature is
not intended to be used without `Cabal
<http://www.haskell.org/cabal/>`__; this manual entry will focus
on the syntax and semantics of signatures.
671

672 673 674
To start with an example, suppose you had a module ``A`` which made use of some
string operations.  Using normal module imports, you would only
be able to pick a particular implementation of strings::
675

676 677
    module Str where
        type Str = String
678

679 680
        empty :: Str
        empty = ""
681

682 683
        toString :: Str -> String
        toString s = s
684 685 686 687 688

    module A where
        import Text
        z = toString empty

689 690 691 692 693 694 695 696 697 698 699 700 701 702 703 704 705 706 707 708 709 710 711 712 713 714 715 716 717 718 719 720 721 722 723 724 725 726 727 728 729 730 731 732 733 734 735 736 737 738 739 740 741 742 743 744 745 746 747 748 749 750 751 752 753 754 755 756 757 758 759 760 761 762 763 764 765 766 767 768 769 770 771 772 773 774 775 776 777 778 779 780 781 782 783 784 785 786 787 788 789 790 791 792 793 794 795 796 797 798 799 800 801 802 803 804 805 806 807 808 809 810 811 812 813 814 815 816 817 818 819 820 821 822 823 824 825 826 827 828 829 830 831 832 833 834 835 836 837 838 839 840 841 842 843 844 845 846 847
By replacing ``Str.hs`` with a signature ``Str.hsig``, ``A`` (and
any other modules in this package) are now parametrized by
a string implementation::

    signature Str where
        data Str
        empty :: Str
        toString :: Str -> String

We can typecheck ``A`` against this signature, or we can instantiate
``Str`` with a module that provides the following declarations.  Refer
to Cabal's documentation for a more in-depth discussion on how to
instantiate signatures.

Module signatures actually consist of two closely related features:

- The ability to define an ``hsig`` file, containing type definitions
  and type signature for values which can be used by modules that
  import the signature, and must be provided by the eventual
  implementing module, and

- The ability to *inherit* required signatures from packages we
  depend upon, combining the signatures into a single merged
  signature which reflects the requirements of any locally defined
  signature, as well as the requirements of our dependencies.

A signature file is denoted by an ``hsig`` file; every required
signature must have an ``hsig`` file (even if it is an empty one),
including required signatures inherited from dependencies.  Signatures
can be imported using an ordinary ``import Sig`` declaration.

``hsig`` files are written in a variant of Haskell similar
to ``hs-boot`` files, but with some slight changes:

- The header of a signature is ``signature A where ...`` (instead
  of the usual ``module A where ...``).

- Import statements and scoping rules are exactly as in Haskell.
  To mention a non-Prelude type or class, you must import it.

- Unlike regular modules, the exports and defined entities of
  a signature include not only those written in the local
  ``hsig`` file, but also those from inherited signatures
  (as inferred from the :ghc-flag:`-package-id` flags).
  These entities are not considered in scope when typechecking
  the local ``hsig`` file, but are available for import by
  any module or signature which imports the signature.  The
  one exception to this rule is the export list, described
  below.

  If a declaration occurs in multiple inherited signatures,
  they will be *merged* together.  For values, we require
  that the types from both signatures match exactly; however,
  other declarations may merge in more interesting ways.
  The merging operation in these cases has the effect of
  textually replacing all occurrences of the old name with
  a reference to the new, merged declaration.  For example,
  if we have the following two signatures::

    signature A where
        data T
        f :: T -> T

    signature A where
        data T = MkT
        g :: T

  the resulting merged signature would be::

    signature A where
        data T = MkT
        f :: T -> T
        g :: T

- The export list of a signature applies the final export list
  of a signature after merging inherited signatures; in particular, it
  may refer to entities which are not declared in the body of the
  local ``hsig`` file.  The set of entities that are required by a
  signature is defined exclusively by its exports; if an entity
  is not mentioned in the export list, it is not required.  This means
  that a library author can provide an omnibus signature containing the
  type of every function someone might want to use, while a client thins
  down the exports to the ones they actually require.  For example,
  supposing that you have inherited a signature for strings, you might
  write a local signature of this form, listing only the entities
  that you need::

    signature Str (Str, empty, append, concat) where
        -- empty

  A few caveats apply here.  First, it is illegal to export an entity
  which refers to a locally defined type which itself is not exported
  (GHC will report an error in this case).  Second, signatures which
  come from dependencies which expose modules cannot be thinned in this
  way (after all, the dependency itself may need the entity); these
  requirements are unconditionally exported, but are associated with
  a warning discouraging their use by a module.  To use an entity
  defined by such a signature, add its declaration to your local
  ``hsig`` file.

- A signature can reexport an entity brought into scope by an import.
  In this case, we indicate that any implementation of the module
  must export this very same entity.  For example, this signature
  must be implemented by a module which itself reexports ``Int``::

    signature A (Int) where
        import Prelude (Int)

    -- can be implemented by:
    module A (Int) where
        import Prelude (Int)

  Conversely, any entity requested by a signature can be provided
  by a reexport from the implementing module.  This is different from
  ``hs-boot`` files, which require every entity to be defined
  locally in the implementing module.

- The declarations and types from signatures of dependencies
  that will be merged in are not in scope when type checking
  an ``hsig`` file.  To refer to any such type, you must
  declare it yourself::

    -- OK, assuming we inherited an A that defines T
    signature A (T) where
        -- empty

    -- Not OK
    signature A (T, f) where
        f :: T -> T

    -- OK
    signature A (T, f) where
        data T
        f :: T -> T

- There must be no value declarations, but there can be type signatures
  for values.  For example, we might define the signature::

        signature A where
            double :: Int -> Int

  A module implementing ``A`` would have to export the function
  ``double`` with a type definitionally equal to the signature.
  Note that this means you can't implement ``double`` using
  a polymorphic function ``double :: Num a => a -> a``.

  Note that signature matching does check if *fixity* matches, so be
  sure specify fixity of ordinary identifiers if you intend to use them
  with backticks.

- Fixity, type synonym, open type/data family declarations
  are permitted as in normal Haskell.

- Closed type family declarations are permitted as in normal
  Haskell.  They can also be given abstractly, as in the
  following example::

    type family ClosedFam a where ..

848
  The ``..`` is meant literally -- you should write two dots in
849 850 851 852 853
  your file.  The ``where`` clause distinguishes closed families
  from open ones.

- A data type declaration can either be given in full, exactly
  as in Haskell, or it can be given abstractly, by omitting the '='
854
  sign and everything that follows.  For example: ::
855 856 857 858 859 860 861 862 863 864 865 866 867 868 869 870 871 872 873 874 875 876 877 878 879 880 881 882 883 884 885 886 887 888 889 890 891 892 893 894 895 896 897 898 899 900 901 902 903 904 905 906 907 908 909

        signature A where
            data T a b

  Abstract data types can be implemented not only with data
  declarations, but also newtypes and type synonyms (with the
  restriction that a type synonym must be fully eta-reduced,
  e.g., ``type T = ...`` to be accepted.)  For example,
  the following are all valid implementations of the T above::

        -- Algebraic data type
        data T a b = MkT a b

        -- Newtype
        newtype T a b = MkT (a, b)

        -- Type synonym
        data T2 a b = MkT2 a a b b
        type T = T2

  Data type declarations merge only with other data type
  declarations which match exactly, except abstract data,
  which can merge with ``data``, ``newtype`` or ``type``
  declarations.  Merges with type synonyms are especially useful:
  suppose you are using a package of strings which has left the type of
  characters in the string unspecified::

        signature Str where
            data Str
            data Elem
            head :: Str -> Elem

  If you locally define a signature which specifies
  ``type Elem = Char``, you can now use ``head`` from the
  inherited signature as if it returned a ``Char``.

  If you do not write out the constructors, you may need to give
  a kind and/or role annotation to tell GHC what the kinds or roles
  of the type variables are, if they are not the default (``*`` and
  representational).  It will be obvious if you've gotten it wrong when
  you try implementing the signature.

- A class declarations can either be abstract or concrete.  An
  abstract class is one with no superclasses or class methods::

    signature A where
        class Key k

  It can be implemented in any way, with any set of superclasses
  and methods; however, modules depending on an abstract class
  are not permitted to define instances (as of GHC 8.2, this
  restriction is not checked, see :ghc-ticket:`13086`.)
  These declarations can be implemented by type synonyms
  of kind ``Constraint``; this can be useful if you want to parametrize
  over a constraint in functions.  For example, with the
910
  ``ConstraintKinds`` extension, this type synonym is a valid
911
  implementation of the signature above::
912 913

    module A where
914
        type Key = Eq
915

916 917 918 919 920
  A concrete class specifies its superclasses, methods,
  default method signatures (but not their implementations)
  and a ``MINIMAL`` pragma.  Unlike regular Haskell classes,
  you don't have to explicitly declare a default for a method
  to make it optional vis-a-vis the ``MINIMAL`` pragma.
921

922 923 924 925 926 927 928 929 930
  When merging class declarations, we require that the superclasses
  and methods match exactly; however, ``MINIMAL`` pragmas are logically
  ORed together, and a method with a default signature will merge
  successfully against one that does not.

- You can include instance declarations as in Haskell; just omit the
  "where" part.  An instance declaration need not be implemented directly;
  if an instance can be derived based on instances in the environment,
  it is considered implemented.  For example, the following signature::
931

932 933 934 935 936 937 938 939 940 941 942 943 944 945 946 947 948 949 950 951 952 953 954 955 956 957 958 959 960
    signature A where
        data Str
        instance Eq Str

  is considered implemented by the following module, since there
  are instances of ``Eq`` for ``[]`` and ``Char`` which can be combined
  to form an instance ``Eq [Char]``::

    module A where
        type Str = [Char]

  Unlike other declarations, for which only the entities declared
  in a signature file are brought into scope, instances from the
  implementation are always brought into scope, even if they were
  not declared in the signature file.  This means that a module may
  typecheck against a signature, but not against a matching
  implementation.  You can avoid situations like this by never
  defining orphan instances inside a package that has signatures.

  Instance declarations are only merged if their heads are exactly
  the same, so it is possible to get into a situation where GHC
  thinks that instances in a signature are overlapping, even if
  they are implemented in a non-overlapping way.  If this is
  giving you problems give us a shout.

Known limitations:

- Algebraic data types specified in a signature cannot be implemented using
  pattern synonyms.  See :ghc-ticket:`12717`
961 962 963 964 965 966 967 968 969 970 971 972

.. _using-make:

Using ``make``
--------------

.. index::
   single: make; building programs with

It is reasonably straightforward to set up a ``Makefile`` to use with
GHC, assuming you name your source files the same as your modules. Thus:

973
.. code-block:: makefile
974 975 976 977 978 979 980 981 982 983 984 985 986 987 988 989 990 991 992 993 994 995 996 997 998 999 1000 1001 1002 1003 1004 1005 1006 1007 1008 1009 1010

    HC      = ghc
    HC_OPTS = -cpp $(EXTRA_HC_OPTS)

    SRCS = Main.lhs Foo.lhs Bar.lhs
    OBJS = Main.o   Foo.o   Bar.o

    .SUFFIXES : .o .hs .hi .lhs .hc .s

    cool_pgm : $(OBJS)
            rm -f $@
            $(HC) -o $@ $(HC_OPTS) $(OBJS)

    # Standard suffix rules
    .o.hi:
            @:

    .lhs.o:
            $(HC) -c $< $(HC_OPTS)

    .hs.o:
            $(HC) -c $< $(HC_OPTS)

    .o-boot.hi-boot:
            @:

    .lhs-boot.o-boot:
            $(HC) -c $< $(HC_OPTS)

    .hs-boot.o-boot:
            $(HC) -c $< $(HC_OPTS)

    # Inter-module dependencies
    Foo.o Foo.hc Foo.s    : Baz.hi          # Foo imports Baz
    Main.o Main.hc Main.s : Foo.hi Baz.hi   # Main imports Foo and Baz

.. note::
1011 1012
    Sophisticated :command:`make` variants may achieve some of the above more
    elegantly. Notably, :command:`gmake`\'s pattern rules let you write the more
1013 1014
    comprehensible:

1015
    .. code-block:: make
1016 1017 1018 1019 1020 1021 1022 1023 1024 1025 1026 1027 1028 1029 1030 1031 1032

        %.o : %.lhs
                $(HC) -c $< $(HC_OPTS)

    What we've shown should work with any ``make``.

Note the cheesy ``.o.hi`` rule: It records the dependency of the
interface (``.hi``) file on the source. The rule says a ``.hi`` file can
be made from a ``.o`` file by doing…nothing. Which is true.

Note that the suffix rules are all repeated twice, once for normal
Haskell source files, and once for ``hs-boot`` files (see
:ref:`mutual-recursion`).

Note also the inter-module dependencies at the end of the Makefile,
which take the form

1033
.. code-block:: make
1034 1035 1036 1037 1038 1039 1040 1041 1042 1043 1044 1045 1046 1047 1048 1049 1050 1051 1052 1053 1054 1055 1056 1057

    Foo.o Foo.hc Foo.s    : Baz.hi          # Foo imports Baz

They tell ``make`` that if any of ``Foo.o``, ``Foo.hc`` or ``Foo.s``
have an earlier modification date than ``Baz.hi``, then the out-of-date
file must be brought up to date. To bring it up to date, ``make`` looks
for a rule to do so; one of the preceding suffix rules does the job
nicely. These dependencies can be generated automatically by ``ghc``;
see :ref:`makefile-dependencies`

.. _makefile-dependencies:

Dependency generation
---------------------

.. index::
   single: dependencies in Makefiles
   single: Makefile dependencies

Putting inter-dependencies of the form ``Foo.o : Bar.hi`` into your
``Makefile`` by hand is rather error-prone. Don't worry, GHC has support
for automatically generating the required dependencies. Add the
following to your ``Makefile``:

1058
.. code-block:: make
1059 1060

    depend :
niteria's avatar
niteria committed
1061
            ghc -dep-suffix '' -M $(HC_OPTS) $(SRCS)
1062 1063 1064 1065 1066 1067 1068 1069 1070 1071 1072 1073

Now, before you start compiling, and any time you change the ``imports``
in your program, do ``make depend`` before you do ``make cool_pgm``. The command
``ghc -M`` will append the needed dependencies to your ``Makefile``.

In general, ``ghc -M Foo`` does the following. For each module ``M`` in
the set ``Foo`` plus all its imports (transitively), it adds to the
Makefile:

-  A line recording the dependence of the object file on the source
   file.

1074
   .. code-block:: make
1075 1076 1077 1078 1079 1080 1081 1082

       M.o : M.hs

   (or ``M.lhs`` if that is the filename you used).

-  For each import declaration ``import X`` in ``M``, a line recording
   the dependence of ``M`` on ``X``:

1083
   .. code-block:: make
1084 1085 1086 1087 1088 1089

       M.o : X.hi

-  For each import declaration ``import {-# SOURCE #-} X`` in ``M``, a
   line recording the dependence of ``M`` on ``X``:

1090
   .. code-block:: make
1091 1092 1093 1094 1095 1096 1097 1098 1099 1100 1101 1102 1103 1104 1105 1106 1107 1108 1109 1110 1111 1112 1113 1114 1115 1116 1117 1118 1119 1120 1121 1122 1123 1124 1125 1126

       M.o : X.hi-boot

   (See :ref:`mutual-recursion` for details of ``hi-boot`` style
   interface files.)

If ``M`` imports multiple modules, then there will be multiple lines
with ``M.o`` as the target.

There is no need to list all of the source files as arguments to the
``ghc -M`` command; ``ghc`` traces the dependencies, just like
``ghc --make`` (a new feature in GHC 6.4).

Note that ``ghc -M`` needs to find a *source file* for each module in
the dependency graph, so that it can parse the import declarations and
follow dependencies. Any pre-compiled modules without source files must
therefore belong to a package [1]_.

By default, ``ghc -M`` generates all the dependencies, and then
concatenates them onto the end of ``makefile`` (or ``Makefile`` if
``makefile`` doesn't exist) bracketed by the lines
"``# DO NOT DELETE: Beginning of Haskell dependencies``" and
"``# DO NOT DELETE: End of Haskell dependencies``". If these lines
already exist in the ``makefile``, then the old dependencies are deleted
first.

Don't forget to use the same ``-package`` options on the ``ghc -M``
command line as you would when compiling; this enables the dependency
generator to locate any imported modules that come from packages. The
package modules won't be included in the dependencies generated, though
(but see the ``-include-pkg-deps`` option below).

The dependency generation phase of GHC can take some additional options,
which you may find useful. The options which affect dependency
generation are:

1127 1128
.. ghc-flag:: -ddump-mod-cycles

1129 1130 1131
    Display a list of the cycles in the module graph. This is useful
    when trying to eliminate such cycles.

1132 1133 1134
.. ghc-flag:: -v2
    :noindex:

1135 1136
    Print a full list of the module dependencies to stdout. (This is the
    standard verbosity flag, so the list will also be displayed with
1137 1138 1139
    ``-v3`` and ``-v4``; see :ref:`options-help`.)

.. ghc-flag:: -dep-makefile ⟨file⟩
1140 1141 1142 1143 1144 1145 1146

    Use ⟨file⟩ as the makefile, rather than ``makefile`` or
    ``Makefile``. If ⟨file⟩ doesn't exist, ``mkdependHS`` creates it. We
    often use ``-dep-makefile .depend`` to put the dependencies in
    ``.depend`` and then ``include`` the file ``.depend`` into
    ``Makefile``.

1147 1148
.. ghc-flag:: -dep-suffix <suf>

1149 1150 1151 1152 1153 1154 1155 1156
    Make extra dependencies that declare that files with suffix
    ``.<suf>_<osuf>`` depend on interface files with suffix
    ``.<suf>_hi``, or (for ``{-# SOURCE #-}`` imports) on ``.hi-boot``.
    Multiple ``-dep-suffix`` flags are permitted. For example,
    ``-dep-suffix a -dep-suffix b`` will make dependencies for ``.hs``
    on ``.hi``, ``.a_hs`` on ``.a_hi``, and ``.b_hs`` on ``.b_hi``.
    (Useful in conjunction with NoFib "ways".)

1157 1158
.. ghc-flag:: --exclude-module=<file>

1159 1160 1161
    Regard ``<file>`` as "stable"; i.e., exclude it from having
    dependencies on it.

1162 1163
.. ghc-flag:: -include-pkg-deps

1164 1165 1166 1167 1168 1169 1170 1171 1172 1173 1174 1175 1176 1177 1178 1179 1180 1181 1182 1183 1184 1185 1186 1187
    Regard modules imported from packages as unstable, i.e., generate
    dependencies on any imported package modules (including ``Prelude``,
    and all other standard Haskell libraries). Dependencies are not
    traced recursively into packages; dependencies are only generated
    for home-package modules on external-package modules directly
    imported by the home package module. This option is normally only
    used by the various system libraries.

.. _orphan-modules:

Orphan modules and instance declarations
----------------------------------------

Haskell specifies that when compiling module ``M``, any instance declaration
in any module "below" ``M`` is visible. (Module ``A`` is "below" ``M`` if ``A`` is
imported directly by ``M``, or if ``A`` is below a module that ``M`` imports
directly.) In principle, GHC must therefore read the interface files of
every module below ``M``, just in case they contain an instance declaration
that matters to ``M``. This would be a disaster in practice, so GHC tries to
be clever.

In particular, if an instance declaration is in the same module as the
definition of any type or class mentioned in the *head* of the instance
declaration (the part after the "``=>``"; see :ref:`instance-rules`), then GHC
1188
has to visit that interface file anyway. Example: ::
1189 1190 1191 1192 1193 1194 1195 1196 1197

      module A where
        instance C a => D (T a) where ...
        data T a = ...

The instance declaration is only relevant if the type ``T`` is in use, and
if so, GHC will have visited ``A``\'s interface file to find ``T``\'s definition.

The only problem comes when a module contains an instance declaration
1198
and GHC has no other reason for visiting the module. Example: ::
1199 1200 1201 1202 1203 1204

      module Orphan where
        instance C a => D (T a) where ...
        class C a where ...

Here, neither ``D`` nor ``T`` is declared in module ``Orphan``. We call such modules
1205
"orphan modules". GHC identifies orphan modules, and visits the
1206 1207 1208 1209
interface file of every orphan module below the module being compiled.
This is usually wasted work, but there is no avoiding it. You should
therefore do your best to have as few orphan modules as possible.

1210
Functional dependencies complicate matters. Suppose we have: ::
1211 1212 1213 1214 1215 1216

      module B where
        instance E T Int where ...
        data T = ...

Is this an orphan module? Apparently not, because ``T`` is declared in
1217
the same module. But suppose class ``E`` had a functional dependency: ::
1218 1219 1220 1221 1222 1223 1224 1225 1226 1227 1228 1229 1230 1231 1232 1233 1234 1235 1236 1237 1238 1239 1240 1241 1242 1243 1244 1245 1246 1247 1248 1249 1250

      module Lib where
        class E x y | y -> x where ...

Then in some importing module ``M``, the constraint ``(E a Int)`` should be
"improved" by setting ``a = T``, *even though there is no explicit
mention* of ``T`` in ``M``.

These considerations lead to the following definition of an orphan
module:

-  An *orphan module* orphan module contains at least one *orphan
   instance* or at least one *orphan rule*.

-  An instance declaration in a module ``M`` is an *orphan instance* if
   orphan instance

   -  The class of the instance declaration is not declared in ``M``, and

   -  *Either* the class has no functional dependencies, and none of the
      type constructors in the instance head is declared in ``M``; *or*
      there is a functional dependency for which none of the type
      constructors mentioned in the *non-determined* part of the
      instance head is defined in ``M``.

   Only the instance head counts. In the example above, it is not good
   enough for ``C``\'s declaration to be in module ``A``; it must be the
   declaration of ``D`` or ``T``.

-  A rewrite rule in a module ``M`` is an *orphan rule* orphan rule if none
   of the variables, type constructors, or classes that are free in the
   left hand side of the rule are declared in ``M``.

1251
If you use the flag :ghc-flag:`-Worphans`, GHC will warn you if you are
1252
creating an orphan module. Like any warning, you can switch the warning
1253 1254
off with :ghc-flag:`-Wno-orphans <-Worphans>`, and :ghc-flag:`-Werror` will make
the compilation fail if the warning is issued.
1255 1256

You can identify an orphan module by looking in its interface file,
1257
``M.hi``, using the :ghc-flag:`--show-iface` :ref:`mode <modes>`. If there is a
1258 1259 1260 1261 1262
``[orphan module]`` on the first line, GHC considers it an orphan
module.

.. [1]
   This is a change in behaviour relative to 6.2 and earlier.