Commits · a76b233d5a598b12f1921405cdcb27b0ea1b809d · Alex Dudarenko / GHC

Jul 05, 2019

Make all submodules have absolute URLs · a76b233d

Artem Pelenitsyn authored 5 years ago and

Marge Bot committed 5 years ago

The relative URLs were a workaround to let most contributors fork from
Github due to a weakness in the haskell.org server.

This workaround is no longer needed. And relative submodule URLs are
an impediment to forking which makes contributions harder than they
should be.

The URLs are chosen to clone from https, because this makes sure that
anybody, even not a registered Gitlab user, can clone a fork
recursively.

a76b233d

Dont gather ticks when only striping them in STG. · f002250a

Andreas Klebinger authored 5 years ago and

Marge Bot committed 5 years ago

Adds stripStgTicksTopE which only returns the stripped expression.
So far we also allocated a list for the stripped ticks which was
never used.

Allocation difference is as expected very small but present.
About 0.02% difference when compiling with -O.

f002250a

Fix over-eager implication constraint discard · 80afdf6b

Simon Peyton Jones authored 5 years ago and

Marge Bot committed 5 years ago

Ticket #16247 showed that we were discarding an implication
constraint that had empty ic_wanted, when we still needed to
keep it so we could check whether it had a bad telescope.

Happily it's a one line fix.  All the rest is comments!

80afdf6b

rts: Fix -hT option with profiling rts · ed662901

Daniel Gröber (dxld) authored 5 years ago and

Marge Bot committed 5 years ago

In dumpCensus we switch/case on doHeapProfile twice. The second switch
tries to barf on unknown doHeapProfile modes but HEAP_BY_CLOSURE_TYPE is
checked by the first switch and not included in the second.

So when trying to pass -hT to the profiling rts it barfs.

This commit simply merges the two switches into one which fixes this
problem.

ed662901

Add a missing zonk (fixes #16902 ) · 53aa59f3

Simon Peyton Jones authored 5 years ago and

Marge Bot committed 5 years ago

In the eager unifier, when unifying (tv1 ~ tv2),
when we decide to swap them over, to unify (tv2 ~ tv1),
I'd forgotten to ensure that tv1's kind was fully zonked,
which is an invariant of uUnfilledTyVar2.

That could lead us to build an infinite kind, or (in the
case of #16902) update the same unification variable twice.

Yikes.

Now we get an error message rather than non-termination,
which is much better.  The error message is not great,
but it's a very strange program, and I can't see an easy way
to improve it, so for now I'm just committing this fix.

Here's the decl
 data F (a :: k) :: (a ~~ k) => Type where
    MkF :: F a

and the rather error message of which I am not proud

  T16902.hs:11:10: error:
    • Expected a type, but found something with kind ‘a1’
    • In the type ‘F a’

53aa59f3

Produce all DerivInfo in tcTyAndClassDecls · 679427f8

Vladislav Zavialov authored 5 years ago and

Marge Bot committed 5 years ago

Before this refactoring:

* DerivInfo for data family instances was returned from tcTyAndClassDecls
* DerivInfo for data declarations was generated with mkDerivInfos and added at a
  later stage of the pipeline in tcInstDeclsDeriv

After this refactoring:

* DerivInfo for both data family instances and data declarations is returned from
  tcTyAndClassDecls in a single list.

This uniform treatment results in a more convenient arrangement to fix #16731.

679427f8

gitlab: Reduce size of template headings · 675d27fc
Ben Gamari authored 5 years ago and Marge Bot committed 5 years ago

675d27fc
Make printer untag when chasing a pointer in a RET_FUN frame · d7f7e1ed
Siddharth Bhat authored 5 years ago and Marge Bot committed 5 years ago
```
This is to mimic what `Scav.c` does. This should fix a crash in
the printer.
```
d7f7e1ed

Jul 04, 2019
- Bump parsec submodule to 3.1.14.0 · f7a2e709
  Ben Gamari authored 5 years ago
  
  f7a2e709
Jul 03, 2019

Bump template-haskell version to 2.16.0.0 · a25f6f55

Ryan Scott authored 5 years ago and

Marge Bot committed 5 years ago

Commit cef80c0b debuted a breaking
change to `template-haskell`, so in order to guard against it
properly with CPP, we need to bump the `template-haskell` version
number accordingly.

a25f6f55

gitlab-ci: Fix doc-tarball job · 973c61b5

Ben Gamari authored 5 years ago and

Marge Bot committed 5 years ago

Previously we used the deb9-debug job which used the `validate` build
flavour which disabled `BUILD_SPHINX_PDF`. Fix this.

Fixes #16890.

973c61b5

Add support for SIMD operations in the NCG · acd79558

Abhiroop Sarkar authored 6 years ago and

Marge Bot committed 5 years ago


This adds support for constructing vector types from Float#, Double# etc
and performing arithmetic operations on them

Cleaned-Up-By: Ben Gamari <ben@well-typed.com>

acd79558

Jul 02, 2019
- Hadrian: disable cloud build cache for symlinks #16800 · df3e5b74
  David Eichmann authored 5 years ago and Marge Bot committed 5 years ago
  
  This is a temporary workaround shake not supporting symlinks when using cloud/cached builds.
  df3e5b74
- Fix stage 1 warnings · 60b9eab9
  Ömer Sinan Ağacan authored 5 years ago and Marge Bot committed 5 years ago
  
  60b9eab9
- Add test for #16575 · 294b55dc
  Eric Wolf authored 5 years ago and Marge Bot committed 5 years ago
  
  just use the test to show the defective behaviour, so we can see the difference, when it gets fixed
  294b55dc
- Fix #15843 by extending Template Haskell AST for tuples to support sections · cef80c0b
  Alex D authored 5 years ago and Marge Bot committed 5 years ago
  
  cef80c0b
- Apply suggestion to rts/linker/Elf.c · 0bed9647
  Ben Gamari authored 5 years ago and Marge Bot committed 5 years ago
  
  0bed9647
- Apply suggestion to rts/linker/elf_got.c · 023a2bc7
  Ben Gamari authored 5 years ago and Marge Bot committed 5 years ago
  
  023a2bc7
- No atomics on arm32; this will just yield stubs. · e9abcad4
  Moritz Angermann authored 5 years ago and Marge Bot committed 5 years ago
  
  As such the internal linker will fail for them. The alternative would be to implement them as stubs in the linker and have them barf when called. > Not all operations are supported by all target processors. If a particular operation cannot be implemented on the target processor, a warning is generated and a call an external function is generated. The external function carries the same name as the built-in version, with an additional suffix ‘_n’ where n is the size of the data type. (https://gcc.gnu.org/onlinedocs/gcc/_005f_005fsync-Builtins.html)
  e9abcad4
- Lookup _GLOBAL_OFFSET_TABLE by symbol->addr when doing relocations · 348e3f8e
  Edward Amsden authored 5 years ago and Marge Bot committed 5 years ago
  
  348e3f8e
- Add _GLOBAL_OFFSET_TABLE_ support · 82693938
  Moritz Angermann authored 5 years ago and Marge Bot committed 5 years ago
  
  This adds lookup logic for _GLOBAL_OFFSET_TABLE_ as well as relocation logic for R_ARM_BASE_PREL and R_ARM_GOT_BREL which the gnu toolchain (gas, gcc, ...) prefers to produce. Apparently recent llvm toolchains will produce those as well.
  82693938
Jun 28, 2019

rts: Assert that LDV profiling isn't used with parallel GC · bd660ede
Ben Gamari authored 5 years ago
```
I'm not entirely sure we are careful about ensuring this; this is a
last-ditch check.
```
bd660ede

Correct closure observation, construction, and mutation on weak memory machines. · 11bac115

Travis Whitaker authored 5 years ago and

Ben Gamari committed 5 years ago


Here the following changes are introduced:
    - A read barrier machine op is added to Cmm.
    - The order in which a closure's fields are read and written is changed.
    - Memory barriers are added to RTS code to ensure correctness on
      out-or-order machines with weak memory ordering.

Cmm has a new CallishMachOp called MO_ReadBarrier. On weak memory machines, this
is lowered to an instruction that ensures memory reads that occur after said
instruction in program order are not performed before reads coming before said
instruction in program order. On machines with strong memory ordering properties
(e.g. X86, SPARC in TSO mode) no such instruction is necessary, so
MO_ReadBarrier is simply erased. However, such an instruction is necessary on
weakly ordered machines, e.g. ARM and PowerPC.

Weam memory ordering has consequences for how closures are observed and mutated.
For example, consider a closure that needs to be updated to an indirection. In
order for the indirection to be safe for concurrent observers to enter, said
observers must read the indirection's info table before they read the
indirectee. Furthermore, the entering observer makes assumptions about the
closure based on its info table contents, e.g. an INFO_TYPE of IND imples the
closure has an indirectee pointer that is safe to follow.

When a closure is updated with an indirection, both its info table and its
indirectee must be written. With weak memory ordering, these two writes can be
arbitrarily reordered, and perhaps even interleaved with other threads' reads
and writes (in the absence of memory barrier instructions). Consider this
example of a bad reordering:

- An updater writes to a closure's info table (INFO_TYPE is now IND).
- A concurrent observer branches upon reading the closure's INFO_TYPE as IND.
- A concurrent observer reads the closure's indirectee and enters it. (!!!)
- An updater writes the closure's indirectee.

Here the update to the indirectee comes too late and the concurrent observer has
jumped off into the abyss. Speculative execution can also cause us issues,
consider:

- An observer is about to case on a value in closure's info table.
- The observer speculatively reads one or more of closure's fields.
- An updater writes to closure's info table.
- The observer takes a branch based on the new info table value, but with the
  old closure fields!
- The updater writes to the closure's other fields, but its too late.

Because of these effects, reads and writes to a closure's info table must be
ordered carefully with respect to reads and writes to the closure's other
fields, and memory barriers must be placed to ensure that reads and writes occur
in program order. Specifically, updates to a closure must follow the following
pattern:

- Update the closure's (non-info table) fields.
- Write barrier.
- Update the closure's info table.

Observing a closure's fields must follow the following pattern:

- Read the closure's info pointer.
- Read barrier.
- Read the closure's (non-info table) fields.

This patch updates RTS code to obey this pattern. This should fix long-standing
SMP bugs on ARM (specifically newer aarch64 microarchitectures supporting
out-of-order execution) and PowerPC. This fixes issue #15449.

Co-Authored-By: Ben Gamari <ben@well-typed.com>

11bac115

typo in the docs for DynFlags.hs · ef6d9a50
Artem Pelenitsyn authored 5 years ago and Marge Bot committed 5 years ago

ef6d9a50
Fix GCC warnings with __clear_cache builtin (#16867 ) · 4ec233ec
Sylvain Henry authored 5 years ago and Marge Bot committed 5 years ago

4ec233ec

Jun 27, 2019

testsuite: Add more type annotations to perf_notes · 217258d0
Ben Gamari authored 5 years ago and Marge Bot committed 5 years ago

217258d0
Fix #16805 by formatting warning message · 2a68b8b7
Alex D authored 5 years ago and Marge Bot committed 5 years ago

2a68b8b7
getExecutablePath: get path from sysctl on FreeBSD · d35cec7a
Fraser Tweedale authored 5 years ago and Marge Bot committed 5 years ago

d35cec7a
Fix Happy deps for Stack (#16825 ) · 90e0ab7d
Sylvain Henry authored 5 years ago and Marge Bot committed 5 years ago

90e0ab7d
configure: prefer cc over gcc · 52f10216
roland authored 5 years ago and Marge Bot committed 5 years ago
```
Fixes #16857.
```
52f10216
Improve doc for :type-at. (#14780) · c1f67887
Roland Senn authored 5 years ago and Marge Bot committed 5 years ago

c1f67887

rts: Do not traverse nursery for dead closures in LDV profile · 07cffc49

Matthew Pickering authored 5 years ago and

Marge Bot committed 5 years ago

It is important that `heapCensus` and `LdvCensusForDead` traverse the
same areas.

`heapCensus` increases the `not_used` counter which tracks how many
closures are live but haven't been used yet.

`LdvCensusForDead` increases the `void_total` counter which tracks how
many dead closures there are.

The `LAG` is then calculated by substracting the `void_total` from
`not_used` and so it is essential that `not_used >= void_total`. This
fact is checked by quite a few assertions.

However, if a program has low maximum residency but allocates a lot in
the nursery then these assertions were failing (see #16753 and #15903)
because `LdvCensusForDead` was observing dead closures from the nursery
which totalled more than the `not_used`. The same closures were not
counted by `heapCensus`.

Therefore, it seems that the correct fix is to make `LdvCensusForDead`
agree with `heapCensus` and not traverse the nursery for dead closures.

Fixes #16100 #16753 #15903 #8982

07cffc49

rts: Correct assertion in LDV_recordDead · ed4cbd93
Matthew Pickering authored 5 years ago and Marge Bot committed 5 years ago
```
It is possible that void_total is exactly equal to not_used and the
other assertions for this check for <= rather than <.
```
ed4cbd93

rts: Correct handling of LARGE ARR_WORDS in LDV profiler · a586b33f

Matthew Pickering authored 5 years ago and

Marge Bot committed 5 years ago

This implements the correct fix for #11627 by skipping over the slop
(which is zeroed) rather than adding special case logic for LARGE
ARR_WORDS which runs the risk of not performing a correct census by
ignoring any subsequent blocks.

This approach implements similar logic to that in Sanity.c

a586b33f

Jun 26, 2019
- testsuite: More type signatures · 1c4f18d0
  Ben Gamari authored 5 years ago
  
  1c4f18d0
- testsuite: Run and report on fragile tests · 44d08c32
  Ben Gamari authored 5 years ago
  
  This allows us to run (but ignore the result of) fragile testcases. Hopefully this should allow us to more easily spot when a fragile test becomes un-fragile.
  44d08c32
- testsuite: Mark T5611 and T5611a as fragile · 551b79e4
  Ben Gamari authored 5 years ago
  
  551b79e4
- testsuite: Add T5611a · 12752342
  Ben Gamari authored 5 years ago
  
  This is the same as T5611 but with an unsafe call to sleep.
  12752342
- testsuite: Use safe FFI call in T5611 · 58e84b30
  Ben Gamari authored 5 years ago
  
  The original issue, #5611, was concerned with safe calls. However, the test inexplicably used an unsafe call. Fix this.
  58e84b30
- [skip ci] add a blurb about the purpose of Printer.c · e0899925
  Siddharth Bhat authored 5 years ago and Ben Gamari committed 5 years ago
  
  e0899925