Short benchmark runtimes

Just to hammer this home a bit, here a graph:

And you can't even avoid that by just running one category, like only "real", bc. the leftmost, i.e., shortest running benchmark is real/mkhprog. The whole of real/eff is also in the top 10 shortest benchmarks.

Shortest: real/mkhprog 1.06E-02

Longest: shootout/fannkuch-redux 4.822

changed the description

Happy to review patches here. As for measurements, I'd simply exclude benchmarks that run less than 0.1s (say).

I wanted to look at adding time-mode args to some benchmarks, when I discovered, that real/mkhprog actually does support it, but the shake nofib-run invocation is broken! The benchmark never ran! See #26 (closed)

Btw. I see that you have concluded this already 3 years ago, w.r.t. "problematic benchmarks": 8632268a

For spectral/scc: the CFGs in NCG are directed graphs and they can be printed in dot format. They can get quite large. But I guess they would have to be parsed then, which is overhead we don't want to measure. Or I hard-code them and select the appropriate graph by argument.

Quite late to this party. But you could parse them and dump using a Show instance. Show generally produces valid haskell syntax so the result can just be copied into a haskell source file.

mentioned in issue #26 (closed)

changed the description

Regarding spectral/scc:

I tried it with a randomly generated directed graph with 500 nodes. Goes from total_cpu_seconds 0.000972 to 0.037614.

But that is a lot of lines to hard code into the source file. Not sure if parsing graph from stdin wouldn't dominate runtime...

Edit: 0.054477 with parsing from stdin. Hmm

I think that's fine. In the end if either the parsing or the scc computation part regress the thing as a whole will regress.

If you still want to look into this you could compile with profiling and set cost centres around the parsing/computation steps and see which part dominates.

mentioned in merge request !62 (merged)

real/rsa:

mode	user t
fast	0.095s
norm	0.651s
slow	3.257s

Changed input size and also added argument for iteration count, bc. I didn't want to inflate input for slow further (as not to get too much I/O).

mode	user t
fast	0,145s
norm	1,551s
slow	8,064s

(See MR !62 (merged))

mentioned in merge request !63 (merged)

shootout/spectral-norm:

mode	user t	Arg
fast	0,038s	800
norm	0,310s	2500
slow	1,474s	5500

Changed to:

mode	user t	Arg
fast	0,112s	1500
norm	1,222s	5000
slow	7,051s	12000

program	Wall time
~~real/mkhprog~~	~~1.06E-02~~
spectral/pretty	1.06E-02
spectral/scc	1.06E-02
~~shootout/reverse-complement~~	~~1.07E-02~~
real/eff/VSD	2.06E-02
real/eff/CS	9.07E-02
real/eff/VSM	0.121
real/eff/FS	0.414
real/eff/VS	0.415
spectral/last-piece	0.488

program	mutator time	std.err.
~~shootout/reverse-complement~~	~~1.18E-04~~	~~7.30%~~
spectral/pretty	2.18E-04	16.60%
spectral/scc	2.25E-04	17.20%

Short benchmark runtimes

Designs

Child items ...

Activity