Advent of Code Day 11 [spoilers], Inclusion-Exclusion, and Haskell's odd design decisions by markgritter

adventofcode · @markgritter · Jan 4 '19

$2.02

Advent of Code Day 11 [spoilers], Inclusion-Exclusion, and Haskell's odd design decisions

Haskell has a [`maximum`](http://hackage.haskell.org/package/base-4.12.0.0/docs/Prelude.html#v:maximum) function and it has lazy evaluations of lists.  I come from Python that has a `max` function and list generators.  But there turns out to be a crucial difference.

[Day 11](https://adventofcode.com/2018/day/11) asks us to find maximum-value squares in a programatically defined integer array.  Part 1 asks for 3x3 squares so I (foolishly) built something that only worked for 3x3 squares.

Puzzle input is a "serial number" so I made a function that when partially applied to the serial number gives the function x, y -> value of the cell.

```
type LevelFunction = Int -> Int -> Int

fuelCellLevel :: Int -> LevelFunction
fuelCellLevel serialNumber y x =
  let rackId = (x+10)
      allDigits = (rackId * y + serialNumber) * rackId in
    ((allDigits `div` 100) `mod` 10) - 5
```

I felt the key here was going to be avoiding redundant calls to this function, as well as adding the same numbers over and over again.  My solution for part 1 was to add three rows together:

```
  1  2  3  4  5  ...
  6  7  8  9 10 ...
 11 12 13 14 15 ...
-------------------
 18 21 24 27 30 ...
```

and then take units of three to get the sum of all the 3x3 squares.  In order to keep track of where the square came from, we need to have both a sum and a label.

Generate an entire row of values:

```
gridRange = [1..300]
rowLevels fn y = (map (fn y) gridRange)
```

Some utility functions for operating on tuples:
```
sum3 (a,b,c) = a + b + c

sumColumns (y,as,bs,cs) =
    (y, map sum3 (zip3 as bs cs))

sumSquare y (x,a,b,c) = (a+b+c, x, y)
```

Take the rows three at a time and add them up in the way shown above.  The `zip` functions all use the shortest list length.  Using `zip` this way is a common Python idiom, I don't know if Haskell people do it too or if they have a different preferred way of accomplishing it.

```
threeByThreeLevels :: (Int -> Int -> Int) -> [(Int,Int,Int)]
threeByThreeLevels fn =
  let rows = [ rowLevels fn y | y <- gridRange ] :: [[Int]]
      threeRows = zip4 gridRange rows (drop 1 rows) (drop 2 rows) :: [(Int,[Int],[Int],[Int])]
      threeRowsSummed = map sumColumns threeRows in
    concat (map sumSquares threeRowsSummed)
```

Then the same pattern is used to take the columns three at a time:

```
sumSquares :: (Int,[Int]) -> [(Int,Int,Int)]
sumSquares (y,cols) =
  let threeCols = zip4 gridRange cols (drop 1 cols) (drop 2 cols) in
    map (sumSquare y) threeCols
```

We ordered things so that the sum comes first in the tuple, so we can just apply maximum to the tuples as they are:

```
maxSquare serialNumber = maximum (threeByThreeLevels (fuelCellLevel serialNumber))
```

OK, that works for part 1.  Part 2 asks us to find the maximum-valued square of any size, so all that work was wasted.

I thought about it a bit and decided the right solution was inclusion/exclusion.  Suppose we know, for every point `(m,n)` in the array, the value of the sum of all the entries between `(1,1)` and `(m,n)`.  Then we can calculate the value of any smaller rectangle by doing some math.

![inclusion-exclusion-areas.png](https://cdn.steemitimages.com/DQmdFKh9fmeJjvzaKRHmVEAAvFmjkNi4fJ9dk3SrKpfM3qo/inclusion-exclusion-areas.png)

We want the area of a small blue square not beginning at (1,1).  So, we can start with the big sum (white square), subtract off the portion on the right that we don't want (red rectangle) and the portion on the bottom that we don't want (green rectangle.)  That means part of the original area got subtracted twice, so we have to add that back in (yellow.)

This technique allows us to precompute a matrix of all the area sums that start at (1,1), and then compute any other sum with just four references into this array.

The code I wrote is a little magical, but follows one of the examples given in 
[Data.Array](http://hackage.haskell.org/package/array-0.5.3.0/docs/Data-Array.html).  We can refer back to the array in order to define it!  Here I do this twice, once to define columns in terms of earlier columns (and the previous row), and once to define the rows of the matrix in terms of its earlier rows:

```
-- Return an entire row's worth of sums 
rowPartialSums :: LevelFunction -> Int -> Array Int Int -> Array Int Int
rowPartialSums fn y prevRow =
  let a = array (1,300) ((1, (prevRow!1) + fn y 1) :
                         [(x, (a!(x-1)) + (prevRow!x) + (fn y x) - (prevRow!(x-1))) | x <- [2..300] ])  in a

-- Entire matrix of sums, (array ! y) ! x = sum from (1,1) to (y,x)
partialSums :: LevelFunction -> Array Int (Array Int Int)
partialSums fn =
  let zero = array (1,300) [(x,0) | x <- [1..300]]
      rows = array (1,300) ((1, rowPartialSums fn 1 zero) :
                            [(y, rowPartialSums fn y (rows!(y-1))) | y <- [2..300] ]) in rows

sums serialNumber = partialSums (fuelCellLevel serialNumber)
```

If you look at `rowPartialSums` it's doing inclusion-exclusion here too.  We want to define `A[x][y]` in terms of sums we already know.  So it's equal to `fn(x,y) + A[x-1][y] + A[x][y-1]`, but both those values already include the value of `A[x-1][y-1]`.

I see looking at this that I could have curried `fn` which was my intention for putting `y` first, but I didn't.

Now to do the inclusion-exclusion, we need to be careful of the edge cases, so I just wrote everything out in four big cases and didn't worry too much about making it compact:

```
areaSum :: Array Int (Array Int Int) -> Int -> Int -> Int -> Int
areaSum a 1 1 size = let
  x' = size
  y' = size in
    (a ! y') ! x'

areaSum a 1 x size = let
  x' = x + size - 1
  y' = size in
  (a ! y') ! x' - (a ! y') ! (x-1)

areaSum a y 1 size = let
  x' = size
  y' = y + size - 1 in
  (a ! y') ! x' - (a ! (y-1)) ! x'

areaSum a y x size = let
  x' = x + size - 1
  y' = y + size - 1 in
  (a ! y') ! x' - (a ! (y-1)) ! x' - (a ! y') ! (x-1) + (a ! (y-1)) ! (x-1)
```

OK, just one more step and we're done, right?  We just have to iterate over all sizes and all locations where squares of that sizes could fit, which we can do in one big list comprehension:

```
maxSquareK :: Int -> (Int,Int,Int,Int)
maxSquareK sn = let a = sums sn in
  maximum [ (areaSum a y x size, x, y, size) |
            size <- [1..300],
            x <- [1..301-size],
            y <- [1..301-size] ]
```

Oops, doesn't work: `day11.hs: stack overflow`

OK, time to try profiling.  We can compile the program with profiling enabled like this:

```
mark@ubuntu:~/aoc2018/day11$ stack ghc -- -prof -fprof-auto -fprof-cafs day11.hs
[1 of 1] Compiling Main             ( day11.hs, day11.o )
Linking day11 ...
```

And run it like this to get heap profiling:
```
mark@ubuntu:~/aoc2018/day11$ ./day11 +RTS -hc -p
```

This results in a test file full of samples like this one:

```
BEGIN_SAMPLE 0.919256
(150)GHC.IO.Handle.Text.CAF     24
(241)CAF:$dShow_r3Z2    152
(126)PINNED     36816
(249)main       120
(248)main/CAF:main      96
MAIN    160
(233)GHC.Conc.Signal.CAF        640
(212)GHC.IO.Handle.FD.CAF       704
(220)GHC.IO.Encoding.Iconv.CAF  120
(222)GHC.IO.Encoding.CAF        1096
(277)maxSquareK/main/CAF:main   301482248
END_SAMPLE 0.919256
```

OK, that's a lot of memory allocation, but why?

```
                                                                                                          individual      inherited
COST CENTRE                             MODULE                SRC                      no.     entries  %time %alloc   %time %alloc
...
   maxSquareK                           Main                  day11.hs:(91,1)-(95,32)  277          1   41.1   47.3    81.2   70.4
    areaSum                             Main                  day11.hs:(66,1)-(84,75)  278     967107   30.8   18.7    36.6   21.1
     areaSum.y'                         Main                  day11.hs:83:3-19         295     960306    2.8    1.2     2.8    1.2
     areaSum.x'                         Main                  day11.hs:82:3-19         296     960305    3.0    1.2     3.0    1.2
     areaSum.x'                         Main                  day11.hs:77:3-11         286       3522    0.0    0.0     0.0    0.0
     areaSum.y'                         Main                  day11.hs:78:3-19         283       3522    0.0    0.0     0.0    0.0
     areaSum.x'                         Main                  day11.hs:72:3-19         294       3267    0.0    0.0     0.0    0.0
     areaSum.y'                         Main                  day11.hs:73:3-11         293       3267    0.0    0.0     0.0    0.0
     areaSum.x'                         Main                  day11.hs:67:3-11         292         12    0.0    0.0     0.0    0.0
     areaSum.y'                         Main                  day11.hs:68:3-11         291         12    0.0    0.0     0.0    0.0
```

I find this a little confusing; it looks like we're accumulating a lot of memory in `areaSum`.  Actually, we're accumulating a bunch of unevaluated `areaSum` thunks.

The reason is that `maximum` doesn't do what I thought, which is to do a strict fold.  Instead it does lazy evaluation of the entire list of comparisons, as if the intermediate result was 

```
max( a, max( b, max( c, max( d, ... ) ) ) )
```

where each of the arguments is one of the `areaSum` function calls.  I have no idea why this is the preferred default behavior.  It also suggests that part 1 is using way too much memory as well.  If you plot it memory usage does start going down, eventually, when we reach the end of the large list generated by the comprehension.

![](https://cdn.steemitimages.com/DQmQMef7iqBswdXPhnrLBB8eLY5qEWTy3aqGbN9R7wwdU8y/image.png)

OK, quick hack.  We'll use `foldl'` which uses strict evaluation (doesn't defer the comparison) like this:

```
maximum' = foldl' max (0,0,0,0)

maxSquareK :: Int -> (Int,Int,Int,Int)
maxSquareK sn = let a = sums sn in
  maximum' [ (areaSum a y x size, x, y, size) |
            size <- [1..300],
            x <- [1..301-size],
            y <- [1..301-size] ]
```

That works fine; it churns away a bit with high CPU but memory usage is modest.

Full code: https://github.com/mgritter/aoc2018/blob/master/day11/day11.hs

👍 lukestokes, eonwarped, hendrikdegrote, steemstem, lafona-miner, suesa, cub1, abigail-dantes, stem-espanol, carloserp-2000, curie, sbi3, remlaps1, lemouth, eforucom, vact, derbesserwisser, mathowl, cmp2020, wackou, markgritter, irelandscape, bloom, iamphysical, anaestrada12, howo, lupafilotaxia, tomastonyperez, elvigia, lorenzor, eliaschess333, massivevibration, mahdiyari, mountain.phil28, helo, josedelacruz, amestyj, de-stem, lamouthe, birddroppings, nuthman, golbang, lisa.palmer, corsica, alexander.alexis, spbeckman, remlaps2, flugschwein, cub2, ubaldonet, rgkmb-unofficial, epicdesigns, steemmyphoto, emiliomoron, luiscd8a, azulear, alexdory, liberosist, ennyta, ulisesfl17, gra, eric-boucher, fran.frey, astronomyizfun, and 138 others

`author`	markgritter
`permlink`	advent-of-code-day-11-spoilers-inclusion-exclusion-and-haskell-s-odd-design-decisions
`category`	adventofcode
`json_metadata`	{"tags":["adventofcode","programming","haskell","functionalprogramming","puzzle"],"image":["https://cdn.steemitimages.com/DQmdFKh9fmeJjvzaKRHmVEAAvFmjkNi4fJ9dk3SrKpfM3qo/inclusion-exclusion-areas.png","https://cdn.steemitimages.com/DQmQMef7iqBswdXPhnrLBB8eLY5qEWTy3aqGbN9R7wwdU8y/image.png"],"links":["http://hackage.haskell.org/package/base-4.12.0.0/docs/Prelude.html#v:maximum","https://adventofcode.com/2018/day/11","http://hackage.haskell.org/package/array-0.5.3.0/docs/Data-Array.html","https://github.com/mgritter/aoc2018/blob/master/day11/day11.hs"],"app":"steemit/0.1","format":"markdown"}
`created`	2019-01-04 04:39:30
`last_update`	2019-01-04 04:39:30
`depth`	0
`children`	2
`last_payout`	2019-01-11 04:39:30
`cashout_time`	1969-12-31 23:59:59
`total_payout_value`	1.546 HBD
`curator_payout_value`	0.472 HBD
`pending_payout_value`	0.000 HBD
`promoted`	0.000 HBD
`body_length`	10,190
`author_reputation`	7,057,249,855,552
`root_title`	"Advent of Code Day 11 [spoilers], Inclusion-Exclusion, and Haskell's odd design decisions"
`beneficiaries`	`[]`
`max_accepted_payout`	1,000,000.000 HBD
`percent_hbd`	10,000
`post_id`	77,844,403
`net_rshares`	3,760,677,973,619
`author_curate_reward`	""

properties (23)vote details (202)

voter	rshares	pct
wackou	18,702,679,453	0.32%
lafona-miner	193,217,612,262	5%
eric-boucher	1,831,145,216	0.54%
anwenbaumeister	15,620,062	1.08%
lukestokes	1,676,944,779,271	100%
mammasitta	273,372,166	0.05%
liberosist	1,905,654,822	1.08%
cmp2020	18,809,516,714	35%
minnowsunite	199,797,171	100%
lemouth	24,050,633,334	5%
rwilday	55,323,763	100%
lamouthe	2,555,958,193	5%
lk666	100,859,043	0.54%
eforucom	21,853,114,126	1%
remlaps1	24,058,883,190	35%
curie	30,359,503,702	1.08%
cub1	94,411,767,849	35%
hendrikdegrote	339,575,712,751	1.08%
vact	21,757,708,787	1.08%
golbang	2,386,470,322	0.32%
steemstem	228,758,749,647	5%
remlaps2	2,156,776,393	35%
dna-replication	1,353,422,112	5%
lisa.palmer	2,320,883,799	35%
cub2	2,126,327,483	35%
astronomyizfun	1,804,301,023	35%
moksamol	115,367,091	0.54%
bloom	13,644,008,865	4%
epicdesigns	2,070,682,827	10%
iansart	609,780,360	0.54%
kryzsec	1,169,261,270	4%
jiujitsu	173,951,800	0.54%
helo	3,417,605,693	2.5%
samminator	1,458,071,819	2.5%
locikll	452,506,932	2.16%
mahdiyari	3,778,117,493	2.5%
lorenzor	5,303,298,272	50%
aboutyourbiz	185,947,208	1.08%
alexander.alexis	2,217,168,428	5%
suesa	125,963,898,605	25%
cryptokrieg	91,823,403	1.08%
corsica	2,296,984,470	5%
howo	10,174,153,689	2.5%
tsoldovieri	321,433,486	2.5%
nitego	83,513,305	0.32%
neumannsalva	119,656,587	0.54%
wargof	237,368,220	10%
abigail-dantes	84,876,604,506	5%
zonguin	204,013,366	1.25%
alexzicky	991,622,714	1.25%
mountain.phil28	3,577,967,737	25%
tuoficinavirtual	97,323,426	25%
iamphysical	13,070,682,450	90%
nuthman	2,443,189,729	0.48%
zest	890,992,453	2.5%
felixrodriguez	179,805,175	2.5%
azulear	1,943,465,742	100%
psicoluigi	287,345,736	50%
massivevibration	3,838,522,930	5%
fbslo	421,022,791	0.5%
nicola71	131,559,845	0.87%
erikkun28	0	1%
filipino	912,387,151	10%
birddroppings	2,451,741,988	10%
mayowadavid	175,141,592	2.5%
emdesan	135,158,735	10%
peaceandwar	127,014,247	0.54%
enzor	106,876,435	5%
jesusj1	80,420,132	100%
carloserp-2000	37,980,013,894	100%
gra	1,833,082,021	5%
eonwarped	456,753,006,117	100%
aalok	107,044,616	26%
drmake	542,272,885	0.54%
guga34	84,876,935	3.75%
amestyj	2,950,956,619	50%
mhm-philippines	616,700,078	0.54%
skycae	96,501,852	1.08%
woolnami	638,503,329	0.32%
kenadis	1,159,357,754	5%
maticpecovnik	240,458,792	2%
robotics101	438,577,623	5%
ivymalifred	1,454,728,852	50%
ennyta	1,894,208,307	50%
rharphelle	1,037,879,194	25%
stahlberg	173,901,233	0.54%
vjap55	368,509,545	100%
mangoish	86,778,935	10%
eliaschess333	4,911,041,238	50%
ydavgonzalez	1,380,434,793	5%
langford	209,880,503	5%
mattiarinaldoni	0	1%
mathowl	19,533,666,019	50%
silkroadgo	1,349,433,441	0.32%
terrylovejoy	796,783,715	2%
traviseric	220,761,467	50%
yrmaleza	873,510,217	50%
mondodidave73	174,706,881	0.75%
kingabesh	112,915,119	2.5%
miguelangel2801	788,854,424	50%
didic	551,874,154	0.54%
fcdvpds	1,637,794,517	2%
therosepatch	196,516,727	50%
emiliomoron	2,032,024,217	50%
dexterdev	519,906,664	2.5%
nwjordan	102,382,001	1.08%
oghie	587,105,138	50%
geopolis	283,934,645	5%
robertbira	482,137,392	1.25%
alexdory	1,927,204,607	2%
aotearoa	764,608,969	9%
flugschwein	2,128,704,017	4.75%
benleemusic	132,627,832	0.1%
ulisesfl17	1,855,034,599	100%
arac	1,014,092,865	100%
francostem	641,588,166	5%
ivan-g	92,221,521	0.54%
endopediatria	694,849,803	20%
croctopus	1,412,520,050	100%
sissyjill	72,314,343	7%
emmanuel293	100,214,836	25%
morbyjohn	138,700,365	7%
positiveninja	123,008,973	0.54%
tomastonyperez	6,871,236,977	50%
elvigia	6,156,974,844	50%
qberry	470,770,674	0.54%
acknowledgement	1,119,295,683	10%
lesmouths-travel	269,263,310	5%
ezravandi	397,944,176	1%
effofex	481,657,557	2.5%
luiscd8a	1,981,038,147	80%
eniolw	220,811,109	5%
de-stem	2,767,081,589	4.95%
geadriana	950,955,973	50%
elpdl	414,349,933	100%
derbesserwisser	20,468,946,536	100%
serylt	1,526,808,754	4.9%
josedelacruz	3,249,750,308	50%
viannis	1,536,636,053	50%
flores39	414,807,607	100%
majapesi	248,960,104	50%
sbi3	27,734,583,488	4.88%
irelandscape	16,876,614,634	100%
deholt	137,373,129	5%
timothyallen	826,173,371	0.54%
temitayo-pelumi	335,458,571	5%
yusvelasquez	418,787,954	50%
alexworld	58,618,049	25%
frost1903	44,807,132	50%
acont	249,282,913	50%
anaestrada12	10,710,382,039	100%
steemzeiger	228,899,754	4.95%
council	948,585,837	10%
blewitt	419,315,305	0.1%
kafupraise	81,907,365	34%
biomimi	189,707,760	40%
drsensor	651,759,290	4%
abcor	738,767,144	0.1%
reyvaj	788,041,053	2.5%
jesusfl17	415,356,690	100%
ilovecryptopl	91,122,817	0.86%
emsonic	254,363,016	0.54%
yomismosoy	192,100,216	50%
ubaldonet	2,097,029,629	80%
spbeckman	2,200,207,200	100%
yestermorrow	270,174,457	1.25%
call-me-howie	155,995,985	0.54%
mary11	293,997,605	75%
rgkmb-unofficial	2,093,140,001	35%
rgkmb	131,612,126	35%
wstanley226	247,927,544	50%
osariemen	634,705,055	90%
markgritter	17,019,250,876	100%
lupafilotaxia	9,255,935,387	100%
fran.frey	1,815,314,561	50%
alaiza	485,844,755	100%
jrevilla	168,314,952	50%
moniroy	644,388,795	25%
cmp2020-lite	109,494,373	35%
remlaps-lite	122,485,754	35%
skorup87	16,697,780	11%
swapsteem	79,895,109	2.5%
stem-espanol	38,427,203,292	100%
praditya	1,328,495,773	24%
lapp	485,844,767	100%
steemtpistia	485,306,871	100%
crassipes	485,560,767	100%
javier.dejuan	1,110,236,036	5%
agrovision	485,844,408	100%
xeliram	249,139,561	50%
giulyfarci52	323,221,903	50%
stem.witness	1,736,236,492	5%
monkeycatwithowl	553,953,218	100%
double-negative	472,606,527	20%
wilmer14molina	1,693,428,033	100%
kingnosa	57,482,706	50%
luna777	1,147,982,240	3.08%
amin-ove	80,778,447	50%
hairgistix	415,107,291	0.54%
huilco	333,373,774	100%
steemmyphoto	2,042,182,017	100%
combatsports	267,536,535	1.08%