Booleans and Conditional (R2) Continued

Select Instructions

Thanks to explicate-control and the grammar of C1, compiling if statements to x86 is straightforward. Let arg1 and arg2 be the results of translating atm1 and atm2 to x86, respectively.

if (eq? atm1 atm2)       =>      cmpq arg2, arg1
  goto l1;                       je l1
else                             jmp l2
  goto l2;

and similarly for the other comparison operators.

We only use the set and movzbq dance for comparisons in an assignment statement. Let arg1 and arg2 be the results of translating atm1 and atm2 to x86, respectively.

var = (eq? atm1 atm2);    =>     cmpq arg2, arg1
                                 sete %al
                                 mozbq %al, var

Register Allocation

Liveness Analysis

We know how to perform liveness on a basic block. But now we have a whole graph full of basic blocks. Example:

              start
             /      \
            /        \
    inner_then      inner_else
    |         \____/         |
    |         /    \         |
    outer_then      outer_else

    locals: (x y)
    start:
        callq 'read_int
        movq %rax, x
        callq 'read_int
        movq %rax, y
        cmpq $1, x
        jl inner_then
        jmp inner_else
    inner_then:
        cmpq $0, x
        je outer_then
        jmp outer_else
    inner_else:
        cmpq $2, x
                        live after ??  {y} (union)
        je outer_then
        jmp outer_else
    outer_then:           {y}
        movq y, %rax
        addq $2, %rax
        jmp conclusion
    outer_else:           {y}
        movq y, %rax
        addq $10, %rax
        jmp conclusion

Q: In what order should we process the blocks? A: Reverse topological order. In other words, first process the blocks with no out-edges, because the live-after set for the last instruction in each of those blocks is the empty set. In this example, first process outer_then and outer_else. Then imagine that those blocks are removed from the graph. Again select a block with no out-edges and repeat the process, continuing until all the blocks are gone.

Q: How do we compute the live-after set for the instruction at the end of each block? After all, we don’t know which way the conditional jumps will go. A: Take the union of the live-before set of the first instruction of every successor block. Thus we compute a conservative approximation of the real live-before set.

Build Interference

Nothing surprising. Need to give movzbq special treatment similar to the movq instruction. Also, the register al should be considered the same as rax, because it is a part of rax.

Patch Instructions

cmpq the second argument must not be an immediate.
movzbq the target argument must be a register.

Challenge: Optimize and Remove Jumps

The output of explicate-control for our running example is not quite as nice as we advertised above. It generates lots of trivial blocks that just goto another block.

block8482:
    if (eq? x8473 2)
       goto block8479;
    else
       goto block8480; // block8476
block8481:
    if (eq? x8473 0)
       goto block8477;
    else
       goto block8478;
block8480:
    goto block8476;
block8479:
    goto block8475;
block8478:
    goto block8476;
block8477:
    goto block8475;
block8476:
    return (+ y8474 10);
block8475:
    return (+ y8474 2);
start:
    x8473 = (read);
    y8474 = (read);
    if (< x8473 1)
       goto block8481;
    else
       goto block8482;

Two passes:

optimize-jumps collapses sequences of jumps through trivial blocks into a single jump. Remove the trivial blocks.
```
 B1 -> B2* -> B3* -> B4 -> B5* -> B6
 =>
 B1 -> B4 -> B6
```

remove-jumps merges a block with one that comes after if there is only one in-edge to the later one.

 B1    B2    B3         B1    B2     B3
 |      \    /          B4     \     /
 |       \  /            \      \   /
 B4       B5       =>     \       B5
   \     /                 \     /
    \   /                   \   /
      B6                      B6

 B1    B2    B3         B1    B2    B3
 |      \    /          B4     \    /
 |       \  /            \      \  /
 B4       B5*      =>     \      \/ 
   \     /                 \     /
    \   /                   \   /
      B6                      B6

R3 Language (Tuples)

(define v (vector 1 2 #t)) ;; [1,2,#t] : (Vector Integer Integer Boolean)
(vector-ref v 0) ;; 1
(vector-ref v 1) ;; 2
(vector-set! v 0 5)       ;; [5,2,#t]
(vector-ref v 0) ;; 5

type ::= Integer | Boolean | (Vector type+) | Void
exp ::= ... 
  | (vector exp+)
  | (vector-ref exp int)
  | (vector-set! exp int exp)
  | (void)
R3 ::= (program exp)

Example Programs

Tuples are first class

  (let ([t (vector 40 #t (vector 2))])
    (if (vector-ref t 1)
       (+ (vector-ref t 0)
          (vector-ref (vector-ref t 2) 0))
       44))

Tuples may be aliased

  (let ([t1 (vector 3 7)])                 ;;    t1 -> [42,7] <- t2 ***
    (let ([t2 t1])                         ;;    t2 -> [3,7] XXX
      (let ([_ (vector-set! t2 0 42)])
        (vector-ref t1 0))))

Tuples live forever (from the programmer’s view)

  (vector-ref
    (let ([t (vector 3 7)])
      t)
    0)  ;; 3

  ;; Question: How is that different from integers?

  (+
    (let ([t 5])
      t)
    0)

  ;; Answer: Ok, consider this example in which (vector 3 7)
  ;; is not returned from the `let`, but we can nevertheless
  ;; refer to it through the vector bound to `x`.

  (let ([x (vector (vector))])           ;; x ->    [ * ]
    (+                                   ;;           |
       (let ([t (vector 3 7)])           ;; t ->    [3,7]
          (vector-set! x 0 t)
          5) ;; 5
       (vector-ref (vector-ref x 0) 0)) ;; 3
  ;; 8