

## Trivial Code Generation

#### Generating Code From Trees

- To generate code from expression trees, traverse the tree and emit machine code instructions.
- For leaves (which represent operands), generate load instructions. For interior nodes, generate arithmetic instructions.
- Assume an infinite number of registers ⇒ easy algorithm!
- Each tree node N has an attribute 'R', the register into which the subtree rooted at N will be computed.



# Generating Code From Labeled Trees

#### Optimal Ordering For Trees I

- . We can generate 'optimal' code from a tree. 'Optimal' in the sense of 'smallest number of instructions generated'.
- The idea is to reorder the computations to minimize the need for register spilling.

| First Order             | Second Order       |  |
|-------------------------|--------------------|--|
|                         | $t_2 := c + d$     |  |
| t <sub>2</sub> := c + d | $t_3 := e - t_2$   |  |
|                         | $t_1 := a + b$     |  |
| $t_4 := t_1 - t_3$      | $t_4 := t_1 - t_3$ |  |
|                         |                    |  |

 Assume two registers available. The first ordering evaluates the left subtree first, and has to spill R0 to have enough registers available for the right subtree.

| First Order                                     | Second Order                                    | First Order            | Second Order           |
|-------------------------------------------------|-------------------------------------------------|------------------------|------------------------|
| t1 :=a+b                                        | t <sub>2</sub> :=c+d                            | MOV a, RO              | MOV c, RO              |
| t <sub>2</sub> :=c+d                            | t <sub>3</sub> :=e-t <sub>2</sub>               | ADD b, RO              | ADD d, RO              |
| t <sub>3</sub> :=e-t <sub>2</sub>               | t1 :=a+b                                        | MOV c, R1              | MOV e, R1              |
| t <sub>4</sub> :=t <sub>1</sub> -t <sub>3</sub> | t <sub>4</sub> :=t <sub>1</sub> -t <sub>3</sub> | ADD d, R1              | SUB RO, R1             |
|                                                 |                                                 | MOV RO, t <sub>1</sub> | MOV a, RO              |
|                                                 |                                                 | MOV e, RO              | ADD b, RO              |
|                                                 |                                                 | SUB R1, R0             | SUB RO, R1             |
|                                                 |                                                 | MOV t1, R1             | MOV RO, t <sub>4</sub> |
|                                                 |                                                 | SUB RO, R1             |                        |
|                                                 |                                                 | MOV R1, t <sub>4</sub> |                        |
|                                                 | •                                               | •                      | •                      |

2 090

D > (#) (2) (2) (2) 2 (0)

#### The Tree Labeling Phase I

#### The Tree Labeling Phase II

- If we have a node n with subtrees n and n with L=label(n1) & R=label(n2) & L<R then we can first evaluate n2 into a register Reg using R registers. Then we use R-1 registers to evaluate n1.
- Similarly, if L>R then we can first evaluate n1 into a register Reg and use the remaining R-1 registers for n2.



register to hold the result of  $n_1$  while we evaluate  $n_2$ .

The algorithm has two parts. First we label each sub-tree with the minimum number of registers needed to evaluate the subtree without any register spilling.

\_ The Labeling Algorithm: \_\_\_\_\_

- *n* is a left leaf  $\Rightarrow$  label(*n*) := 1;
- *n* is a right leaf  $\Rightarrow$  label(*n*) := 0:
- n's children have labels I<sub>1</sub> & I<sub>R</sub>:

• 
$$l_L \neq l_R \Rightarrow label(n) := max(l_L, l_R)$$

• 
$$I_L = I_R \Rightarrow \text{label}(n) := I_L + 1$$



#### The Generation Phase I

#### The Generation Phase II

- gencode (n) generates machine code for a subtree n of a labeled tree T.
- MOV M, R Load variable M into register R.
- MOV R, M Store register R into variable M.
- OP M, R Compute R := R OP M.  $OP \in ADD$ , SUB, MUL, DIV.
- OP R2, R1 Compute R1 := R1 OP R2.
  - A stack rstack initially contains all available registers. gencode(n) generates code for subtree n using the registers on rstack, computing its value into the register on the top of the stack.
  - A stack tstack of temporary memory locations is used for register spilling.



- Case 0 A leaf n is the leftmost child of its parent.
- Case 1 A leaf n2 is the rightmost child of its parent.
- Case 2 A right subtree  $n_2$  requires more registers than the left subtree  $n_1$ .
- Case 3 A left subtree  $n_1$  requires more registers than the right subtree  $n_2$ .
- Case 4 Both subtrees require registers to be spilt.



### The Generation Phase III



- Generate code for n<sub>1</sub> into register top(rstack), i.e. call gencode(n<sub>1</sub>).
- Generate OP name, top(rstack)
- (D) (B) (2) (2) (2) (2) (0)

#### The Generation Phase IV



- $n_1$  can be evaluated without spilling, but  $n_2$  requires more registers than  $n_1$ .
- We swap the two top registers on rstack, evaluate  $n_2$  into top(rstack), remove the top register, then evaluate  $n_1$  into top(rstack). Restore the stack.
- swap(rstack), gencode(n<sub>2</sub>)
- Q R := pop(rstack)
- gencode(n1)
- Generate OP R, top(rstack)
- push(rstack, R), swap(rstack)

#### The Generation Phase VI





# Summary

 This lecture is taken from the Dragon Book: Code Generation From Trees: 557–559, 561–566. Local Optimization: 530–532, 600–602.

#### Summary I





