Dream Coder - Algorithm 3

Let us say you have a sample space $S$ and $s$ is an event in $S$ that occurs with probability $P (s) = \frac{1}{2}$ . so if we observe the event $s$ , we are effective cutting the sample space in half. So this event carries $1$ bit of information.
Similarly, an event $p$ with probability $P (p) = \frac{1}{4}$ carries two bits of information, because observing it effectively cuts the sample space into a quarter.

So less likely events carry more information when they occur. in general, if $x \in S$ and $P : S \to [0, 1]$ is a probability function, $I (x) = - l o g_{2} (P (x))$ .

Description of the Enumerate Algorithm

Setup and Notation

Let $S$ denote the sample space of all possible programs, and let $μ : S \to [0, 1]$ be a probability distribution over programs, such that $μ (x)$ represents the probability of program $x \in S$ . The information content of a program $x$ is defined as:

This measures the amount of information carried by observing $x$ , and less probable events carry more information. For example:

If $P (x) = \frac{1}{2}$ , then $I (x) = 1$ bit.
If $P (x) = \frac{1}{4}$ , then $I (x) = 2$ bits.

The goal of the enumerate procedure is to generate programs in approximately decreasing order of probability (or equivalently, increasing order of information) under $μ$ . The algorithm works in two phases: a Heap Search Phase followed by a Parallel DFS Phase.

Heap Search Phase

Initialization

Define a lower bound $lowerBound = 0$ and an upper bound $upperBound = Δ$ , where $Δ > 0$ is a hyperparameter controlling the granularity of the search.
Initialize a max-heap, where programs are ordered by their priority, defined as $priority (x) = \log_{2} (μ (x))$ (equivalently, higher probabilities are prioritized).
Insert the initial empty program (or syntax tree) into the heap with priority $0$ .

Iterative Heap Processing

While the heap size is manageable (specifically, $| heap | \leq 10 \times CPUs$ ):

Pop a program $ρ$ from the heap:
- If $ρ$ is complete (a fully-specified program):
  - Check whether $lowerBound \leq I (ρ) \leq upperBound$ . If true, yield $ρ$ .
- Otherwise, compute the set of children $children (ρ)$ , where each child $c \in children (ρ)$ fills in the next "hole" in $ρ$ 's syntax tree.
For each child $c$ :
- Compute its information $I (c) = - \log_{2} (μ (c))$ .
- If $I (c) \leq upperBound$ (i.e., $c$ is not too improbable), insert $c$ into the heap with priority $\log_{2} (μ (c))$ .

If the heap grows too large ( $| heap | > 10 \times CPUs$ ), transition to the Parallel DFS Phase.

Parallel DFS Phase

Task Distribution

Distribute up to $10 \times CPUs$ programs from the heap to the CPUs for processing. Each CPU receives approximately $10$ programs (partial programs) as independent tasks.
Set the initial search bounds for the DFS to $lowerBound = 0$ and $upperBound = Δ$ .

Per-CPU DFS Procedure

Each CPU processes its assigned tasks sequentially. For each partial program $ρ$ :

Perform a depth-first search (DFS) to explore all completions of $ρ$ .
When a completion $c$ is found:
- Check if $lowerBound \leq I (c) \leq upperBound$ .
- If true, yield $c$ .
For incomplete programs:
- Compute their children $children (ρ)$ .
- For each child $c$ :
  - Compute its information $I (c)$ .
  - If $I (c) \leq upperBound$ , push $c$ onto the local DFS stack for further exploration.

Each CPU finishes all of its $10$ assigned jobs sequentially but operates in parallel with other CPUs.

Lower Bound Adjustment

Once all CPUs complete their assigned jobs:

Increase the lower bound by $Δ$ , i.e., set $lowerBound \leftarrow lowerBound + Δ$ .
Adjust the upper bound accordingly, i.e., $upperBound \leftarrow upperBound + Δ$ .
Reuse the same set of programs from the heap for the next DFS phase, but now search for completions with higher information (i.e., less probable programs).

Summary of the Algorithm

The algorithm starts with a Heap Search Phase, which uses a max-heap to explore programs with information $I (ρ) \leq Δ$ .
When the heap grows too large, the algorithm transitions to a Parallel DFS Phase:
- Each CPU explores its assigned programs in parallel, searching for completions within the current information bounds $[lowerBound, upperBound]$ .
- The bounds are incrementally shifted by $Δ$ after each round, allowing the algorithm to progressively explore less probable programs.

This approach efficiently enumerates programs in decreasing order of probability while balancing parallelism and memory constraints.

A note on (MDL) or minimum description length:

Basically, the information of a program is an estimate of its MDL, that is what the upper bound is for, highly informative programs are longer and are lower probability.
Support/Figures/Pasted image 20241212220318.png