Add BeamExtract (approximate DAG extractor) and DagExpr (canonical multi-rooted DAG expression) #358

recmo · 2025-09-05T09:59:27Z

This PR adds two new structs:

DagExpr. This generalizes RecExpr by allowing multiple roots. It also maintains strong invariants on minimality and canonical order, essentially being a unique representation of a given DAG for the roots.
BeamExtract. A DAG extractor based on DagExpr that uses beam search to approximate the optimal.

One cool thing I realized while building this is that if we add a minor restriction on Language (the ordering of nodes is preserved under a monotonically increasing map of node Ids), then the merging two DagExprs can be done using a linear time merge sort variant.

I'm wondering if this minor restriction should be added to the docs of Language as a requirement, or an additional marker trait languages need to implement.

It may make sense to use DagExpr for the LpExtract result as well, but that would be breaking the public API.

oflatt · 2025-09-06T19:27:29Z

Hi- thanks for your PR!
The egg team has switched focus to egglog and can't keep expanding egg.

Could this PR be a separate crate?

This algorithm could be a great contribution to the extraction gym! It works over a serialized egraph that both egg and egglog export to.

https://github.com/egraphs-good/extraction-gym

recmo added 9 commits September 5, 2025 11:51

Consume self in RecExpr compaction

329df1f

Canonical, minimal, compact

3e50ed7

Shortlex order using binary heap

9d221c6

Don't need interner

debde68

Refactor to DagExpr

5007976

Use TopK struct

6a57be0

Add is_valid

0b18805

Cleanup

f8ad212

Improve docs and tests

38304ef

recmo mentioned this pull request Sep 12, 2025

Add Beam extractor and basic good_lp ILP extractor. egraphs-good/extraction-gym#48

Draft

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add BeamExtract (approximate DAG extractor) and DagExpr (canonical multi-rooted DAG expression) #358

Add BeamExtract (approximate DAG extractor) and DagExpr (canonical multi-rooted DAG expression) #358

Uh oh!

recmo commented Sep 5, 2025 •

edited

Loading

Uh oh!

oflatt commented Sep 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add BeamExtract (approximate DAG extractor) and DagExpr (canonical multi-rooted DAG expression) #358

Are you sure you want to change the base?

Add BeamExtract (approximate DAG extractor) and DagExpr (canonical multi-rooted DAG expression) #358

Uh oh!

Conversation

recmo commented Sep 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

oflatt commented Sep 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

recmo commented Sep 5, 2025 •

edited

Loading