Tree @fb38023 (Download .tar.gz)
Lariat
Lariat is a project to define an abstract data type for lambda terms,
consisting of six basic operations: app
, abs
, var
, resolve
,
destruct
, and freevars
.
This repository presents the definition of these operations. It also contains implementations of this abstract data type (and possibly one day variations on it) in various programming languages, including:
The version of the Lariat defined by this document is 0.1. This version number will be promoted to 1.0 once vetted sufficiently.
Table of Contents
Background
There are several approaches for representing lambda terms in software.
The naive approach is to represent them just as they are written on paper. In this approach, whether a variable, such as x, is free or bound depends on whether it is inside a lambda abstraction λ x or not. If you need to manipulate it, you might need to rename it so that it doesn't conflict with another variable also called x that is bound to a different lambda abstraction.
This is tiresome and error-prone. So other approaches were developed.
One such approach alternate is De Bruijn indexes, where variables are represented not by names, but by numbers. The number indicates which lambda abstraction the variable is bound to, if any; a 1 indicates the immediately enclosing lambda abstraction, a 2 indicates the lambda abstraction just above that, and so on. If the number exceeds the number of enclosing lambda abstractions, then it is a free variable.
But this, too, has some drawbacks, so people have devised a number of other approaches:
- "nominal techniques" (Gabbay and Pitts)
- "locally nameless" (various?)
- "maps" (Sato et al.)
- "bound" (Kmett)
among others.
But the point we would like to make is this: At some level of abstraction it does not matter which approach is chosen as long as the approach satisfies the essential properties that we require of lambda terms.
To this end, we present this abstract data type for lambda terms, which we call Lariat, consisting of six operations. The actual, concrete data structure in which they are stored, and the actual, concrete mechanism by which names become bound to terms, are of no consequence (and may well be hidden from the programmer) so long as the implementation of the operations conforms to the stated specification.
The Operations
var(n: name): term
Given a name n, return a free variable with the name n.
Note: A free variable is a term; it can be passed to any operation that expects a term.
app(t: term, u: term): term
Given a term t and a term u, return an application term which contains t as its first subterm and u as its second subterm.
Note: An application term is a term that is an ordered pair of terms.
abs(n: name, t: term): term
Given a name n and a term t, return an abstraction term containing t', where t' is a version of t where all free variables named n inside t' have been replaced with bound variables. These bound variables are bound to the returned abstraction term.
Note: a bound variable is a term, but the user cannot work with bound variables directly. A bound variable is always bound to a particular abstraction term. In the case of
abs
, the abstraction term to which variables are bound, is the term returned by the operation.Note: an abstraction term contains one subterm. This subterm cannot be extracted directly, as it may contain bound variables, which the user cannot work with directly.
resolve(t: term, u: term): term
Given a term t and a term u, return either t (if t is not an abstraction term) or t', where t' is a version of t where all bound variables in t' that were bound to t itself have been replaced by u.
Note: as stated above, a bound variable is always bound to an abstraction term. The bound variables that are replaced by a
resolve
operation are always those that are bound to t.Note: this operation was specifically not named
subst
, assubst
is often described as replacing free variables, while this operation replaces bound ones. It was also specifically not namedbeta
because it does not require that t and u come from the same application term.
destruct(t: term, f1: fun, f2: fun, f3: fun): X
Given a term t and three functions f1, f2, and f3 (each with a different signature, described below), choose one of the three functions based on t, and evaluate it, returning what it returns.
If t is a free variable, evaluate f1(n) where n is the name of the free variable t.
If t is an application term, evaluate f2(u, v) where u is the first subterm of t and v is the second subterm of t.
If t is an abstraction term, evaluate f3(t).
Note: the
destruct
operation's signature shown above was abbreviated to make it look less intimidating. The full signature would be something more likedestruct(t: term, f1: fun(n: name): X, f2: fun(u: term, v: term): X, f3: fun(t: term): X): X
Note: while f1 and f2 "take apart" the term they are working on, f3 does not, because the only operation that is allowed to "take apart" an abstraction term, is
resolve
, and callingresolve
involves a choice (what to resolve the bound variable to) anddestruct
cannot make this choice for you.)
freevars(t: term): list(term)
Given a term t, return a list of free variables contained in t. These free variables may be located at any depth inside t, including inside any and all abstraction terms contained in t.
Stepping Back
Now that we have given the operations, we can make some comments on them.
This is an abstract data type for lambda terms, not the lambda calculus. Naturally, one ought to be able to write a lambda calculus normalizer using these operations (and this will be one of our goals in the next section), but one is not restricted to that. The terms constructed using the Lariat operations may be used for any purpose for which terms-with-name-binding might be useful.
It is destruct
that allows us to examine a lambda term. destruct
is a
"destructorizer" in the sense described in the Destructorizers article.
It is a slight variation on the conventional destructorizer pattern
described there, as it does not "take apart" abstraction terms, but it is
undoubtedly in the spirit of the thing.
It is regrettable that freevars
is an intrinsic operation rather than
something that can be built as a recursive function that uses destruct
,
but it seems it is needed in order to use destruct
with generality,
for the following reasons. The only practical way to "take apart" an abstraction
term is to resolve
it with a known free variable. But how do you
pick that free variable, so that it will not collide with any of the
free variables inside the term you are "taking apart"? Short of
devising some clever namespacing for names, we need to know what
free variables are inside the term, before resolve
-ing it.
Thus freevars
exists to fulfill that purpose.
Some Examples
Example 1
As a warm-up, suppose we want to write a function that tells us if a
lambda term contains any free variable named (for the sake of
concreteness, let's say) j
.
In this implementation and those that follow, we will assume we have
a simple functional language with the usual accoutrements (recursion,
if
, let
and so forth).
We'll also assume the existence of a value which is a
name supply; we call a function pick
on it, and it returns
a name and new name supply. Now, pick
by itself does not ensure
the name is "fresh" (that is, not already used in a lambda term
that we're interested in), so we also inform pick
of the set
of names that we don't want it to return. In this particular case,
that set is the singleton set containing only j
.
let contains_j = fun(t, ns) ->
destruct(r,
fun(n) -> n == "j",
fun(t, u) -> contains_j(t, ns) || contains_j(u, ns),
fun(t) ->
let
(n, ns') = pick(ns, ~{"j"})
t' = resolve(t, var(n))
in
contains_j(t', ns')
)
Example 2
What if we want to get the set of free variables present in a lambda term? We ought to be able to do this, and yet, it's difficult unless we know of a name that we are certain will not be among the names used by the free variables. (Because in practice we "take apart" abstraction terms by resolving them with a free variable.)
So for that task, we have freevars
as one of the intrinsic
operations.
TODO: instead have an example of using freevars
and
passing it to pick
and walking down arbitrary abstraction
terms this way.
Example 3
The next task is to write a beta-reducer. We destruct
the term twice, once to ensure it is an application term,
and second to ensure the application term's first subterm
is an abstraction term. Then we resolve
the abstraction
with the application term's second subterm.
let beta = fun(r) ->
destruct(r,
fun(n) -> var(n),
fun(u, v) ->
destruct(u,
fun(_) -> app(u, v),
fun(_, _) -> app(u, v),
fun(u) -> resolve(u, v)
),
fun(t) -> t
)
Example 4
The next task would be to search through a lambda term, looking for a candidate application term to reduce, and reducing it.
--
-- Returns [bool, term] where bool indicates "has rewritten"
-- UNTESTED
--
let reduce = fun(t) ->
if is_beta_reducible(t) then
[true, beta(t)]
else
destruct(t,
fun(n) -> [false, var(n)],
fun(u, v) ->
let
r = reduce(u)
in
if r[0] then
[true, app(r[1], v)]
else
let
s = reduce(v)
in
if s[0] then
[true, app(u, s[1])]
else
[false, app(r[1], s[1])],
fun(t) -> [false, t]
)
From there it ought to be just a hop, a skip, and a jump to a proper lambda term normalizer.
Commit History
@fb380234f6645a6f94d2c8b9cee7cfa810e09ae4
git clone https://git.catseye.tc/Lariat/
- Implemented but untested Chris Pressey 3 years ago
- Implement `idBetaReducible` to support the `reduce` example. Chris Pressey 3 years ago
- Implement "beta-reduce a term" and improve tests. Chris Pressey 3 years ago
- Implement the "contains a free variable named _j_" example. Chris Pressey 3 years ago
- Add name supply code in Haskell implementation. Chris Pressey 3 years ago
- Link implementation to documentation. Chris Pressey 3 years ago
- Implement `resolve`. Chris Pressey 3 years ago
- Implement `freevars`. Chris Pressey 3 years ago
- Implement `destruct`. Chris Pressey 3 years ago
- Implement `abs`. Chris Pressey 3 years ago