Learning by Osmosis | Letters to an Unknown Audience

letters

to an unknown audience

Learning by Osmosis/ # /February 20, 2005

On the off chance that you are trying to write a Prolog interpreter, don't do what I did.

What did I do? I tried to store a bunch of equations amongst the various variables, and used a connected-component-finding algorithm to determine whether two things are required to be equal.

Instead, do what everyone else does, which is to just make substitutions directly in the unificands (?). This way, equals will still be required to be equal, but you don't have to search the graph every time you want to check whether two things are meant to be equal. My original algorithm was 50 lines of ML, and the bloody thing actually worked, but after I did some reading about unification, I sat down and wrote this:

fun unify [] result = result
  | unify ((Leaf name, T)::others) result =
	unify (map (cross (subst(Leaf name, T))) others)
			([(Leaf name, T)] @ result)
  | unify ((T, Leaf name)::others) result =
	unify (map (cross (subst(Leaf name, T))) others) 
			([(T, Leaf name)] @ result)
  | unify ((Op(sop, sargs), Op(top, targs))::others) result =
  	if sop = top then
		unify (zip sargs targs @ others) result
	else
		unify others result;

For those at home, that's 12 lines (the utilty routines are given in a footnote, for bean counters), and it actually does more than my naive 50-line monster. I tweaked the above about three times before I got it, and then it worked: I spent about an hour. The bulk of yesterday and today got the surrounding code that does the searching and matching on full Prolog expressions.

The morning after I wrote the above (Monday), I got out of bed and pulled out Pierce, Types and Programming Languages, and here's what he has:

unify(C) = if C = âˆ…, then [ ]
           else let {S = T} âˆª C' = C in
               if S = T
                   then unify(C')
               else if S = X and X âˆ‰ FV(T)
                   then unify([X = T]C') âˆ˜ [X = T]
               else if T = X and X âˆ‰ FV(S)
                   then unify([X = S]C') âˆ˜ [X = S]
               else if S = S₁ â†’ S₂ and T = T₁ â†’ T₂
                   then unify(C' âˆª {S₁ = T₁, S₂ = T₂})
               else
                   fail

Types and Programming Languages, p. 327

Almost exactly the same thing. The key differences are that I didn't do the free-variable check (X âˆ‰ FV(T)) and I didn't compose the new substitution onto the result of the recursive call (that's the "âˆ˜ [X â†’ T]" clause). So, mine could be made to break. Still, I did pretty good for a guy who didn't really know what he was doing.

Appendix

The details.

datatype TermTree =   Var of string | Const of string
                    | Op of string * TermTree list;

fun cross f(X, Y) = (f X, f Y);

fun zip [] [] = []
  | zip (x::xs) (y::ys) = (x, y) :: zip xs ys
  | zip _ _ = raise LengthMismatch;
  
fun subst (this, that) (Const C) = Const C
  | subst (this, that) (Var X) = if this = Var X then that else Var X
  | subst (this, that) (Op(name, args)) =
                                  Op(name, map (subst (this, that)) args);

I also updated it to treat constants and variables differently:

fun unify [] result = SOME result
  | unify ((Const c, Const d)::others) result = 
  	if c = d then
  		unify others result
  	else
  		NONE
  | unify ((Var name, T)::others) result =
	unify (map (cross (subst(Var name, T))) others)
			(comp_one result (Var name, T))
  | unify ((T, Var name)::others) result =
	unify (map (cross (subst(Var name, T))) others) 
			(comp_one result (Var name, T))
  | unify ((Op(sop, sargs), Op(top, targs))::others) result =
  	if sop = top then
		unify (zip sargs targs @ others) result
	else
		NONE
  | unify _ _ = NONE;

Keep Reading >

Comments

boy, of all the random places to find a prolog interpreter algorithm...

i actually tried writing one in PHP a few years back, and was far from successful. and if i understood this, i would try again, but i can't read ML. a translation to a more common language would be nice.

—posted by scott reynen at February 23, 2005 5:19 PM

Scott—

I just got around to approving your comment; very sorry about the delay. I'm glad you weren't burned by that experience.

Unfortunately, the above algorithm isn't a complete Prolog implementation; it just does the part called "unification." That is, it finds matches between an expression like Grandfather(Ezra, x) and one like Grandfather(Ezra, Newton); in this case "unify" would spit out "x → Newton".

It would be fun to try to write this up in another language; but FWIW I think it would actually be harder to read, for one who knows the respective languages. ML has a neat trait of being very readable, for certain kinds of programs. One reason is because ML makes it easy to deal with labelled tree structures, which is what "unify" is working with here. In ML, you can say "if I have a node which is labelled TwoPartThing, let the parts be called x and y, and then do this..." very efficiently, where in many languages that requires a comparison, an assignment to the variables x and y, etc.

Also, there's a great tutorial on ML here: A Gentle Introduction to ML. I highly recommend that tutorial as it's both enlightening and fun.

—posted by Ezra at July 17, 2005 3:33 PM

Excellent blog, good to see someone actually uses em for quality posts. Keep up the good work!

—posted by female free pic at April 24, 2006 7:49 AM

Good blog! I like your posting style, so your wording. It's good that people are so different and everyone has his own story.

—posted by Cheap Calling Cards at May 6, 2006 1:59 PM

Good blog! I like your posting style, so your wording. It's good that people are so different and everyone has his own story.

—posted by calling cards sale at May 13, 2006 12:50 PM

hello! http://www.dirare.com/Sweden/ online directory. SMART Yellow Pages, About DIRare, Search in Business Category. From online directory .

—posted by online directory main at May 16, 2006 1:37 PM

Howdy, I sow Your link at one page and I want to say only - i agree with You and dont forget keep up the good working!

—posted by latina latina at May 18, 2006 9:05 PM

Others