-->

Sunday, June 14, 2015

Anonymous Wrapper Functions Revisited


Some definitions:


In [1]:
# I made the fixed-point combinator using recursion for simplicity
function fix(fn)
    Z = fn( (input...) -> Z(input...) ) # named after the strict Z combinator (which I'm using)
end




Out[1]:
fix (generic function with 1 method)



In [2]:
fac = (fac) -> (n) -> if (n < 2) 1 else n*fac(n-1) end

fib = (fib) -> (n) -> if (n < 2) n else fib(n-2)+fib(n-1) end




Out[2]:
(anonymous function)



In [3]:
fix(fac)(6)




Out[3]:
720



In [4]:
fix(fib)(5)




Out[4]:
5



More wrappers:


Now to do something interesting. I want to make a trace wrapper that will print out the function name with its arguments whenever its called. First I make a type. The name holds the function name as a string. The funcall element holds the anonymous definition. Eventually I'd like to do this with mutually recursive functions.

In [5]:
type Function
    name
    funcall
end


In [6]:
function trace(f)
    (Yf) -> (input) -> begin
        output = f.funcall(Yf)(input)
        println(f.name, "(", input, ") = ", output)
        output
    end
end




Out[6]:
trace (generic function with 1 method)



In [7]:
fix( trace( Function("factorial", fac) ) )(8)




factorial(1) = 1
factorial(2) = 2
factorial(3) = 6
factorial(4) = 24
factorial(5) = 120
factorial(6) = 720
factorial(7) = 5040
factorial(8) = 40320


Out[7]:
40320



Beautiful!

In [8]:
fix( trace( Function("fibonacci", fib) ) )(6)




fibonacci(0) = 0
fibonacci(1) = 1
fibonacci(2) = 1
fibonacci(1) = 1
fibonacci(0) = 0
fibonacci(1) = 1
fibonacci(2) = 1
fibonacci(3) = 2
fibonacci(4) = 3
fibonacci(1) = 1
fibonacci(0) = 0
fibonacci(1) = 1
fibonacci(2) = 1
fibonacci(3) = 2
fibonacci(0) = 0
fibonacci(1) = 1
fibonacci(2) = 1
fibonacci(1) = 1
fibonacci(0) = 0
fibonacci(1) = 1
fibonacci(2) = 1
fibonacci(3) = 2
fibonacci(4) = 3
fibonacci(5) = 5
fibonacci(6) = 8


Out[8]:
8



Eh...less beautiful but still cool. Obviously memoization is needed to keep fibonacci from being exponential.

In [9]:
function memoize(f)
    inputs = []
    outputs = []

    (Yf) -> (input) -> begin
        # test to see if input already given
        match = find( (elem)->(elem == input), inputs )

        if length(match) == 0
            # result hasn't been found
            output = f(Yf)(input)
            inputs = [inputs, input]
            outputs = [outputs, output]
        else
            # result has already been memoized
            output = outputs[match[1]]
        end

        output
    end
end




Out[9]:
memoize (generic function with 1 method)



I like this version of memoization much better than my previous version. It's nowhere near as cluttered.

In [10]:
fix( memoize( trace( Function("fibonacci", fib) ) ) )(6)




fibonacci(0) = 0
fibonacci(1) = 1
fibonacci(2) = 1
fibonacci(3) = 2
fibonacci(4) = 3
fibonacci(5) = 5
fibonacci(6) = 8


Out[10]:
8



Yeeeeessssssssss!!!! Okay the fixed-point combinator has the ability to work with multiple inputs. How about a tail recursive fibonacci function?

Multiple inputs:


In [11]:
fib_tail = (fib) -> (n, a, b) -> if (n < 2) b else fib(n-1, b, a+b) end




Out[11]:
(anonymous function)



In [12]:
fix(fib_tail)(100, 0, 1)




Out[12]:
3736710778780434371



In [13]:
@time fix(fib_tail)(100, 0, 1)




elapsed time: 0.002422741 seconds (19344 bytes allocated)


Out[13]:
3736710778780434371



In [14]:
# better trace wrapper for functions with multiple inputs
function trace_(f)
    (Yf) -> (input...) -> begin
        println(f.name, input)
        f.funcall(Yf)(input...)
    end
end




Out[14]:
trace_ (generic function with 1 method)



In [15]:
fix( trace_( Function("fib", fib_tail) ) )(6, 0, 1)




fib(6,0,1)
fib(5,1,1)
fib(4,1,2)
fib(3,2,3)
fib(2,3,5)
fib(1,5,8)


Out[15]:
8



Tuesday, June 9, 2015

Playing with streams in Julia


Delays and yields:


We need two definitions to make streams work. First, delay is a macro that replaces a function with an anonymous function with it's body equal to the function. Later when we need to evaluate the function, we only have to call the anonymous function. This is what yield does. Yield isn't as necessary, but it makes it clear what's happening.

In [1]:
macro delay(f)
    return quote
        # put the function inside a lambda
        function ()
            $f
        end
    end
end


In [2]:
function yield(delayed_function)
    delayed_function() # call it
end




Out[2]:
yield (generic function with 1 method)



In [3]:
something = @delay(println("text"))




Out[3]:
(anonymous function)



In [4]:
yield(something)




text



Making a stream type:


Now with that done we create a stream type. The first element is the actual value held there. The second element is a delayed stream constructor. We then create head and tail to access elements of the stream. Nth takes successive head and tails of a stream.

In [5]:
type IntStream
    literal
    next

    function IntStream(num)
        self = new()
        self.literal = num
        self.next = @delay(IntStream(num+1))
        return self
    end
end


In [6]:
function head(item::IntStream)
    return item.literal
end




Out[6]:
head (generic function with 1 method)



In [7]:
function tail(item::IntStream)
    return yield(item.next)
end




Out[7]:
tail (generic function with 1 method)



In [8]:
integers = IntStream(1)




Out[8]:
IntStream(1,(anonymous function))



In [9]:
head(integers)




Out[9]:
1



In [10]:
head(tail(tail(integers)))




Out[10]:
3



In [11]:
function nth(stream, n)
    if n == 1
        return head(stream)
    else
        return nth(tail(stream), n-1)
    end
end




Out[11]:
nth (generic function with 1 method)



In [12]:
nth(integers, 50)




Out[12]:
50



More general definitions:


For more general streams it is simple to add a next method for each object.

In [13]:
type Stream
    obj
    next
    
    function Stream(obj, next) # next is the method for generating the next object
        self = new()
        self.obj = obj
        self.next = @delay(Stream(next(obj), next))
        return self
    end
end


In [14]:
function head(item::Stream)
    return item.obj
end




Out[14]:
head (generic function with 2 methods)



In [15]:
function tail(item::Stream)
    return yield(item.next)
end




Out[15]:
tail (generic function with 2 methods)



In [16]:
function nth(stream, n)
    if n == 1
        return head(stream)
    else
        return nth(tail(stream), n-1)
    end
end




Out[16]:
nth (generic function with 1 method)



Fibonacci list stream:


Now to try it on the Fibonacci numbers. Instead of making each element the nth Fibonacci number we'll make it the list of the first n+1 Fibonacci numbers.

In [17]:
function next_fib_list(list)
    return [ list, (list[end] + list[end-1]) ]
end




Out[17]:
next_fib_list (generic function with 1 method)



In [18]:
fib_list = Stream([0, 1], next_fib_list)




Out[18]:
Stream([0,1],(anonymous function))



In [19]:
nth(fib_list, 5)




Out[19]:
6-element Array{Int64,1}:
 0
 1
 1
 2
 3
 5



In [20]:
nth(fib_list, 10)




Out[20]:
11-element Array{Int64,1}:
  0
  1
  1
  2
  3
  5
  8
 13
 21
 34
 55



In [21]:
for idx = 1:15
    println(nth(fib_list, idx))
end




[0,1]
[0,1,1]
[0,1,1,2]
[0,1,1,2,3]
[0,1,1,2,3,5]
[0,1,1,2,3,5,8]
[0,1,1,2,3,5,8,13]
[0,1,1,2,3,5,8,13,21]
[0,1,1,2,3,5,8,13,21,34]
[0,1,1,2,3,5,8,13,21,34,55]
[0,1,1,2,3,5,8,13,21,34,55,89]
[0,1,1,2,3,5,8,13,21,34,55,89,144]
[0,1,1,2,3,5,8,13,21,34,55,89,144,233]
[0,1,1,2,3,5,8,13,21,34,55,89,144,233,377]
[0,1,1,2,3,5,8,13,21,34,55,89,144,233,377,610]



It is also possible to memoize the function calls so that expensive operations aren't run more than once.

Monday, January 19, 2015

Encapsulation Test

Lisp orginally had no object oriented features, but it was able to add them in later without changing the language itself through a clever use of macros and closures.

Julia however already has some elements of OOP built in like subtype polymorphism, multiple dispatch, container types with constructors, and methods. One thing Julia currently doesn't have is encapsulation. However, Julia's closures allow us to simulate encapsulation just like they do in Lisp.

First, we create a type with all the members we want private and another one for the ones we want public. Next, we make a class constructor with an object defined inside it with private member type. Then, we write the functions we want to be public - letting them close over the internal private variables/functions. Finally, we assign these functions to the external object.

Class Definition

In [1]:
## Private members
type Rectangle_Private
    width
    height
    area
end
In [2]:
## Public members
type Rectangle
    set_values
    get_area
end
In [3]:
## Class instance
function makeClass(object::Rectangle)
    self = Rectangle_Private(1,1,1) # constructor with default values
    
    # functions below close over internal environment
    function set_values(w, h)
        (self.width, self.height) = (w, h)
        self.area = w*h
    end
    
    function get_area()
        self.area
    end
    
    (object.set_values, object.get_area) = (set_values, get_area)
end
Out[3]:
makeClass (generic function with 1 method)

Instantiation

In [4]:
myRect1 = Rectangle(nothing, nothing) # constructor with empty args
makeClass(myRect1) # actual instantiation
Out[4]:
(set_values,get_area)
In [5]:
myRect2 = Rectangle(nothing, nothing)
makeClass(myRect2)
Out[5]:
(set_values,get_area)

Usage

In [6]:
myRect1.set_values(1, 2)
myRect2.set_values(3, 4)
Out[6]:
12
Note: Separate objects have separate internal members as we expected
In [7]:
myRect1.get_area()
Out[7]:
2
In [8]:
myRect2.get_area()
Out[8]:
12
In [9]:
myRect1 # type info
Out[9]:
Rectangle(set_values,get_area)
Technically speaking the set_values and get_area fields in type Rectangle are different names from the internal ones defined in makeClass. I named them identically for convenience. It would be distracting to see it named something else.
In [10]:
myRect1.get_area
Out[10]:
get_area (generic function with 1 method)

Thursday, January 1, 2015

Anonymous Recursive Wrappers

So I recently came across fixed point combinators in lambda calculus. Fixed point combinators allow you to implement recursion while still using anonymous functions. It's a really slick technique for implementing recursion. I decided to try it out using Julia, a dynamic programming language I've been using for a while.

To start take an anonymous function that takes another function as input as well as another variable.

fac_ = f -> n -> if (n < 2) 1 else n*f(n-1) end

In Julia's syntax f is a function to be input to the base fac_. Similarly n is to be input next. The base function is called as:

fac_( some_other_func() )(n)

where some_other_func can be literally any other function and n is an integer. To keep things simple input the null function into fac_. The null function is the function taking no inputs and providing no outputs.

() -> ()

We can now call the base function for the values of 0 and 1.

fac_( () -> () )(0) =>

1

fac_( () -> () )(1) =>

1

 If we try to input 2 or any other number we get ERROR: wrong number of arguments. This is because the if condition in the base function fails, so it tries to input n-1 into the null function - which doesn't take any arguments. The simplest fix is to input the base function into itself. Once the first if statement fails it will evaluate the second base function fac_. This allows to calculate 2 factorial. If we string together base functions we can calculate factorials of any size input.

fac_(fac_(fac_(fac_(fac_( () -> () )))))(n)

This is terribly inefficient, but it works. Each base is a crappy function that only computes so many factorials. Taking the fac_ of this chain gives us a slightly less crappy version. Obviously a perfect factorial function could self apply itself an infinite number of times if necessary. Therefore we know the factorial function is:

fac = fac_(fac_(...inifinitely many base functions...))

We also know that if we apply the base function to the regular factorial we'll just get the regular factorial right back. This means the factorial function is the solution to the fixed point problem fac_(fac) = fac. Amazingly we can construct the fixed point combinator that will immediately solve for fac.

fix(fac_) = fac_( fix(fac_) )

The fixed point combinator (also known as the Y combinator) takes the base function and a variable as input. It then self applies the base function to itself allowing you to construct the infinite tower of base functions.

U = f -> f(f) # self applicator

Y = f -> U( x -> f( U(x) ) )

While theoretically possible to implement, the standard fixed point combinator thought up by Haskell Curry is hard to implement in practice due to eager evaluation by Julia. It can be fixed by inserting an extra number to be input that prevents the Y combinator from being evaluated instantly. The simplest implementation is:

fix = f -> g = f( (x...) -> g(x...) )

fix(fac_)(5) =>

120

Now we can define fac = fix(fac_) which works for all inputs.

If that wasn't cool enough we can also define wrapper functions as described here. These wrapper functions can modify the function at runtime and at every single computational step. You can print the input values, print the output values, check the types on variables, do bounds checks, or even memoize results.

For printing results, catch all the  parameters, print what you want, and then return the result of evaluating the expression last.

printInput = base -> func -> (args...) ->
        (println(args...);
        base(func)(args...))

printOutput = base -> func -> (args...) ->
        (result = base(func)(args...);
        println(result);
        result)

fix(printOutput(fac_))(5) =>

1
2
6
24
120
120

Errors doing anonymous recursion are nearly impossible to decipher. I spent countless hours trying to track down my mistakes when first writing these functions. To do custom error checking, catch the inputs and provide the error you want.

nonnegative = base -> func -> n -> if (n < 0)
        error("input must be nonnegative")
        else base(func)(n) end

integersOnly = base -> func -> n -> if (typeof(n) != Int)
        error("input must be an integer")
        else base(func)(n) end

fix(integersOnly(nonnegative(fac_)))("hello") =>

ERROR: input must be an integer

fix(integersOnly(nonnegative(fac_)))(-1)
ERROR: input must be nonnegative

It's also possible to provide memoization for these recursive functions.  The main thing to do is create a cache that stores inputs and outputs. I chose to use an array with each input first and the result second. I first created a helper function to search the cache for previous results.

function look_up(a, n)
        if (length(a) == 0) return nothing end

        for i = 1:(length(a)/2)
                if (a[2*i-1] == n) return a[2*i] end
                # 2*i-1 gets odd indexed elements
                # 2*i gets even indexed elements
        end

        return nothing # if value not found
end

Now to carry out the memoized version I create an empty cache, lookup entries each iteration, return previous results if they're been computed, and calculate new entries.

memoize = base ->
        let cache = []
                func -> arg ->
                let val = look_up(cache,arg)
                        if (val == nothing)
                        let result = base(func)(arg)
                                if (length(cache) == 0) cache = [arg result] # first time through
                                else cache = [cache [arg result]] end
                                result # return to function above
                        end
                        else val end
                end
        end

fib_ = f -> n -> if (n < 2) n else f(n-1)+f(n-2) end

@time fix(memoize(fib_))(164) =>

elapsed time: 0.001400088 seconds (298952 bytes allocated)
7278316983633677837


(edit: I just realized that the standard int implementation here can only compute fibonacci numbers up to 92 before overflow kicks in => this answer is wrong, but the time is all I care about, and I'm too lazy to rerun it haha)

Test Post

Did this work?

test URL: www.google.com