Functional Programming ====================== Recursion --------- Defining solution of a problem in terms of the same problem, typically of smaller size, is called recursion. Recursion makes it possible to express solution of a problem very concisely and elegantly. A function is called recursive if it makes call to itself. Typically, a recursive function will have a terminating condition and one or more recursive calls to itself. Example: Computing Exponent ^^^^^^^^^^^^^^^^^^^^^^^^^^^ Mathematically we can define exponent of a number in terms of its smaller power. .. code-block:: python def exp(x, n): """ Computes the result of x raised to the power of n. >>> exp(2, 3) 8 >>> exp(3, 2) 9 """ if n == 0: return 1 else: return x * exp(x, n-1) Lets look at the execution pattern. :: exp(2, 4) +-- 2 * exp(2, 3) | +-- 2 * exp(2, 2) | | +-- 2 * exp(2, 1) | | | +-- 2 * exp(2, 0) | | | | +-- 1 | | | +-- 2 * 1 | | | +-- 2 | | +-- 2 * 2 | | +-- 4 | +-- 2 * 4 | +-- 8 +-- 2 * 8 +-- 16 Number of calls to the above ``exp`` function is proportional to size of the problem, which is ``n`` here. We can compute exponent in fewer steps if we use successive squaring. .. code-block:: python def fast_exp(x, n): if n == 0: return 1 elif n % 2 == 0: return fast_exp(x*x, n/2)) else: return x * fast_exp(x, n-1) Lets look at the execution pattern now. :: fast_exp(2, 10) +-- fast_exp(4, 5) # 2 * 2 | +-- 4 * fast_exp(4, 4) | | +-- fast_exp(16, 2) # 4 * 4 | | | +-- fast_exp(256, 1) # 16 * 16 | | | | +-- 256 * fast_exp(256, 0) | | | | +-- 1 | | | | +-- 256 * 1 | | | | +-- 256 | | | +-- 256 | | +-- 256 | +-- 4 * 256 | +-- 1024 +-- 1024 1024 .. problem:: Implement a function ``product`` to multiply 2 numbers recursively using ``+`` and ``-`` operators only. Example: Flatten a list ^^^^^^^^^^^^^^^^^^^^^^^ Supposed you have a nested list and want to flatten it. .. code-block:: python def flatten_list(a, result=None): """Flattens a nested list. >>> flatten_list([ [1, 2, [3, 4] ], [5, 6], 7]) [1, 2, 3, 4, 5, 6, 7] """ if result is None: result = [] for x in a: if isinstance(x, list): flatten_list(x, result) else: result.append(x) return result .. problem:: Write a function ``flatten_dict`` to flatten a nested dictionary by joining the keys with ``.`` character. >>> flatten_dict({'a': 1, 'b': {'x': 2, 'y': 3}, 'c': 4}) {'a': 1, 'b.x': 2, 'b.y': 3, 'c': 4} .. problem:: Write a function ``unflatten_dict`` to do reverse of ``flatten_dict``. >>> unflatten_dict({'a': 1, 'b.x': 2, 'b.y': 3, 'c': 4}) {'a': 1, 'b': {'x': 2, 'y': 3}, 'c': 4} .. problem:: Write a function ``treemap`` to map a function over nested list. >>> treemap(lambda x: x*x, [1, 2, [3, 4, [5]]]) [1, 4, [9, 16, [25]]] .. problem:: Write a function ``tree_reverse`` to reverse elements of a nested-list recursively. >>> tree_reverse([[1, 2], [3, [4, 5]], 6]) [6, [[5, 4], 3], [2, 1]] Example: JSON Encode ^^^^^^^^^^^^^^^^^^^^ Lets look at more commonly used example of serializing a python datastructure into `JSON (JavaScript Object Notation) `_. Here is an example of JSON record. :: { "name": "Advanced Python Training", "date": "October 13, 2012", "completed": false, "instructor": { "name": "Anand Chitipothu", "website": "http://anandology.com/" }, "participants": [ { "name": "Participant 1", "email": "email1@example.com" }, { "name": "Participant 2", "email": "email2@example.com" } ] } It looks very much like Python dictionaries and lists. There are some differences though. Strings are always enclosed in double quotes, booleans are represented as ``true`` and ``false``. The standard library module `json` provides functionality to work in JSON. Lets try to implement it now as it is very good example of use of recursion. For simplicity, lets assume that strings will not have any special characters and can have space, tab and newline characters. .. code-block:: python def json_encode(data): if isinstance(data, bool): if data: return "true" else: return "false" elif isinstance(data, (int, float)): return str(data) elif isinstance(data, str): return '"' + escape_string(data) + '"' elif isinstance(data, list): return "[" + ", ".join(json_encode(d) for d in data) + "]" else: raise TypeError("%s is not JSON serializable" % repr(data)) def escape_string(s): """Escapes double-quote, tab and new line characters in a string.""" s = s.replace('"', '\\"') s = s.replace("\t", "\\t") s = s.replace("\n", "\\n") return s This handles booleans, integers, strings, floats and lists, but doesn't handle dictionaries yet. That is left an exercise to the readers. If you notice the block of code that is handling lists, we are calling `json_encode` recursively for each element of the list, that is required because each element can be of any type, even a list or a dictionary. .. problem:: Complete the above implementation of ``json_encode`` by handling the case of dictionaries. .. problem:: Implement a program ``dirtree.py`` that takes a directory as argument and prints all the files in that directory recursively as a tree. Hint: Use ``os.listdir`` and ``os.path.isdir`` funtions. :: $ python dirtree.py foo/ foo/ |-- a.txt |-- b.txt |-- bar/ | |-- p.txt | `-- q.txt `-- c.txt .. problem:: Write a function ``count_change`` to count the number of ways to change any given amount. Available coins are also passed as argument to the function. >>> count_change(10, [1, 5]) 3 >>> count_change(10, [1, 2]) 6 >>> count_change(100, [1, 5, 10, 25, 50]) 292 .. problem:: Write a function ``permute`` to compute all possible permutations of elements of a given list. >>> permute([1, 2, 3]) [[1, 2, 3], [1, 3, 2], [2, 1, 3], [2, 3, 1], [3, 1, 2], [3, 2, 1]] Higher Order Functions & Decorators ----------------------------------- In Python, functions are first-class objects. They can be passed as arguments to other functions and a new functions can be returned from a function call. Example: Tracing Function Calls ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ For example, consider the following ``fib`` function. .. code-block:: python def fib(n): if n is 0 or n is 1: return 1 else: return fib(n-1) + fib(n-2) Suppose we want to trace all the calls to the ``fib`` function. We can write a higher order function to return a new function, which prints whenever ``fib`` function is called. .. code-block:: python def trace(f): def g(x): print(f.__name__, x) value = f(x) print('return', repr(value)) return value return g fib = trace(fib) print(fib(3)) This produces the following output. .. code-block:: text fib 3 fib 2 fib 1 return 1 fib 0 return 1 return 2 fib 1 return 1 return 3 3 Noticed that the trick here is at ``fib = trace(fib)``. We have replaced the function ``fib`` with a new function, so whenever that function is called recursively, it is the our new function, which prints the trace before calling the orginal function. To make the output more readable, let us indent the function calls. .. code-block:: python def trace(f): f.indent = 0 def g(x): print('| ' * f.indent + '|--', f.__name__, x) f.indent += 1 value = f(x) print('| ' * f.indent + '|--', 'return', repr(value)) f.indent -= 1 return value return g fib = trace(fib) print(fib(4)) This produces the following output. :: $ python fib.py |-- fib 4 | |-- fib 3 | | |-- fib 2 | | | |-- fib 1 | | | | |-- return 1 | | | |-- fib 0 | | | | |-- return 1 | | | |-- return 2 | | |-- fib 1 | | | |-- return 1 | | |-- return 3 | |-- fib 2 | | |-- fib 1 | | | |-- return 1 | | |-- fib 0 | | | |-- return 1 | | |-- return 2 | |-- return 5 5 This pattern is so useful that python has special syntax for specifying this concisely. .. code-block:: python @trace def fib(n): ... It is equivalant of adding ``fib = trace(fib)`` after the function definition. Example: Memoize ^^^^^^^^^^^^^^^^ In the above example, it is clear that number of function calls are growing exponentially with the size of input and there is lot of redundant computation that is done. Suppose we want to get rid of the redundant computation by caching the result of ``fib`` when it is called for the first time and reuse it when it is needed next time. Doing this is very popular in functional programming world and it is called ``memoize``. .. code-block:: python def memoize(f): cache = {} def g(x): if x not in cache: cache[x] = f(x) return cache[x] return g fib = trace(fib) fib = memoize(fib) print(fib(4)) If you notice, after ``memoize``, growth of ``fib`` has become linear. .. code-block:: text |-- fib 4 | |-- fib 3 | | |-- fib 2 | | | |-- fib 1 | | | | |-- return 1 | | | |-- fib 0 | | | | |-- return 1 | | | |-- return 2 | | |-- return 3 | |-- return 5 5 .. problem :: Write a function ``profile``, which takes a function as argument and returns a new function, which behaves exactly similar to the given function, except that it prints the time consumed in executing it. .. code-block:: python >>> fib = profile(fib) >>> fib(20) time taken: 0.1 sec 10946 .. problem :: Write a function ``vectorize`` which takes a function ``f`` and return a new function, which takes a list as argument and calls ``f`` for every element and returns the result as a list. .. code-block:: python >>> def square(x): return x * x ... >>> f = vectorize(square) >>> f([1, 2, 3]) [1, 4, 9] >>> g = vectorize(len) >>> g(["hello", "world"]) [5, 5] >>> g([[1, 2], [2, 3, 4]]) [2, 3] Example: unixcommand decorator ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ Many unix commands have a typical pattern. They accept multiple filenames as arguments, does some processing and prints the lines back. Some examples of such commands are ``cat`` and ``grep``. :: def unixcommand(f): def g(filenames): printlines(out for line in readlines(filenames) for out in f(line)) return g Lets see how to use it. :: @unixcommand def cat(line): yield line @unixcommand def lowercase(line): yield line.lower() .. problem: Write a program wrap.py that'll wrap lines longer than 40 characters. :: $ python wrap.py file1 file2 file3 exec & eval ----------- Python privides the whole interpreter as a built-in function. You can pass a string and ask it is execute that piece of code at run time. For example:: >>> exec("x = 1") >>> x 1 By default ``exec`` works in the current environment, so it updated the globals in the above example. It is also possible to specify an environment to ``exec``. :: >>> env = {'a' : 42} >>> exec('x = a+1', env) >>> print(env['x']) 43 It is also possible to create functions or classes dynamically using ``exec``, though it is usually not a good idea. :: >>> code = 'def add_%d(x): return x + %d' >>> for i in range(1, 5): ... exec(code % (i, i)) ... >>> add_1(3) 4 >>> add_3(3) 6 ``eval`` is like ``exec`` but it takes an expression and returns its value. :: >>> eval("2+3") 5 >>> a = 2 >>> eval("a * a") 4 >>> env = {'x' : 42} >>> eval('x+1', env) 43