Students should use DrScheme, following the design recipe, on the exercises at their own pace, while labbies wander among the students, answering questions, bringing the more important ones to the lab's attention.
Generative Recursion Design Recipe Review
In class, we presented a new more general methodology for developing Scheme functions called generative recursion.
To develop a generative recursive, we use a generalized design recipe that incorporates the following changes:
- When making function examples, we need even more examples, as they help us to find the algorithm we will use.
- When making a template, we don't follow the structure of the data. Instead, we ask ourselves six questions:
- What is (are) the trivial case(s)?
- How do we solve the trivial case(s)?
- For the non-trivial case(s), how many subproblems do we use?
- How do we generate these subproblems?
- Can we combine the results of these subproblems to solve the given problem?
- How do we combine the results of these subproblems?
- We use the answers to the preceding questions to fill in the following very general template:
- We add a step to reason about why our algorithm terminates, since this is no longer provided by simply following the structure of the data.
Converting a token stream to a list input stream
The Scheme reader reads list inputs rather than symbols or characters where a list input is defined by the following set of mutually recursive data definitions:
An input is either:
- any built-in Scheme value that is not a list (
- a general list.
A general list is either:
(cons i agl)where
iis an input and
aglis a general list.
Note that the second clause above could be written as the following two clauses:
(cons p l)where
pis a primitive Scheme value that is not a list, or
(cons el l)where
lare general lists.
Your task is to write the core function
listify used in an idealized Scheme reader stream.
listify converts a list of tokens, where a token is defined below, to a list of inputs as defined above.
A token is either
- any built-in Scheme value (including symbol, boolean, number, string,
empty, etc.; or
- (make-open), or
given the structure definitions
The open and close parenthesis objects (not the symbols
'|)|) clearly play a critical role in token streams because they delimit general lists.
(listify empty) = empty.
(listify (list (make-open) (make-close)) = (list empty).
(listify (list (make-open) 'a (make-close)) = (list '(a))
(listify (list (make-open) 'a 'b (make-close)) = (list '(a b))
(listify (list (make-open) (make-open) 'a (make-close) 'b (make-close)) = (list '((a) b))
You will probably want to define the following predicate
in your program. As a rough guideline follow the parse.ss program that we wrote in class on Monday, Feb 2. Note that finding the first input is more involved than finding the first line. You will probably want to explicitly check for errors in the input because they correspond to natural tests on the form of the input.
Insertion Sort vs. Quicksort
In class, we've described two different ways of sorting numbers, insertion sort and quicksort.
So we have two fundamentally different approaches to sorting a list (and there are lots of others, too). It seems unsurprising that each might behave differently. Can we observe this difference? Can we provide explanations? We'll do some timing experiments, outline the theoretical analysis, and see if the two are consistent.
If your programs only sort small lists a few times, it doesn't matter; any sort that is easy to write and woks correctly is fine. However, for longer lists, the difference really is huge. In the real world, sorting is an operation often done on very long lists (repeatedly).
In DrScheme, there is an expression
(time expr) that can be used to time how long it takes the computer to evaluate something. For example,
determines how long it takes to sort
big-list using the function
isort and prints the timing results along with the evaluation result. Since we're only interested in the time, we can avoid seeing the long sorted list by writing
time operation prints three timing figures, each in milliseconds. The first is how much time elapsed while the computer was working on this computation, which is exactly what we're interested in.
Partner with someone else in lab to split the following work.
- We need some data to use for our timing experiments. Write a function
up: nat -> (list-of nat)which constructs a list of naturals starting with
0in ascending order or retrieve it from a previous lab. For example,
(list 0 1 2 3 4 5 6). You can probably write it faster using
build-listthat you can retrieve it. Similarly, write a function
down: nat -> (list-of nat)that constructs a list of naturals ending in
0in descending order. For example,
(list 6 5 4 3 2 1 0).
- Now write a function
32767. To create a single random number in the range from
(random n+1). For larger numbers in the range
[0, 32768), try: Note:
randomis a function in the Scheme library, but it is not a mathematical function, since the output is not determined by the input.
that constructs a list of random numbers in the range from
- Make sure you know how to use
timeby using it to time
isorton a list of 200 elements. Run the exact same expression again a couple times. Note that the time varies some. We typically deal with such variation by taking the average of several runs. The variation is caused by a number of low-level hardware phenomena that are beyond the scope of this course (see ELEC 220 and COMP 221).
- Use the
timeoperation to fill in the following chart.When you are done, compare your results with those generated by other pairs in the lab.
COMP 280 introduces concepts of how these algorithms behave in general.
The punchline is that we can say that both insertion sort and quicksort on lists are O(n*2) in the worst case. i.e., for a list of n elements, they each take about c n**2 evaluation steps to sort the list, for some constant c. Furthermore, in the "average" case, Quicksort does better: O(n log n). A well-coded formulation of quicksort using arrays almost never exhibits worst-case behavior. COMP 280, and later COMP 482, show precisely what big O notation means these statements mean and how to derive them.
Ackermann's Function Example
If you have time, here is the definition of a strange numerical function on natural numbers,
called Ackermann's function:
Note that this definition is not structurally recursive. In fact, it cannot be defined in a structurally recursive manner. In technical jargon, the function is not primitive recursive. See COMP 481 for what that means.
Here's the equivalent Scheme code (assuming
n are natural numbers:
Ackermann's function exercises
ackon some small inputs. Warning: try very, very small inputs for
m, like 0, 1, 2, or 3, because
(ack 4 1)takes a very, very long time to compute. You can compute
(ack 4 1)using DrScheme if you infer a simple non-recursive formula for
(ack 3 n)and add a clause containing this short-cut for computing
(ack 3 n)to the Scheme code for
ack. Do not try to compute
(ack 4 2). The answer is more than 19,000 digits long.
- Prove why
ackalways terminates for natural numbers. Hint: you need to use "double-induction".
Ackermann's function is not useful in constructing real software applications, but it is an important mathematical defintion because it is provably not primitive recursive and it plays an important role in the complexity analysis of some algorithms (notably union-find). In COMP 482 you will learn about union-find and its complexity analysis.