>> Okay, we're going to do part 2
of our talk on algorithms today.
And where we left off last time
was issues of proof with respect
to in particular the binary search code.
So issues of proof in an
algorithm are are we certain
that the algorithm terminates
or does it run forever?
Are we certain the algorithm delivers
the outcomes that we are asserting?
And correctness usually needs some
sort of support methodology for proving
that algorithms do what
they claim to do.
Quite often in the early going
that support methodology is some
form of mathematical induction.
Another thing is we want to be able
to analyze performance of algorithms.
How fast are they?
How much memory do they require?
And we need some basis
for comparing speeds
and space utilization
of different algorithms.
Which two algorithms may
accomplish the same task
but with different degrees
of efficiency.
And we need a way to measure
that and talk about it.
And this is going to be a common
theme from now through the rest
of your time studying computer science.
So it's not like you're
expected to learn it all today.
So in correctness loop termination,
what you want to typically do is look
at the state of the loop when you enter
the loop and the state exiting the loop.
Loop invariants are statements that
are true in each iteration of the loop
and they're very analogous to induction
steps in mathematical induction.
I'm just going to give
some examples here.
Sequential search.
We said before that sequential
search is so,
is so clear intuitively
that it actually works.
It's a little bit silly to talk
about a form of proof that it works
but we'll give it a try here.
So the, before entering the
loop t has not been found.
That's a statement which
we'll assert to be true.
Now a loop invariant is that a
current item has not been tested.
That invariant would be true
when you enter the loop body.
And at this juncture after the
test the loop invariant would be
that t is not the item, I'm
sorry the current item is not t.
So we did not find t. All right,
if we go to the next item
then entering the loop body,
that item has not been tested.
And so that loop invariant is true.
Entering the next iteration of the
loop, if we get to here it means
that we did not execute
a return statement
and therefore t was not
equal to the item.
And so that loop invariant is true
for every iteration of the loop.
So those were examples of loop
invariants that can help build a sort
of framework for proving
[inaudible] of an algorithm.
Binary search is as we guess last week
is certainly a more complicated problem
to prove correct.
So remember this thing,
this binary search,
this happens to be the lower
bound version of binary search
that I'm talking about here.
And what I've done is repeat
that algorithmic code but insert
in red some loop invariants
to go with it.
Now notice that we have 1,
2, really 3 lines of code
in the body of the algorithm.
I'm sorry, in the body of the y
loop but we have 7 loop invariants.
When you have more loop
invariants than you do lines
of code you know something
complicated is going on, right.
So what I want to do is just
discuss what these are with you
and then show you in the
narrative where you can read
in much more detail how these
things [inaudible] you into proof
of termination and proof of
correctness for a lower bound algorithm.
So and remember the location of
the loop invariant's important.
So in variants 1, 2 and 3 are intended
to be true upon entering
the while loop body.
Four, five, six and seven
are intended to be true
after executing the loop body.
So while low is less than high, well in
variant 1 says low is less than high,
okay, that's going to be true
because otherwise we wouldn't
have entered the loop.
V of the low minus 1 index is less than
t assuming that the index is valid.
So if low is zero then low minus
1 is not even a valid index
so that's vacuously true.
On the other hand if low is a positive
index value, which could happen,
you know, because of having
executed the loop a few times.
We need to prove that on
entering the loop body v
of low minus 1 is than t. And...
[ pause ]
Finally again if high is a valid index
and we need to show that t is less
than equal to the vector
value at the high index.
Now you might note here that 4, I'm
sorry, 6 and 7 are identical to 2 and 3
but they are after the change
has been made in the loop body.
So either low has been, the
value of low has been changed
or the value of high has been changed.
And then you want to claim that
those four things are true.
[ pause ]
So let me show you in the
narrative, oops, this is a repeat
of that slide picture there.
What we have is assertion
1, low is less than high.
And in assertion 2, assertion
3, assertion 3 is a tricky one.
It's the one that says t
is less than or equal to v
of high if the index is valid.
[ pause ]
So, assertion 4 depends on
evaluating, you have two cases.
If you have v of mid is less than
t than high has been, low has been,
I'm sorry, low has been redefined.
If otherwise, high has been redefined.
And in those, each of those two cases
you have to show 4, 5, 6 and 7 are true.
And once you divide it into cases,
they reduce to straightforward
substitution in algebra.
But it's logically a
little bit complicated just
to keep track of all of those cases.
So this is second case where v of
mid is bigger than or equal to 2.
So in other words, the v of
mid less than t is one case
in which the proof have to be verified.
And v of mid bigger or
equal to t is the other case
in which things have to be verified.
And in the first case
low has been redefined.
And the second case high
has been redefined.
And both need to be verified.
So it's not deep.
It's all just pretty straightforward
computation algebraically computing
things and reasoning about inequalities.
But the fact that it divides
into two cases makes it logically
a little more complicated.
The fact that you've got four
different things to prove in each
of those two cases means you've really
got eight things to prove and so forth.
So it's non-trivial to prove it.
So everyone should go to that
sometime, right it down in a notebook
and make sure you can follow that proof
that I've outlined there
in the narrative.
[ pause ]
Now this is where we start kind
of a different parallel topic.
And I'm going to review this notion
of computational complexity here.
And what we're after is comparing the
growth rate of two different functions.
The domains of these functions
are assumed to include all
or most non-negative integers if they,
by most I mean after a certain
point it includes all of them.
So all integers bigger
than 5 for example.
Typically it includes all
integers bigger than zero or 1.
And these things, these integers in the
algorithm setting represent the size
of some object that you're operating.
The values of these functions should
also be non-negative real numbers.
And typically what these in
our algorithm applications,
what these values are going to
represent is the elapsed time
when you make the computation.
If you're estimating speed
or the memory footprint size,
if you're estimating the amount of
memory required to make the computation.
And of course the speed
is a non-negative number.
The size of memory is
a non-negative number.
So the other thing is we'd like for this
to be independent of initial values.
So we'd like for it to be
independent of constant multipliers
and in general for a
lower order effects.
I realize that's vague.
We'll get some examples of what that
means but the basic idea is you would
like to be able to evaluate
the algorithm
without saying what computer language
is used to implement the algorithm
without saying what compiler is
used to create executable code.
And without specifying what
hardware you're going to run it on.
Now all of those things introduce
variables, variable amounts of time
and so forth that aren't relevant
to the basic algorithm itself.
And so you would like for
your [inaudible] complexity
to be independent of those things.
And that's really the motivation
behind computational complexity.
All right, our notation is
slightly different from these,
what you see in your
Discrete Mathematics book.
And slight, which is
also slightly different
from what you see in your textbook.
What I have done is use less than or
equal to, greater than or equal to
and equal to where in those other
sources they always use equal to.
And it's only a notational difference.
But I think it helps get the points
across about these three different
notions of asymptotics here.
The first one and the
one you see most often
in the literature especially the
popular computing literature is Big O.
So we have a function g and we
have a function f. So there's g
and there is f. We say that g is
less than or equal to Big O of f
if there constant c and n naught.
But c represents a constant in
the value set for the functions.
N represents a constant in
the domain set of functions.
And what this says is that g of n, the
actual value of g is less than or equal
to f of n, the actual value of f,
multiplied by this constant as long
as we are sufficiently far out
in the numbers past n naught.
So another way to say that would be g
is asymptotically bounded above by f.
So f is above g asymptotically
if this [inaudible].
And this is why I like to use the
less than or equal to symbol here
to remind you that this
doesn't mean equality.
Now Big Omega is just
the opposite of that.
Again we've got g and f and we're
using bigger than or equal to.
And there's a constant c which
is in the range of values.
And an n naught which is
in the domain value set.
And we want c times f to be less than or
equal to g for all sufficiently large n.
So in other words f is underneath g or
constant multiple of f is underneath g
for [inaudible] n. So in other words
that says g dominates f. The
first one says g is subordinate
to f. The second one says g is super
ordinate to f. Big Theta is really both.