>> Okay let's talk a little bit about
Homework 2, it's going to relate to all
of these things here and back
to the algorithms chapter.
I really enjoyed putting this together,
I hope you enjoy working with it,
so it's a new...it's been sort of coming
to life over the last several weeks.
I finally put it together this past
weekend, there's still a draft notice
up here but it's mostly done, the stuff
that's missing has to do with the nature
of a report that I'm
going to ask you to write.
Of course that comes way
after you've done the coding
so let's just talk a
little bit about it.
Your subtitle is, Exploration
of Algorithm Runtime Using a
Trojan Horse Comparison Operator.
So at the end of this I'd like for
you to be able to find function class,
function object, predicate request,
predicate object, generic algorithm,
design and implement function
of predicate request templates,
design and implement generic
algorithms, use function objects
and predicate objects in client
programs and then comes the spy port.
We're going to measure the number
of calls made to an atomic operation
and use these measured
counts of atomic operations
to empirically corroborate known
theoretical runtimes of algorithms
and discuss the advantages
and disadvantages
of known various implementations
of algorithms.
So those are the kinds of things I
would like you to take away from this.
So operational objectives you need to
create a predicate class and it's going
to be called less than spy and it's
going to be in the file compare_spy.h
and a generic algorithm
called g_lower bound just
like g_lower bound we just talked
about in the previous chapter.
It's going to be in a different
name space though called seq,
I just made that up for sequential
and you're going to put it
in the file gssearch.h. If you
replace that first s with a b that's
where your generic binary
search already is in the file.
It's deliverable to three
files compare_spy.h,
gssearch.h and report.txt.
Procedurally copy all the files out of
hw2 directory in the library as usual.
What you expect to get out of
that is some client programs,
one is called sort_spy and
one's called search_spy.
Sort_spy is a client of your less
then spy class, search_spy is going
to be a client of your less than spy
class and your seq search algorithms.
[Inaudible section] I wonder if I stated
this correctly...yeah you've got a lower
bound and an upper bound too.
Okay and then you've got helper up here
called ranuint, you just compile that
and it can generate files at random
[inaudible] and there's an h_sort.cpp
which is a sorter if you have
a bunch of numbers in a file
and you want them sorted you just use
this with redirection, h_sort.dotx.
Left arrow file 1, right arrow
file 2 [inaudible] file 1 sort
and file 2 sort [inaudible].
And of course there's submit script.
So you got to create the files,
compare spy and gssearch.h, test them,
make sure they work correctly.
Use the supplied spy
clients and if you chose
to make modifications that will be fine.
Generate some runtime data
and write a little report
about the discoveries you
make with that runtime data.
So code requirements, for less then spy.
It's going to be a predicate class
template, its object is going
to maintain a count of the number of
times operator paren paren is called
since the object was created or
the last time reset was called.
And the API for less
than spy looks like this,
it's got your operator
paren paren defines a bool,
that makes it a predicate request.
Takes T1 and T2 and it returns true
if and only if T1 is less than T2.
So it's behavior in that
perspective is exactly the same
as the less than function object.
But it has a method called reset,
sets the internal counter to zero.
It's got a method called
count which is const
which just returns the
value of your counter.
The default constructor that it has
also sets the internal counter to zero
so the internal counter it's
initialized to zero on start up
and can be reinitialized
to zero by calling reset.
So this is going to be a very
interesting little object to have
because you can drop it in where
we use the less than object
in a generic algorithm, you can
drop this in and the functionality
of the generic algorithm
will be exactly the same.
But this little Trojan horse
gets dropped in and collects data
on how many times that algorithm
calls the less than operator.
So that's why it's kind of a
Trojan horse because it's going in
and behaves just like
a less than operator.
It's a spy because it's
returning information
about the internal workings
of that generic algorithm.
[ Background noise ]
Code accounts for sequential
lower bound and upper bound,
they're both generic algorithms,
both operate on forward iterators,
much weaker class of iterators
than random access iterators.
A precondition for successful
operation on them is that they range
to which they are applied is sorted
using the same predicate as used
in the algorithm call, g lower
bound returns the lower bound
of t in the range.
The lower bound is defined
exactly the same way as it is
for generic binary search, lower bound.
Upper bound returns the
upper bound index of t,
basically that is the iterator
pointing to the place...the first place
in the range that's bigger than or
equal to t, that's the lower bound.
Upper bound is the first place in
the range that is bigger than t,
so you remember if t is not in there
then equal isn't going to happen
and so lower bound and upper bound
return the same way, namely the pointer
to place, the first place is
bigger than [inaudible] range.
Now that could be anywhere from
medium range to also end of range
because t could be bigger
than everything in [inaudible]
so it could return the end but it always
will return something for iterator
into the range or to
the end of the range.
Now we're in the namespace seq
and both of these things are going
to be in the file gssearch.
You're going to write a report about
your findings with this but we'll talk
about that report later,
actually I think it will be pretty
self explanatory.
Here's some hints, all these things
can be compiled with one command,
you don't even need a makefile, if
you want welcome to make a makefile
but I just compile them
in my c4530 macro.
If you guys haven't gotten your c4530
macro up and running you can copy it
out of examples/scripts, put it in your
.vn there is a cl4530 in there as well.
Put it in your .vn and change it to
executable and call rehash or log out
and log back in and it will be available
for you everywhere in your requirement.
Second hint, some models, you've got
compare that's obviously a pretty good
model to start with for
compare_spy, you've got gbsearch
which is a good model to start with
looking how to create gssearch.
The code implementation
is going to be different
but the prototypes can
look quite similar.
One thing you want to be sure of is
that you can use compare and compare_spy
in the same program which means that
your protection against multiple reads
for compare_spy can't be the
same as it is for compare.
Don't forget your best practices
particularly when you're talking
about constructors for classes.
Here's a hint about the naming, this
g_lower bound and g_upper bound are used
in at least three namespaces
in this class, in this course,
they all have the same name
and all accomplish the same thing
really but in different ways.
So in namespace fsu is implemented
iteratively using the binary search idea
just like in our lecture groups and just
like in the standard template library.
These require random access iterators,
so that's in your file gbsearch.h.
In namespace alt they're
implemented recursively
as divide and conquer algorithms.
They still require random
access iterators,
these are also in your library
implemented in rbsearch.h. And finally
in namespace seq that's for the ones
you're building they're implemented
iteratively but they use
sequential search and operate
with the less restrictive
forward iterator assumptions.
These in your file gssearch.h.
Now sort_spy
and search_spy that's source code that
you're being given, I'm still tinkering
with a little bit so be sure to get
fresh copies whenever you restart
and of course as usual you have
the executable file [inaudible].
So the next thing I will do
is log back in linprog here.
[ Typing ]
There's a couple of sticky keys in this
machine so let's see I can do c4530,
sort_spy, I can save that
[inaudible] you guys can't use it yet,
I've got to distribute gsort.h.
Or take it out...so let's see we'll
just...I need to do [inaudible].
[ Background keystrokes ]
Search so I guess both of them, so
[inaudible] will work, so sort_spy,
of course sorting you know you go to
a file and stuff that needs sorting,
then you sort it right, so it's going
to require input file and output file
so that means I will need to do,
make myself some files of data
so I just compile ranuint.x, it
wants some files, some information
so ranuint.x I'm going to put
n.100 that will be the name
of the file upper bound will be 1000
and the number of items will be 100.
So that's going to give me a number
between 0 and 1000...probably 999
and it will give me 100 of them,
I'll put them in the file n.100.
I'm going to go ahead and up the
ante on all this, get me 1000 numbers
and maybe 10,000 numbers
and maybe 100,000 numbers.
So now I can run sort_spy and it
will remind me of what it wants,
it wants an input file and
an output file so...now
because this is sort_spy
it's not actually going
to write the sort the data
[inaudible] if you want
to sort the data just use g_sort.
But what it's going to write to the file
is the measurements that it's making.
So let's say...what was my first
one, n.100, 100 numbers oh,
it wants an output file too.
It basically wrote this same
little [inaudible] to output,
so what we see is there is that it has
passed in, it's used your comp count,
it's used your spy, your
last end spy object to pass
to these generic algorithms and deduced
from that how many pairs are called up.
So here's your selection_sort
which called
on 1000 objects, 5050
calls to less than.
Insertion_sort, 2579, heap_sort
1034, merge_sort 582 and list_sort