Assignment #5 - Text File Analyzer

Due Date: Mon, Nov 7

Objectives

Task

Write a program that reads a text file, then prints some statistical results about its contents to an output file. The results will be primarily an analysis of the characters in the file, which will include the frequency of occurrence of certain categories of characters, as well as the frequency of occurrences of each of the letters of the alphabet.

Note that in most standard English text, certain letters typically appear with a higher frequency than others. This kind of information has been used in setting up games like Scrabble (letter frequencies and point values, etc), as well as more serious applications like decoding encrypted messages and cyphers.

Details and Requirements

General Requirements


Sample input files and results

Example 1

Sample input file - file1.txt:

Hello.  How are you?
I am 4 years old.  My favorite color is blue.
Next year I will be 5.

Sample program execution

(underline denotes user input)
Please enter the name of the input file.
Filename:  file1.txt 
Please enter the name of the output file.
Filename:  out1.txt
Processing complete

Contents of out1.txt

Statistics for file: file1.txt
-------------------------------------------------

Total # of characters in file:        90

Category                How many in file           % of file
----------------------------------------------------------------------
Letters                               61               67.78 %
White space                           22               24.44 %
Digits                                 2                2.22 %
Other characters                       5                5.56 %


LETTER STATISTICS

Category                How many in file    % of all letters
----------------------------------------------------------------------
Uppercase                              6                9.84 %
Lowercase                             55               90.16 %
a                                      5                8.20 %
b                                      2                3.28 %
c                                      1                1.64 %
d                                      1                1.64 %
e                                      8               13.11 %
f                                      1                1.64 %
g                                      0                0.00 %
h                                      2                3.28 %
i                                      5                8.20 %
j                                      0                0.00 %
k                                      0                0.00 %
l                                      7               11.48 %
m                                      2                3.28 %
n                                      1                1.64 %
o                                      7               11.48 %
p                                      0                0.00 %
q                                      0                0.00 %
r                                      5                8.20 %
s                                      2                3.28 %
t                                      2                3.28 %
u                                      2                3.28 %
v                                      1                1.64 %
w                                      2                3.28 %
x                                      1                1.64 %
y                                      4                6.56 %
z                                      0                0.00 %

Example 2

Sample input file - file2.txt:

The quick brown fox jumped over the lazy dog.
How lazy was he?  Well, he was pretty darn lazy, I'll tell you.
And speaking of the fox, why was he jumping over dogs anyways?
Didn't he have anything better to do with his time?
When he checked his watch at 12:45 PM, 15 more foxes jumped over
the lazy dog, who so far had counted 100 sheep, 17 cats, and 2 rabbits.

Sample program execution

(underline denotes user input)
Please enter the name of the input file.
Filename:  file2.txt 
Please enter the name of the output file.
Filename:  out2.txt
Processing complete

Contents of out2.txt

Statistics for file: file2.txt
-------------------------------------------------

Total # of characters in file:       362

Category                How many in file           % of file
----------------------------------------------------------------------
Letters                              261               72.10 %
White space                           73               20.17 %
Digits                                12                3.31 %
Other characters                      16                4.42 %


LETTER STATISTICS

Category                How many in file    % of all letters
----------------------------------------------------------------------
Uppercase                              9                3.45 %
Lowercase                            252               96.55 %
a                                     21                8.05 %
b                                      4                1.53 %
c                                      6                2.30 %
d                                     14                5.36 %
e                                     30               11.49 %
f                                      5                1.92 %
g                                      6                2.30 %
h                                     22                8.43 %
i                                     11                4.21 %
j                                      3                1.15 %
k                                      3                1.15 %
l                                     10                3.83 %
m                                      6                2.30 %
n                                     12                4.60 %
o                                     19                7.28 %
p                                      7                2.68 %
q                                      1                0.38 %
r                                     10                3.83 %
s                                     13                4.98 %
t                                     19                7.28 %
u                                      6                2.30 %
v                                      4                1.53 %
w                                     12                4.60 %
x                                      3                1.15 %
y                                     10                3.83 %
z                                      4                1.53 %

Submitting

Submit your program (Use the filename prog5.cpp) in the usual way, through the web page.