frequency counter in text using struct

Completato Pubblicato Sep 24, 2015 Pagato alla consegna
Completato Pagato alla consegna

A Frequency Counter

Our word frequency counter allows you to count the frequency usage of each word in your text.

That MS Word add-on created a list of all the words in a document, ordered by frequency. It made it easy to

detect overuse and/or abuse of a certain word or expression. The little used words were also of help, because it

may find errors that the spelling checker does not detect.

Automated authorship detection is the process of using a computer program to analyze a large collection of

texts, one of which has an unknown author, and making guesses about the author of that unattributed text. The

basic idea is to use different statistics from the text -- called "features" in the machine learning community -- to

form a linguistic "signature" for each text. One example of a simple feature is Type-Token Ratio - the number

of different words used in a text divided by the total number of words. It's a measure of how repetitive the

vocabulary is.

The documents to be checked is just regular text files (i.e., sequences of characters) and it has one word per line

--- you do not have to extract a word. It means this project does not depend on previous lab projects. The

frequency counter reads in the file to get words from the file. Each word in the file is counted and sorted

properly to produce output:

 Word list in frequency order -- high to low (first 10 highest and last 10 lowest)

 Word list in alphabetical order -- high to low (first 10 in alphabetical order and last 10 in alphabetical

order)

 Type-Token Ratio

input:

frequency.txt.

output:

 Word list in frequency order -- high to low (first 10 highest and last 10 lowest)

 Word list in alphabetical order -- high to low (first 10 in alphabetical order and last 10 in alphabetical

order)

 Type-Token Ratio

Programmazione C++

Rif. progetto: #8549076

Info sul progetto

5 proposte Progetto a distanza Attivo Sep 24, 2015

Assegnato a:

vano101

Hello, This is easy assignment, it can be completed in about 1-2 hrs .........................................

$10 USD in 1 giorno
(568 valutazioni)
6.7

5 freelance hanno fatto un'offerta media di $18 per questo lavoro

it2051229

Hi, I have read the requirements and I would like to do this project if given the opportunity. I am a computer science practitioner and having the "C++ and Algorithms Certification Badge" in my profile gives me confide Altro

$25 USD in 1 giorno
(303 valutazioni)
6.3
urmate

A proposal has not yet been provided

$20 USD in 0 giorni
(84 valutazioni)
5.4
ithuang2014

A proposal has not yet been provided

$24 USD in 1 giorno
(27 valutazioni)
4.1
rinabuoy13

A proposal has not yet been provided

$25 USD in 1 giorno
(7 valutazioni)
2.9
ajaynishad

A proposal has not yet been provided

$10 USD in 1 giorno
(0 valutazioni)
0.0