Skip to content

norbert-acedanski/document-statistics

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

document-statistics

About The Project

Script provides zipf Graph of a given text file (graph of the number of occurrences of given words).

Built With

Python 3.9.10

Getting started

Requirements

All required packages in requirements.txt file.

To install all required packages, type:

pip install -r requirements.txt

in the terminal.

Working with document-statistics:

  1. Copy text you need to process to the processFile.txt file and save it.
  2. Set start_index and amount_of_words_to_display variables accordingly.
  3. Run the script.
  4. After successful execution, 2 files should be created: number_of_word_instances_list.txt and words_list.txt and graph of selected range of words should appear.
  5. When a graph of selected range becomes visible, you can type y in the terminal to see next chunk of the same size, or type n to close the graph and quit the program.

Usage

Allows you to quickly check the frequency of words in a given file.

Script project contains also a sample .txt file for processing.

Licence

Distributed under the MIT License. See LICENSE file for more information.

Releases

No releases published

Packages

No packages published

Languages