diff --git a/README.md b/README.md new file mode 100644 index 0000000..32b5778 --- /dev/null +++ b/README.md @@ -0,0 +1,42 @@ +# Word Analyzer + +Extracts text from PDFs using Apache PDFBox and analyzes word frequency with customizable filters. + +

+ Screenshot of program +

+ +## Features + +- Scans all PDF files in a given folder +- Counts and displays word frequency +- Filters results by minimum and maximum frequency +- Optional maximum file count limit +- Shows scan logs and results in separate windows +- Displays total scan time + +## Requirements + +- Java JRE 8 or higher +- Apache PDFBox 1.8.16 (bundled, no download needed) + +## Usage + +```bash +java -cp WordAnalyzer.jar:pdfbox-1_8_16.jar wordanalyzer.WordAnalyzer +``` + +1. Enter the folder path containing your PDF files +2. Set the minimum/maximum frequency filter +3. Optionally set a maximum file count +4. Click **Confirm Folder Path** to start the scan +5. Results will appear in the results window when the scan is complete + +## Building from Source + +```bash +mkdir out +javac -cp pdfbox-1_8_16.jar -d out src/wordanalyzer/WordAnalyzer.java +jar cfe WordAnalyzer.jar wordanalyzer.WordAnalyzer -C out . +``` + diff --git a/WordAnalyzer.jar b/WordAnalyzer.jar new file mode 100644 index 0000000..0810f04 Binary files /dev/null and b/WordAnalyzer.jar differ diff --git a/docs/assets/img/preview.jpg b/docs/assets/img/preview.jpg new file mode 100644 index 0000000..70c652e Binary files /dev/null and b/docs/assets/img/preview.jpg differ diff --git a/out/wordanalyzer/WordAnalyzer$1.class b/out/wordanalyzer/WordAnalyzer$1.class new file mode 100644 index 0000000..d606bdd Binary files /dev/null and b/out/wordanalyzer/WordAnalyzer$1.class differ diff --git a/out/wordanalyzer/WordAnalyzer$2.class b/out/wordanalyzer/WordAnalyzer$2.class new file mode 100644 index 0000000..d919052 Binary files /dev/null and b/out/wordanalyzer/WordAnalyzer$2.class differ diff --git a/out/wordanalyzer/WordAnalyzer.class b/out/wordanalyzer/WordAnalyzer.class new file mode 100644 index 0000000..0bff6f7 Binary files /dev/null and b/out/wordanalyzer/WordAnalyzer.class differ