dupeGuru: Find Duplicate Files Across Platforms

dupeGuru: The Ultimate Cross-Platform Duplicate File Finder

In a world where storage fills up faster than we can manage, dupeGuru emerges as a lifesaver. This battle-tested, open-source tool with 7.5k GitHub stars helps you reclaim precious disk space by identifying duplicate files across your entire system.

What Makes dupeGuru Special?

Unlike basic duplicate finders, dupeGuru offers cross-platform perfection: - Linux, macOS, Windows - One tool, all platforms - Python 3 + Qt - Modern, maintainable codebase - 95.5% Python - Easy to understand and contribute - Active development - Latest commits in 2026

Key Features

  • Advanced duplicate matching algorithms
  • Visual results with preview capabilities
  • Safe deletion with trash integration
  • Customizable scanning criteria
  • Translation support via Transifex

Quick Start: Build from Source

Prerequisites

Python 3.7+ & PyQt5
# Linux: python3-pyqt5, pyqt5-dev-tools, python3-dev, build-essential

One-Command Setup

make && make run

Virtual Environment Method

python3 -m venv --system-site-packages ./env
source ./env/bin/activate
pip install -r requirements.txt
python build.py
python run.py

Packaging for Distribution

Debian/Ubuntu one-liner:

bash -c "python3 -m venv --system-site-packages env && source env/bin/activate && pip install -r requirements.txt -r requirements-extra.txt && python build.py --clean && python package.py"

Running Tests

# With tox (recommended)
tox

# Or manually
pip install -r requirements-extra.txt
py.test core hscommon

Project Structure

dupeGuru/
β”œβ”€β”€ core/          # Matching algorithms
β”œβ”€β”€ qt/            # Qt UI implementation
β”œβ”€β”€ help/          # Sphinx documentation
β”œβ”€β”€ locale/        # Translations (.po files)
β”œβ”€β”€ pkg/           # Packaging templates
└── hscommon/      # Shared utilities

Active Maintenance Needs

The project seeks help with: - macOS packaging & bug reproduction - Linux packaging (PPA, Debian, RPM) - Translations (Transifex project active) - Documentation updates

Why Choose dupeGuru?

With 2,086 commits, 32 contributors, and regular releases (latest 4.3.1), this isn't a dead project. It's a production-ready tool that powers real disk cleanup workflows.

Topics: python, deduplication License: GPL-3.0 Website: dupeguru.voltaicideas.net

Ready to clean up your storage? Clone the repo and start scanning today!

git clone https://github.com/arsenetar/dupeguru.git
cd dupeguru
make run
Original Article: View Original

Share this article