dupeGuru: Find Duplicate Files Across Platforms
dupeGuru: The Ultimate Cross-Platform Duplicate File Finder
In a world where storage fills up faster than we can manage, dupeGuru emerges as a lifesaver. This battle-tested, open-source tool with 7.5k GitHub stars helps you reclaim precious disk space by identifying duplicate files across your entire system.
What Makes dupeGuru Special?
Unlike basic duplicate finders, dupeGuru offers cross-platform perfection: - Linux, macOS, Windows - One tool, all platforms - Python 3 + Qt - Modern, maintainable codebase - 95.5% Python - Easy to understand and contribute - Active development - Latest commits in 2026
Key Features
- Advanced duplicate matching algorithms
- Visual results with preview capabilities
- Safe deletion with trash integration
- Customizable scanning criteria
- Translation support via Transifex
Quick Start: Build from Source
Prerequisites
Python 3.7+ & PyQt5
# Linux: python3-pyqt5, pyqt5-dev-tools, python3-dev, build-essential
One-Command Setup
make && make run
Virtual Environment Method
python3 -m venv --system-site-packages ./env
source ./env/bin/activate
pip install -r requirements.txt
python build.py
python run.py
Packaging for Distribution
Debian/Ubuntu one-liner:
bash -c "python3 -m venv --system-site-packages env && source env/bin/activate && pip install -r requirements.txt -r requirements-extra.txt && python build.py --clean && python package.py"
Running Tests
# With tox (recommended)
tox
# Or manually
pip install -r requirements-extra.txt
py.test core hscommon
Project Structure
dupeGuru/
βββ core/ # Matching algorithms
βββ qt/ # Qt UI implementation
βββ help/ # Sphinx documentation
βββ locale/ # Translations (.po files)
βββ pkg/ # Packaging templates
βββ hscommon/ # Shared utilities
Active Maintenance Needs
The project seeks help with: - macOS packaging & bug reproduction - Linux packaging (PPA, Debian, RPM) - Translations (Transifex project active) - Documentation updates
Why Choose dupeGuru?
With 2,086 commits, 32 contributors, and regular releases (latest 4.3.1), this isn't a dead project. It's a production-ready tool that powers real disk cleanup workflows.
Topics: python, deduplication License: GPL-3.0 Website: dupeguru.voltaicideas.net
Ready to clean up your storage? Clone the repo and start scanning today!
git clone https://github.com/arsenetar/dupeguru.git
cd dupeguru
make run