fdupes Command: Tutorial & Examples

Finding and Managing Duplicate Files

Have you ever found yourself struggling to free up disk space on your Linux server? Or maybe you've encountered issues caused by duplicate files scattered across your directories. Well, fear not! In the vast realm of Linux, there's a powerful command-line tool called fdupes that can help you identify and manage those pesky duplicates. In this guide, we'll explore what fdupes does, how it works, and why it's an invaluable asset for your Linux server.

What Does fdupes Do?

Simply put, fdupes helps you locate duplicate files on your Linux server. By scanning through directories, fdupes compares files based on their size and content, allowing it to identify identical files even if they have different names. It helps you reclaim storage space and maintain a more organized file system by identifying and handling duplicate files efficiently.

The accumulation of duplicate files can be problematic for several reasons. First and foremost, duplicate files consume valuable storage space, leading to disk capacity issues. Additionally, duplicated files may cause confusion and inefficiency when searching for specific documents. They can also create unnecessary redundancy, resulting in increased backup times and resource usage.

By leveraging fdupes, you can address these challenges effectively. It allows you to identify and eliminate duplicate files, ensuring a more streamlined and efficient file system.

How Does fdupes Work?

fdupes utilizes a clever algorithm to detect duplicate files. It compares the files in a selected directory or a set of directories and checks for similarities in both file size and content. By performing a binary comparison, fdupes can quickly identify duplicates, even if they're scattered across various locations.

Once fdupes finds duplicates, it presents a list of files that match, making it easier for you to decide what actions to take. You can choose to delete or move duplicates, preserve specific versions, or create hard links to save disk space while maintaining file accessibility.

Using fdupes - Examples

Let's dive into some practical examples to grasp the versatility of fdupes and how it can be utilized in different scenarios:

Example 1: Scanning a Directory

To scan a directory and find duplicate files, you can use the following command:

fdupes /path/to/directory

Replace /path/to/directory with the actual path to the directory you want to scan. fdupes will analyze the contents and display a list of duplicate files it finds within that directory. You can also include the sub folders like this:

fdupes --recurse /path/to/directory

Example 2: Scanning Multiple Directories

To scan multiple directories simultaneously, specify each directory as an argument:

fdupes --recurse /path/to/directory1 /path/to/directory2 /path/to/directory3

By providing multiple directory paths, fdupes will search for duplicates across all specified directories.

Example 3: Deleting Duplicate Files

If you want to delete duplicate files directly, you can utilize the --delete option:

fdupes --recurse --delete /path/to/directory

This command will prompt you to select which duplicates to keep and which to delete. If you don't want to be prompted, use this command instead:

fdupes --recurse --delete --noprompt /path/to/directory

Or short:

fdupes -rdN /path/to/directory

Exercise caution when using this option, as deleted files cannot be easily recovered.

Except where otherwise noted, content on this site is licensed under a CC BY-SA 4.0 license CC BY SA