DiskBoss Logo
Flexense Data Management Software

Duplicate Files Search Performance

DiskBoss duplicate files finder is optimized for multi-core and multi-CPU servers and is capable of scanning directories and searching duplicate files using multiple parallel processing threads. Power users are provided with a number of advanced duplicate files search performance options allowing one to tune the performance of duplicate files search operations for user-specific hardware and software configurations.

Duplicate Files Search Performance Options

In order to configure a duplicate files search operation to use a number of parallel processing threads, open the duplicate files search options dialog, select the 'Advanced' tab and specify the maximum number of parallel threads to scan directories and the number of parallel duplicate files search threads.

Duplicate Files Search Performance SSD Disks

For example, when searching duplicate files located on SSD disks, the performance of a duplicate files search operation reaches up to 3,100 Files/Sec for a single processing thread and scales very well up to 14,000 Files/Sec when the same operation is performed using 8 parallel processing threads.

Duplicate Files Search Show User Names

In order to enable categorization and filtering of duplicate files by the user name and display of the amount of duplicate disk space and the number of duplicate files per user, the option to process and display duplicate files user names should be enabled. But, this option significantly impacts the performance of the duplicate files search operation and in order to mitigate the performance degradation, the duplicate files search command should be configured to use 4-8 parallel processing threads.

Duplicate Files Search Performance SSD Disks With User Names

For example, when searching duplicate files located on SSD disks with the option to process and display user names enabled, the duplicate files search operation reaches up to 2,400 Files/Sec for a single processing thread and scales very well up to 10,800 Files/Sec when the same duplicate files search operation is performed using 8 parallel processing threads.

Searching Duplicate Files Over the Network

Searching duplicate files over the network is a demanding operation and in order to be effective a high-speed, low-latency network is required and the duplicate files search command should be configured to use a number of parallel directory scanning and processing threads. For example, when searching duplicate files over the network with a single processing thread, a duplicate files search operation reaches up to 320 Files/Sec and scales up to 1,144 Files/Sec when the same duplicate files search operation is performed using 8 parallel processing threads.

Duplicate Files Search Performance Network

Searching duplicate files over the network with the option to process and display user names enabled is a time consuming operation because it takes a significant amount of time to inquire files user names from remote servers. In order to mitigate the performance degradation, the duplicate files search command should be configured to use at least 4-8 parallel processing threads, which will inquire duplicate files user names for multiple files simultaneously.

Duplicate Files Search Performance Network With User Names

For example, when searching duplicate files over the network with the option to process and display duplicate files user names enabled, a duplicate files search operation manages to reach up to 110 Files/Sec for a single processing thread and scales up to 592 Files/Sec when the same duplicate files search operation is performed using 8 parallel processing threads.