Major Geeks, providing free tested downloads.
advertisement


  Drive Cleaners | Dupseek 1.3
Author: Antonio Bellezza
Date: 2005-12-05
Size: 14 Kb
License: Freeware
OS: OSX 10.3+
Downloaded: 1761 Times
Download Locations:
Author's Site Author's Site
Download from USA MajorGeeks TX - |USA|
Download from USA MajorGeeks TX - |USA|
Download from USA MajorGeeks FL - |USA|
Download from Internode Internode - |Australia|
Australia Planet Mirror - |Australia|

>> Report A Bad Link<<
Rating: N/A (0 votes)

CompatDB.org Project:
Submit Results

Check Results



A command-line interactive perl program to find and remove duplicate files. A few strategies are possible for finding duplicate files in a big set, such as a heavily populated directory. One of the most widely used consists of grouping files by size (because files of different size can't be identical) and then computing a short digital fingerprint (such as a md5 checksum) for the files. Files with a different fingerprint are different, and files with the same digital fingerprint are very probably the same. Just to be sure, one can further check possible duplicates.

Dupseek does something different:
-It starts by grouping files by size.
-Then it starts reading small chunks of the files of the same size and comparing them. It creates smaller groups depending on these comparisons.
-It goes on with bigger and bigger chunks (of size up to a hard-coded limit).
-It stops reading from files as soon as they form a single-element group or they are read completely (which only happens when they have a very high probability of having duplicates).

This algorithm is much more efficient than competitors when dealing with large files of the same size. When files differ, reading usually stops after very few reads.
Partial execution

Dupseek (and destroy) can be interrupted at any moment. The user is then presented with partial results and can either intervene manually or go on with the reading and computation, on a group-by-group basis. Since subsequent reads happen sparsely in the file, if some files are still in the same group after many iterations, they are most probably identical, unless the differences are very small.
Platforms

Dupseek was reported to run on the following platforms:
-Debian GNU/Linux "Woody" and "Sarge"
-Mac OS X v10.2.6
-Freebsd 4.7

Dupseek was developed with perl 5.6.1 and was also tested with perl 5.8.4. It relies on the following modules:
-File::Find directory recursion;
-IO::File object-oriented file handles;
-Getopt::Std option parsing
 




  Support Forum
  NEW Geek-Wear
  Free Magazines
  Geek Shopping
  Geektionary
  About
  Links
  Folding@Home Team
  FAQ

advertisement
    
< Copyright & Disclaimer | Privacy Statement >
There are currently 74 Mac Geeks online
Copyright ©2000-2005 MajorGeeks.Com
Powered by Esselbach Storyteller CMS System Version 1.8