![Joey Hess](/assets/img/avatar_default.png)
* unannex, uninit: Avoid committing after every file is unannexed, for massive speedup. * --notify-finish switch will cause desktop notifications after each file upload/download/drop completes (using the dbus Desktop Notifications Specification) * --notify-start switch will show desktop notifications when each file upload/download starts. * webapp: Automatically install Nautilus integration scripts to get and drop files. * tahoe: Pass -d parameter before subcommand; putting it after the subcommand no longer works with tahoe-lafs version 1.10. (Thanks, Alberto Berti) * forget --drop-dead: Avoid removing the dead remote from the trust.log, so that if git remotes for it still exist anywhere, git annex info will still know it's dead and not show it. * git-annex-shell: Make configlist automatically initialize a remote git repository, as long as a git-annex branch has been pushed to it, to simplify setup of remote git repositories, including via gitolite. * add --include-dotfiles: New option, perhaps useful for backups. * Version 5.20140227 broke creation of glacier repositories, not including the datacenter and vault in their configuration. This bug is fixed, but glacier repositories set up with the broken version of git-annex need to have the datacenter and vault set in order to be usable. This can be done using git annex enableremote to add the missing settings. For details, see http://git-annex.branchable.com/bugs/problems_with_glacier/ * Added required content configuration. * assistant: Improve ssh authorized keys line generated in local pairing or for a remote ssh server to set environment variables in an alternative way that works with the non-POSIX fish shell, as well as POSIX shells. # imported from the archive
21 lines
797 B
Markdown
21 lines
797 B
Markdown
Maybe you had a lot of files scattered around on different drives, and you
|
|
added them all into a single git-annex repository. Some of the files are
|
|
surely duplicates of others.
|
|
|
|
While git-annex stores the file contents efficiently, it would still
|
|
help in cleaning up this mess if you could find, and perhaps remove
|
|
the duplicate files.
|
|
|
|
Here's a command line that will show duplicate sets of files grouped together:
|
|
|
|
git annex find --include '*' --format='${file} ${escaped_key}\n' | \
|
|
sort -k2 | uniq --all-repeated=separate -f1 | \
|
|
sed 's/ [^ ]*$//'
|
|
|
|
Here's a command line that will remove one of each duplicate set of files:
|
|
|
|
git annex find --include '*' --format='${file} ${escaped_key}\n' | \
|
|
sort -k2 | uniq --repeated -f1 | sed 's/ [^ ]*$//' | \
|
|
xargs -d '\n' git rm
|
|
|
|
--[[Joey]]
|