Commit graph

24498 commits

Author SHA1 Message Date
CandyAngel
ea60ce4066 Initial writeup of tips for repos with large file count 2015-06-17 08:28:14 +00:00
CandyAngel
d096cd5c40 Added a comment 2015-06-17 07:51:53 +00:00
graviton31415@3f1790ceea563b1bc76c318c7bfd6d4e817ecc5d
a558e91fc7 2015-06-17 07:12:26 +00:00
Joey Hess
4d7683b10c man page points to this page now, remove comment requesting it 2015-06-16 20:35:53 -04:00
Joey Hess
3dc82d0b78 add inpreferreddir 2015-06-16 20:33:52 -04:00
Joey Hess
648601fb99 reorg 2015-06-16 20:30:48 -04:00
Joey Hess
f20a8f88de prep release 2015-06-16 20:29:09 -04:00
Joey Hess
c217dbf2a5 reorder 2015-06-16 20:28:32 -04:00
Joey Hess
396049fc64 comment weeding 2015-06-16 20:26:43 -04:00
Joey Hess
b9d49b6831 clarify 2015-06-16 20:24:35 -04:00
Joey Hess
cc175a69ef Revert "box"
This reverts commit c36e1f2bdd.
2015-06-16 20:23:57 -04:00
Joey Hess
c36e1f2bdd box 2015-06-16 20:23:30 -04:00
Joey Hess
575e7a5271 fmt 2015-06-16 20:22:09 -04:00
Joey Hess
76f7c345b6 typography 2015-06-16 20:21:50 -04:00
Joey Hess
8296f90112 typography 2015-06-16 20:20:52 -04:00
Joey Hess
74f54b7593 fmt 2015-06-16 20:20:03 -04:00
Joey Hess
b8ccf28756 fmt 2015-06-16 20:19:31 -04:00
Joey Hess
b0c5ed47a4 rewrite so it's understandable without knowing about the related command-line options 2015-06-16 20:17:17 -04:00
Joey Hess
f77d485915 move comment 2015-06-16 19:07:14 -04:00
Joey Hess
be9d9cb5ad avoid building unused bloomfilter when run without --all 2015-06-16 19:04:20 -04:00
Joey Hess
5b801fcad9 on second thought, sync --content --unused is probably not useful, remove 2015-06-16 19:01:06 -04:00
Joey Hess
8b0549b408 Merge branch 'master' of ssh://git-annex.branchable.com 2015-06-16 18:56:20 -04:00
Joey Hess
2c77fb5cae devblog 2015-06-16 18:54:35 -04:00
Joey Hess
adba0595bd use bloom filter in second pass of sync --all --content
This is needed because when preferred content matches on files,
the second pass would otherwise want to drop all keys. Using a bloom filter
avoids this, and in the case of a false positive, a key will be left
undropped that preferred content would allow dropping. Chances of that
happening are a mere 1 in 1 million.
2015-06-16 18:50:13 -04:00
dev@c21308d8de79665e508a8f95f6f68ef82d56f698
0c69e6055d 2015-06-16 22:43:15 +00:00
Joey Hess
a0a8127956 instance Hashable Key for bloomfilter 2015-06-16 18:37:41 -04:00
Joey Hess
8b74aec3ea Increased the default annex.bloomaccuracy from 1000 to 10000000
This makes git annex unused use around 48 mb more memory than it did before,
but the massive increase in accuracy makes this worthwhile for all but the
smallest systems.

Also, I want to use the bloom filter for sync --all --content, to avoid
dropping files that the preferred content doesn't want, and 1/1000
false positives would be far too many in that use case, even if it were
acceptable for unused.

Actual memory use numbers:

1000: 21.06user 3.42system 0:26.40elapsed 92%CPU (0avgtext+0avgdata 501552maxresident)k
1000000: 21.41user 3.55system 0:26.84elapsed 93%CPU (0avgtext+0avgdata 549496maxresident)k
10000000: 21.84user 3.52system 0:27.89elapsed 90%CPU (0avgtext+0avgdata 549920maxresident)k

Based on these numbers, 10 million seemed a better pick than 1 million.
2015-06-16 18:12:00 -04:00
Joey Hess
f7350b7c33 wording 2015-06-16 17:32:41 -04:00
Joey Hess
8268f7951e adjust standard preferred content to work better with git annex sync --all --content
backup: Use new "anything" terminal. This means that content that
is not unused, but has no associated file will be wanted by backup repos.

unwanted: "not anything" will result in any and all content moving
off of these repos.

incremental backup: Remove the "(include=* or unused)",
so it matches content that has no associated files
but is not unused.

client: Add a include=* to the expression. This limits it to matching
only files in the work tree. Without this change, sync --all --content
would match a key against the expression, and since it matches
exclude=archive/*, the client repo would have wanted the file content.
The "and not unused" would have kept unused objects out, but not
objects that were not known to be unused, or objects that another branch
referred to. In practice, everything would have flooded into client repos
without this change.
2015-06-16 17:18:53 -04:00
Joey Hess
a4955542a3 Fix incremental backup standard preferred content expression to match its documentation, which says it does not want files that have reached a backup repository.
Checked history and these have been out of sync from the very beginning!
2015-06-16 17:10:10 -04:00
anarcat
da60a29e56 sign and split out 2015-06-16 21:06:14 +00:00
anarcat
c508c3472a first python implementation of this 2015-06-16 21:03:48 +00:00
Joey Hess
8c46ea22c2 Added new "anything" preferred content expression, which matches all versions of all files. 2015-06-16 17:03:34 -04:00
Joey Hess
29c03145e6 sync: Add support for --all and --unused. 2015-06-16 16:50:03 -04:00
anarcat
f5d84ac62e Added a comment 2015-06-16 20:10:50 +00:00
Joey Hess
58e6f033b9 update 2015-06-16 16:04:13 -04:00
Joey Hess
99a1113461 switch code to using associated files 2015-06-16 15:07:03 -04:00
Joey Hess
32adb5f0e0 actually.. 2015-06-16 14:03:13 -04:00
Joey Hess
fbc06b3d1f Merge branch 'master' of ssh://git-annex.branchable.com 2015-06-16 13:50:48 -04:00
Joey Hess
67f7f1b1cb info: Added json output for "backend usage", "numcopies stats", "repositories containing these files", and "transfers in progress". 2015-06-16 13:50:28 -04:00
eigengrau
5e9684436e Added a comment 2015-06-16 13:20:07 +00:00
https://id.koumbit.net/anarcat
bc87ed040e neat checksumming api at s3 that could be leveraged 2015-06-16 00:50:06 +00:00
anarcat
911054dbb8 Added a comment 2015-06-15 20:05:13 +00:00
anarcat
4f427c64f9 Added a comment 2015-06-15 20:02:56 +00:00
anarcat
003e979576 Added a comment 2015-06-15 19:48:46 +00:00
Joey Hess
c96b333869 clarify 2015-06-15 15:30:00 -04:00
Joey Hess
f62138b9c5 add basic progress 2015-06-15 15:27:17 -04:00
anarcat
009e961eca workaround: restarting the assistant 2015-06-15 19:16:18 +00:00
anarcat
5a8de788b0 Added a comment 2015-06-15 19:15:12 +00:00
Joey Hess
18a3d1b100 followup 2015-06-15 15:00:02 -04:00