Merge branch 'master' of ssh://git-annex.branchable.com

This commit is contained in:
Joey Hess 2015-04-16 11:30:53 -04:00
commit 4836979518
6 changed files with 102 additions and 0 deletions

View file

@ -0,0 +1,24 @@
### Please describe the problem.
I think this is what happened; I need to go back and check this again (maybe I was just misreading something) but I want to get it written down first.
I've got a git repository, and I just ran git annex init. Then I ran git annex addurl a bunch of times, followed by git annex sync. The result was apparently a repository where the files downloaded by addurl were added using the SHA256 backend rather than the URL backend. I deleted the branches and tried again, but this time after calling git annex addurl a bunch of times I did a normal git commit. This time everything looked fine; the files were all listed in as present in the web remote.
### What steps will reproduce the problem?
[[!format sh """
git annex init
git annex addurl "https://archive.org/download/emularity_engine_jsmess/messnapple2e.js.gz" --file "messnapple2e.js.gz"
git annex sync
"""]]
### What version of git-annex are you using? On what operating system?
git-annex version: 5.20150412-g2be4834
build flags: Assistant Webapp Webapp-secure Pairing Testsuite S3 WebDAV Inotify DBus DesktopNotify XMPP DNS Feeds Quvi TDFA TorrentParser
key/value backends: SHA256E SHA1E SHA512E SHA224E SHA384E SKEIN256E SKEIN512E MD5E SHA256 SHA1 SHA512 SHA224 SHA384 SKEIN256 SKEIN512 MD5 WORM URL
remote types: git gcrypt S3 bup directory rsync web bittorrent webdav tahoe glacier ddar hook external
local repository version: 5
supported repository version: 5
upgrade supported from repository versions: 0 1 2 4
This is on Linux.

View file

@ -0,0 +1,2 @@
Files only present in remotes show up as broken symlinks. That's great for knowing what files exist, but sometimes I just want to browse the files that are actually present. In this case, the many broken symlinks are just clutter.
Is there a straightforward way to switch to a view that shows only locally present files?

View file

@ -0,0 +1,9 @@
<video src=http://s.natalian.org/2015-04-15/git-annex-issues.mp4></video>
I removed some archived directories perhaps foolishly with `rm -rf`. How do I find the files that I've had deleted?
I also have an issue where by I want one command to sync between two hardrives and [github](https://github.com/kaihendry/uploadme). Or do I have to: `git-annex move --to {foo,bar}; git-annex drop; git-annex sync`? Basically I want copies everywhere except on my laptop (X1C3).
I also expected my git dir to be much smaller than 1.4GB after dropping everything. Thanks!

View file

@ -0,0 +1,37 @@
[[!comment format=mdwn
username="https://www.google.com/accounts/o8/id?id=AItOawlXt6nnNs-3uw61EGYtxr_AVhJqXybwLR8"
nickname="Bruno"
subject="comment 2"
date="2015-04-15T17:51:18Z"
content="""
@CandyAngel Thank you for your **git annex find** tips. But for git gc, it seem not working fine :)
After i have executed the **git gc**, the **git annex info** return the result after **1h 45m**
% time git annex info
repository mode: indirect
trusted repositories: 0
semitrusted repositories: 5
00000000-0000-0000-0000-000000000001 -- web
00000000-0000-0000-0000-000000000002 -- bittorrent
181d4dae-2131-435e-9c00-b8c7f1bfc332 -- [sbackup]
2db1f8e7-0b29-4d61-8875-a4a4a42a79dd -- [dellcomputer]
703df355-73a6-4487-97fd-a3a5d6ae034e -- usbhomebackup [here]
untrusted repositories: 0
transfers in progress: none
available local disk space: 135.24 gigabytes (+1 megabyte reserved)
local annex keys: 275416
local annex size: 780.55 gigabytes
annexed files in working tree: 265888
size of annexed files in working tree: 751.49 gigabytes
bloom filter size: 16 mebibytes (55.1% full)
backend usage:
SHA256E: 541304
git annex info 83,95s user 50,68s system 2% **cpu 1:45:01,70 total**
Can you explain exactly the git gc or git repack parameters that you use for optimizing git annex performance ?
Thanks
"""]]

View file

@ -0,0 +1,16 @@
[[!comment format=mdwn
username="CandyAngel"
subject="comment 3"
date="2015-04-16T07:41:07Z"
content="""
*git annex info* has check every file (not sure if it traverses *.git/annex/objects* specifically or not) to get \"local annex\" information. You can improve its performance by improving directory traversal in general (different filesystem or [changing the hashing method so it isn't Xx/Yy/KEY/FILE](https://github.com/datalad/datalad/issues/32)).
The repack/gc speeds up operations for the git side of things, like syncing (pull/push), cloning and committing.
Here's what I used:
git repack -ad
git gc
This took git actions down from 1 hour+ to ~10 minutes (for a repo with 5.6 million objects).
"""]]

View file

@ -0,0 +1,14 @@
[[!comment format=mdwn
username="https://www.google.com/accounts/o8/id?id=AItOawlXt6nnNs-3uw61EGYtxr_AVhJqXybwLR8"
nickname="Bruno"
subject="comment 4"
date="2015-04-16T11:47:50Z"
content="""
Thanks @CandyAngle,
Effectively, your tips for reduce a time for some git-annex commands if works fine, i will see in the long term if that is work perfectly
ex:, now **git annex sync** it work in **45s** ! :)
Thanks
"""]]