Commit graph

31776 commits

Author SHA1 Message Date
dev@49d24b8461d78227a0ef1e366862734efda0b362
45a013eda6 2018-04-07 15:33:17 +00:00
Joey Hess
61aa56465b
fix pattern match 2018-04-06 23:11:20 -04:00
Joey Hess
bd45129c27
always poll file
This is now only used when downloading an url, and polling is always
needed when using curl, no matter how the output is configured.
2018-04-06 23:09:19 -04:00
Joey Hess
c8f2d302dc
run curl when configured to do it at runtime, even if not available at build time 2018-04-06 21:17:36 -04:00
Joey Hess
db9524bfcc
update for wget removal 2018-04-06 21:15:31 -04:00
Joey Hess
76f352a9a9
Merge branch 'master' of ssh://git-annex.branchable.com 2018-04-06 17:38:52 -04:00
Joey Hess
388eb4135f
devblog 2018-04-06 17:38:30 -04:00
Joey Hess
c34152777b
Use http-conduit for url downloads by default, annex.web-options enables curl
* For url downloads, git-annex now defaults to using a http library,
  rather than wget or curl. But, if annex.web-options is set, it will
  use curl. To use the .netrc file, run:
    git config annex.web-options --netrc
* git-annex no longer uses wget (and wget is no longer shipped with
  git-annex builds).

Note that curl is always run in silent mode, since the new API for
download has a MeterUpdate and doesn't make way for curl progress
output. It might be worth writing a parser for curl's progress output
to update the meter when using it, but I didn't bother with this edge
case for now.

This commit was supported by the NSF-funded DataLad project.
2018-04-06 17:36:20 -04:00
Joey Hess
0791c24221
fix bad refactoring
Reponse BodyReader is not a conduit thing, so can't use the refactored
function here after all. Oops. Put it back how it was.
2018-04-06 16:59:14 -04:00
Joey Hess
36e6b8abbf
Fix resuming a download when using curl.
Noticed a bug; when using curl a workaround for its empty file behavior
overwrote the file content, so it never resumed and always started over.
2018-04-06 16:09:53 -04:00
Joey Hess
0f6775f1ff
refactor sinkResponseFile and add downloadC
Remote.S3 and Remote.Helper.Http both had similar code to sink a
http-conduit Response to a file; refactor out sinkResponseFile.

downloadC downloads an url to a file using http-conduit, and supports
resuming. Falls back to curl to handle urls that http-conduit does not
support. This is not used yet, but the goal is to replace download with
it.

git-annex.cabal: conduit-extra was not actually used for a long time,
remove the dep. conduit moves into the main dependency list, but since
http-conduit was already in there, and it depends on conduit, that's not
really adding a new build dep.

This commit was supported by the NSF-funded DataLad project.
2018-04-06 16:07:08 -04:00
Joey Hess
0e8564201e
link to recastex 2018-04-06 11:27:18 -04:00
Joey Hess
d2ad5ebe4a
gitlab shell's git-annex support was removed 2018-04-06 11:21:44 -04:00
svw
f96de42138 2018-04-06 06:51:36 +00:00
vrs+annex@ea5fa24dbb279be61a8e50adb638bf8366300717
d92ec5c1d6 Added a comment 2018-04-05 21:01:51 +00:00
Joey Hess
6cb5b7294f
info: Changed sorting of numcopies stats table, so it's ordered by the variance from the desired number of copies.
Compare these...

numcopies stats:
	numcopies -1: 1986
	numcopies +0: 1170
	numcopies -2: 769
	numcopies +1: 716
	numcopies -4: 696
	numcopies -3: 485
	numcopies -6: 230
	numcopies -5: 111
	numcopies -7: 91
	numcopies -9: 9

numcopies stats:
	numcopies +1: 716
	numcopies +0: 1170
	numcopies -1: 1986
	numcopies -2: 769
	numcopies -3: 485
	numcopies -4: 696
	numcopies -5: 111
	numcopies -6: 230
	numcopies -7: 91
	numcopies -9: 9

I feel that the former is a jumbled mess that doesn't tell much overall,
while the second shows pretty clearly that most files are within 1 degree
of the desired number of copies, with some outliers without enough.
2018-04-05 14:54:39 -04:00
Joey Hess
817ebb5765
info: Added "combined size of repositories containing these files" stat
when run on a directory

This commit was sponsored by andrea rota.
2018-04-05 14:44:58 -04:00
CandyAngel
dc6d1ad00f Added a comment 2018-04-05 15:45:52 +00:00
jhnichol@cce81d2a480707652a3340ea2f24b3dc4b1f808c
1377e75970 Added a comment: I also have this problem 2018-04-05 15:25:22 +00:00
jhnichol@cce81d2a480707652a3340ea2f24b3dc4b1f808c
6c43b6084f removed 2018-04-05 15:24:31 +00:00
jhnichol@cce81d2a480707652a3340ea2f24b3dc4b1f808c
39bd152085 Added a comment 2018-04-05 15:23:42 +00:00
jhnichol@cce81d2a480707652a3340ea2f24b3dc4b1f808c
e257e4b6a3 removed 2018-04-05 15:19:51 +00:00
jhnichol@cce81d2a480707652a3340ea2f24b3dc4b1f808c
26b603418f removed 2018-04-05 15:18:14 +00:00
jhnichol@cce81d2a480707652a3340ea2f24b3dc4b1f808c
8c989eca91 Added a comment: I also can't add remotes through the web assistant 2018-04-05 15:17:53 +00:00
jhnichol@cce81d2a480707652a3340ea2f24b3dc4b1f808c
5f5d64b4d2 Added a comment: I also can't add remotes through the web assistant 2018-04-05 15:17:44 +00:00
vrs+annex@ea5fa24dbb279be61a8e50adb638bf8366300717
896f42ea15 Added a comment 2018-04-05 02:11:41 +00:00
vrs+annex@ea5fa24dbb279be61a8e50adb638bf8366300717
d8af254515 Added a comment 2018-04-05 01:32:33 +00:00
Joey Hess
9b98d3f630
better HTTP connection reuse
Enable HTTP connection reuse across multiple files, when git-annex
uses http-conduit. Before, a new Manager was created each time
Utility.Url used it. Now, a single Manager gets created the first time,
so connections are reused.

Doesn't help when external programs are used for url download,
but does speed up addurl --fast, fsck --from web, etc.

Testing fsck --fast --from web with 3 files, over high-latency
satellite internet, it sped up from 19.37s to 14.96s.

This commit was supported by the NSF-funded DataLad project.
2018-04-04 15:39:40 -04:00
Joey Hess
2ec07bc29f
Avoid running annex.http-headers-command more than once. 2018-04-04 15:15:08 -04:00
Joey Hess
98cc34c211
Merge branch 'master' of ssh://git-annex.branchable.com 2018-04-04 14:33:03 -04:00
Joey Hess
0783352fae
todo 2018-04-04 14:32:32 -04:00
Joey Hess
72014550f8
comment 2018-04-04 14:19:22 -04:00
Joey Hess
c769d359a7
response 2018-04-04 14:09:30 -04:00
Joey Hess
3ce13f7a2a
thoughts 2018-04-04 14:00:55 -04:00
Joey Hess
ef389722ae
don't copy old date metadata when adding new version of a file
When adding a new version of a file, and annex.genmetadata is enabled,
don't copy the data metadata from the old version of the file, instead use
the mtime of the file. Rationalle being that the user has requested to
generate metadata and so would expect to get the new mtime into metadata.

Also, avoid warning about copying metadata when all the old metadata is
date metadata. Which was rather the harder part.

This commit was sponsored by Boyd Stephen Smith Jr. on Patreon.
2018-04-04 13:58:16 -04:00
Joey Hess
c6252018fa
thoughrs 2018-04-04 12:50:09 -04:00
Joey Hess
c14638886f
response 2018-04-04 12:37:43 -04:00
achilleas.k@14be77d42a1252fab5ec9dbf4e5ea03c5833e8c8
4b522713f0 Added a comment 2018-04-04 16:27:54 +00:00
Joey Hess
b63c42bd9e
response 2018-04-04 12:24:32 -04:00
Joey Hess
2b55c90da7
remove spam 2018-04-04 12:21:04 -04:00
Joey Hess
9a42c32a5a
response 2018-04-04 12:20:44 -04:00
Joey Hess
4ee317b531
kudos 2018-04-04 12:17:42 -04:00
Joey Hess
d6f3034761
Merge branch 'master' of ssh://git-annex.branchable.com 2018-04-04 12:08:33 -04:00
Joey Hess
f856b15f27
response 2018-04-04 12:08:10 -04:00
Joey Hess
300e277c6a
response 2018-04-04 12:00:50 -04:00
Joey Hess
7b967c3800
close 2018-04-04 11:59:15 -04:00
Joey Hess
18c3d9ea55
response 2018-04-04 11:57:56 -04:00
CandyAngel
97117d6b49 fix plan list and add sorting annex diagram/explanation 2018-04-04 08:53:37 +00:00
Joey Hess
28c7e2cf61
document the -A shorthand for --all 2018-04-03 23:12:04 -04:00
Joey Hess
9f33225ad0
note on efficiency 2018-04-03 22:58:34 -04:00