Commit graph

44054 commits

Author SHA1 Message Date
Joey Hess
1d020df896
git-annex branch size when storing migration information
Sponsored-by: Jack Hill on Patreon
2023-12-01 13:09:52 -04:00
Joey Hess
d37219e3e5
comment 2023-11-30 17:07:17 -04:00
Joey Hess
3e8618fed3
comment 2023-11-30 16:49:48 -04:00
Joey Hess
093110d997
Merge branch 'master' of ssh://git-annex.branchable.com 2023-11-30 16:36:50 -04:00
Joey Hess
1e31bf8122
copy/move --from-anywhere --to remote
Implementation was simple because it's equivilant to
--from=foo --to remote for each other remote, followed by
--to remote when there's a local copy.

(Or, in the edge case of --from-anywhere --to=here,
it's the same as --to=here.)

Note that, when the local repo does not have a copy,
fromToPerform gets it from a remote, sends it to the destination,
and drops the local copy. Another call to that for a second remote
will notice that the dest now has a copy, and simply drop from the
second remote, avoiding a second transfer.

Also note that, when numcopies doesn't allow dropping it from
everywhere, it will drop it from the cheapest remotes first
(maybe not ideal) up to more expensive remotes, and finally from the local
repo. So the local repo will generally end up holding a copy. Maybe not
ideal in all cases either, but it seems no worse to do that than to end up
with a copy undropped from a remote.

And I'm not entirely happy with the output, eg:

	copy bigfile (from r3...) ok
	copy bigfile ok

That makes sense if you think of the second line as being
the same as what is output by `git-annex copy bigfile --to bar`,
but it's less clear in this context. Maybe add "(from here...)"?
Also the --json output doesn't have a machine-readable field for
the "from" uuid, and maybe it should?

Sponsored-by: Dartmouth College's DANDI project
2023-11-30 16:34:30 -04:00
Joey Hess
1654572bc1
fix --from overriding annex-ignore
Make git-annex get/copy/move --from foo override configuration of
remote.foo.annex-ignore, as documented.

This already worked for remotes supporting hasKeyCheap. For others though,
git-annex copy --from foo would silently not do anything, while
git-annex copy --to foo would use the annex-ignored remote.

Also improved the annex-ignore docs, to reflect that `git-annex get`
without --from will skip using annex-ignored remotes, for example.

Sponsored-by: Dartmouth College's DANDI project
2023-11-30 15:12:07 -04:00
nobodyinperson
1bb1a66255 Added a comment 2023-11-30 06:51:54 +00:00
unqueued
f89def74e4 2023-11-30 05:06:06 +00:00
unqueued
c108fe91bb 2023-11-30 04:52:51 +00:00
Joey Hess
060259b750
add manual: true to ParallelBuild flag
hackage demands that -j be gated behind a build flag with manual: true
set.
2023-11-29 16:05:03 -04:00
Joey Hess
7310ec897e
add news item for git-annex 10.20231129 2023-11-29 16:01:10 -04:00
Joey Hess
bacd781c4f
releasing package git-annex version 10.20231129 2023-11-29 16:01:01 -04:00
Joey Hess
c1037da2e5
attribution armoring
Read a fun paper where they got chatgpt to emit large chunks of code
https://not-just-memorization.github.io/extracting-training-data-from-chatgpt.html
"ChatGPT memorized significant fractions of its training dataset"
2023-11-29 14:42:44 -04:00
Joey Hess
bb9ba8dd94
comment 2023-11-29 13:42:12 -04:00
Joey Hess
e66a082b4e
close as fixed 2023-11-29 13:32:19 -04:00
Joey Hess
9ce39d1d2a
response 2023-11-29 13:26:24 -04:00
Joey Hess
af5ee7e951
comment 2023-11-29 12:45:32 -04:00
sng@353ca358075d9aa328f60a5439a3cee10f8301fe
0dce6e5608 Added a comment: corrupted bare repo 2023-11-29 15:34:30 +00:00
Joey Hess
241a344128
comment 2023-11-28 12:23:35 -04:00
Joey Hess
a237f98a70
dial back addition, but keep it
It's a semi-common point of confusion that numcopies is not something
these commands go out and copy files around specifically to satisfy,
without further configuration in preferred content. So this is a good
addition, but it also seemed too long and too specific to the user's
particular situation.
2023-11-28 12:03:55 -04:00
Joey Hess
668db84e51
fix some language 2023-11-28 12:03:49 -04:00
Joey Hess
f3f864fc6d
findkeys: Support --largerthan and --smallerthan
Sponsored-by: Brett Eisenberg on Patreon
2023-11-28 11:51:43 -04:00
Joey Hess
ab59b618c2
Merge branch 'master' of ssh://git-annex.branchable.com 2023-11-28 11:26:02 -04:00
lell
6707534edb 2023-11-28 10:41:33 +00:00
lell
7d2300e74e 2023-11-28 10:41:15 +00:00
lell
110fa43ea3 2023-11-28 10:41:02 +00:00
lell
33119339aa 2023-11-28 10:39:30 +00:00
lell
bc6a16813d 2023-11-28 10:37:35 +00:00
lell
4d9751f95f 2023-11-28 10:35:34 +00:00
Joey Hess
c18c353a76
Merge branch 'master' of ssh://git-annex.branchable.com 2023-11-26 13:00:35 -04:00
Yann Büchau
0e4fc0f399
Add note to 'satisfy' manpage about it not satisfying numcopies and how to set it up to pass files between remotes that are not present locally.
I got bitten several times in the past by the fact that local preferred
content expressions are not violated (even temporarily) in order to
satisfy numcopies or other remotes' preferred content expressions.
Mostly in the form of the local repo not allowing arbitrary files in
(e.g. because it's set to only want `present` files). This note I add
here explains how to get out of this situation with
`approxlackingcopies=1`.

It might be too specific for this manpage, but I didn't find a better
place to put it.
2023-11-24 12:08:09 +01:00
branch
8525019a06 2023-11-22 22:02:40 +00:00
Joey Hess
c9b766b914
comment 2023-11-21 16:45:03 -04:00
nobodyinperson
574196b887 Added a comment 2023-11-21 20:20:14 +00:00
Joey Hess
c3dd670609
comment 2023-11-21 16:16:27 -04:00
Joey Hess
e08f170b0e
followup and wontfix this 2023-11-21 16:11:10 -04:00
Joey Hess
96c269e087
comment 2023-11-21 16:02:21 -04:00
Joey Hess
a16ad2096b
close 2023-11-21 15:59:18 -04:00
Joey Hess
7ff40bc5be
move a comment that is a bug report 2023-11-21 15:57:10 -04:00
Joey Hess
9e6cee2aa3
Merge branch 'master' of ssh://git-annex.branchable.com 2023-11-21 12:05:47 -04:00
Joey Hess
f1c2e18b8d
improve attribution armoring
Split out an author parameter, will make it easier to add authors and
reads better.

Got rid of the function without the copyright year, because an adversary
could have mechanically changed the function with a copyright year to
the one without, and so bypassed the protection of LLM copyright
year hallucination.

Sponsored-by: Luke T. Shumaker on Patreon
2023-11-21 11:34:21 -04:00
beryllium@5bc3c32eb8156390f96e363e4ba38976567425ec
30bc55e55e Added a comment 2023-11-21 09:25:33 +00:00
Joey Hess
e901d31feb
exhaustiveness check fix 2023-11-20 21:34:29 -04:00
Joey Hess
06fa528408
golfing 2023-11-20 21:33:09 -04:00
Joey Hess
63c79dceef
avoid displicate of year range 2023-11-20 21:29:19 -04:00
Joey Hess
dab9687184
improve attribution armoring 2023-11-20 21:20:37 -04:00
Joey Hess
d5d570a96c
avoid replacing otherwise
While authorJoeyHess is True same as otherwise, ghc's exhastiveness
checker turns out to special case otherwise. So this avoids warnings.
2023-11-20 20:25:51 -04:00
Joey Hess
00933f69df
fix build 2023-11-20 20:08:35 -04:00
Joey Hess
70f7982ef8
Merge branch 'master' of ssh://git-annex.branchable.com 2023-11-20 12:29:44 -04:00
Joey Hess
cda3e85164
make my authorship explicit in the code
This is intended to guard against LLM code theft, which is the current
bubble technology de jour.

Note that authorJoeyHess' with a year older than the year I began
developing git-annex will behave badly, by intention. Eg, it will spin
and eventually crash.

This is not the first anti-LLM protection in git-annex. For example see
9562da790f. That method, while much harder
for an adversary to detect and remove, also complicates code somewhat
significantly, and needs extensions to be enabled. There are also
probably significantly fewer ways to implement that method in Haskell.
This new approach, by contrast, will be easy to add throughout the code
base, with very little effort, and without complicating reading or
maintaining it any more than noticing that yes, I am the author of this
code.

An adversary could of course remove all calls to these functions
before feeding code into their LLM-based laundry facility. I think this
would need to be done manually, or with the help of some fairly advanced
Haskell parsing though. In some cases, authorJoeyHess needs to be
removed, while in other places it needs to be replaced with a value.
Also a monadic use of authorJoeyHess' may involve other added monadic
machinery which would need to be eliminated to keep the code compiling.

Alternatively, an adversary could replace my name with something
innocuous. This would be clear intent to remove author attribution
from my code, even more than running it through an LLM laundry is.

If you work for a large company that is laundering my code through an
LLM, please do us a favor and use your immense privilege to quit and go
do something socially beneficial. I will not explain further
developments of this code in such detail, and you have better things to
do than playing cat and mouse with me as I explore directions such as
extending this approach to the type level.

Sponsored-by: k0ld on Patreon
2023-11-20 12:29:12 -04:00