Commit graph

44094 commits

Author SHA1 Message Date
Joey Hess
7310ec897e
add news item for git-annex 10.20231129 2023-11-29 16:01:10 -04:00
Joey Hess
bacd781c4f
releasing package git-annex version 10.20231129 2023-11-29 16:01:01 -04:00
Joey Hess
c1037da2e5
attribution armoring
Read a fun paper where they got chatgpt to emit large chunks of code
https://not-just-memorization.github.io/extracting-training-data-from-chatgpt.html
"ChatGPT memorized significant fractions of its training dataset"
2023-11-29 14:42:44 -04:00
Joey Hess
bb9ba8dd94
comment 2023-11-29 13:42:12 -04:00
Joey Hess
e66a082b4e
close as fixed 2023-11-29 13:32:19 -04:00
Joey Hess
9ce39d1d2a
response 2023-11-29 13:26:24 -04:00
Joey Hess
af5ee7e951
comment 2023-11-29 12:45:32 -04:00
sng@353ca358075d9aa328f60a5439a3cee10f8301fe
0dce6e5608 Added a comment: corrupted bare repo 2023-11-29 15:34:30 +00:00
Joey Hess
241a344128
comment 2023-11-28 12:23:35 -04:00
Joey Hess
a237f98a70
dial back addition, but keep it
It's a semi-common point of confusion that numcopies is not something
these commands go out and copy files around specifically to satisfy,
without further configuration in preferred content. So this is a good
addition, but it also seemed too long and too specific to the user's
particular situation.
2023-11-28 12:03:55 -04:00
Joey Hess
668db84e51
fix some language 2023-11-28 12:03:49 -04:00
Joey Hess
f3f864fc6d
findkeys: Support --largerthan and --smallerthan
Sponsored-by: Brett Eisenberg on Patreon
2023-11-28 11:51:43 -04:00
Joey Hess
ab59b618c2
Merge branch 'master' of ssh://git-annex.branchable.com 2023-11-28 11:26:02 -04:00
lell
6707534edb 2023-11-28 10:41:33 +00:00
lell
7d2300e74e 2023-11-28 10:41:15 +00:00
lell
110fa43ea3 2023-11-28 10:41:02 +00:00
lell
33119339aa 2023-11-28 10:39:30 +00:00
lell
bc6a16813d 2023-11-28 10:37:35 +00:00
lell
4d9751f95f 2023-11-28 10:35:34 +00:00
Joey Hess
c18c353a76
Merge branch 'master' of ssh://git-annex.branchable.com 2023-11-26 13:00:35 -04:00
Yann Büchau
0e4fc0f399
Add note to 'satisfy' manpage about it not satisfying numcopies and how to set it up to pass files between remotes that are not present locally.
I got bitten several times in the past by the fact that local preferred
content expressions are not violated (even temporarily) in order to
satisfy numcopies or other remotes' preferred content expressions.
Mostly in the form of the local repo not allowing arbitrary files in
(e.g. because it's set to only want `present` files). This note I add
here explains how to get out of this situation with
`approxlackingcopies=1`.

It might be too specific for this manpage, but I didn't find a better
place to put it.
2023-11-24 12:08:09 +01:00
branch
8525019a06 2023-11-22 22:02:40 +00:00
Joey Hess
c9b766b914
comment 2023-11-21 16:45:03 -04:00
nobodyinperson
574196b887 Added a comment 2023-11-21 20:20:14 +00:00
Joey Hess
c3dd670609
comment 2023-11-21 16:16:27 -04:00
Joey Hess
e08f170b0e
followup and wontfix this 2023-11-21 16:11:10 -04:00
Joey Hess
96c269e087
comment 2023-11-21 16:02:21 -04:00
Joey Hess
a16ad2096b
close 2023-11-21 15:59:18 -04:00
Joey Hess
7ff40bc5be
move a comment that is a bug report 2023-11-21 15:57:10 -04:00
Joey Hess
9e6cee2aa3
Merge branch 'master' of ssh://git-annex.branchable.com 2023-11-21 12:05:47 -04:00
Joey Hess
f1c2e18b8d
improve attribution armoring
Split out an author parameter, will make it easier to add authors and
reads better.

Got rid of the function without the copyright year, because an adversary
could have mechanically changed the function with a copyright year to
the one without, and so bypassed the protection of LLM copyright
year hallucination.

Sponsored-by: Luke T. Shumaker on Patreon
2023-11-21 11:34:21 -04:00
beryllium@5bc3c32eb8156390f96e363e4ba38976567425ec
30bc55e55e Added a comment 2023-11-21 09:25:33 +00:00
Joey Hess
e901d31feb
exhaustiveness check fix 2023-11-20 21:34:29 -04:00
Joey Hess
06fa528408
golfing 2023-11-20 21:33:09 -04:00
Joey Hess
63c79dceef
avoid displicate of year range 2023-11-20 21:29:19 -04:00
Joey Hess
dab9687184
improve attribution armoring 2023-11-20 21:20:37 -04:00
Joey Hess
d5d570a96c
avoid replacing otherwise
While authorJoeyHess is True same as otherwise, ghc's exhastiveness
checker turns out to special case otherwise. So this avoids warnings.
2023-11-20 20:25:51 -04:00
Joey Hess
00933f69df
fix build 2023-11-20 20:08:35 -04:00
Joey Hess
70f7982ef8
Merge branch 'master' of ssh://git-annex.branchable.com 2023-11-20 12:29:44 -04:00
Joey Hess
cda3e85164
make my authorship explicit in the code
This is intended to guard against LLM code theft, which is the current
bubble technology de jour.

Note that authorJoeyHess' with a year older than the year I began
developing git-annex will behave badly, by intention. Eg, it will spin
and eventually crash.

This is not the first anti-LLM protection in git-annex. For example see
9562da790f. That method, while much harder
for an adversary to detect and remove, also complicates code somewhat
significantly, and needs extensions to be enabled. There are also
probably significantly fewer ways to implement that method in Haskell.
This new approach, by contrast, will be easy to add throughout the code
base, with very little effort, and without complicating reading or
maintaining it any more than noticing that yes, I am the author of this
code.

An adversary could of course remove all calls to these functions
before feeding code into their LLM-based laundry facility. I think this
would need to be done manually, or with the help of some fairly advanced
Haskell parsing though. In some cases, authorJoeyHess needs to be
removed, while in other places it needs to be replaced with a value.
Also a monadic use of authorJoeyHess' may involve other added monadic
machinery which would need to be eliminated to keep the code compiling.

Alternatively, an adversary could replace my name with something
innocuous. This would be clear intent to remove author attribution
from my code, even more than running it through an LLM laundry is.

If you work for a large company that is laundering my code through an
LLM, please do us a favor and use your immense privilege to quit and go
do something socially beneficial. I will not explain further
developments of this code in such detail, and you have better things to
do than playing cat and mouse with me as I explore directions such as
extending this approach to the type level.

Sponsored-by: k0ld on Patreon
2023-11-20 12:29:12 -04:00
NewUser
fc10ac3be2 Added a comment 2023-11-20 13:17:03 +00:00
nobodyinperson
3de99793d3 Added a comment: forgot to add the new ignored link + shortcoming of 'ln' command 2023-11-20 10:25:52 +00:00
NewUser
3a4883cabb Added a comment: Is annex.tune.objecthashlower=true recommended for interop with windows? 2023-11-20 04:24:35 +00:00
NewUser
544767a5a4 Added a comment: git annex uninit fails when I symlink your symlink 2023-11-20 04:22:02 +00:00
NewUser
f8f1670005 2023-11-20 02:09:42 +00:00
NewUser
289266618a Added a comment: Thanks Yann! 2023-11-19 18:20:03 +00:00
nobodyinperson
37e5866cd1 Added a comment 2023-11-19 16:27:35 +00:00
nobodyinperson
f4bde53022 Make it more clear that git annex webapp will add files and sync via the assistant. 2023-11-19 16:26:44 +00:00
nobodyinperson
3cd54476b5 Added a comment 2023-11-19 16:16:15 +00:00
NewUser
d5664ac5e2 Added a comment: launching git annex webapp starts adding to the annex which is surprising 2023-11-18 20:08:18 +00:00