git-annex

Author	SHA1	Message	Date
lykos@d125a37d89b1cfac20829f12911656c40cb70018	bc451b6aa8		2024-06-27 10:47:43 +00:00
Joey Hess	effaf51b1f	avoid loop between cluster gateways The VIA extension is still needed to avoid some extra work and ugly messages, but this is enough that it actually works. This filters out the RemoteSides that are a proxied connection via a remote gateway to the cluster. The VIA extension will not filter those out, but will send VIA to them on connect, which will cause the ones that are accessed via the listed gateways to be filtered out.	2024-06-26 15:29:59 -04:00
Joey Hess	4172109c8d	support multi-gateway clusters VIA extension still needed otherwise a copy to a cluster can loop forever.	2024-06-26 15:07:03 -04:00
Joey Hess	8b6708e745	update for multi-gateway clusters	2024-06-26 14:40:25 -04:00
Joey Hess	07e899c9d3	git-annex-shell: proxy nodes located beyond remote cluster gateways Walking a tightrope between security and convenience here, because git-annex-shell needs to only proxy for things when there has been an explicit, local action to configure them. In this case, the user has to have run `git-annex extendcluster`, which now sets annex-cluster-gateway on the remote. Note that any repositories that the gateway is recorded to proxy for will be proxied onward. This is not limited to cluster nodes, because checking the node log would not add any security; someone could add any uuid to it. The gateway of course then does its own checking to determine if it will allow proxying for the remote.	2024-06-26 12:56:16 -04:00
Joey Hess	1ec2fecf3f	set up proxies for cluster nodes that are themselves proxied via a remote When there are multiple gateways to a cluster, this sets up proxying for nodes that are accessed via a remote gateway. Eg, when running in nyc and amsterdam is the remote gateway, and it has node1 and node2, this sets up proxying for amsterdam-node1 and amsterdam-node2. A client that has nyc as a remote will see proxied remotes nyc-amsterdam-node1 and nyc-amsterdam-node2.	2024-06-26 11:24:55 -04:00
Joey Hess	02bf3ddc3f	updatecluster: support multiple gateways Just look at the existing proxied remotes that correspond to already existing nodes of the cluster, and keep those nodes in the cluster. While adding any remotes of the local repo that are configured as cluster nodes. This allows removing cluster nodes from the local repo and updating, without it also removing nodes provided by other gateways.	2024-06-26 10:51:14 -04:00
Joey Hess	0b72b85df5	added git-annex extendcluster This works, but updatecluster does not work yet in multi-gateway clusters, nor do gateways relay to other gateways.	2024-06-26 10:26:54 -04:00
m.risse@77eac2c22d673d5f10305c0bade738ad74055f92	f9ce7a452c	Added a comment	2024-06-26 10:20:29 +00:00
m.risse@77eac2c22d673d5f10305c0bade738ad74055f92	6e6811c72f	Do checkpresentkey with --debug set	2024-06-26 10:11:58 +00:00
m.risse@77eac2c22d673d5f10305c0bade738ad74055f92	b1e36c5ddf		2024-06-26 08:06:37 +00:00
Joey Hess	798d6f6a46	todo	2024-06-25 17:58:45 -04:00
Joey Hess	e3dd29409b	improve docs	2024-06-25 17:50:22 -04:00
Joey Hess	0a1001dbfb	update	2024-06-25 17:26:26 -04:00
Joey Hess	9a8dcb58cd	design for distributed clusters	2024-06-25 17:20:49 -04:00
Joey Hess	b9889917a3	thoughts on cycles Rejected the idea of automatically instantiating remotes for proxies-of-proxies. That needs cycle protection, while the current behavior, which happened for free, is that running git-annex updateproxy on the proxy can be used to configure it, but only for topologies that actually exist.	2024-06-25 15:32:11 -04:00
Joey Hess	cec2848e8a	support annex.jobs for clusters	2024-06-25 14:54:20 -04:00
Joey Hess	5ede109ae5	gave up on upload fanout to cluster's proxy The problem with that idea is that the cluster's proxy is necessarily a remote, and necessarily one that we'll want to sync with, since the git repository is stored there. So when its preferred content wants a file, and the cluster does too, the file will get uploaded to it as well as to the cluster. With fanout, the upload to the cluster will populate the proxy as well, avoiding a second upload. But only if the file is sent to the cluster first. If it's sent to the proxy first, there will be two uploads. Another, lesser problem is that a repository can proxy for more than one cluster. So when does it make sense to drop content from the repository? It could be done when dropping from one cluster, but what of the other one? This complication was not necessary anyway. Instead, if it's desirable to have some content accessed from close to the proxy, one of the cluster nodes can just be put on the same filesystem as it. That will be just as fast as storing the content on the proxy.	2024-06-25 13:35:12 -04:00
m.risse@77eac2c22d673d5f10305c0bade738ad74055f92	bbdfe6b910		2024-06-25 15:59:36 +00:00
Joey Hess	1bfe7f8a53	honor preferred content settings of cluster nodes Except when no nodes want a file, it has to be stored somewhere, so store it on all. Which is not really desirable, but neither is having to pick one. ProtoAssociatedFile deserialization is rather broken, and this could possibly affect preferred content expressions that match on filenames. The inability to roundtrip whitespace like tabs and newlines through is not a problem because preferred content expressions can't be written that match on whitespace such as a tab. For example: joey@darkstar:~/tmp/bench/z>git-annex wanted origin-node2 'exclude=CTRL-VTab' wanted origin-node2 git-annex: Parse error: Parse failure: near "*" But, the filtering of control characters could perhaps be a problem. I think that filtering is now obsolete, git-annex has comprehensive filtering of control characters when displaying filenames, that happens at a higher level. However, I don't want to risk a security hole so am leaving in that filtering in ProtoAssociatedFile deserialization for now.	2024-06-25 11:43:09 -04:00
Joey Hess	202ea3ff2a	don't sync with cluster nodes by default Avoid `git-annex sync --content` etc from operating on cluster nodes by default since syncing with a cluster implicitly syncs with its nodes. This avoids a lot of unncessary work when a cluster has a lot of nodes just in checking if each node's preferred content is satisfied. And it avoids content being sent to nodes individually, so instead syncing with clusters always fanout uploads to nodes. The downside is that there are situations where a cluster's preferred content settings can be met, but those of its nodes are not. Or where a node does not contain a key, but the cluster does, and there are not enough copies of the key yet, so it would be desirable the send it there. I think that's an acceptable tradeoff. These kind of situations are ones where the cluster itself should probably be responsible for copying content to the node. Which it can do much less expensively than a client can. Part of the balanced preferred content design that I will be working on in a couple of months involves rebalancing clusters, so I expect to revisit this. The use of annex-sync config does allow running git-annex sync with a specific node, or nodes, and it will sync with it. And it's also possible to set annex-sync git configs to make it sync with a node by default. (Although that will require setting up an explicit git remote for the node rather than relying on the proxied remote.) Logs.Cluster.Basic is needed because Remote.Git cannot import Logs.Cluster due to a cycle. And the Annex.Startup load of clusters happens too late for Remote.Git to use that. This does mean one redundant load of the cluster log, though only when there is a proxy.	2024-06-25 10:24:38 -04:00
m.risse@77eac2c22d673d5f10305c0bade738ad74055f92	65f5a0f228		2024-06-25 10:46:13 +00:00
Joey Hess	b8016eeb65	add annex-proxied This makes git-annex sync and similar not treat proxied remotes as git syncable remotes. Also, display in git-annex info remote when the remote is proxied.	2024-06-24 10:16:59 -04:00
Joey Hess	0c111fc96a	fix git-annex sync --content with proxied remotes Loading the remote list a second time was removing all proxied remotes. That happened because setting up the proxied remote added some config fields to the in-memory git config, and on the second load, it saw those configs and decided not to overwrite them with the proxy. Now on the second load, that still happens. But now, the proxied git configs are used to generate a remote same as if those configs were all set. The reason that didn't happen before was twofold, the gitremotes cache was not dropped, and the remote's url field was not set correctly. The problem with the remote's url field is that while it was marked as proxy inherited, all other proxy inherited fields are annex- configs. And the code to inherit didn't work for the url field. Now it all works, but git-annex sync is left running git push/pull on the proxied remote, which doesn't work. That still needs to be fixed.	2024-06-24 09:45:51 -04:00
Joey Hess	60413a2557	update	2024-06-23 16:38:01 -04:00
Joey Hess	5d8bdac38e	upload fanout resume seems free of fenceposts Tested it with small chunk sizes (like 2) and resumes that were eg 1 byte from the end of the file or beginning of file. Also, git-annex testremote passes now against a cluster!	2024-06-23 16:22:39 -04:00
Joey Hess	9e070470f4	update	2024-06-23 12:48:22 -04:00
Joey Hess	3cd7969823	update	2024-06-23 12:31:00 -04:00
Joey Hess	d0aec8f623	always check numcopies when moving from cluster When the destination does not start with a copy, the cluster has one or more copies. If more, dropping would reduce the number of copies, so numcopies must be checked. Considered checking how many nodes of the cluster contain a copy. If only 1 node does, it could allow a move without checking numcopies. The problem with that, though, is that other nodes of the cluster could have copies that we don't know about. And dropping from a cluster tries to drop from all nodes, so will drop even from those. So any drop from a cluster can remove more than 1 copy.	2024-06-23 12:00:50 -04:00
Joey Hess	ec5b6454f4	todo	2024-06-23 10:09:35 -04:00
Joey Hess	2762f9c4ce	fix location log update for copy to 1-node cluster	2024-06-23 09:53:33 -04:00
Joey Hess	5b332a87be	dropping from clusters Dropping from a cluster drops from every node of the cluster. Including nodes that the cluster does not think have the content. This is different from GET and CHECKPRESENT, which do trust the cluster's location log. The difference is that removing from a cluster should make 100% the content is gone from every node. So doing extra work is ok. Compare with CHECKPRESENT where checking every node could make it very expensive, and the worst that can happen in a false negative is extra work being done. Extended the P2P protocol with FAILURE-PLUS to handle the case where a drop from one node succeeds, but a drop from another node fails. In that case the entire cluster drop has failed. Note that SUCCESS-PLUS is returned when dropping from a proxied remote that is not a cluster, when the protocol version supports it. This is because P2P.Proxy does not know when it's proxying for a single node cluster vs for a remote that is not a cluster.	2024-06-23 09:43:40 -04:00
Joey Hess	7bbd822a17	avoid using cluster nodes in drop proof when dropping from cluster This is obviously necessary in order for dropping from a cluster to be able to drop from all nodes. It also avoids violating numcopies when a cluster node is a special remote. If it were used in the drop proof, nothing would prevent the cluster from dropping from it.	2024-06-23 06:20:11 -04:00
Joey Hess	5a4b4b59b9	update	2024-06-23 05:26:45 -04:00
nobodyinperson	724eb8a369	Suggest that 'git annex unused' reports total unused size	2024-06-21 16:30:09 +00:00
Joey Hess	53674e8abb	Merge branch 'master' into proxy	2024-06-20 11:20:26 -04:00
Joey Hess	53598e5154	merge from proxy branch	2024-06-20 11:20:16 -04:00
Joey Hess	d89ac8c6ee	Merge branch 'master' of ssh://git-annex.branchable.com	2024-06-20 11:03:30 -04:00
Joey Hess	9173095d11	add my distribits talk	2024-06-20 11:03:19 -04:00
Joey Hess	ff5fe4e759	clusters documentation	2024-06-20 10:57:43 -04:00
Joey Hess	032d3902d8	wording	2024-06-20 10:15:24 -04:00
joris	b35be4b656	Added a comment	2024-06-20 09:58:05 +00:00
jochen.keil@38b1f86ab65128dab3e62e726403ceee4f5141bf	4da453e30c		2024-06-19 15:46:26 +00:00
Joey Hess	54307af8c0	more on proxying special remotes	2024-06-19 06:40:19 -04:00
Joey Hess	097ef9979c	towards a design for proxying to special remotes	2024-06-19 06:15:03 -04:00
Joey Hess	f18740699e	P2P protocol version 2, adding SUCCESS-PLUS and ALREADY-HAVE-PLUS Client side support for SUCCESS-PLUS and ALREADY-HAVE-PLUS is complete, when a PUT stores to additional repositories than the expected on, the location log is updated with the additional UUIDs that contain the content. Started implementing PUT fanout to multiple remotes for clusters. It is untested, and I fear fencepost errors in the relative offset calculations. And it is missing proxying for the protocol after DATA.	2024-06-18 16:21:40 -04:00
Joey Hess	fb0fd78485	only use a remote as a node when git configuration is set Avoids someone writing to cluster.log and nominating remotes of someone else's repository as a cluster.	2024-06-18 11:37:38 -04:00
Joey Hess	f049156a03	checkpresent support for clusters This assumes that the proxy for a cluster has up-to-date location logs. If it didn't, it might proxy the checkpresent to a node that no longer has the content, while some other node still does, and so it would incorrectly appear that the cluster no longer contains the content. Since cluster UUIDs are not stored to location logs, git-annex fsck --fast when claiming to fix a location log when that occurred would not cause any problems. And presumably the location tracking would later get sorted out. At least usually, changes to the content of nodes goes via the proxy, and it will update its location logs, so they will be accurate. However, if there were multiple proxies to the same cluster, or nodes were accessed directly (or via proxy to the node and not the cluster), the proxy's location log could certainly be wrong. (The location log access for GET has the same issues.)	2024-06-18 11:16:16 -04:00
Joey Hess	88d9a02f7c	initial, working support for getting from clusters Currently tends to put all the load on a single node, which will need to be improved.	2024-06-18 11:01:10 -04:00
Joey Hess	8290f70978	update	2024-06-18 10:08:15 -04:00
yarikoptic	28029d6668	original report / question	2024-06-18 13:57:23 +00:00
Joey Hess	e2fd2ee2bd	update	2024-06-17 09:31:44 -04:00
Joey Hess	3970bbb03b	Merge branch 'master' into proxy	2024-06-17 09:29:34 -04:00
Joey Hess	64afbb0b93	don't count clusters as copies, continued Handled limitCopies, as well as everything using fromNumCopies and fromMinCopies. This should be everything, probably. Note that, git-annex info displays a count of repositories, which still includes cluster. I think that's ok. It would be possible to filter out clusters there, but to the user they're pretty much just another repository. The numcopies displayed by eg `git-annex info .` does not include clusters.	2024-06-16 15:14:53 -04:00
Joey Hess	780367200b	remove dead nodes when loading the cluster log This is to avoid inserting a cluster uuid into the location log when only dead nodes in the cluster contain the content of a key. One reason why this is necessary is Remote.keyLocations, which excludes dead repositories from the list. But there are probably many more. Implementing this was challenging, because Logs.Location importing Logs.Cluster which imports Logs.Trust which imports Remote.List resulted in an import cycle through several other modules. Resorted to making Logs.Location not import Logs.Cluster, and instead it assumes that Annex.clusters gets populated when necessary before it's called. That's done in Annex.Startup, which is run by the git-annex command (but not other commands) at early startup in initialized repos. Or, is run after initialization. Note that is Remote.Git, it is unable to import Annex.Startup, because Remote.Git importing Logs.Cluster leads the the same import cycle. So ensureInitialized is not passed annexStartup in there. Other commands, like git-annex-shell currently don't run annexStartup either. So there are cases where Logs.Location will not see clusters. So it won't add any cluster UUIDs when loading the log. That's ok, the only reason to do that is to make display of where objects are located include clusters, and to make commands like git-annex get --from treat keys as being located in a cluster. git-annex-shell certainly does not do anything like that, and I'm pretty sure Remote.Git (and callers to Remote.Git.onLocalRepo) don't either.	2024-06-16 14:39:44 -04:00
beryllium@5bc3c32eb8156390f96e363e4ba38976567425ec	f707baf908	Added a comment	2024-06-15 07:37:07 +00:00
beryllium@5bc3c32eb8156390f96e363e4ba38976567425ec	0062ac1b49	Added a comment: Grafting? a special remote for tuned migration	2024-06-15 00:57:27 +00:00
Joey Hess	b3370a191c	insert cluster UUIDs when loading location logs, and omit when saving Inline isClusterUUID for speed.	2024-06-14 18:06:28 -04:00
Joey Hess	570ceffe8d	broke out initcluster One benefit of this is that a typo in annex-cluster-node config won't init a new cluster. Also it gets the cluster description set and is consistent with initremote.	2024-06-14 17:23:11 -04:00
Joey Hess	846903e9bb	update todo list for this month whew that's gonna be a lot	2024-06-14 15:23:43 -04:00
Joey Hess	bbf261487d	add git-annex updatecluster command Seems to work fine, making the right changes to the git-annex branch.	2024-06-14 15:02:01 -04:00
Joey Hess	2844230dfe	add git configs for clusters	2024-06-14 12:20:17 -04:00
Joey Hess	de1d795dfe	cache getClusters in Annex state	2024-06-14 11:16:01 -04:00
Joey Hess	9895e6659d	update	2024-06-13 19:08:04 -04:00
Joey Hess	6d59118b29	unique uuid namespace for clusters	2024-06-13 17:56:53 -04:00
Joey Hess	aa56d433d5	implement cluster.log Not used yet. (Or tested.) I did consider making the log start with the uuid of the node, followed by the cluster uuid (or uuids). That would perhaps mean a smaller write to the git-annex branch when adding a node, but overall the log file would be larger, and it will be read and cached near to startup on most git-annex runs.	2024-06-13 16:00:58 -04:00
Joey Hess	d16e19b8ca	comment	2024-06-13 14:30:32 -04:00
Joey Hess	ebebc04273	comment	2024-06-13 13:40:04 -04:00
Joey Hess	6ea78ec867	partial reproducer	2024-06-13 13:03:38 -04:00
Joey Hess	01f5015f30	update	2024-06-13 11:44:39 -04:00
Joey Hess	5e0acd1842	more cluster thoughts	2024-06-13 10:48:31 -04:00
Joey Hess	90e3b8b44f	avoided the strangeness of the cluster's proxy location tracking being wrong	2024-06-13 10:34:19 -04:00
Joey Hess	ffd7c745ff	update	2024-06-13 06:49:36 -04:00
Joey Hess	d8daabe9ec	Merge branch 'master' of ssh://git-annex.branchable.com	2024-06-13 06:44:22 -04:00
Joey Hess	22a329c57e	copied over some changes from proxy branch	2024-06-13 06:43:59 -04:00
Joey Hess	3cc48279ad	more thoughts on clusters	2024-06-13 06:41:42 -04:00
Joey Hess	555d7e52d3	more thoughts on clusters	2024-06-12 17:30:55 -04:00
Joey Hess	0ebb107974	update	2024-06-12 15:21:23 -04:00
Joey Hess	46a1fcb3ea	avoid git syncing with instantiate proxied remotes These remotes have no url configured, so git pull and push will fail. git-annex sync --content etc can still sync with them otherwise. Also, avoid git syncing twice with the same url. This is for cases where a proxied remote has been manually configured and so does have a url. Or perhaps proxied remotes will get configured like that automatically later.	2024-06-12 15:10:03 -04:00
Joey Hess	a986a20034	designing clusters	2024-06-12 14:57:26 -04:00
Joey Hess	e70e3473b3	on cycles	2024-06-12 13:52:17 -04:00
Joey Hess	44464e4410	update	2024-06-12 12:37:14 -04:00
Joey Hess	67d1e2a459	updates	2024-06-12 12:02:25 -04:00
m.risse@77eac2c22d673d5f10305c0bade738ad74055f92	c855b50f04		2024-06-12 15:42:42 +00:00
Joey Hess	dfdda95053	proxy updates location tracking information This does mean a redundant write to the git-annex branch. But, it means that two clients can be using the same proxy, and after one sends a file to a proxied remote, the other only has to pull from the proxy to learn about that. It does not need to pull from every remote behind the proxy (which it couldn't do anyway as git repo access is not currently proxied). Anyway, the overhead of this in git-annex branch writes is no worse than eg, sending a file to a repository where git-annex assistant is running, which then sends the file on to a remote, and updates the git-annex branch then. Indeed, when the assistant also drops the local copy, that results in more writes to the git-annex branch.	2024-06-12 11:37:14 -04:00
Joey Hess	96853cd833	finish P2P protocol proxying CONNECT is not supported by git-annex-shell p2pstdio, but for proxying to tor-annex remotes, it will be supported, and will make a git pull/push to a proxied remote work the same with that as it does over ssh, eg it accesses the proxy's git repo not the proxied remote's git repo. The p2p protocol docs say that NOTIFYCHANGES is not always supported, and it looked annoying to implement it for this, and it also seems pretty useless, so make it be a protocol error. git-annex remotedaemon will already be getting change notifications from the proxy's git repo, so there's no need to get additional redundant change notifications for proxied remotes that would be for changes to the same git repo.	2024-06-12 10:40:51 -04:00
Joey Hess	f98605bce7	a local git remote cannot proxy Prevent listProxied from listing anything when the proxy remote's url is a local directory. Proxying does not work in that situation, because the proxied remotes have the same url, and so git-annex-shell is not run when accessing them, instead the proxy remote is accessed directly. I don't think there is any good way to support this. Even if the instantiated git repos for the proxied remotes somehow used an url that caused it to use git-annex-shell to access them, planned features like `git-annex copy --to proxy` accepting a key and sending it on to nodes behind the proxy would not work, since git-annex-shell is not used to access the proxy. So it would need to use something to access the proxy that causes git-annex-shell to be run and speaks P2P protocol over it. And we have that. It's a ssh connection to localhost. Of course, it would be possible to take ssh out of that mix, and swap in something that does not have encryption overhead and authentication complications, but otherwise behaves the same as ssh. And if the user wants to do that, GIT_SSH does exist.	2024-06-12 10:16:04 -04:00
Joey Hess	c6e0710281	proxying to local git remotes works This just happened to work correctly. Rather surprisingly. It turns out that openP2PSshConnection actually also supports local git remotes, by just running git-annex-shell with the path to the remote. Renamed "P2PSsh" to "P2PShell" to make this clear.	2024-06-12 10:10:11 -04:00
Joey Hess	178da0dc99	Merge branch 'master' into proxy	2024-06-12 09:49:30 -04:00
Joey Hess	345494e3b4	expanding on the exporttree=yes design	2024-06-12 09:43:59 -04:00
yarikoptic	c6f2a5d372	TODO for log --key	2024-06-12 13:20:29 +00:00
Joey Hess	5beaffb412	proxying PUT now working The almost identical code duplication between relayDATA and relayDATA' is very annoying. I tried quite a few things to parameterize them, but the type checker is having fits when I try it.	2024-06-11 16:56:52 -04:00
Joey Hess	ed4fda098b	todo	2024-06-11 15:15:58 -04:00
Joey Hess	a2f4a8eddf	proxying GET now working Memory use is small and constant; receiveBytes returns a lazy bytestring and it does stream. Comparing speed of a get of a 500 mb file over proxy from origin-origin, vs from the same remote over a direct ssh: joey@darkstar:~/tmp/bench/client>/usr/bin/time git-annex get bigfile --from origin-origin get bigfile (from origin-origin...) ok (recording state in git...) 1.89user 0.67system 0:10.79elapsed 23%CPU (0avgtext+0avgdata 68716maxresident)k 0inputs+984320outputs (0major+10779minor)pagefaults 0swaps joey@darkstar:~/tmp/bench/client>/usr/bin/time git-annex get bigfile --from direct-ssh get bigfile (from direct-ssh...) ok 1.79user 0.63system 0:10.49elapsed 23%CPU (0avgtext+0avgdata 65776maxresident)k 0inputs+1024312outputs (0major+9773minor)pagefaults 0swaps So the proxy doesn't add much overhead even when run on the same machine as the client and remote. Still, piping receiveBytes into sendBytes like this does suggest that the proxy could be made to use less CPU resouces by using `sendfile()`.	2024-06-11 15:09:43 -04:00
Joey Hess	09b5e53f49	set annex.uuid in proxy's Repo getRepoUUID looks at that, and was seeing the annex.uuid of the proxy. Which caused it to unncessarily set the git config. Probably also would have led to other problems.	2024-06-11 13:40:50 -04:00
yarikoptic	b96ff82871	Added a comment	2024-06-11 17:36:51 +00:00
Joey Hess	657a91527a	update	2024-06-11 13:22:03 -04:00
Joey Hess	dd429ba8fe	Merge branch 'master' of ssh://git-annex.branchable.com	2024-06-11 13:08:45 -04:00
Joey Hess	5bb7f8cd64	Merge branch 'master' into proxy	2024-06-11 13:08:23 -04:00
Joey Hess	d2e3c5c89f	update	2024-06-11 13:07:53 -04:00
NewUser	124c1313bb		2024-06-11 13:31:01 +00:00
Joey Hess	501d65eeab	started implementing git-annex-shell proxy So far, it negotiates VERSION with both parties. This is a tricky dance. Untested.	2024-06-10 18:01:36 -04:00
Joey Hess	7b1548dbfa	correct AUTH-SUCCESS and AUTH-FAILURE It's AUTH_SUCCESS internally in git-annex, but the line based serialization uses AUTH-SUCCESS.	2024-06-10 15:06:27 -04:00
Joey Hess	649b87bedd	Merge branch 'master' into proxy	2024-06-10 14:26:18 -04:00
Joey Hess	d2576e5f1a	git-annex-shell: accept uuid of remote that proxying is enabled for For NotifyChanges and also for the fallthrough case where git-annex-shell passes a command off to git-shell, proxying is currently ignored. So every remote that is accessed via a proxy will be treated as the same git repository. Every other command listed in cmdsMap will need to check if Annex.proxyremote is set, and if so handle the proxying appropriately. Probably only P2PStdio will need to support proxying. For now, everything else refuses to work when proxying. The part of that I don't like is that there's the possibility a command later gets added to the list that doesn't check proxying. When proxying is not enabled, it's important that git-annex-shell not leak information that it would not have exposed before. Such as the names or uuids of remotes. I decided that, in the case where a repository used to have proxying enabled, but no longer supports any proxies, it's ok to give the user a clear error message indicating that proxying is not configured, rather than a confusing uuid mismatch message. Similarly, if a repository has proxying enabled, but not for the requested repository, give a clear error message. A tricky thing here is how to handle the case where there is more than one remote, with proxying enabled, with the specified uuid. One way to handle that would be to plumb the proxyRemoteName all the way through from the remote git-annex to git-annex-shell, eg as a field, and use only a remote with the same name. That would be very intrusive though. Instead, I decided to let the proxy pick which remote it uses to access a given Remote. And so it picks the least expensive one. The client after all doesn't necessarily know any details about the proxy's configuration. This does mean though, that if the least expensive remote is not accessible, but another remote would have worked, an access via the proxy will fail.	2024-06-10 12:44:35 -04:00
Joey Hess	783eb8879a	notes on behavior	2024-06-10 11:07:04 -04:00
jlueters@79a910340cdff27611c6a650c108afbe2f61c5f6	daa2c6cce1		2024-06-10 14:24:34 +00:00
Joey Hess	25a6ab6f11	Avoid grafting in export tree objects that are missing They could be missing due to an interrupted git-annex at just the wrong time during a prior graft, after which the tree objects got garbage collected. Or they could be missing because of manual messing with the git-annex branch, eg resetting it to back before the graft commit. Sponsored-by: Dartmouth College's OpenNeuro project	2024-06-07 16:51:50 -04:00
Joey Hess	b32c4c2e98	atomic git-annex branch update when regrafting in transition Fix a bug where interrupting git-annex while it is updating the git-annex branch could lead to git fsck complaining about missing tree objects. Interrupting git-annex while regraftexports is running in a transition that is forgetting git-annex branch history would leave the repository with a git-annex branch that did not contain the tree shas listed in export.log. That lets those trees be garbage collected. A subsequent run of the same transition then regrafts the trees listed in export.log into the git-annex branch. But those trees have been lost. Note that both sides of `if neednewlocalbranch` are atomic now. I had thought only the True side needed to be, but I do think there may be cases where the False side needs to be as well. Sponsored-by: Dartmouth College's OpenNeuro project	2024-06-07 16:34:10 -04:00
Joey Hess	6568ba4904	Merge branch 'master' into proxy	2024-06-07 12:35:47 -04:00
Joey Hess	43ff697f25	update status and design work on proxy encryption and chunking	2024-06-07 12:35:04 -04:00
Joey Hess	a0e59c1d17	comment	2024-06-07 12:35:00 -04:00
Joey Hess	5aaa285083	Merge branch 'master' into proxy	2024-06-07 10:43:13 -04:00
Joey Hess	058726ee86	next step identified	2024-06-06 18:06:45 -04:00
Joey Hess	d59383beaf	update	2024-06-06 17:25:22 -04:00
Joey Hess	9bc4dd635c	update	2024-06-06 17:23:51 -04:00
Joey Hess	a72d0f69d0	filter out illegal remote names when reading proxy log	2024-06-06 12:51:30 -04:00
Joey Hess	d208b03e5d	Merge branch 'master' into proxy	2024-06-06 12:42:18 -04:00
ruslan@302cb7f8d398fcce72f88b26b0c2f3a53aaf0bcd	1e6b4f324a	removed	2024-06-06 13:40:26 +00:00
ruslan@302cb7f8d398fcce72f88b26b0c2f3a53aaf0bcd	6274d16102	Added a comment	2024-06-06 11:23:55 +00:00
ruslan@302cb7f8d398fcce72f88b26b0c2f3a53aaf0bcd	d4993248eb	Added a comment	2024-06-06 11:23:34 +00:00
ruslan@302cb7f8d398fcce72f88b26b0c2f3a53aaf0bcd	a1e1af35af		2024-06-06 10:29:21 +00:00
nobodyinperson	6985c62a47	Added a comment	2024-06-06 09:09:03 +00:00
ruslan@302cb7f8d398fcce72f88b26b0c2f3a53aaf0bcd	7dbfb16415		2024-06-05 17:45:49 +00:00
ruslan@302cb7f8d398fcce72f88b26b0c2f3a53aaf0bcd	93b11da4db	Added a comment	2024-06-05 17:34:32 +00:00
ruslan@302cb7f8d398fcce72f88b26b0c2f3a53aaf0bcd	6b4ae7b635		2024-06-05 17:22:04 +00:00
ruslan@302cb7f8d398fcce72f88b26b0c2f3a53aaf0bcd	ca687413ef	Added a comment	2024-06-05 16:53:51 +00:00
Joey Hess	1761e971ee	status update after day 1 of new project	2024-06-04 14:55:54 -04:00
Joey Hess	f97f4b8bdb	Added updateproxy command and remote.name.annex-proxy configuration So far this only records proxy information on the git-annex branch.	2024-06-04 14:52:03 -04:00
Joey Hess	3df70c5c0c	implementation plan	2024-06-04 07:51:33 -04:00
Joey Hess	6375e3be3b	recieved funding to work on this, which comes with a schedule	2024-06-04 06:53:59 -04:00
Joey Hess	ac3fe92956	comment	2024-06-04 06:41:14 -04:00
Joey Hess	3db94f1b71	Merge branch 'master' of ssh://git-annex.branchable.com	2024-06-04 06:40:08 -04:00
Joey Hess	3be7163771	update	2024-06-04 06:40:04 -04:00
Joey Hess	5992e1729a	fixed by git release	2024-06-04 06:39:08 -04:00
nobodyinperson	c606b6a35d	Added a comment: Yes, GitLab fixed!	2024-06-04 07:38:47 +00:00
datamanager	82b891de7a	Added a comment: GitLab fixed?	2024-06-04 01:18:25 +00:00
Joey Hess	61ed0b3f03	root cause analysis	2024-06-03 13:56:43 -04:00
yarikoptic	4a48933867	Added a comment	2024-06-03 17:54:43 +00:00
Joey Hess	c382555cf8	comment	2024-06-03 12:31:55 -04:00
jkniiv	313a0285e5	a small clarification	2024-06-01 22:11:32 +00:00
jkniiv	5badd2ae4e	report on git-remote-annex on Windows not quite working	2024-06-01 21:59:27 +00:00
Joey Hess	0e96f0acd8	add news item for git-annex 10.20240531	2024-05-31 12:32:42 -04:00
Joey Hess	a51c5d1cde	some analysis	2024-05-31 11:47:59 -04:00
yarikoptic	8706a6faf1	report on git repo getting broken	2024-05-31 14:38:58 +00:00
yarikoptic	d313dc22e3	reporting that annex merge should not merge into main branch	2024-05-31 13:49:17 +00:00
Joey Hess	d8cf23ffdb	tweak	2024-05-30 13:31:49 -04:00
Joey Hess	69c9e8c11c	tweak	2024-05-30 13:30:57 -04:00
Joey Hess	19454917eb	tweak	2024-05-30 13:30:33 -04:00
Joey Hess	3a48eafce4	tweaks	2024-05-30 13:30:10 -04:00

1 2 3 4 5 ...

34419 commits