git-annex

Author	SHA1	Message	Date
Joey Hess	6a3f755bfa	add common parameters to generic get API Honestly this was just done to make the documentation correct. There's no point in using these parameters. And they're optional.	2024-07-24 20:55:58 -04:00
Joey Hess	ab22938c0b	fix build without servant	2024-07-24 15:13:02 -04:00
Joey Hess	515c42e1e3	testremote passes on p2phttp remote	2024-07-24 14:42:24 -04:00
Joey Hess	f5624a69e3	expire lock after 10 minutes initially Once keeplocked is called, the lock will expire at the end of that call. But if keeplocked never gets called, this avoids the lock persisting forever.	2024-07-24 14:25:40 -04:00
Joey Hess	97836aafba	Remote.Git lockContent works with annex+http urls	2024-07-24 13:42:57 -04:00
Joey Hess	9fa9678585	Remote.Git removeKey works with annex+http urls Does not yet handle drop proof lock timestamp checking.	2024-07-24 12:33:26 -04:00
Joey Hess	cfdb80cd05	progress meter for p2phttp storeKey	2024-07-24 12:14:56 -04:00
Joey Hess	b3915b88ba	Remote.Git storeKey works with annex+http urls Does not yet update progress meter.	2024-07-24 12:05:10 -04:00
Joey Hess	5b1ac1a313	more generic clientGet	2024-07-24 11:10:19 -04:00
Joey Hess	10f2c23fd7	fix slowloris timeout in hashing resume of download of large file Hash the data that is already present in the file before connecting to the http server.	2024-07-24 11:03:59 -04:00
Joey Hess	7bd616e169	Remote.Git retrieveKeyFile works with annex+http urls This includes a bugfix to serveGet, it hung at the end.	2024-07-24 10:28:44 -04:00
Joey Hess	a2d1844292	factor out resumeVerifyFromOffset	2024-07-24 09:29:44 -04:00
Joey Hess	ad945896c9	avoid needing ifdefs when using P2P.Http.Client	2024-07-24 08:33:59 -04:00
Joey Hess	48657405c6	cache credentials for p2phttp in memory	2024-07-23 18:45:02 -04:00
Joey Hess	75771772ec	remove uuid from url when calling git credential The user doesn't want to see a uuid in the prompt. Also, when a http server is proxying, multiple remotes will have the same base url, and the same password will work for them all.	2024-07-23 18:21:01 -04:00
Joey Hess	b89c784a9b	use git credential when p2phttp needs auth	2024-07-23 18:11:15 -04:00
Joey Hess	b7454f1eeb	protocol version fallback on 404 and prettified errors	2024-07-23 14:58:49 -04:00
Joey Hess	4e15b786ca	Remote.Git checkpresent works with annex+http urls.	2024-07-23 14:31:32 -04:00
Joey Hess	b0eed55d4f	factor out http server and client into own modules To avoid a cycle when Remote.Git uses the client.	2024-07-23 14:12:38 -04:00
Joey Hess	6bbc4565e6	started wiring p2phttp into Remote.Git but we have a cycle, ugh	2024-07-23 13:53:10 -04:00
Joey Hess	2aa9154b1f	require a valid uuid at the end of an annex+http url	2024-07-23 12:30:27 -04:00
Joey Hess	75b1d50b99	add remoteAnnexP2PHttpUrl to RemoveGitConfig This is always parsed, when building without servant, a Baseurl is not generated, and users of it will need to fail.	2024-07-23 09:57:01 -04:00
Joey Hess	5c39652235	starting support for remote.name.annexUrl set to annex+http In this case, Remote.Git should not use that url for all access to the repository. It will only be used for annex operations, which isn't done yet.	2024-07-23 09:12:21 -04:00
Joey Hess	37373fad38	refactor	2024-07-22 21:19:32 -04:00
Joey Hess	06de2ad972	change default port to 9417 Port 80 would need root, not a good idea, so pick something that might work by default. 9418 is git protocol's port. 9419 is used by something, but nothing known uses 9417, so it's as good a default as any.	2024-07-22 20:52:17 -04:00
Joey Hess	53000cf06e	exit cleanly on eg, failure to bind socket	2024-07-22 20:37:37 -04:00
Joey Hess	163a7e91c4	make keeplocked client UUID optional It's not used, but it makes the protocol simpler if this parameter can always be sent.	2024-07-22 19:48:54 -04:00
Joey Hess	7f4cff7ae9	locking over http basically working	2024-07-22 19:44:26 -04:00
Joey Hess	e979e85bff	make serveKeepLocked check auth just to be safe	2024-07-22 19:15:52 -04:00
Joey Hess	63e42aa1bc	implement clientLockContent	2024-07-22 18:55:12 -04:00
Joey Hess	f5dd7a8bc0	implemented serveLockContent (untested)	2024-07-22 17:38:42 -04:00
Joey Hess	d5eaf0f567	improve clientKeepLocked	2024-07-22 16:56:44 -04:00
Joey Hess	10682eb882	variable name	2024-07-22 16:50:11 -04:00
Joey Hess	8a48d08abd	refactor	2024-07-22 16:49:05 -04:00
Joey Hess	8d36e597f1	finished clientGet validity checking	2024-07-22 16:31:39 -04:00
Joey Hess	48eb6671e4	improve clientGet types	2024-07-22 16:23:08 -04:00
Joey Hess	b697c6b9da	fix TMVar left full crash affecting servePutOffset Problem is that whatever is reading from the TMVar may not have read from it yet before the client writes the next thing to it.	2024-07-22 15:48:46 -04:00
Joey Hess	3069e28dd8	implemented servePutOffset and clientPutOffset But, it's buggy: the server hangs without processing the VALIDITY, and I can't seem to work out why. As far as I can see, storefile is getting as far as running the validitycheck, which is supposed to read that, but never does. This is especially strange because what seems like the same protocol doesn't hang when servePut runs it. This made me think that it needed to use inAnnexWorker to be more like servePut, but that didn't help. Another small problem with this is that it does create an empty .git/annex/tmp/ file for the key. Since this will usually be used in combination with servePut, that doesn't seem worth worrying about much.	2024-07-22 15:04:10 -04:00
Joey Hess	c36b9cd1fc	fix crash in servePut when client sends from too high offset Have to consume all the content from the client, but force an Invalid result, so it will just get discarded.	2024-07-22 13:12:15 -04:00
Joey Hess	b240a11b79	clientPut seeking to offset	2024-07-22 12:50:21 -04:00
Joey Hess	a01426b713	avoid padding in servePut This means that when the client sends a truncated data to indicate invalidity, DATA is not passed the full expected data. That leaves the P2P connection in a state where it cannot be reused. While so far, they are not reused, they will be later when proxies are supported. So, have to close the P2P connection in this situation.	2024-07-22 12:30:30 -04:00
Joey Hess	efa0efdc44	avoid padding in clientPut Instead truncate when necessary to indicate invalid content was sent. Very similar to how serveGet handles it.	2024-07-22 11:47:24 -04:00
Joey Hess	726c815a7f	fix releasing of p2p connection	2024-07-22 11:26:22 -04:00
Joey Hess	72d0769ca5	avoid padding content in serveGet Always truncate instead. The padding risked something not noticing the content was bad and getting a file that was corrupted in a novel way with the padding "X" at the end. A truncated file is better.	2024-07-22 11:19:52 -04:00
Joey Hess	4826a3745d	servePut and clientPut implementation Made the data-length header required even for v0. This simplifies the implementation, and doesn't preclude extra verification being done for v0. The connectionWaitVar is an ugly hack. In servePut, nothing waits on the waitvar, and I could not find a good way to make anything wait on it.	2024-07-22 10:27:44 -04:00
Joey Hess	eb4fb388bd	only base64 non-utf8	2024-07-11 15:47:16 -04:00
Joey Hess	97a2d0e4fb	use worker pool in withLocalP2PConnections This allows multiple clients to be handled at the same time.	2024-07-11 14:37:52 -04:00
Joey Hess	68227154fb	switch HTTP P2P protocol to base64url Base64 can include '/', and with UUIDs and keys both used in routes, the encoding needs to avoid that. Use base64url everywhere in the HTTP protocol for consistency.	2024-07-11 12:31:41 -04:00
Joey Hess	fc90270ba0	roughed in servePut	2024-07-11 12:20:07 -04:00
Joey Hess	1cff4c9f5b	oops, 2 datalength headers for put route	2024-07-11 12:12:33 -04:00
Joey Hess	3a55f0eec3	use DataLength	2024-07-11 11:57:55 -04:00
Joey Hess	14e0f778b7	simplify	2024-07-11 11:50:44 -04:00
Joey Hess	2228d56db3	serveGet invalidation	2024-07-11 11:42:32 -04:00
Joey Hess	80d2ffc79a	implement generic get server	2024-07-11 11:26:03 -04:00
Joey Hess	a7383b5c59	move serveruuid into routes In particular the generic get route needs it, so that when a single http server is serving multiple repositories, it knows what repository to use.	2024-07-11 11:19:20 -04:00
Joey Hess	74c6175795	fix serveGet early handle close Needed that waitv after all..	2024-07-11 09:55:17 -04:00
Joey Hess	2c13e6c165	fix annexworker shutdown on early client disconnect	2024-07-11 09:15:52 -04:00
Joey Hess	6e3d35744d	cleanup	2024-07-11 07:56:55 -04:00
Joey Hess	3b37b9e53f	fix serveGet hang This came down to SendBytes waiting on the waitv. Nothing ever filled it. Only Annex.Proxy needs the waitv, and it handles filling it. So make it optional.	2024-07-11 07:46:52 -04:00
Joey Hess	80fb5445b5	a little progress on serveGet hang Now it gets to the validity checker, but it seems it never runs it.	2024-07-10 17:48:48 -04:00
Joey Hess	1e0f92a5a1	implemented serveGet and clientGet Both are only at bare proof of concept stage. Still need to deal with signaling validity and invalidity, and checking it. And there's a bad bug: After -JN*2 requests, another request hangs! So, I think it's failing to free up the Annex worker and end of request lifetime. Perhaps I need to use this: https://docs.servant.dev/en/stable/cookbook/managed-resource/ManagedResource.html	2024-07-10 16:06:39 -04:00
Joey Hess	f9b7ce7224	add Annex worker pool to P2PHttp This will be needed for get and store, since those need to run Annex actions. withLocalP2PConnections will also probably use it.	2024-07-10 12:19:47 -04:00
Joey Hess	d4b9aea87b	implement gettimestamp	2024-07-10 10:23:10 -04:00
Joey Hess	7c588a5791	implement remove-before The reason to use removeBeforeRemoteEndTime is twofold. First, removeBefore sends two protocol commands. Currently, the HTTP protocol runner only supports sending a single command per invocation. Secondly, the http server gets a monotonic timestamp from the client. So translating back to a POSIXTime would be annoying. The timestamp flow with a proxy will be: - client gets timestamp, which gets the monotonic timestamp from the proxied remote via the proxy. The timestamp is currently not proxied when there is a single proxy. - client calls remove-before - http server calls removeBeforeRemoteEndTime which sends REMOVE-BEFORE to the proxied remote.	2024-07-10 10:03:26 -04:00
Joey Hess	e9cba0a580	Revert "proxy local timestamps in single proxy case as well as cluster case" Turns out not to be necessary. I think. This reverts commit `81e11efda1`.	2024-07-10 09:45:23 -04:00
Joey Hess	81e11efda1	proxy local timestamps in single proxy case as well as cluster case This is less efficient, but it guarantees that the timestamps that the client is sending are local timestamps, which turns out to be necessary for the HTTP PTP protocol server.	2024-07-10 09:40:13 -04:00
Joey Hess	518248f009	remove debug print	2024-07-10 09:37:43 -04:00
Joey Hess	b8a26712c6	implement clientRemove Tested removal.	2024-07-10 09:20:13 -04:00
Joey Hess	48f76cb3e8	implement serveRemove and send WWW-Authenticate header on auth failure	2024-07-10 09:13:01 -04:00
Joey Hess	97d0fc9b65	git-annex p2phttp options	2024-07-10 00:01:55 -04:00
Joey Hess	6a8a4d1775	authentication is implemented just need to make Command.P2PHttp generate a GetServerMode from options	2024-07-09 20:54:47 -04:00
Joey Hess	e5bf49b879	http basic authorization header parsing Sadly servant does not expose this though it also implements it.	2024-07-09 20:07:53 -04:00
Joey Hess	08371c3745	started on auth	2024-07-09 17:30:55 -04:00
Joey Hess	dcd77ee555	fix p2phttp server to not get stuck Process 1 command, then stop. Hopefully each of the Handlers will only need 1 command.	2024-07-09 14:26:30 -04:00
Joey Hess	3d13521479	set up handles for p2phttp Now it fully works.. for the first request. But then it gets stuck waiting for the P2P protocol runner to shut down.	2024-07-09 13:50:42 -04:00
Joey Hess	edf8a3df2d	p2phttp is almost working for checkpresent The server is fully running annex actions, only the P2PConnection is wrong, currently using stdio.	2024-07-09 13:37:55 -04:00
Joey Hess	a3dd8b4bcb	capture API version in routes Needed so the client can send it.	2024-07-09 12:04:29 -04:00
Joey Hess	751b8e0baf	implemented serveCheckPresent Still need a way to run Proto though	2024-07-09 09:08:42 -04:00
Joey Hess	9a592f946f	split module	2024-07-08 21:12:23 -04:00
Joey Hess	3f402a20a8	implement Locker	2024-07-08 21:00:10 -04:00
Joey Hess	b758b01692	add lockids to http p2p protocol	2024-07-08 20:18:55 -04:00
Joey Hess	58031455dc	add lock map	2024-07-08 14:20:30 -04:00
Joey Hess	0bdee626ad	thread in a state	2024-07-08 14:00:23 -04:00
Joey Hess	69c4f07ab0	finish get API	2024-07-08 13:27:50 -04:00
Joey Hess	82d66ede5e	convert lockcontent api to http long polling Websockets would work, but the problem with using them for this is that each lockcontent call is a separate websocket connection. And that's an actual TCP connection. One TCP connection per file dropped would be too expensive. With http long polling, regular http pipelining can be used, so it will reuse a TCP connection. Unfortunately, at least with servant, bi-directional streams with long polling don't result in true bidirectional full duplex communication. Servant processes the whole client body stream before generating the server body stream. I think it's entirely possible to do full bi-directional communication over http, but it would need changes to servant. And, there's no way for the client to tell if the server successfully locked the content, since the server will keep processing the client stream no matter what.: So, added a new api endpoint, keeplocked. lockcontent will lock the key for 10 minutes with retention lock, and then a call to keeplocked will keep it locked for as long as needed. This does mean that there will need to be a Map of locks by key, and I will probably want to add some kind of lock identifier that lockcontent returns.	2024-07-08 12:57:46 -04:00
Joey Hess	522700d1c4	implemented servant-client support for websockets	2024-07-08 07:44:59 -04:00
Joey Hess	392b15d5c3	may have found a way to make a request for a websocket?! dunno, it compiles anyway	2024-07-07 21:51:30 -04:00
Joey Hess	9ee005e49a	dummy HasClient ClientM WebSocket Enough to let lockcontent routes be included and servant-client be used. But not enough to use servant-client with those routes. May need to implement a separate runner for that part of the protocol? Also some misc other stuff needed to use servant-client. And fix exposing of UUID in the JSON types. UUID does actually have aeson instances, but they're used elsewhere (metadata --batch, although only included to get it to compile, not actually used in there) and not suitable for use here since this must work with every possible UUID.	2024-07-07 21:21:45 -04:00
Joey Hess	bfa8c39adb	servant client mostly implemented lockcontent had to be disabled until I can implement HasClient ClientM WebSocket and in clientGet, it's not clear how to use the v1 and v0 versions, which don't have a DataLengthHeader	2024-07-07 16:08:05 -04:00
Joey Hess	9a726cedf6	servant server now compiling Just need to fill in some undefined	2024-07-07 14:48:20 -04:00
Joey Hess	1dbb5ec70d	servant API type is complete	2024-07-07 12:59:12 -04:00
Joey Hess	f452bd448a	REMOVE-BEFORE and GETTIMESTAMP proxying For clusters, the timestamps have to be translated, since each node can have its own idea about what time it is. To translate a timestamp, the proxy remembers what time it asked the node for a timestamp in GETTIMESTAMP, and applies the delta as an offset in REMOVE-BEFORE. This does mean that a remove from a cluster has to call GETTIMESTAMP on every node before dropping from nodes. Not very efficient. Although currently it tries to drop from every single node anyway, which is also not very efficient. I thought about caching the GETTIMESTAMP from the nodes on the first call. That would improve efficiency. But, since monotonic clocks on !Linux don't advance when the computer is suspended, consider what might happen if one node was suspended for a while, then came back. Its monotonic timestamp would end up behind where the proxying expects it to be. Would that result in removing when it shouldn't, or refusing to remove when it should? Have not thought it through. Either way, a cluster behaving strangly for an extended period of time because one of its nodes was briefly asleep doesn't seem like good behavior.	2024-07-04 15:09:34 -04:00
Joey Hess	99b7a0cfe9	use REMOVE-BEFORE in P2P protocol Only clusters still need to be fixed to close this todo.	2024-07-04 13:47:38 -04:00
Joey Hess	1243af4a18	toward SafeDropProof expiry checking Added Maybe POSIXTime to SafeDropProof, which gets set when the proof is based on a LockedCopy. If there are several LockedCopies, it uses the closest expiry time. That is not optimal, it may be that the proof expires based on one LockedCopy but another one has not expired. But that seems unlikely to really happen, and anyway the user can just re-run a drop if it fails due to expiry. Pass the SafeDropProof to removeKey, which is responsible for checking it for expiry in situations where that could be a problem. Which really only means in Remote.Git. Made Remote.Git check expiry when dropping from a local remote. Checking expiry when dropping from a P2P remote is not yet implemented. P2P.Protocol.remove has SafeDropProof plumbed through to it for that purpose. Fixing the remaining 2 build warnings should complete this work. Note that the use of a POSIXTime here means that if the clock gets set forward while git-annex is in the middle of a drop, it may say that dropping took too long. That seems ok. Less ok is that if the clock gets turned back a sufficient amount (eg 5 minutes), proof expiry won't be noticed. It might be better to use the Monotonic clock, but that doesn't advance when a laptop is suspended, and while there is the linux Boottime clock, that is not available on other systems. Perhaps a combination of POSIXTime and the Monotonic clock could detect laptop suspension and also detect clock being turned back? There is a potential future flag day where p2pDefaultLockContentRetentionDuration is not assumed, but is probed using the P2P protocol, and peers that don't support it can no longer produce a LockedCopy. Until that happens, when git-annex is communicating with older peers there is a risk of data loss when a ssh connection closes during LOCKCONTENT.	2024-07-04 12:39:06 -04:00
Joey Hess	5b6150e5d5	factor out Utility.MonotonicClock	2024-07-03 17:54:01 -04:00
Joey Hess	543c610a31	REMOVE-BEFORE and GETTIMESTAMP Only implemented server side, not used client side yet. And not yet implemented for proxies/clusters, for which there's a build warning about unhandled cases. This is P2P protocol version 3. Probably will be the only change in that version.. Added a dependency on clock to access a monotonic clock. On i386-ancient, that is at version 0.2.0.0.	2024-07-03 17:01:58 -04:00
Joey Hess	d2b27ca136	add content retention files This allows lockContentShared to lock content for eg, 10 minutes and if the process then gets terminated before it can unlock, the content will remain locked for that amount of time. The Windows implementation is not yet tested. In P2P.Annex, a duration of 10 minutes is used. This way, when p2pstdio or remotedaemon is serving the P2P protocol, and is asked to LOCKCONTENT, and that process gets killed, the content will not be subject to deletion. This is not a perfect solution to doc/todo/P2P_locking_connection_drop_safety.mdwn yet, but it gets most of the way there, without needing any P2P protocol changes. This is only done in v10 and higher repositories (or on Windows). It might be possible to backport it to v8 or earlier, but it would complicate locking even further, and without a separate lock file, might be hard. I think that by the time this fix reaches a given user, they will probably have been running git-annex 10.x long enough that their v8 repositories will have upgraded to v10 after the 1 year wait. And it's not as if git-annex hasn't already been subject to this problem (though I have not heard of any data loss caused by it) for 6 years already, so waiting another fraction of a year on top of however long it takes this fix to reach users is unlikely to be a problem.	2024-07-03 14:58:39 -04:00
Joey Hess	fa5e7463eb	fix display when proxied GET yields ERROR The error message is not displayed to the use, but this mirrors the behavior when a regular get from a special remote fails. At least now there is not a protocol error.	2024-07-01 11:19:02 -04:00
Joey Hess	8b5fc94d50	add optional object file location to storeKey This will be used by the next commit to simplify the proxy.	2024-07-01 10:42:27 -04:00
Joey Hess	0dfdc9f951	dup stdio handles for P2P proxy Special remotes might output to stdout, or read from stdin, which would mess up the P2P protocol. So dup the handles to avoid any such problem.	2024-07-01 10:06:29 -04:00

1 2 3 4 5 ...

304 commits