git-annex

Author	SHA1	Message	Date
Joey Hess	67f09bca6d	fully fix fsck memory use by iterative fscking Not very well tested, but I'm sure it doesn't eg, loop forever.	2014-03-12 15:18:43 -04:00
Joey Hess	9f27339e80	remove uninofrmative warning dateUnusedLog is only used to show a timestamp in the webapp, so not worth a warning	2014-03-12 12:42:51 -04:00
Joey Hess	c2e8c21ca6	view, vfilter: Add support for filtering tags and values out of a view, using !tag and field!=value. Note that negated globs are not supported. Would have complicated the code to add them, without changing the data type serialization in a non-backwards-compatable way. This commit was sponsored by Denver Gingerich.	2014-03-02 14:53:19 -04:00
Joey Hess	a1432bce2f	Put non-object tmp files in .git/annex/misctmp, leaving .git/annex/tmp for only partially transferred objects. This allows eg, putting .git/annex/tmp on a ram disk, if the disk IO of temp object files is too annoying (and if you don't want to keep partially transferred objects across reboots). .git/annex/misctmp must be on the same filesystem as the git work tree, since files are moved to there in a way that will not work cross-device, as well as symlinked into there. I first wanted to put the tmp objects in .git/annex/objects/tmp, but that would pose transition problems on upgrade when partially transferred objects existed. git annex info does not currently show the size of .git/annex/misctemp, since it should stay small. It would also be ok to make something clean it out, periodically.	2014-02-26 16:52:56 -04:00
Joey Hess	8d5158fa31	Preserve metadata when staging a new version of an annexed file. Performance impact: When adding a large tree of new files, this needs to do some git cat-file queries to check if any of the files already existed and might need a metadata copy. I tried a benchmark in a copy of my sound repository (so there was already a significant git tree to check against. Adding 10000 small files, with a cold cache: before: 1m48.539s after: 1m52.791s So, impact is 0.0004 seconds per file added. Which seems acceptable, so did not add some kind of configuration to enable/disable this. This commit was sponsored by Lisa Feilen.	2014-02-24 14:41:33 -04:00
Joey Hess	7498c5dd96	annex.genmetadata can be set to make git-annex automatically set metadata (year and month) when adding files	2014-02-23 00:08:29 -04:00
Joey Hess	bdfc8e1f44	fix build with old version of Data.Set that lacks toDescList	2014-02-21 11:30:31 -04:00
Joey Hess	cfed7f6a5d	remove special case for tags in view branch names Just having "_" for tags=* turned out to be too hard to understand. Note that this invalidaes all current views.	2014-02-19 17:38:45 -04:00
Joey Hess	c85a482136	improve view branch name when there are a list of values	2014-02-19 16:35:00 -04:00
Joey Hess	dd7b99c860	add tip about metadata driven views (and more flexible view filtering) While writing this documentation, I realized that there needed to be a way to stay in a view like tag=* while adding a filter like tag=work that applies to the same field. So, there are really two ways a view can be refined. It can have a new "field=explicitvalue" filter added to it, which does not change the "shape" of the view, but narrows the files it shows. Or, it can have a new view added, which adds another level of subdirectories. So, added a vfilter command, which takes explicit values to add to the filter, and rejects changes that would change the shape of the view. And, made vadd only accept changes that change the shape of the view. And, changed the View data type slightly; now components that can match multiple metadata values can be visible, or not visible. This commit was sponsored by Stelian Iancu.	2014-02-19 16:29:56 -04:00
Joey Hess	39ebfa1a2e	pre-commit: Update metadata when committing changes to annexed files within a view. So the user can now switch to a view and then move files around within it to manage metadata. For example, moving a file into a new directory when in the tags=* view adds a tag to it. Implementation is fairly efficient. One diff-index, which is no more expensive than the first stage of a git commit, followed by possibly some cat-file --batch traffic to find the key (when deleting a file). Very similar to what's done in direct mode when committing. And like direct mode when updating the WC after a merge, it has to buffer the diff-tree values in order to make 2 passes over them. When not in a view, pre-commit now does one extra git symbolic-ref, which is tiny overhead. This commit was sponsored by Andrew Eskridge.	2014-02-19 14:17:58 -04:00
Joey Hess	02259d2a55	speed up currentView when not in a view Avoid reading the view log when the branch is clearly not a view branch.	2014-02-19 12:52:47 -04:00
Joey Hess	4e0be2792b	remove Read instance for Ref Removed instance, got it all to build using fromRef. (With a few things that really need to show something using a ref for debugging stubbed out.) Then added back Read instance, and made Logs.View use it for serialization. This changes the view log format.	2014-02-19 01:19:57 -04:00
Joey Hess	2bf338f443	fixed vpop	2014-02-18 21:09:25 -04:00
Joey Hess	67fd06af76	add git annex view command (And a vpop command, which is still a bit buggy.) Still need to do vadd and vrm, though this also adds their documentation. Currently not very happy with the view log data serialization. I had to lose the TDFA regexps temporarily, so I can have Read/Show instances of View. I expect the view log format will change in some incompatable way later, probably adding last known refs for the parent branch to View or something like that. Anyway, it basically works, although it's a bit slow looking up the metadata. The actual git branch construction is about as fast as it can be using the current git plumbing. This commit was sponsored by Peter Hogg.	2014-02-18 18:22:20 -04:00
Joey Hess	a18eae9a0f	nice git ack space optimisation when setting the same metadata value for multiple files	2014-02-13 01:57:43 -04:00
Joey Hess	361aee0470	avoid churning in git to no benefit when optimising metadata log I think this is now optimal.	2014-02-12 23:24:04 -04:00
Joey Hess	8076530284	improve simplifier	2014-02-12 22:50:41 -04:00
Joey Hess	a05ac13e92	fix metadata log simplifier and additional quickcheck tests	2014-02-12 22:27:55 -04:00
Joey Hess	9f7e76130e	add metadata command to get/set metadata Adds metadata log, and command. Note that unsetting field values seems to currently be broken. And in general this has had all of 2 minutes worth of testing. This commit was sponsored by Julien Lefrique.	2014-02-12 21:30:33 -04:00
Joey Hess	c390e896d1	fix windows build (and make --stop work on windows, incidentially) The Utility.PID will clean up other code soon.	2014-02-11 15:25:59 -04:00
Joey Hess	4f7e72b51a	fix parsing of unused log; keys can contain spaces	2014-02-08 15:27:11 -04:00
Joey Hess	a44e01c29c	--in can now refer to files that were located in a repository at some past date. For example, --in="here@{yesterday}"	2014-02-06 12:43:56 -04:00
Joey Hess	1572c460e8	avoid using openFile when withFile can be used Potentially fixes some FD leak if an action on an opened file handle fails for some reason. There have been some hard to reproduce reports of git-annex leaking FDs, and this may solve them.	2014-02-03 10:19:06 -04:00
Joey Hess	32f1f68dc9	typo	2014-01-28 17:17:21 -04:00
Joey Hess	f0dfac4d96	fix build with old ghc that used old-time type	2014-01-28 17:14:43 -04:00
Joey Hess	eefda291c6	fix warning	2014-01-28 14:43:20 -04:00
Joey Hess	891c85cd88	use locking on Windows This is all the easy cases, where there was already a separate lock file.	2014-01-28 14:42:03 -04:00
Joey Hess	3518c586cf	fix transfers of key with no associated file Several places assumed this would not happen, and when the AssociatedFile was Nothing, did nothing. As part of this, preferred content checks pass the Key around. Note that checkMatcher is sometimes now called with Just Key and Just File. It currently constructs a FileMatcher, ignoring the Key. However, if it constructed a FileKeyMatcher, which contained both, then it might be possible to speed up parts of Limit, which currently call the somewhat expensive lookupFileKey to get the Key. I have not made this optimisation yet, because I am not sure if the key is always the same. Will need some significant checking to satisfy myself that's the case..	2014-01-23 16:44:02 -04:00
Joey Hess	e0bd088f08	add webapp UI to manage unused files	2014-01-23 15:09:43 -04:00
Joey Hess	3da0064657	assistant unused file handling Make sanity checker run git annex unused daily, and queue up transfers of unused files to any remotes that will have them. The transfer retrying code works for us here, so eg when a backup disk remote is plugged in, any transfers to it are done. Once the unused files reach a remote, they'll be removed locally as unwanted. If the setup does not cause unused files to go to a remote, they'll pile up, and the sanity checker detects this using some heuristics that are pretty good -- 1000 unused files, or 10% of disk used by unused files, or more disk wasted by unused files than is left free. Once it detects this, it pops up an alert in the webapp, with a button to take action. TODO: Webapp UI to configure this, and also the ability to launch an immediate cleanup of all unused files. This commit was sponsored by Simon Michael.	2014-01-22 22:53:18 -04:00
Joey Hess	4b55afe9e9	add "unused" preferred content expression With a really nice optimisation that keeps it from having any overhead in normal operation! This commit was sponsored by Ulises Vitulli.	2014-01-22 16:35:32 -04:00
Joey Hess	ae3cd632bd	add timestamps to unused log files This will be used in expiring old unused objects. The timestamp is when it was first noticed it was unused. Backwards compatability: It supports reading old format unused log files. The old version of git-annex will ignore lines in log files written by the new version, so the worst interop problem would be git annex dropunused not knowing some numbers that git-annex unused reported.	2014-01-22 15:33:02 -04:00
Joey Hess	f7cdc40f7b	reorg	2014-01-21 18:08:56 -04:00
Joey Hess	0ef282a116	numcopies cleanup, part 2 This includes several bug fixes.	2014-01-21 17:25:39 -04:00
Joey Hess	b40df4f0d0	reorganize numcopies code (no behavior changes) Move stuff into Logs.NumCopies. Add a NumCopies newtype. Better names for various serialization classes that are specific to one thing or another.	2014-01-21 16:08:59 -04:00
Joey Hess	d66535f065	global numcopies setting * numcopies: New command, sets global numcopies value that is seen by all clones of a repository. * The annex.numcopies git config setting is deprecated. Once the numcopies command is used to set the global number of copies, any annex.numcopies git configs will be ignored. * assistant: Make the prefs page set the global numcopies. This global numcopies setting is needed to let preferred content expressions operate on numcopies. It's also convenient, because typically if you want git-annex to preserve N copies of files in a repo, you want it to do that no matter which repo it's running in. Making it global avoids needing to warn the user about gotchas involving inconsistent annex.numcopies settings. (See changes to doc/numcopies.mdwn.) Added a new variety of git-annex branch log file, that holds only 1 value. Will probably be useful for other stuff later. This commit was sponsored by Nicolas Pouillard.	2014-01-20 16:47:56 -04:00
Joey Hess	93161d0dea	copyright year	2014-01-08 16:29:15 -04:00
Joey Hess	3e68c1c2fd	add remote state logs This allows a remote to store a piece of arbitrary state associated with a key. This is needed to support Tahoe, where the file-cap is calculated from the data stored in it, and used to retrieve a key later. Glacier also would be much improved by using this. GETSTATE and SETSTATE are added to the external special remote protocol. Note that the state is left as-is even when a key is removed from a remote. It's up to the remote to decide when it wants to clear the state. The remote state log, $KEY.log.rmt, is a UUID-based log. However, rather than using the old UUID-based log format, I created a new variant of that format. The new varient is more space efficient (since it lacks the "timestamp=" hack, and easier to parse (and the parser doesn't mess with whitespace in the value), and avoids compatability cruft in the old one. This seemed worth cleaning up for these new files, since there could be a lot of them, while before UUID-based logs were only used for a few log files at the top of the git-annex branch. The transition code has also been updated to handle these new UUID-based logs. This commit was sponsored by Daniel Hofer.	2014-01-03 16:35:57 -04:00
Joey Hess	8e3032df2d	added GETWANTED, SETWANTED for Tobias's flickr remote This was unexpectedly difficult because of a depdenency cycle. To parse a preferred content expression involves several things that need to operate on the list of remotes. Which needs Remote.External. The only way to avoid this cycle (I tried breaking it at several points) was to skip parsing the expression in SETWANTED. That's sorta ok, because git-annex already has to deal with unparsable preferred content expressions being stored, in order to handle eg, upgrades. But I'm still not very happy that I cannot check it. I feel this is a strong indication that I need to beware of further bloating the special remote protocol interface.	2014-01-01 20:12:20 -04:00
Joey Hess	f0a6de1ca2	add PreferredContentExpression type	2014-01-01 19:58:02 -04:00
Richard Hartmann	974fe009bf	Another round of s/amoung/among/	2013-12-19 12:30:53 -04:00
Joey Hess	f931272681	syntax	2013-12-11 00:18:58 -04:00
Joey Hess	011b8bc7ec	pull in Win32-extras, to be able to get current process id in Windows Fixed up a number of things that had worked around there not being a way to get that. Most notably, transfer info files on windows now include the process id, since no locking is currently done. This means the file format varies between windows and unix.	2013-12-11 00:15:10 -04:00
Joey Hess	ecd42aef8e	different PID types for Unix and Windows Windows has a larger (unsigned) PID space, so cannot use the unix CInt there. Note that TransferInfo does not yet ever get the TransferPid populated, as there is missing locking.	2013-12-10 23:48:42 -04:00
Joey Hess	6edac746f0	merge improved fsck types from git-repair and some associated changes	2013-11-30 14:29:11 -04:00
Joey Hess	53ab737723	clean up cruft left in log by bug	2013-11-09 14:30:26 -04:00
Joey Hess	8e1b8af6e7	fix crash on empty description Caused by bug fixed in `46cf00ffd8`	2013-11-09 13:50:44 -04:00
Joey Hess	049e80e865	refactor	2013-10-28 14:05:55 -04:00
Joey Hess	d345e5b52f	add git fsck to cronner, and UI for repository repair (not yet wired up)	2013-10-22 16:02:52 -04:00
Joey Hess	92d5452a19	write via temp file	2013-10-14 16:15:38 -04:00
Joey Hess	296e21b381	add schedule command Mostly because it gives me an excuse and a hook to document the schedule expression format.	2013-10-13 15:40:38 -04:00
Joey Hess	88ec6eff15	add/remove/edit schedule UI working Once I built the basic widget, it turned out to be rather easy to replicate it once per scheduled activity and wire it all up to a fully working UI. This does abuse yesod's form handling a bit, but I think it's ok. And it would be nice to have it all ajax-y, so that saving one modified form won't lose any modifications to other forms. But for now, a nice simple 115 line of code implementation is a win. This late night hack session commit was sponsored by Andrea Rota.	2013-10-11 03:04:11 -04:00
Joey Hess	af5e1d0494	half way complete cronner thread to run scheduled activities	2013-10-08 11:48:28 -04:00
Joey Hess	b9375acb18	add schedule to vicfg	2013-10-07 17:11:13 -04:00
Joey Hess	29ca49dad4	add a log file for scheduled activities	2013-10-07 16:06:34 -04:00
Joey Hess	57d49a6d04	remove >=> and >=> ; use <$$> instead I forgot I had <$$> hidden away in Utility.Applicative. It allows doing the same kind of currying as does >=> and I found using it made the code more readable for me. (>=> was not used)	2013-09-27 19:58:48 -04:00
Joey Hess	c1990702e9	hlint	2013-09-25 23:19:01 -04:00
Joey Hess	4dc4a9a385	assistant: Clear the list of failed transfers when doing a full transfer scan. This prevents repeated retries to download files that are not available, or are not referenced by the current git tree. This is motivated by a user report that the assistant was repeatedly retrying transfers of files that had been deleted (in direct mode, so removing the only copy). Note that the glacier code retries failed transfers after a while to retry downloads that have aged long enough to be available. This is ok; if we're doing a full transfer scan we'll retry on every file that is still in the git tree. Also note that this makes the assistant less likely to get every file referenced by old revs of the git tree. Not something the assistant tries to ensure anyway, so I feel this is acceptable.	2013-09-25 11:46:17 -04:00
Joey Hess	eb42bde19a	sync, pre-commit, indirect: Avoid unnecessarily catting non-symlink files from git, which can be so large it runs out of memory.	2013-09-19 14:48:42 -04:00
Joey Hess	51ce7fcaf1	fix warning	2013-09-04 21:37:13 -04:00
Joey Hess	0831e18372	forget --drop-dead: Completely removes mentions of repositories that have been marked as dead from the git-annex branch. Wrote nice pure transition calculator, and ugly code to stage its results into the git-annex branch. Also had to split up several Log modules that Annex.Branch needed to use, but that themselves used Annex.Branch. The transition calculator is limited to looking at and changing one file at a time. While this made the implementation relatively easy, it precludes transitions that do stuff like deleting old url log files for keys that are being removed because they are no longer present anywhere.	2013-08-31 17:51:13 -04:00
Joey Hess	62beaa1a86	refactor git-annex branch log filename code into central location Having one module that knows about all the filenames used on the branch allows working back from an arbitrary filename to enough information about it to implement dropping dead remotes and doing other log file compacting as part of a forget transition.	2013-08-29 19:13:00 -04:00
Joey Hess	4a915cd3cd	add forget command Works, more or less. --dead is not implemented, and so far a new branch is made, but keys no longer present anywhere are not scrubbed. git annex sync fails to push the synced/git-annex branch after a forget, because it's not a fast-forward of the existing synced branch. Could be fixed by making git-annex sync use assistant-style sync branches.	2013-08-28 16:41:13 -04:00
Joey Hess	fcd5c167ef	untested transition detection on merging, and transition running code	2013-08-28 15:57:42 -04:00
Joey Hess	511cf77b6d	add transition log	2013-08-28 13:54:51 -04:00
Joey Hess	824241b6fb	better cases	2013-08-22 23:44:13 -04:00
Joey Hess	46b6d75274	Youtube support! (And 53 other video hosts) When quvi is installed, git-annex addurl automatically uses it to detect when an page is a video, and downloads the video file. web special remote: Also support using quvi, for getting files, or checking if files exist in the web. This commit was sponsored by Mark Hepburn. Thanks!	2013-08-22 18:50:43 -04:00
Joey Hess	a3224ce35b	avoid more build warnings on Windows	2013-08-04 14:05:36 -04:00
Joey Hess	93f2371e09	get rid of __WINDOWS__, use mingw32_HOST_OS The latter is harder for me to remember, but avoids build failures in code used by the configure program.	2013-08-02 12:27:32 -04:00
Joey Hess	7e66d260ea	importfeed: git-annex becomes a podcatcher in 150 LOC	2013-07-28 16:55:42 -04:00
Joey Hess	ec8cf85fcc	display "transfer already in progress" as a note	2013-07-17 16:16:17 -04:00
Joey Hess	7afd92d083	When a transfer is already being run by another process, proceed on to the next file, rather than dying.	2013-07-17 15:54:01 -04:00
Joey Hess	7a7e426352	moved AssociatedFile definition	2013-07-04 02:36:02 -04:00
Joey Hess	04d07f2c1f	--unused: New switch that makes git-annex operate on all data found by the last run of git annex unused. Supported by fsck, get, move, copy.	2013-07-03 15:26:59 -04:00
Joey Hess	bf86b5ca16	improve robustness of fromDirect and replaceFile Made fromDirect check that a file in the tree has good content (and is not a broken symlink either) before copying it to another file that has the same key. Made replaceFile clean up the temp file if the action that creates it, or the file replacement action fails.	2013-05-25 15:06:02 -04:00
Joey Hess	25a8d4b11c	rename module	2013-05-12 19:19:28 -04:00
Joey Hess	03e8594369	fix the day's windows permissions damage	2013-05-12 19:09:48 -04:00
Joey Hess	73d2f8b280	deal with git using / internally, even on DOS	2013-05-12 17:29:49 -05:00
Joey Hess	abe8d549df	fix permission damage (thanks, Windows)	2013-05-11 23:54:25 -04:00
Joey Hess	18bdff3fae	clean up from windows porting	2013-05-11 18:23:41 -04:00
Joey Hess	3c7e30a295	git-annex now builds on Windows (doesn't work)	2013-05-11 15:03:00 -05:00
Joey Hess	0ae8c82c53	per-IA-item content directories	2013-04-25 23:44:55 -04:00
Joey Hess	49547ad32d	initremote: If two existing remotes have the same name, prefer the one with a higher trust level.	2013-04-24 21:53:58 -04:00
Joey Hess	6be815a30c	rmurl: New command, removes one of the recorded urls for a file.	2013-04-22 17:18:53 -04:00
Joey Hess	9e11699c76	connect existing meters to the transfer log for downloads Most remotes have meters in their implementations of retrieveKeyFile already. Simply hooking these up to the transfer log makes that information available. Easy peasy. This is particularly valuable information for encrypted remotes, which otherwise bypass the assistant's polling of temp files, and so don't have good progress bars yet. Still some work to do here (see progressbars.mdwn changes), but this is entirely an improvement from the lack of progress bars for encrypted downloads.	2013-04-11 17:32:31 -04:00
Joey Hess	c9e4c218a6	fix invalidating the preferred content cache when changing a group The ConfigMonitor already did this, but groups can also be changed by eg, the webapp UI, so need to do it at this deeper level.	2013-04-08 16:43:06 -04:00
Joey Hess	9a5f421768	detect when unwanted remote is empty and remove it Needs fixes to build when the webapp is disabled.	2013-04-03 17:01:40 -04:00
Joey Hess	8a5b397ac4	hlint	2013-04-03 03:52:41 -04:00
Joey Hess	7b6cf1981f	show bytesComplete	2013-04-02 16:38:47 -04:00
Joey Hess	91b7de97e8	invalidated the wrong cache when setting preferred content	2013-03-31 19:00:14 -04:00
Joey Hess	67e817c6a1	New annex.largefiles setting, which configures which files `git annex add` and the assistant add to the annex. I would have sort of liked to put this in .gitattributes, but it seems it does not support multi-word attribute values. Also, making this a single config setting makes it easy to only parse the expression once. A natural next step would be to make the assistant `git add` files that are not annex.largefiles. OTOH, I don't think `git annex add` should `git add` such files, because git-annex command line tools are not in the business of wrapping git command line tools.	2013-03-29 16:17:13 -04:00
Joey Hess	cf07a2c412	webapp: Progess bar fixes for many types of special remotes. There was confusion in different parts of the progress bar code about whether an update contained the total number of bytes transferred, or the number of bytes transferred since the last update. One way this bug showed up was progress bars that seemed to stick at zero for a long time. In order to fix it comprehensively, I add a new BytesProcessed data type, that is explicitly a total quantity of bytes, not a delta. Note that this doesn't necessarily fix every problem with progress bars. Particularly, buffering can now cause progress bars to seem to run ahead of transfers, reaching 100% when data is still being uploaded.	2013-03-28 17:04:37 -04:00
Joey Hess	e9048ecec8	get, copy, move: Display an error message when an identical transfer is already in progress, rather than failing with no indication why.	2013-03-19 13:56:20 -04:00
Joey Hess	b543842a7f	optimisation for transfers to drives that are not plugged in Rather than forking a git-annex transferkey only to have it fail, just immediately record the failed transfer (so when the drive is plugged in, the scan will retry it).	2013-03-18 20:40:24 -04:00
Joey Hess	a1b6d2e057	show an error message if garbage is provided to dropunused	2013-03-03 20:04:24 -04:00
Joey Hess	46c9cbeb1e	add additional debug info about reasons for transfers	2013-03-01 15:23:59 -04:00
Joey Hess	24316f6562	improve imports	2013-02-27 21:48:46 -04:00
Joey Hess	a2f17146fa	move Arbitrary instances out of Test and into modules that define the types This is possible now that we build-depend on QuickCheck.	2013-02-27 21:42:07 -04:00
Joey Hess	4008590c68	type based git config handling for remotes Still a couple of places that use git config ad-hoc, but this is most of it done.	2013-01-01 13:58:14 -04:00
Joey Hess	1702409f00	check	2012-12-20 00:08:30 -04:00
Joey Hess	df90a2acd5	another quickcheck	2012-12-20 00:02:33 -04:00
Joey Hess	8491917d04	more quickcheck fun and the code gets better..	2012-12-19 22:14:12 -04:00
Joey Hess	bf71d42681	quickcheck test for transfer info read/write code Fixed a bug the quickcheck turned up.	2012-12-19 16:15:39 -04:00
Joey Hess	7da2e27293	Bugfix: Fixed bug parsing transfer info files The newline after the filename was included in it. This was generally benign -- mostly these filenames are just displayed, and the newline didn't matter. But in the assistant, it caused unexpected dropping of preferred content. A characteristic of this bug is that the drop was displayed like this: drop some_file ok	2012-12-19 14:17:01 -04:00
Joey Hess	ffdd08fd2e	Merge branch 'master' into desymlink	2012-12-13 00:46:10 -04:00
Joey Hess	0d50a6105b	whitespace fixes	2012-12-13 00:45:27 -04:00
Joey Hess	e7b8cb0063	direct mode committing	2012-12-12 19:20:38 -04:00
Joey Hess	99a8a5297c	--auto fixes * get/copy --auto: Transfer data even if it would exceed numcopies, when preferred content settings want it. * drop --auto: Fix dropping content when there are no preferred content settings.	2012-12-06 13:22:16 -04:00
Joey Hess	ea5d7292e6	dropping from web	2012-11-29 17:01:07 -04:00
Joey Hess	2172cc586e	where indenting	2012-11-11 00:51:07 -04:00
Joey Hess	ec337baaee	add trustExclude	2012-11-11 00:24:32 -04:00
Joey Hess	c6fbed48a1	bugfix: Don't fail transferring content from read-only repos. Closes: #691341 This used to work, but got broken when the transfer info files were added, as it failed writing them on the readonly filesystem.	2012-10-24 10:59:25 -04:00
Joey Hess	452e6819d0	!! removal	2012-10-21 00:51:42 -04:00
Joey Hess	c7c2015435	add ConfigMonitor thread Monitors git-annex branch for changes, which are noticed by the Merger thread whenever the branch ref is changed (either due to an incoming push, or a local change), and refreshes cached config values for modified config files. Rate limited to run no more often than once per minute. This is important because frequent git-annex branch changes happen when files are being added, or transferred, etc. A primary use case is that, when preferred content changes are made, and get pushed to remotes, the remotes start honoring those settings. Other use cases include propigating repository description and trust changes to remotes, and learning when a remote has added a new special remote, so the webapp can present the GUI to enable that special remote locally. Also added a uuid.log cache. All other config files already had caches.	2012-10-20 16:43:35 -04:00
Joey Hess	40aab719df	Replace "in=" with "present" in preferred content expressions in= was problimatic in two ways. First, it referred to a remote by name, but preferred content expressions can be evaluated elsewhere, where that remote doesn't exist, or a different remote has the same name. This name lookup code could error out at runtime. Secondly, in= seemed pretty useless. in=here did not cause content to be gotten, but it did let present content be dropped. present is more useful, although "not present" is unstable and should be avoided.	2012-10-19 16:09:21 -04:00
Joey Hess	e7780a39f5	Preferred content path matching bugfix. When in a subdir, both the normal filepath, and the filepath relative to the top of the git repo are needed for matching. The former for key lookup, and the latter for include/exclude to match against. Previously, key lookup didn't work in this situation.	2012-10-17 16:01:09 -04:00
Joey Hess	c78975babb	avoid duplicate code with a more generic monadic matcher Interesting type signature ghc derived for this: forall o (m :: * -> *). Monad m => Matcher o -> (o -> m Bool) -> m Bool	2012-10-13 15:17:15 -04:00
Joey Hess	7aef34f501	implement saving of repository settings	2012-10-10 19:13:49 -04:00
Joey Hess	4e2e08b45a	ui for selecting a repository group	2012-10-10 16:23:41 -04:00
Joey Hess	39be7eea40	add standard group selector to repo edit form	2012-10-10 16:04:28 -04:00
Joey Hess	9da7dd8874	webapp: configure new repos to use the standard preferred content settings	2012-10-10 15:35:10 -04:00
Joey Hess	3490977d97	webapp: put new repos in standard groups I'm using transfer for most things, both removable drives and cloud storage, because it's the safest choice. We'll see if it makes sense to prompt for the group when setting this up, or let the user pick something else after the fact.	2012-10-10 15:27:25 -04:00
Joey Hess	f9b81c7a75	refactor	2012-10-10 15:15:56 -04:00
Joey Hess	0c88d9395d	standard preferred content settings for client, transfer, backup, and archive repositories I've designed these to work well together, I hope. If I get it wrong, I can just change the code in one place, since these expressions won't be stored in the git-annex branch.	2012-10-10 13:54:40 -04:00
Joey Hess	b6ce003843	rename --ingroup to --inallgroup	2012-10-10 12:59:45 -04:00
Joey Hess	e375b931c0	add --ingroup limit	2012-10-08 15:18:58 -04:00
Joey Hess	7cd81bd978	Added --smallerthan and --largerthan limits	2012-10-08 13:39:18 -04:00
Joey Hess	71fd18a97f	wired preferred content up to get, copy, and drop --auto	2012-10-08 13:16:53 -04:00
Joey Hess	7bb4d507ba	add AssumeNotPresent parameter to limits Solves the issue with preferred content expressions and dropping that I mentioned yesterday. My solution was to add a parameter to specify a set of repositories where content should be assumed not to be present. When deciding whether to drop, it can put the current repository in, and then if the expression fails to match, the content can be dropped. Using yesterday's example "(not copies=trusted:2) and (not in=usbdrive)", when the local repo is one of the 2 trusted copies, the drop check will see only 1 trusted copy, so the expression matches, and so the content will not be dropped.	2012-10-05 16:52:44 -04:00
Joey Hess	bc649a35ba	added preferred-content log, and allow editing it with vicfg This includes a full parser for the boolean expressions in the log, that compiles them into Matchers. Those matchers are not used yet. A complication is that matching against an expression should never crash git-annex with an error. Instead, vicfg checks that the expressions parse. If a bad expression (or an expression understood by some future git-annex version) gets into the log, it'll be ignored. Most of the code in Limit couldn't fail anyway, but I did have to make limitCopies check its parameter first, and return an error if it's bad, rather than erroring at runtime.	2012-10-04 16:00:19 -04:00
Joey Hess	7a7f63182c	vicfg: New command, allows editing (or simply viewing) most of the repository configuration settings stored in the git-annex branch. Incomplete; I need to finish parsing and saving. This will also be used for editing transfer control expresssions. Removed the group display from the status output, I didn't really like that format, and vicfg can be used to see as well as edit rempository group membership.	2012-10-03 17:04:52 -04:00
Joey Hess	717e008390	status: display repository groups	2012-10-02 13:45:30 -04:00
Joey Hess	5bd5bc094a	simplify	2012-10-01 15:17:21 -04:00
Joey Hess	2a96b1aab3	group, ungroup: New commands to indicate groups of repositories.	2012-10-01 15:12:04 -04:00
Joey Hess	3887432c54	fixes for transfer resume Fix resuming of downloads, which do not have a transfer info file to read. When checking upload progress, use the MVar, rather than re-reading the info file. Catch exceptions in the transfer action. Required a tryAnnex.	2012-09-24 13:18:16 -04:00
Joey Hess	d77ff5dadd	changelog and minor cleanup to fix mixed spaces/tabs	2012-09-23 15:42:05 -04:00
Joey Hess	0732d4c8ef	Merge remote-tracking branch 'npouillard/trustedcopies'	2012-09-23 15:35:00 -04:00
Nicolas Pouillard	f0bcc77fb2	Limiting the number of copies per trustlevel The --copies flag now takes an argument of the form: trustlevel:number or number If a trust level is specified the command is limited to files with at least 'number' copies of this 'trustlevel'.	2012-09-23 19:57:21 +02:00
Joey Hess	df07ccf404	make the assistant retry failed transfers When a transfer fails, the progress info can be used to intelligently retry it. If the transfer managed to make some progress, but did not fully complete, then there's a good chance that a retry will finish it (or at least make more progress).	2012-09-23 13:27:13 -04:00
Joey Hess	77af38ec6c	git-annex-shell transferinfo command TODO: Use this when running sendkey, to feed back transfer info from the client side rsync.	2012-09-21 16:23:25 -04:00
Joey Hess	34ca1d698c	avoid updating transfer info file until another 1% of the total has been transferred	2012-09-21 15:11:45 -04:00
Joey Hess	226781c047	unify types	2012-09-21 14:50:14 -04:00
Joey Hess	06ed6ceac4	fix reading of transfer info files with a bytesComplete value	2012-09-20 16:40:48 -04:00
Joey Hess	aff09a1f33	add a progress callback to storeKey, and threaded it all the way through Transfer info files are updated when the callback is called, updating the number of bytes transferred. Left unused p variables at every place the callback should be used. Which is rather a lot..	2012-09-19 16:08:37 -04:00
Joey Hess	18bae020ed	make other repositories list list all autostarted repos And add a form to add another, unrelated repository	2012-09-18 17:50:07 -04:00
Joey Hess	7a86dc9443	cleanup	2012-09-17 14:58:43 -04:00
Joey Hess	e8188ea611	flip catchDefaultIO	2012-09-17 00:18:07 -04:00
Joey Hess	0b12db64d8	Avoid crashing on encoding errors in filenames when writing transfer info files and reading from checksum commands.	2012-09-16 01:53:06 -04:00
Joey Hess	476d36ed16	stupid typo	2012-08-29 15:32:57 -04:00
Joey Hess	99525f8454	when canceling a transfer, also cancel all other downloads of the same key	2012-08-29 15:24:09 -04:00
Joey Hess	93037580b6	fix resume button Change alterTransferInfo to not merge in old values, including transferPaused.	2012-08-29 14:14:57 -04:00
Joey Hess	19e8f1ca0e	don't show "unknown" as the percent complete for transferinfo with no bytesComplete value	2012-08-28 14:31:30 -04:00
Joey Hess	1296cfb09a	avoid possibly re-adding a removed transfer when updating its info Doesn't fix the bug I thought it'd fix, but is clearly correct.	2012-08-28 14:19:11 -04:00
Joey Hess	ab5e409a95	keep track of which remotes have been scanned in process state Since it turned out to make sense to always scan all remotes on startup, there's no need to persist the info about which have been scanned.	2012-08-24 15:52:23 -04:00
Joey Hess	715a9a2f8e	keep logs of failed transfers, and requeue them when doing a non-full scan of a remote	2012-08-23 15:24:15 -04:00
Joey Hess	487bdf0e24	add transfer scanned flag files	2012-08-23 13:42:26 -04:00
Joey Hess	8ba9830653	implement pausing of transfers A paused transfer's thread keeps running, keeping the slot in use. This is intentional; pausing a transfer should not let other queued transfers to run in its place.	2012-08-10 18:42:44 -04:00
Joey Hess	94fcd0cf59	add routes to pause/start/cancel transfers This commit includes a paydown on technical debt incurred two years ago, when I didn't know that it was bad to make custom Read and Show instances for types. As the routes need Read and Show for Transfer, which includes a Key, and deriving my own Read instance of key was not practical, I had to finally clean that up. So the compact Key read and show functions are now file2key and key2file, and Read and Show are now derived instances. Changed all code that used the old instances, compiler checked. (There were a few places, particularly in Command.Unused, and the test suite where the Show instance continue to be used for legitimate comparisons; ie show key_x == show key_y (though really in a bloom filter))	2012-08-08 16:20:24 -04:00
Joey Hess	7e2d07484f	Merge branch 'master' into assistant	2012-08-07 13:31:43 -04:00
Joey Hess	2a9077f4e9	fix transfer log cleanup crash Avoid crashing when "git annex get" fails to download from one location, and falls back to downloading from a second location. The problem is that git annex get calls download recursively from within itself if the first download attempt fails. So the first time through, it writes a transfer info file, which is then overwritten on the second, recursive call. Then on cleanup, it tries to delete the file twice, which of course doesn't work. Fixed both by not crashing if the transfer file is removed, and by changing Get to not run download recursively like that. It's the only thing that did so, and it just seems like a bad idea.	2012-08-07 13:30:08 -04:00
Joey Hess	0f6292920a	webapp now displays the real running and queued transfers yowza!!!	2012-07-27 11:47:34 -04:00
Joey Hess	21d35f88d8	pull in transfer log code from assistant branch New log file format.	2012-07-18 21:45:41 -04:00
Joey Hess	549f861999	fix parsing of startedTime	2012-07-18 20:48:08 -04:00
Joey Hess	cf47bb3f50	run file transfers in threads, not processes This should fix OSX/BSD issues with not noticing transfer information files with kqueue. Now that threads are used, the thread can manage the transfer slot allocation and deallocation by itself; much cleaner.	2012-07-18 19:15:34 -04:00
Joey Hess	eea0a3616c	add thread id field to transferinfo Also converted its timestand to posix seconds, like is used in the other log files.	2012-07-18 18:42:41 -04:00
Joey Hess	d53f70e203	avoid parsing lock files as transfer files This seems to happen with kqueue, not inotify. The newly added lck file triggers an add event and was then parsed as a transfer file.	2012-07-17 17:26:53 -04:00
Joey Hess	b702bae950	bugfix	2012-07-17 17:22:00 -04:00
Joey Hess	9ab9ef3ebd	change transfer lock filenames to avoid ambiguity foo.lck could be a lock file for a transfer of foo, or a transfer of a key that happened to end in ".lck". Fix this by using "lck.foo" instead.	2012-07-17 17:16:30 -04:00
Joey Hess	9379c77fb3	split transfer info and lock files Since the lock file has to be kept open, this prevented the TransferWatcher from noticing when it appeared, since inotify (and more importantly kqueue) events happen when a new file is closed. Writing a separate info file fixes that problem.	2012-07-07 11:47:36 -06:00
Joey Hess	62876502c5	wait on child transfer processes, and invalidate cache There's still a bug; if the child updates its transfer info file, then the data from it will superscede the TransferInfo, losing the info that we should wait on this child.	2012-07-06 16:44:13 -06:00
Joey Hess	a92f5589fc	unfinished (and unbuildable) work toward separate transfer processes	2012-07-05 18:57:06 -06:00
Joey Hess	71b5ad8398	wrote transfer thread finally!	2012-07-05 14:34:20 -06:00
Joey Hess	4845b59413	startedTime needs to be a Maybe to handle transfers that have not started yet This changes the file format.	2012-07-02 16:17:06 -04:00
Joey Hess	c9d7e9f6bd	startedTime needs to be a Maybe to handle transfers that have not started yet This changes the file format.	2012-07-02 16:06:52 -04:00
Joey Hess	0c0fd0c54c	update	2012-07-02 13:49:27 -04:00
Joey Hess	8f6c2e6081	fix reading of empty filename from transfer info file	2012-07-02 11:02:47 -04:00
Joey Hess	9517fbb948	cleanup	2012-07-02 08:35:15 -04:00
Joey Hess	bea0ac0274	record transfers for git-annex-shell Not yet tested and places git-annex-shell is run need to be modified to pass the new field settings. Note that rsyncServerSend was changed to fork, rather than directly exec rsync, because it needs to keep the transfer lock held, and clean up the transfer log when done.	2012-07-02 01:31:10 -04:00
Joey Hess	7225c2bfc0	record transfer information on local git remotes In order to record a semi-useful filename associated with the key, this required plumbing the filename all the way through to the remotes' storeKey and retrieveKeyFile. Note that there is potential for deadlock here, narrowly avoided. Suppose the repos are A and B. A sends file foo to B, and at the same time, B gets file foo from A. So, A locks its upload transfer info file, and then locks B's download transfer info file. At the same time, B is taking the two locks in the opposite order. This is only not a deadlock because the lock code does not wait, and aborts. So one of A or B's transfers will be aborted and the other transfer will continue. Whew!	2012-07-01 17:15:11 -04:00
Joey Hess	8c10f37714	bugfixes fdToHandle seems to close the fd avoid excess trailing newline	2012-07-01 17:15:11 -04:00
Joey Hess	72988bae34	tested; bugfixes	2012-07-01 17:15:11 -04:00
Joey Hess	be0e38bcc3	add transfer information files	2012-07-01 17:15:11 -04:00
Joey Hess	29335bf326	pointlessness	2012-06-29 10:00:05 -04:00
Joey Hess	8c09c17f6b	use strict insertWith	2012-05-04 00:44:11 -04:00
Joey Hess	32de288c35	syntax tweaks Although I hate to lose one of the only places I've ever used the list monad..	2012-05-02 19:51:41 -04:00
Joey Hess	392931eca9	addunused: New command, the opposite of dropunused, it relinks unused content into the git repository.	2012-05-02 14:59:05 -04:00
Joey Hess	ed79596b75	noop	2012-04-21 23:32:33 -04:00
Joey Hess	184a69171d	removed another 10 lines via ifM	2012-03-16 01:59:07 -04:00
Joey Hess	7e17151e69	revert hlint change broke a test	2012-02-20 15:37:31 -04:00
Joey Hess	0cbbf0da79	warning	2012-02-18 11:54:47 -04:00
Joey Hess	0fada43808	avoid unnecessary log changes when re-adding the same url	2012-02-17 23:58:56 -04:00
Joey Hess	5bf07b3b5c	Store web special remote url info in a more efficient location. storing it in remotes/web/xx/yy/foo.log meant lots of extra directory objects in git. Now I use xx/yy/foo.log.web, which is just as unique, but more efficient since foo.log is there anyway. Of course, it still looks in the old location too.	2012-02-17 23:15:29 -04:00
Joey Hess	a1e52f0ce5	hlint	2012-02-16 00:44:51 -04:00
Joey Hess	abdacf58ed	tweaks	2012-01-11 00:06:54 -04:00
Joey Hess	07cacbeee9	break module dependancy loop A PITA but worth it to clean up the trust configuration code.	2012-01-10 13:32:38 -04:00
Joey Hess	0d5c402210	Add annex-trustlevel configuration settings, which can be used to override the trust level of a remote. This overrides the trust.log, and is overridden by the command-line trust parameters. It would have been nicer to have Logs.Trust.trustMap just look up the configuration for all remotes, but a dependency loop prevented that (Remotes depends on Logs.Trust in several ways). So instead, look up the configuration when building remotes, storing it in the same forcetrust field used for the command-line trust parameters.	2012-01-09 23:31:44 -04:00
Joey Hess	a3a9f87047	log: New command that displays the location log for file, showing each repository they were added to and removed from. This needs to run git log on the location log files to get at all past versions of the file, which tends to be a bit slow. It would be possible to make a version optimised for showing the location logs for every key. That would only need to run git log once, so would be faster, but it would need to process an enormous amount of data, so would not speed up the individual file case. In the future it would be nice to support log --format. log --json also doesn't work right yet.	2012-01-06 15:40:07 -04:00
Joey Hess	95d2391f58	more partial function removal Left a few Prelude.head's in where it was checked not null and too hard to remove, etc.	2011-12-15 18:19:36 -04:00
Joey Hess	b7e0d39abb	remove some partial functions A few were too hard to get rid of, and safe since the code does check for an empty line.	2011-12-15 16:59:48 -04:00
Joey Hess	d64132a43a	hslint	2011-12-09 01:57:13 -04:00
Joey Hess	f0cc42685e	fix display of dead repositories in status	2011-12-02 19:21:56 -04:00
Joey Hess	251c01d51e	dead: A command which says that a repository is gone for good and you don't want git-annex to mention it again.	2011-12-02 16:59:55 -04:00
Mark Wright	041d324125	Remove haskell98 to build with ghc 7.2.2, also built with ghc 7.0.4 Signed-off-by: Joey Hess <joey@kitenet.net>	2011-11-26 12:05:08 -04:00
Joey Hess	c50a5fbeb4	status: Include all special remotes in the list of repositories. Special remotes do not always have a description listed in uuid.log, and such ones were not listed before.	2011-11-18 13:22:48 -04:00
Joey Hess	2bb6b02948	When not run in a git repository, git-annex can still display a usage message, and "git annex version" even works. Things that sound simple, but are made hard by the Annex monad being built with the assumption that there will always be a git repo.	2011-11-16 00:49:09 -04:00
Joey Hess	9b71b5f26c	fix display of semitrusted repos in status semitrusted uuids rarely are listed in trust.log, so a special case is needed to get a list of them. Take the difference of all known uuids with non-semitrusted uuids.	2011-11-16 00:01:07 -04:00
Joey Hess	826d5887b2	Automatically fix up badly formatted uuid.log entries produced by 3.20111105, whenever the uuid.log is changed (ie, by init or describe).	2011-11-11 13:42:31 -04:00
Joey Hess	637b5feb45	lint	2011-11-11 01:52:58 -04:00
Joey Hess	56b8194470	cleanup	2011-11-09 01:33:20 -04:00
Joey Hess	b11a63a860	clean up read/show abuse Avoid ever using read to parse a non-haskell formatted input string. show :: Key is arguably still show abuse, but displaying Keys as filenames is just too useful to give up.	2011-11-08 00:17:54 -04:00
Joey Hess	63a292324d	add a UUID type Should have done this a long time ago.	2011-11-07 15:59:16 -04:00
Joey Hess	eec137f33a	Record uuid when auto-initializing a remote so it shows in status.	2011-11-02 14:18:21 -04:00
Joey Hess	2566eb85fe	fsck: Now works in bare repositories. Checks location log information, and file contents. Does not check that numcopies is satisfied, as .gitattributes information about numcopies is not available in a bare repository. In practice, that should not be a problem, since fsck is also run in a checkout and will check numcopies there.	2011-10-29 18:03:28 -04:00
Joey Hess	ab738a403a	status: Now always shows the current repository, even when it does not appear in uuid.log.	2011-10-28 19:49:01 -04:00
Joey Hess	ee9af605bc	break out non-log stuff to separate module	2011-10-15 17:47:03 -04:00
Joey Hess	ec169f84b1	migrate: Copy url logs for keys when migrating.	2011-10-15 16:36:56 -04:00
Joey Hess	b4015064e1	break web log handling into a separate module	2011-10-15 16:25:51 -04:00
Joey Hess	1a29b5b52e	reorganize log modules no code changes	2011-10-15 16:21:08 -04:00

... 3 4 5 6 7 ...

419 commits