Commit graph

35660 commits

Author SHA1 Message Date
Joey Hess
9085a2cfec
make sure all sqlite selects have indexes
Bearing in mind that these indexes are really uniqueness constraints
that just happen to also make sqlite generate indexes.

In Database.ContentIndentifier, the ContentIndentifiersKeyRemoteCidIndex
is fine as a uniqueness constraint because it contains all rows from the
table. The ContentIndentifiersCidRemoteIndex is also ok because there
can only be one key for a given (cid, uuid) combination.

In Database.Export, the new ExportTreeFileKeyIndex is the same pair as
the old ExportTreeKeyFileIndex (previously ExportTreeIndex). And
in Database.Keys.SQL, the new InodeCacheKeyIndex is the same pair as the
old KeyInodeCacheIndex.
2019-10-30 13:46:52 -04:00
Joey Hess
3732f27722
document indexes
This was really confusing, though the code was ok. I think I now
understand it fully again.
2019-10-30 13:28:00 -04:00
Joey Hess
f6cfb84dfe
rename row 2019-10-30 13:22:41 -04:00
Joey Hess
aa27969e55
improve layout 2019-10-29 17:08:36 -04:00
Joey Hess
e8437ae7a3
removed now unused import 2019-10-29 17:07:15 -04:00
Joey Hess
61c9b0945d
bump version, though there is no upgrade path yet
I just don't want this branch to accidentially run in my production repos yet.
2019-10-29 17:06:35 -04:00
Joey Hess
c35a9047d3
improve data types for sqlite
This is a non-backwards compatable change, so not suitable for merging
w/o a annex.version bump and transition code. Not yet tested.

This improves performance of git-annex benchmark --databases
across the board by 10-25%, since eg Key roundtrips as a ByteString.

(serializeKey' produces a lazy ByteString, so there is still a
copy involved in converting it to a strict ByteString. It may be faster
to switch to using bytestring-strict-builder.)

FilePath and Key are both stored as blobs. This avoids mojibake in some
situations. It would be possible to use varchar instead, if persistent
could avoid converting that to Text, but it seems there is no good
way to do so. See doc/todo/sqlite_database_improvements.mdwn

Eliminated some ugly artifacts of using Read/Show serialization;
constructors and quoted strings are no longer stored in sqlite.

Renamed SRef to SSha to reflect that it is only ever a git sha,
not a ref name. Since it is limited to the characters in a sha,
it is not affected by mojibake, so still uses String.
2019-10-29 17:05:36 -04:00
Joey Hess
e1b21a0491
benchmark: Add --databases to benchmark sqlite databases
Rescued from commit 11d6e2e260 which removed
db benchmarks in favor of benchmarking arbitrary git-annex commands. Which
is nice and general, but microbenchmarks are useful too.
2019-10-29 17:05:10 -04:00
Joey Hess
25f912de5b
benchmark: Add --databases to benchmark sqlite databases
Rescued from commit 11d6e2e260 which removed
db benchmarks in favor of benchmarking arbitrary git-annex commands. Which
is nice and general, but microbenchmarks are useful too.
2019-10-29 16:59:27 -04:00
Joey Hess
27e10fdbd7
Merge branch 'master' of ssh://git-annex.branchable.com 2019-10-28 12:24:29 -04:00
Joey Hess
8bb99ff2aa
devblog 2019-10-28 12:24:13 -04:00
Simon
b1077b417f Added a comment: Maybe this is the problem... 2019-10-28 16:13:18 +00:00
Simon
3f3e2d56ab Added a comment: Should there have been a combine step? 2019-10-28 16:06:22 +00:00
Simon
c4b628b241 Added a comment: Can you give more details? 2019-10-28 16:04:52 +00:00
nordmagnus999
bb9421bda4 Added a comment: Oh yeah, use verbose flag too. 2019-10-28 15:50:42 +00:00
nordmagnus999
54cdad7630 Added a comment: A possible solution 2019-10-28 15:49:12 +00:00
Simon
acae35d6e1 2019-10-28 15:37:04 +00:00
nordmagnus999
7fd392e584 2019-10-26 09:58:54 +00:00
anthony@ad39673d230d75cbfd19d2757d754030049c7673
e4828c7e45 Added a comment: Thank you 2019-10-25 20:29:46 +00:00
nordmagnus999
72e3faa747 2019-10-25 19:54:13 +00:00
nordmagnus999
24afa8ebe5 2019-10-25 19:37:42 +00:00
Joey Hess
772a6be526
add news item for git-annex 7.20191024 2019-10-25 13:08:24 -04:00
Joey Hess
fd96408c67
releasing package git-annex version 7.20191024 2019-10-25 13:07:58 -04:00
https://tribut.de/
d5ca57a596 Added a comment 2019-10-25 12:41:55 +00:00
https://id.andreparames.com/
32adffbfe4 Added a comment: Thank you 2019-10-24 23:56:42 +00:00
Ilya_Shlyakhter
cd482f3cd6 added todo "warn of breaking changes same way git does" 2019-10-24 22:05:46 +00:00
giuly.ippoliti@c1e2f0d5e40b128900f94f3d107d3719f87c3ff7
3d0330a555 Added a comment: Triggering which stage and when? 2019-10-24 20:30:11 +00:00
Joey Hess
59b8294b2b
prep release 2019-10-24 14:40:36 -04:00
Joey Hess
4a3f3a2cb5
make git add only annex when configured by annex.largefiles 2019-10-24 14:17:29 -04:00
Joey Hess
31a5b58b2c
documentation for making git add only annex when configured by annex.largefiles
Code change should be trvial, but not yet implemented. This
significantly complicated the task of documenting how git-annex works.

I'm not sure how useful the annex.gitaddtoannex confguration is after
this change; seems that if a user has an annex.largefiles they will want
it applied consistently. But the last thing I want to hear is more
complaining from users about git add doing something they don't want it
to.

There's a pretty high risk users who got used to the git add behavior
and don't have annex.largefiles configured will miss the NEWS and
complain bitterly about their suddenly bloated repositories. Oh well.

Removed outdated comments about the old behavior to avoid confusion.
I don't know if I've found all the places that griping spread to.
2019-10-24 14:01:54 -04:00
Joey Hess
64d4a35523
Merge branch 'master' of ssh://git-annex.branchable.com 2019-10-24 11:57:43 -04:00
Joey Hess
0f7fd008d4
fix sql syntax 2019-10-24 11:57:17 -04:00
Joey Hess
098afe144e
display sqlite error message when it crashes 2019-10-24 11:50:55 -04:00
Joey Hess
168f91efec
avoid warning over name 2019-10-24 11:46:40 -04:00
jason.dixon.email@aa0e536a2ec2877d6f666108dbbc6e39bbe87ac0
ccdb34b181 Added a comment: Friendly bump to keep on the radar 2019-10-24 09:26:23 +00:00
ghen1
6dd732f1e6 Added a comment 2019-10-23 21:03:11 +00:00
anarcat
b4a9397c4a Added a comment: right! forgot about log -S 2019-10-23 20:16:26 +00:00
anarcat
2755a22c37 update from comments 2019-10-23 20:15:40 +00:00
Joey Hess
94a7bd9873
clarify 2019-10-23 15:43:31 -04:00
Joey Hess
f1b374e44e
devblog 2019-10-23 15:33:30 -04:00
Joey Hess
8fb1350a2c
Merge branch 'master' of ssh://git-annex.branchable.com 2019-10-23 15:30:55 -04:00
Joey Hess
bd197be3ad
annex.gitaddtoannex configuration
Added annex.gitaddtoannex configuration. Setting it to false prevents
git add from usually adding files to the annex.
(Unless the file was annexed before, or a renamed annexed file is detected.)

Currently left at true; some users are encouraging it be set to false.
2019-10-23 15:29:46 -04:00
Joey Hess
ec08b66bda
shouldAnnex: check isInodeKnown
Renamed unlocked files are now detected, and will always be
annexed, unless annex.largefiles disallows it.

This allows for git add's behavior to later be changed to otherwise
not annex files (whether by default or as a config option), without
worrying about the rename case.

This is not a major behavior change; annexing is still the default. But
there is one case where the behavior is changed, I think for the better:

	touch f
	git -c annex.largefiles=nothing add f
	git add bigfile
	git commit -m ...
	mv bigfile f
	git add f

Before, git-annex would see that f was previously not annexed,
and so the renamed bigfile content gets added to git. Now, it notices
that the inode is the one that bigfile used, and so it annexes it.

This potentially slows down git add a lot in some repositories because
of the poor performance of isInodeKnown when there are a lot of unlocked
files. Configuring annex.largefiles avoids the speed hit.
2019-10-23 14:49:45 -04:00
Joey Hess
94efc400e9
horrible impementation of isInodeKnown
The only good thing about it is it does not require a major version bump
to improve the database. That will need to happen at some point though.

Potentially very very slow in a large repository.

Ugly use of raw sql.
2019-10-23 14:37:29 -04:00
Ilya_Shlyakhter
2d33c62965 Added a comment: configuring git add behavior 2019-10-23 17:57:03 +00:00
Ilya_Shlyakhter
bdf6d2eb48 Added a comment: preventing git add from annexing new files 2019-10-23 17:05:42 +00:00
Joey Hess
eebf080b33
comment typo 2019-10-23 12:32:46 -04:00
Ilya_Shlyakhter
5f7036d0a7 Added a comment: searching by inode 2019-10-23 16:32:37 +00:00
Joey Hess
ff5c0ef43f
comment 2019-10-23 12:26:55 -04:00
Joey Hess
f318743f77
rename this so it does not start with a dash 2019-10-23 12:16:12 -04:00