mercurial-scm/hg-stable: mercurial/revlog.py comparison

comparison mercurial/revlog.py @ 34898:1bde8e8e5de0

sparse-read: ignore trailing empty revs in each read chunk An empty entry in the revlog may happen for two reasons: - when the file is empty, and the revlog stores a snapshot; - when there is a merge and both parents were identical. `hg debugindex -m | awk '$3=="0"{print}' | wc -l` gives 1917 of such entries in my clone of pypy, and 113 on my clone of mercurial. These empty revision may be located at the end of a sparse chain, and in some special cases may lead to read relatively large amounts of data for nothing.

author	Paul Morelle <paul.morelle@octobus.net>
date	Wed, 18 Oct 2017 15:28:19 +0200
parents	8c9b08a0c48c
children	6226668a7169

comparison

equal deleted inserted replaced

-:2e350d2a0eca
+:1bde8e8e5de0
 s = hashlib.sha1(a)
 s.update(b)
 s.update(text)
 return s.digest()
+def _trimchunk(revlog, revs, startidx, endidx=None):
+"""returns revs[startidx:endidx] without empty trailing revs
+"""
+length = revlog.length
+if endidx is None:
+endidx = len(revs)
+# Trim empty revs at the end, but never the very first revision of a chain
+while endidx > 1 and endidx > startidx and length(revs[endidx - 1]) == 0:
+endidx -= 1
+return revs[startidx:endidx]
 def _slicechunk(revlog, revs):
 """slice revs to reduce the amount of unrelated data to be read from disk.
 ``revs`` is sliced into groups that should be read in one time.
 Assume that revs are sorted.
 prevend = None
 for i, rev in enumerate(revs):
 revstart = start(rev)
 revlen = length(rev)
+# Skip empty revisions to form larger holes
+if revlen == 0:
+continue
 if prevend is not None:
 gapsize = revstart - prevend
 # only consider holes that are large enough
 if gapsize > revlog._srmingapsize:
 heapq.heappush(gapsheap, (-gapsize, i))
 # Cut the revs at collected indices
 previdx = 0
 while indicesheap:
 idx = heapq.heappop(indicesheap)
-yield revs[previdx:idx]
+chunk = _trimchunk(revlog, revs, previdx, idx)
+if chunk:
+yield chunk
 previdx = idx
-yield revs[previdx:]
+chunk = _trimchunk(revlog, revs, previdx)
+if chunk:
+yield chunk
 # index v0:
 #  4 bytes: offset
 #  4 bytes: compressed length
 #  4 bytes: base rev

Mercurial > public > mercurial-scm > hg-stable

comparison mercurial/revlog.py @ 34898:1bde8e8e5de0