mercurial-scm/hg: mercurial/encoding.py comparison

comparison mercurial/encoding.py @ 28066:d1cc07123243

encoding: change jsonmap to a list indexed by code point This is slightly faster and convenient to implement a paranoid escaping. $ python -m timeit \ -s 'from mercurial import encoding; data = str(bytearray(xrange(128)))' \ 'encoding.jsonescape(data)' original: 100000 loops, best of 3: 15.1 usec per loop this patch: 100000 loops, best of 3: 13.7 usec per loop

author	Yuya Nishihara <yuya@tcha.org>
date	Sat, 30 Jan 2016 19:41:34 +0900
parents	ffa599f3f503
children	69a02b1e947c

comparison

equal deleted inserted replaced

-:6b1fc09c699a
+:d1cc07123243
 This should be kept in sync with normcase_spec in util.h.'''
 lower = -1
 upper = 1
 other = 0
-_jsonmap = {}
+_jsonmap = []
 def jsonescape(s):
 '''returns a string suitable for JSON
 JSON is problematic for us because it doesn't support non-Unicode
 >>> jsonescape('')
 ''
 '''
 if not _jsonmap:
-for x in xrange(32):
+_jsonmap.extend("\\u%04x" % x for x in xrange(32))
-_jsonmap[chr(x)] = "\\u%04x" % x
+_jsonmap.extend(chr(x) for x in xrange(32, 256))
-for x in xrange(32, 256):
+_jsonmap[0x7f] = '\\u007f'
-c = chr(x)
+_jsonmap[0x09] = '\\t'
-_jsonmap[c] = c
+_jsonmap[0x0a] = '\\n'
-_jsonmap['\x7f'] = '\\u007f'
+_jsonmap[0x22] = '\\"'
-_jsonmap['\t'] = '\\t'
+_jsonmap[0x5c] = '\\\\'
-_jsonmap['\n'] = '\\n'
+_jsonmap[0x08] = '\\b'
-_jsonmap['\"'] = '\\"'
+_jsonmap[0x0c] = '\\f'
-_jsonmap['\\'] = '\\\\'
+_jsonmap[0x0d] = '\\r'
-_jsonmap['\b'] = '\\b'
-_jsonmap['\f'] = '\\f'
+return ''.join(_jsonmap[x] for x in bytearray(toutf8b(s)))
-_jsonmap['\r'] = '\\r'
-return ''.join(_jsonmap[c] for c in toutf8b(s))
 _utf8len = [0, 0, 0, 0, 0, 0, 0, 0, 1, 1, 1, 1, 2, 2, 3, 4]
 def getutf8char(s, pos):
 '''get the next full utf-8 character in the given string, starting at pos

Mercurial > public > mercurial-scm > hg

comparison mercurial/encoding.py @ 28066:d1cc07123243