mercurial-scm/hg: mercurial/encoding.py comparison

comparison mercurial/encoding.py @ 28507:9bcbd9412225

encoding: make HFS+ ignore code Python 3 compatible unichr() doesn't exist in Python 3. chr() is the equivalent there. Unfortunately, we can't use chr() outright because Python 2 only accepts values smaller than 256. Also, Python 3 returns an int when accessing a character of a bytes type (s[x]). So, we have to ord() the values in the assert statement.

author	Gregory Szorc <gregory.szorc@gmail.com>
date	Fri, 11 Mar 2016 21:23:34 -0800
parents	b2d24c2898f9
children	3c6e94d0811c

comparison

equal deleted inserted replaced

-:10252652c6e4
+:9bcbd9412225
 from __future__ import absolute_import
 import array
 import locale
 import os
+import sys
 import unicodedata
 from . import (
 error,
 )
+if sys.version_info[0] >= 3:
+unichr = chr
 # These unicode characters are ignored by HFS+ (Apple Technote 1150,
 # "Unicode Subtleties"), so we need to ignore them in some places for
 # sanity.
 _ignore = [unichr(int(x, 16)).encode("utf-8") for x in
 "200c 200d 200e 200f 202a 202b 202c 202d 202e "
 "206a 206b 206c 206d 206e 206f feff".split()]
 # verify the next function will work
-assert set([i[0] for i in _ignore]) == set(["\xe2", "\xef"])
+if sys.version_info[0] >= 3:
+assert set(i[0] for i in _ignore) == set([ord(b'\xe2'), ord(b'\xef')])
+else:
+assert set(i[0] for i in _ignore) == set(["\xe2", "\xef"])
 def hfsignoreclean(s):
 """Remove codepoints ignored by HFS+ from s.
 >>> hfsignoreclean(u'.h\u200cg'.encode('utf-8'))

Mercurial > public > mercurial-scm > hg

comparison mercurial/encoding.py @ 28507:9bcbd9412225