sapling/eden/scm/tests/test-encoding.t

#require py2
Test character encoding

  $ hg init t
  $ cd t

we need a repo with some legacy latin-1 changesets

  $ hg unbundle "$TESTDIR/bundles/legacy-encoding.hg"
  adding changesets
  adding manifests
  adding file changes
  added 2 changesets with 2 changes to 1 files
  $ hg co
  1 files updated, 0 files merged, 0 files removed, 0 files unresolved
  $ $PYTHON << EOF
  > f = open('latin-1', 'wb'); _ = f.write(b"latin-1 e' encoded: \xe9"); f.close()
  > f = open('utf-8', 'wb'); _ = f.write(b"utf-8 e' encoded: \xc3\xa9"); f.close()
  > f = open('latin-1-tag', 'wb'); _ = f.write(b"\xe9"); f.close()
  > EOF

should fail with encoding error

  $ echo "plain old ascii" > a
  $ hg st
  M a
  ? latin-1
  ? latin-1-tag
  ? utf-8
  $ HGENCODING=ascii hg ci -l latin-1
  abort: decoding near ' encoded: \xe9': 'utf8' codec can't decode byte 0xe9 in position 20: unexpected end of data! (esc)
  [255]

these should work

  $ echo "latin-1" > a
  $ HGENCODING=latin-1 hg ci -l latin-1
  $ echo "utf-8" > a
  $ HGENCODING=utf-8 hg ci -l utf-8

hg log (ascii)

  $ hg --encoding ascii log
  commit:      ca661e7520de
  user:        test
  date:        Thu Jan 01 00:00:00 1970 +0000
  summary:     utf-8 e' encoded: ?
  
  commit:      650c6f3d55dd
  user:        test
  date:        Thu Jan 01 00:00:00 1970 +0000
  summary:     latin-1 e' encoded: ?
  
  commit:      0e5b7e3f9c4a
  user:        test
  date:        Mon Jan 12 13:46:40 1970 +0000
  summary:     koi8-r: ????? = u'\u0440\u0442\u0443\u0442\u044c'
  
  commit:      1e78a93102a3
  user:        test
  date:        Mon Jan 12 13:46:40 1970 +0000
  summary:     latin-1 e': ? = u'\xe9'
  

hg log (latin-1)

  $ hg --encoding latin-1 log
  commit:      ca661e7520de
  user:        test
  date:        Thu Jan 01 00:00:00 1970 +0000
  summary:     utf-8 e' encoded: \xe9 (esc)
  
  commit:      650c6f3d55dd
  user:        test
  date:        Thu Jan 01 00:00:00 1970 +0000
  summary:     latin-1 e' encoded: \xe9 (esc)
  
  commit:      0e5b7e3f9c4a
  user:        test
  date:        Mon Jan 12 13:46:40 1970 +0000
  summary:     koi8-r: \xd2\xd4\xd5\xd4\xd8 = u'\\u0440\\u0442\\u0443\\u0442\\u044c' (esc)
  
  commit:      1e78a93102a3
  user:        test
  date:        Mon Jan 12 13:46:40 1970 +0000
  summary:     latin-1 e': \xe9 = u'\\xe9' (esc)
  

hg log (utf-8)

  $ hg --encoding utf-8 log
  commit:      ca661e7520de
  user:        test
  date:        Thu Jan 01 00:00:00 1970 +0000
  summary:     utf-8 e' encoded: \xc3\xa9 (esc)
  
  commit:      650c6f3d55dd
  user:        test
  date:        Thu Jan 01 00:00:00 1970 +0000
  summary:     latin-1 e' encoded: \xc3\xa9 (esc)
  
  commit:      0e5b7e3f9c4a
  user:        test
  date:        Mon Jan 12 13:46:40 1970 +0000
  summary:     koi8-r: \xc3\x92\xc3\x94\xc3\x95\xc3\x94\xc3\x98 = u'\\u0440\\u0442\\u0443\\u0442\\u044c' (esc)
  
  commit:      1e78a93102a3
  user:        test
  date:        Mon Jan 12 13:46:40 1970 +0000
  summary:     latin-1 e': \xc3\xa9 = u'\\xe9' (esc)
  

hg log (utf-8)

  $ HGENCODING=utf-8 hg log
  commit:      ca661e7520de
  user:        test
  date:        Thu Jan 01 00:00:00 1970 +0000
  summary:     utf-8 e' encoded: \xc3\xa9 (esc)
  
  commit:      650c6f3d55dd
  user:        test
  date:        Thu Jan 01 00:00:00 1970 +0000
  summary:     latin-1 e' encoded: \xc3\xa9 (esc)
  
  commit:      0e5b7e3f9c4a
  user:        test
  date:        Mon Jan 12 13:46:40 1970 +0000
  summary:     koi8-r: \xc3\x92\xc3\x94\xc3\x95\xc3\x94\xc3\x98 = u'\\u0440\\u0442\\u0443\\u0442\\u044c' (esc)
  
  commit:      1e78a93102a3
  user:        test
  date:        Mon Jan 12 13:46:40 1970 +0000
  summary:     latin-1 e': \xc3\xa9 = u'\\xe9' (esc)
  

hg log (dolphin)

  $ HGENCODING=dolphin hg log
  abort: unknown encoding: dolphin
  (please check your locale settings)
  [255]
  $ cp latin-1-tag .hg/branch
  $ HGENCODING=latin-1 hg ci -m 'auto-promote legacy name'

  $ cd ..

Test roundtrip encoding/decoding of utf8b for generated data

#if hypothesis

  >>> from hypothesishelpers import *
  >>> from edenscm.mercurial import encoding
  >>> roundtrips(st.binary(), encoding.fromutf8b, encoding.toutf8b)
  Round trip OK

#endif
mark all tests requiring Python 2 Summary: This diff marks ALL mercurial tests requiring Python 2 feature. After you fixes some tests, simply remove the `py2` feature requirement and that tests will be continuously run after your diff is landed. To bypass this feature requirement, run the tests command with `HGTEST_FORCE_PY2=1`. For example: ``` HGTEST_FORCE_PY2=1 buck test //eden/scm/tests:hg_run_tests ``` or ``` HGTEST_FORCE_PY2=1 python run-tests.py ``` ---- Basically this diff are created with the following commands: ``` $ sed -i 's/import feature\(.\)$/import feature\1\n\nfeature.require(["py2"])/' test--t.py $ sed -i '1s/^/#require py2\n/' test-.t $ ls \| grep -P "^test.(?<\!-t)\.py$" > list && vim -p $(cat list) # manually adding feature requires for these Python tests. ``` (Note: this ignores all push blocking failures!) ignore-conflict-markers Reviewed By: singhsrb Differential Revision: D19655148 fbshipit-source-id: 985e3ccb4010cc559049f1d89f8909bc2d9b5e20 2020-01-31 05:40:49 +03:00			`#require py2`
tests: unify test-encoding 2010-09-26 22:41:32 +04:00			`Test character encoding`

			`$ hg init t`
			`$ cd t`

			`we need a repo with some legacy latin-1 changesets`

tests: make tests work if directory contains special characters With this quoting tests will work e.g. in "/tmp/foo bar/mercurial/". 2012-04-03 21:06:35 +04:00			`$ hg unbundle "$TESTDIR/bundles/legacy-encoding.hg"`
tests: unify test-encoding 2010-09-26 22:41:32 +04:00			`adding changesets`
			`adding manifests`
			`adding file changes`
			`added 2 changesets with 2 changes to 1 files`
			`$ hg co`
			`1 files updated, 0 files merged, 0 files removed, 0 files unresolved`
tests: replace yet more calls to `python` with $PYTHON These are some simple cases. More to come in a future change. Reviewers: krbullock Reviewed By: krbullock Differential Revision: https://phab.mercurial-scm.org/D4 2017-07-05 19:10:11 +03:00			`$ $PYTHON << EOF`
py3: supress the write return value Summary: `>>> open(file).write(data)` is a common patten for writing files across the tests. In py3 such statement return the number of bytes written in py2 the output is none. Let's make it None in py3 as well. Reviewed By: xavierd Differential Revision: D19666339 fbshipit-source-id: 5424287f85f34f3aef2d1596bb476d622464564a 2020-01-31 23:58:35 +03:00			`> f = open('latin-1', 'wb'); _ = f.write(b"latin-1 e' encoded: \xe9"); f.close()`
			`> f = open('utf-8', 'wb'); _ = f.write(b"utf-8 e' encoded: \xc3\xa9"); f.close()`
			`> f = open('latin-1-tag', 'wb'); _ = f.write(b"\xe9"); f.close()`
tests: unify test-encoding 2010-09-26 22:41:32 +04:00			`> EOF`

			`should fail with encoding error`

			`$ echo "plain old ascii" > a`
			`$ hg st`
			`M a`
			`? latin-1`
			`? latin-1-tag`
			`? utf-8`
			`$ HGENCODING=ascii hg ci -l latin-1`
encoding: replace 'ascii' with 'utf-8' automatically Summary: `ascii` was used as the default / fallback, which is not a user-friendly choice. Nowadays utf-8 dominates: - Rust stdlib is utf-8. - Ruby since 1.9 is utf-8 by default. - Python 3 is unicode by default. - Windows 10 adds utf-8 code page. Given the fact that: - Our CI sets HGENCODING to utf-8 - Nuclide passes `--encoding=utf-8` to every command. - Some people have messed up with `LC_*` and complained about hg crashes. - utf-8 is a super set of ascii, nobody complains that they want `ascii` encoding and the `utf-8` encoding messed their setup up. Let's just use `utf-8` as the default encoding. More aggressively, if someone sets `ascii` as the encoding, it's almost always a mistake. Auto-correct that to `utf-8` too. This should also make future integration with Rust easier (where it's enforced utf-8 and does not have an option to change the encoding). In the future we might just drop the flexibility of choosing customized encoding, so this diff autofixes `ascii` to `utf-8`, instead of allowing `ascii` to be set. We cannot enforce `utf-8` yet, because of Windows. Here is our encoding strategy vs the upstream's: \| item \| upstream \| \| ours \| ours \| \| \| current \| ideal \| current \| ideal \| \| CLI argv \| bytes \| bytes \| utf-8 [1] \| utf-8 \| \| path \| bytes \| auto [3] \| migrating [2] \| utf-8 \| \| commit message \| utf-8 \| utf-8 \| utf-8 \| utf-8 \| \| bookmark name \| utf-8 \| utf-8 \| utf-8 \| utf-8 \| \| file content \| bytes \| bytes \| bytes \| bytes \| [1]: Argv was accidentally enforced utf-8 for command-line arguments by a Rust wrapper. But it simplified a lot of things and is kind of ok: everything that can be passed as CLI arguments are utf-8: -M commit message, -b bookmark, paths, etc. There is no "file content" passed via CLI arguments. [2]: Path is controversial, because it's possible for systems to have non-utf8 paths. The upstream behavior is incorrect if a repo gets shared among different encoding systems (ex. both Linux and Windows). We have to know the encoding of paths to be able to convert them suitable for the local system. One way is to enforce UTF-8 for paths. The other is to keep encoding information stored with individual paths (like Ruby strings). The UTF-8 approach is much simpler with the tradeoff that non-utf-8 paths become unsupported, which seems to be a reasonable trade-off. [3]: See https://www.mercurial-scm.org/wiki/WindowsUTF8Plan. Reviewed By: singhsrb Differential Revision: D17098991 fbshipit-source-id: c0ff1e586a887233bd43cdb854fb3538aa9b70c2 2019-09-13 01:05:08 +03:00			`abort: decoding near ' encoded: \xe9': 'utf8' codec can't decode byte 0xe9 in position 20: unexpected end of data! (esc)`
tests: unify test-encoding 2010-09-26 22:41:32 +04:00			`[255]`

			`these should work`

			`$ echo "latin-1" > a`
			`$ HGENCODING=latin-1 hg ci -l latin-1`
			`$ echo "utf-8" > a`
			`$ HGENCODING=utf-8 hg ci -l utf-8`

			`hg log (ascii)`

			`$ hg --encoding ascii log`
tests: enable template-new-builtin Summary: This makes tests depend less on revision numbers. Reviewed By: DurhamG Differential Revision: D22468669 fbshipit-source-id: 74a06930faa3e6ee9d246ecc718c2a3740f57a54 2020-07-21 03:23:52 +03:00			`commit: ca661e7520de`
tests: unify test-encoding 2010-09-26 22:41:32 +04:00			`user: test`
			`date: Thu Jan 01 00:00:00 1970 +0000`
			`summary: utf-8 e' encoded: ?`

tests: enable template-new-builtin Summary: This makes tests depend less on revision numbers. Reviewed By: DurhamG Differential Revision: D22468669 fbshipit-source-id: 74a06930faa3e6ee9d246ecc718c2a3740f57a54 2020-07-21 03:23:52 +03:00			`commit: 650c6f3d55dd`
tests: unify test-encoding 2010-09-26 22:41:32 +04:00			`user: test`
			`date: Thu Jan 01 00:00:00 1970 +0000`
			`summary: latin-1 e' encoded: ?`

tests: enable template-new-builtin Summary: This makes tests depend less on revision numbers. Reviewed By: DurhamG Differential Revision: D22468669 fbshipit-source-id: 74a06930faa3e6ee9d246ecc718c2a3740f57a54 2020-07-21 03:23:52 +03:00			`commit: 0e5b7e3f9c4a`
tests: unify test-encoding 2010-09-26 22:41:32 +04:00			`user: test`
			`date: Mon Jan 12 13:46:40 1970 +0000`
			`summary: koi8-r: ????? = u'\u0440\u0442\u0443\u0442\u044c'`

tests: enable template-new-builtin Summary: This makes tests depend less on revision numbers. Reviewed By: DurhamG Differential Revision: D22468669 fbshipit-source-id: 74a06930faa3e6ee9d246ecc718c2a3740f57a54 2020-07-21 03:23:52 +03:00			`commit: 1e78a93102a3`
tests: unify test-encoding 2010-09-26 22:41:32 +04:00			`user: test`
			`date: Mon Jan 12 13:46:40 1970 +0000`
			`summary: latin-1 e': ? = u'\xe9'`


			`hg log (latin-1)`

			`$ hg --encoding latin-1 log`
tests: enable template-new-builtin Summary: This makes tests depend less on revision numbers. Reviewed By: DurhamG Differential Revision: D22468669 fbshipit-source-id: 74a06930faa3e6ee9d246ecc718c2a3740f57a54 2020-07-21 03:23:52 +03:00			`commit: ca661e7520de`
tests: unify test-encoding 2010-09-26 22:41:32 +04:00			`user: test`
			`date: Thu Jan 01 00:00:00 1970 +0000`
tests: use (esc) for all non-ASCII test output 2010-11-08 03:41:41 +03:00			`summary: utf-8 e' encoded: \xe9 (esc)`
tests: unify test-encoding 2010-09-26 22:41:32 +04:00
tests: enable template-new-builtin Summary: This makes tests depend less on revision numbers. Reviewed By: DurhamG Differential Revision: D22468669 fbshipit-source-id: 74a06930faa3e6ee9d246ecc718c2a3740f57a54 2020-07-21 03:23:52 +03:00			`commit: 650c6f3d55dd`
tests: unify test-encoding 2010-09-26 22:41:32 +04:00			`user: test`
			`date: Thu Jan 01 00:00:00 1970 +0000`
tests: use (esc) for all non-ASCII test output 2010-11-08 03:41:41 +03:00			`summary: latin-1 e' encoded: \xe9 (esc)`
tests: unify test-encoding 2010-09-26 22:41:32 +04:00
tests: enable template-new-builtin Summary: This makes tests depend less on revision numbers. Reviewed By: DurhamG Differential Revision: D22468669 fbshipit-source-id: 74a06930faa3e6ee9d246ecc718c2a3740f57a54 2020-07-21 03:23:52 +03:00			`commit: 0e5b7e3f9c4a`
tests: unify test-encoding 2010-09-26 22:41:32 +04:00			`user: test`
			`date: Mon Jan 12 13:46:40 1970 +0000`
tests: use (esc) for all non-ASCII test output 2010-11-08 03:41:41 +03:00			`summary: koi8-r: \xd2\xd4\xd5\xd4\xd8 = u'\\u0440\\u0442\\u0443\\u0442\\u044c' (esc)`
tests: unify test-encoding 2010-09-26 22:41:32 +04:00
tests: enable template-new-builtin Summary: This makes tests depend less on revision numbers. Reviewed By: DurhamG Differential Revision: D22468669 fbshipit-source-id: 74a06930faa3e6ee9d246ecc718c2a3740f57a54 2020-07-21 03:23:52 +03:00			`commit: 1e78a93102a3`
tests: unify test-encoding 2010-09-26 22:41:32 +04:00			`user: test`
			`date: Mon Jan 12 13:46:40 1970 +0000`
tests: use (esc) for all non-ASCII test output 2010-11-08 03:41:41 +03:00			`summary: latin-1 e': \xe9 = u'\\xe9' (esc)`
tests: unify test-encoding 2010-09-26 22:41:32 +04:00

			`hg log (utf-8)`

			`$ hg --encoding utf-8 log`
tests: enable template-new-builtin Summary: This makes tests depend less on revision numbers. Reviewed By: DurhamG Differential Revision: D22468669 fbshipit-source-id: 74a06930faa3e6ee9d246ecc718c2a3740f57a54 2020-07-21 03:23:52 +03:00			`commit: ca661e7520de`
tests: unify test-encoding 2010-09-26 22:41:32 +04:00			`user: test`
			`date: Thu Jan 01 00:00:00 1970 +0000`
tests: use (esc) for all non-ASCII test output 2010-11-08 03:41:41 +03:00			`summary: utf-8 e' encoded: \xc3\xa9 (esc)`
tests: unify test-encoding 2010-09-26 22:41:32 +04:00
tests: enable template-new-builtin Summary: This makes tests depend less on revision numbers. Reviewed By: DurhamG Differential Revision: D22468669 fbshipit-source-id: 74a06930faa3e6ee9d246ecc718c2a3740f57a54 2020-07-21 03:23:52 +03:00			`commit: 650c6f3d55dd`
tests: unify test-encoding 2010-09-26 22:41:32 +04:00			`user: test`
			`date: Thu Jan 01 00:00:00 1970 +0000`
tests: use (esc) for all non-ASCII test output 2010-11-08 03:41:41 +03:00			`summary: latin-1 e' encoded: \xc3\xa9 (esc)`
tests: unify test-encoding 2010-09-26 22:41:32 +04:00
tests: enable template-new-builtin Summary: This makes tests depend less on revision numbers. Reviewed By: DurhamG Differential Revision: D22468669 fbshipit-source-id: 74a06930faa3e6ee9d246ecc718c2a3740f57a54 2020-07-21 03:23:52 +03:00			`commit: 0e5b7e3f9c4a`
tests: unify test-encoding 2010-09-26 22:41:32 +04:00			`user: test`
			`date: Mon Jan 12 13:46:40 1970 +0000`
tests: use (esc) for all non-ASCII test output 2010-11-08 03:41:41 +03:00			`summary: koi8-r: \xc3\x92\xc3\x94\xc3\x95\xc3\x94\xc3\x98 = u'\\u0440\\u0442\\u0443\\u0442\\u044c' (esc)`
tests: unify test-encoding 2010-09-26 22:41:32 +04:00
tests: enable template-new-builtin Summary: This makes tests depend less on revision numbers. Reviewed By: DurhamG Differential Revision: D22468669 fbshipit-source-id: 74a06930faa3e6ee9d246ecc718c2a3740f57a54 2020-07-21 03:23:52 +03:00			`commit: 1e78a93102a3`
tests: unify test-encoding 2010-09-26 22:41:32 +04:00			`user: test`
			`date: Mon Jan 12 13:46:40 1970 +0000`
tests: use (esc) for all non-ASCII test output 2010-11-08 03:41:41 +03:00			`summary: latin-1 e': \xc3\xa9 = u'\\xe9' (esc)`
tests: unify test-encoding 2010-09-26 22:41:32 +04:00

			`hg log (utf-8)`

			`$ HGENCODING=utf-8 hg log`
tests: enable template-new-builtin Summary: This makes tests depend less on revision numbers. Reviewed By: DurhamG Differential Revision: D22468669 fbshipit-source-id: 74a06930faa3e6ee9d246ecc718c2a3740f57a54 2020-07-21 03:23:52 +03:00			`commit: ca661e7520de`
tests: unify test-encoding 2010-09-26 22:41:32 +04:00			`user: test`
			`date: Thu Jan 01 00:00:00 1970 +0000`
tests: use (esc) for all non-ASCII test output 2010-11-08 03:41:41 +03:00			`summary: utf-8 e' encoded: \xc3\xa9 (esc)`
tests: unify test-encoding 2010-09-26 22:41:32 +04:00
tests: enable template-new-builtin Summary: This makes tests depend less on revision numbers. Reviewed By: DurhamG Differential Revision: D22468669 fbshipit-source-id: 74a06930faa3e6ee9d246ecc718c2a3740f57a54 2020-07-21 03:23:52 +03:00			`commit: 650c6f3d55dd`
tests: unify test-encoding 2010-09-26 22:41:32 +04:00			`user: test`
			`date: Thu Jan 01 00:00:00 1970 +0000`
tests: use (esc) for all non-ASCII test output 2010-11-08 03:41:41 +03:00			`summary: latin-1 e' encoded: \xc3\xa9 (esc)`
tests: unify test-encoding 2010-09-26 22:41:32 +04:00
tests: enable template-new-builtin Summary: This makes tests depend less on revision numbers. Reviewed By: DurhamG Differential Revision: D22468669 fbshipit-source-id: 74a06930faa3e6ee9d246ecc718c2a3740f57a54 2020-07-21 03:23:52 +03:00			`commit: 0e5b7e3f9c4a`
tests: unify test-encoding 2010-09-26 22:41:32 +04:00			`user: test`
			`date: Mon Jan 12 13:46:40 1970 +0000`
test-encoding: avoid using named branches Summary: Make it future-proof. Reviewed By: singhsrb Differential Revision: D13978576 fbshipit-source-id: 3ac619a0856a567f34d3a78cd5c333b3a68a91d4 2019-02-13 06:48:29 +03:00			`summary: koi8-r: \xc3\x92\xc3\x94\xc3\x95\xc3\x94\xc3\x98 = u'\\u0440\\u0442\\u0443\\u0442\\u044c' (esc)`
tests: unify test-encoding 2010-09-26 22:41:32 +04:00
tests: enable template-new-builtin Summary: This makes tests depend less on revision numbers. Reviewed By: DurhamG Differential Revision: D22468669 fbshipit-source-id: 74a06930faa3e6ee9d246ecc718c2a3740f57a54 2020-07-21 03:23:52 +03:00			`commit: 1e78a93102a3`
tests: unify test-encoding 2010-09-26 22:41:32 +04:00			`user: test`
			`date: Mon Jan 12 13:46:40 1970 +0000`
test-encoding: avoid using named branches Summary: Make it future-proof. Reviewed By: singhsrb Differential Revision: D13978576 fbshipit-source-id: 3ac619a0856a567f34d3a78cd5c333b3a68a91d4 2019-02-13 06:48:29 +03:00			`summary: latin-1 e': \xc3\xa9 = u'\\xe9' (esc)`
tests: unify test-encoding 2010-09-26 22:41:32 +04:00

			`hg log (dolphin)`

			`$ HGENCODING=dolphin hg log`
encoding: use hint markup for "please check your locale settings" This will also make test-encoding.t pass on windows. The test would hit some other code path that already used hint markup. 2011-12-26 18:01:06 +04:00			`abort: unknown encoding: dolphin`
			`(please check your locale settings)`
tests: unify test-encoding 2010-09-26 22:41:32 +04:00			`[255]`
			`$ cp latin-1-tag .hg/branch`
branch: operate on branch names in local string space where possible Previously, branch names were ideally manipulated as UTF-8 strings, because they were stored as UTF-8 in the dirstate and the changelog and could not be safely converted to the local encoding and back. However, only about 80% of branch name code was actually using the right encoding conventions. This patch uses the localstr addition to allow working on branch names as local strings, which simplifies handling so that the previously incorrect code becomes correct. 2010-11-25 00:56:32 +03:00			`$ HGENCODING=latin-1 hg ci -m 'auto-promote legacy name'`
encoding: avoid localstr when a string can be encoded losslessly (issue2763) localstr's hash method exists to prevent bogus matching on lossy local encodings. For instance, we don't want 'caf?' to match 'café' in an ASCII locale. But when café can be losslessly encoded in the local charset, we can simply use a normal string and avoid the hashing trick. This avoids using localstr's hash method, which would prevent a match between 2011-04-16 08:45:41 +04:00
tests: add missing trailing 'cd ..' Many tests didn't change back from subdirectories at the end of the tests ... and they don't have to. The missing 'cd ..' could always be added when another test case is added to the test file. This change do that tests (99.5%) consistently end up in $TESTDIR where they started, thus making it simpler to extend them or move them around. 2012-06-11 03:40:51 +04:00			`$ cd ..`
test-encoding: enable fuzz testing of utf8b roundtrip This test was written by David R. MacIver <david@drmaciver.com> at the London sprint. We can enable it as the bug in utf8b encoder/decoder has been fixed. 2015-11-09 16:31:01 +03:00
			`Test roundtrip encoding/decoding of utf8b for generated data`

			`#if hypothesis`

			`>>> from hypothesishelpers import *`
codemod: import from the edenscm package Summary: D13853115 adds `edenscm/` to `sys.path` and code still uses `import mercurial`. That has nasty problems if both `import mercurial` and `import edenscm.mercurial` are used, because Python would think `mercurial.foo` and `edenscm.mercurial.foo` are different modules so code like `try: ... except mercurial.error.Foo: ...`, or `isinstance(x, mercurial.foo.Bar)` would fail to handle the `edenscm.mercurial` version. There are also some module-level states (ex. `extensions._extensions`) that would cause trouble if they have multiple versions in a single process. Change imports to use the `edenscm` so ideally the `mercurial` is no longer imported at all. Add checks in extensions.py to catch unexpected extensions importing modules from the old (wrong) locations when running tests. Reviewed By: phillco Differential Revision: D13868981 fbshipit-source-id: f4e2513766957fd81d85407994f7521a08e4de48 2019-01-30 03:25:33 +03:00			`>>> from edenscm.mercurial import encoding`
test-encoding: enable fuzz testing of utf8b roundtrip This test was written by David R. MacIver <david@drmaciver.com> at the London sprint. We can enable it as the bug in utf8b encoder/decoder has been fixed. 2015-11-09 16:31:01 +03:00			`>>> roundtrips(st.binary(), encoding.fromutf8b, encoding.toutf8b)`
			`Round trip OK`

			`#endif`