Skip to content

Commit

Permalink
Merge pull request #108 from mattip/move-to-github
Browse files Browse the repository at this point in the history
add post about the move to github
  • Loading branch information
mattip authored Jan 1, 2024
2 parents b05ad30 + 771667c commit 330d5b3
Show file tree
Hide file tree
Showing 6 changed files with 211 additions and 11 deletions.
4 changes: 2 additions & 2 deletions pages/contact.rst
Original file line number Diff line number Diff line change
Expand Up @@ -17,11 +17,11 @@

* more on our `dev site`_.

* code on https://foss.heptapod.net/pypy/pypy
* code on https://github.com/pypy/pypy

* Reach out to our consultants_ for specific projects

.. __: https://mail.python.org/mailman/listinfo/pypy-dev
.. _`issue tracker`: https://foss.heptapod.net/pypy/pypy/-/issues
.. _`issue tracker`: https://github.com/pypy/pypy/issues
.. _`dev site`: https://doc.pypy.org
.. _consultants: pypy-sponsors.html
11 changes: 5 additions & 6 deletions pages/download_advanced.rst
Original file line number Diff line number Diff line change
Expand Up @@ -311,19 +311,19 @@ Building from source


1. Get the source code. The preferred way is to checkout the current
trunk using Mercurial_. The trunk usually works and is of course
trunk using git. The trunk usually works and is of course
more up-to-date:

.. code-block:: bash
hg clone https://foss.heptapod.net/pypy/pypy
git clone https://github.com/pypy/pypy
The trunk contains PyPy 2. For PyPy 3, switch to the correct branch:

.. code-block:: bash
# switch to the branch that implements Python 3.8
hg update py3.8
# switch to the branch that implements Python 3.10
git checkout branches/py3.10
Alternatively, get one of the following smaller packages for the source at
the same revision as the above binaries:
Expand Down Expand Up @@ -429,7 +429,6 @@ Notes:
.. _`greenlets`: https://pypy.readthedocs.org/en/latest/stackless.html#greenlets
.. _`Windows build instructions`: https://doc.pypy.org/en/latest/windows.html#preparing-windows-for-the-large-build
.. _`shadow stack`: https://pypy.readthedocs.org/en/latest/config/translation.gcrootfinder.html
.. _Mercurial: https://www.mercurial-scm.org/

Packaging
---------
Expand Down Expand Up @@ -467,7 +466,7 @@ pre-compiling them, normal users will get errors:
normally found: e.g. in your project's main directory, or in a
directory that you add to the env var ``PYTHONPATH``.

.. _`package.py`: https://foss.heptapod.net/pypy/pypy/-/blob/release-pypy3.9-v7.3.14/pypy/tool/release/package.py
.. _`package.py`: https://github.com/pypy/pypy/blob/release-pypy3.9-v7.3.14/pypy/tool/release/package.py

Checksums
---------
Expand Down
2 changes: 1 addition & 1 deletion pages/people.rst
Original file line number Diff line number Diff line change
Expand Up @@ -151,4 +151,4 @@ Many more people
PyPy is and has always been an effort of many volunteers. Consult the `LICENSE`_
file for details.

.. _`LICENSE`: https://foss.heptapod.net/pypy/pypy/-/blob/branch/default/LICENSE
.. _`LICENSE`: https://github.com/pypy/pypy/blob/branch/default/LICENSE
2 changes: 1 addition & 1 deletion pages/performance.rst
Original file line number Diff line number Diff line change
Expand Up @@ -371,7 +371,7 @@ We generally consider things that are slower on PyPy than CPython to be bugs
of PyPy. If you find some issue that is not documented here,
please report it to our `bug tracker`_ for investigation.

.. _`bug tracker`: https://foss.heptapod.net/pypy/pypy/-/issues
.. _`bug tracker`: https://github.com/pypy/pypy/issues
.. _`speed website`: http://speed.pypy.org
.. _CFFI: http://cffi.readthedocs.org/
.. _HPy: https://hpyproject.org/
197 changes: 197 additions & 0 deletions posts/2023/12/pypy-moved-to-git-github.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,197 @@
<!--
.. title: PyPy has moved to Git, GitHub
.. slug: pypy-moved-to-git-github
.. date: 2023-12-29 14:19:55 UTC
.. tags:
.. category:
.. link:
.. description:
.. type: text
.. author: mattip
-->

PyPy has moved its canonical repo and issue tracker from
<https://foss.heptapod.net/pypy/pypy> to <https://github.com/pypy/pypy>. Obviously,
this means development will now be tracked in Git rather than Mercurial.

## Motivation

We still feel Mercurial is a better version control system. The named branch
model and user interface are superior. But

- foss.heptapod.net is not well indexed in google/bing/duckduckgo
search, so people find it harder to search for issues in the project.

- Since Heptapod has tightened its spam control, we get reports that
users create issues only to have them flagged as spam.

- Open Source has become synonymous with GitHub, and we are too small to
change that.

- Much of the current development comes as a reaction to fixing issues.
Tracking interlocking issues is easier if all the code is on the same
platform.

- The [FAQ](https://doc.pypy.org/en/latest/faq.html#why-doesn-t-pypy-use-git-and-move-to-github)
presents two arguments against the move. [Github notes](https://git-scm.com/docs/git-notes)
solves much of point (1): the difficulty of discovering provenance of
commits, although not entirely. But the main problem is point (2), it turns
out that __not__ moving to GitHub is an impediment to contribution and issue
reporting.

- People who wish to continue to use Mercurial can use the same method below to
push to GitHub.

- GitHub is more resource rich than foss.heptapod.net. We could add CI
jobs to replace some of our aging [buildbot
infrastructure](https://buildbot.pypy.org).

## Method

The migration required two parts: migrating the code and then migrating the
issues and merge requests.

### Code migration 1: code and notes

I used a [fork of git-remote-hg](https://github.com/mnauw/git-remote-hg) to
create a local Git repo with all the changesets. Then I wanted to add a Git
note to each commit with the branch it came from. So I prepared a file with two
columns: the Git commit hash, and the corresponding branch from Mercurial.
Mercurial can describe each commit in two ways: either the commit hash or by a
number index. I used `hg log` to convert an index `i` to the Mercurial hash,
and then `git-hg-helper` from `git-remote-hg` to convert the Mercurial hash to
a Git hash:
```
$(cd pypy-git; git-hg-helper git-rev $(cd ../pypy-hg; hg log -r $i -T"{node}\n"))
```

Then I used `hg log` again to print the Mercurial branch for the index `i`:
```
$(cd pypy-hg; hg log -r $i -T'{branch}\n')
```

Putting these two together, I could loop over all the commits by their
numerical index to prepare the file. Then I iterated over each line in the
file, and added the Git note. Since the `git note add` command works on the
current HEAD, I needed to checkout each commit in turn and then add the note:
```
git checkout -q <hash> && git notes --ref refs/notes/branch add -m branch:<branch>
```

I could then use `git push --all` to push to GitHub.

### Code migration 2: prepare the branches

PyPy has almost 500 open branches. The code migration created all the branch
HEADs, but `git push --all` did not push them. I needed to check them out and
push each one. So I created a file with all the branch names
```
cd pypy-hg; hg branches | cut -f1 -d" " > branches.txt
```

and then push each one to the GitHub repo

```
while read branch; do git checkout branches/$branch && git push origin branches/$branch; done < branches.txt
```

Note that the branches were named `branches/XXX` by the migration, not `branch/XXX`. This confuses the merge request migration, more about that later.

### Issue and merge request migration

I used the solution from
[node-gitlab-2-github](https://github.com/piceaTech/node-gitlab-2-github) which
worked almost perfectly. It is important to do the conversion on a __private
repo__ otherwise every mention of a sucessfully mapped user name notifies
the user about the transfer. This can be quite annoying for a repo the size of
PyPy with 600 merge requests and over 4000 issues. Issues transfered without a
problem: the script properly retained the issue numbers. However the script
does not convert the Mercurial hashes to Git hashes, so the bare hashes in
comments show up without a link to the commit. Merge requests are more of a problem:

- The Mercurial named branch "disappears" once it is merged, so a merge request
to a merged branch does not find the target branch name in Git. The
conversion creates an issue instead with the label `gitlab merge request`.
- For some reason, the branches created by `git-remote-hg` are called
`branches/XXX` and not `branch/XXX` as expected by GitLab. This messes up the
merge request/PR conversion. For some of the branches (open PRs and main
target branches) I manually created additional branches without the `es`. The
net result is that open merge requests became open PRs, merged merge requests
became issues, and closed-not-merged merge requests were not migrated.

### Layered conversions
PyPy already migrated once from Bitbucket to Heptapod. Many of the issues
reflect the multiple transitions: they have lines like "Created originally on
Bitbucket by XXX" from the first transition, and an additional line "In
Heptapod" from this transition.

## Credits
We would like to express our gratitude to the [Octobus](https://octobus.net/)
team who support Heptapod. The transition from Bitbucket was quite an effort,
and they have generously hosted our developement since then. We wish them all
the best, and still believe that Mercurial should have "won".

## Next steps

While the repo at GitHub is live, there are still a few more things we need to
do:

- Documentation needs an update for the new repo and the build automation from
readthedocs must be adjusted.
- The wiki should be copied from Heptapod.
- buildbot.pypy.org should also look at the new repo. I hope the code is up to
the task of interacting with a Git repo.
- speed.pypy.org tracks changes, it too needs to reference the new location
- To keep tracking branches with Git notes on new commits, I activated a
[github action](https://github.com/Julian/named-branch-action) by Julian to
add a Git branch note to each commit. Please see the README there for
directions on using Git notes.
- Some of the merge requests were not migrated. If someone wants to, they could
migrate those once they figure out the branch naming problems.

Additionally, now is the time for all of you to prove the move is worthwhile:

- Star the repo, let others know how to find it,
- Help fix some of the open issues or file new ones,
- Take advantage of the more familiar workflow to get involved in the project,
- Suggest ways to improve the migration: are there things I missed or could
have done better?

## How will development change?
Heptapod did not allow personal forks, so we were generous with a commit bit to
the main repo. Additionally, we (well, me) have been using a
commit-directly-to-main workflow. We will now be adopting a more structured
workflow. Please fork the repo and submit a pull request for any changes. We
can now add some pre-merge CI to check that the PR at least passes the first
stage of translation. The live and active branches will be:

- `main`: what was "default" in Mercurial, it is the Python2.7 interpreter and
the base of the RPython interpreter,
- `py3.9`: the Python3.9 interpreter, which also includes all RPython changes
from `main`. This is exactly like on Mercurial, and
- `py3.10`: the Python3.10 interpreter, which also includes all RPython changes
from `main` and all bugfixes from `py3.9`. This is exactly like on Mercurial.

### Working between the repos

#### Finding commits

If you want to figure out how a Mercurial commit relates to a Git commit, you
can use `git-hg-helper`. You run it in the Git repo. It takes the full long
hash from one repo and gives you the corresponding hash of the other repo:
```
$ git-hg-helper git-rev d64027c4c2b903403ceeef2c301f5132454491df
4527e62ad94b0e940a5b0f9f20d29428672f93f7
$ git-hg-helper hg-rev 4527e62ad94b0e940a5b0f9f20d29428672f93f7
d64027c4c2b903403ceeef2c301f5132454491df
```

#### Finding branches

Branches migrated from Mercurial will have a `branches` prefix, not `branch`.
While GitLab uses `branch` for its prefix, the `git-remote-hg` script uses
`branches`. New work should be in a PR targeting `main`, `py3.9` or `py3.10`.

Thanks for helping to make PyPy better.

Matti
6 changes: 5 additions & 1 deletion themes/pypy/assets/css/styles.css
Original file line number Diff line number Diff line change
Expand Up @@ -53,7 +53,7 @@ pre.literal-block *:last-child {

#container {
margin: 1em auto;
max-width: 900px;
max-width: 1200px;
}

.post {
Expand Down Expand Up @@ -369,3 +369,7 @@ h5:hover .headerlink, h6:hover .headerlink {
color: #000;
text-decoration: none;
}

code {
background-color: #eeeeee;
}

0 comments on commit 330d5b3

Please sign in to comment.