-
Notifications
You must be signed in to change notification settings - Fork 20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Primary branch of development is depended on by users #43
Comments
How would we maintain a separate branch without the generated files? Everytime we merge back from master->development we would have to create a new commit deleting the generated files. I think it would be fine to have a development branch, just not sure what we gain by removing the generated C source. I feel like leaving the C source in the dev branch would be fine |
I think the topic of generated files has come up a few times at least at the tree-sitter repository. Here's one of the more notable ones: tree-sitter/tree-sitter#730 - regarding the "why", here are some points mentioned:
So some obvious questions are:
I'm not sure :) Here's some more from the tree-sitter-swift folks: https://github.com/alex-pinkus/tree-sitter-swift#where-is-your-parserc
On a related note, there was this issue at the tree-sitter-swift repository created by someone involved with helix-editor. Regarding:
Naively (perhaps?), I was thinking that stuff might go from development to master, but not the other way. May be I'm missing something... In the development -> master direction, I think it depends on how stuff is "transferred". It may be as you say though that there is some kind of friction that is a cost to be considered. |
I came across a tip about marking some files in Apparently this:
That seemed interesting. |
I also found this issue from about a year ago that touched a bit on security and generated files. The idea mentions a "central parser registry" -- I was thinking the other day that it could be handy for there to be a single site that could host per-parser discussion / announcement sections for something like what we're doing for the "potential changes announcement". I think users (e.g. nvim-treesitter, Emacs 29+, etc.) would find it much more to their liking to interface with one site rather than many different places (e.g. like we have our thing, and someone else might do some other thing, etc.). If enough of these sorts of ideas are mentioned, perhaps someone will start to think it's worth running a single location that addresses some of these points...(I considered GitHub discussions for this type of thing but I don't think it works very well from a UI / UX perspective.) |
I think merging from master->development only applies if we need to do some kind of "hotfix" to master, which I don't think is relevant. Nothing is so important here that it needs to skip the development branch. Sorry if that was confusing! |
Right now we have the pinned issue and anyone is free to submit normal github issues, which seems good enough for these purposes. Until something like that registry exists I think we're doing pretty good with what we have. |
As for the swift parser.c handling, I wonder how things like Emacs handle this. The default grammar installer now downloads a git repository and assumes the parser.c is checked in. I bet it would fail for this parser. Going against convention seems like it will making everyone else's tooling more difficult to write and maintain. That's kind of the worst part about this checked in generated C code, everyone is dependent on it right now, and it's going to take a big coordinated effort to move away from it, and I'm not sure it will be worth the trouble for the entire community to try it. A lot of things will break for users if that happens. |
I looked at one of the comments in the issue I linked at tree-sitter-swift's repository and it looks like there is a dedicated branch that provides the generated files. Here is the comment (has a link to the branch too). As I mentioned earlier (last bit), if we were to try something similar (which I'm not sure is worth it), I would prefer to do this manually. I'm not inclined to increasing reliance on this particular platform ATM. Anyway, for me, bringing some of these things up is more about increasing awareness of what's being tried and why. For this particular issue of generated files in a repository currently I agree with what you mentioned:
If the generated files in the repository haven't been bothering us, it seems that sticking with what we've been doing is fine for the moment. Regarding the original point about our primary branch of development being depended on, this also seems like it would be impractical to get others to change away from. Having a "development" branch seems convenient to me as it can signal to other folks (including us) where things might be going. What we've been doing seems to be creating separate branches for individual bits of work and then merging these to the primary branch. Folks becoming aware of these separate branches seems impractical, but one "development" branch doesn't seem too bad. |
It never really bothered me when working on the namespace tokens, and I don't see it becoming an issue anytime soon. I even managed a very lengthy rebase where I regenerated the parser.c output for every commit without much issue. I'd say having a development branch is fine and won't cause any problems between us. We can even open PRs against the development branch itself if we feel it's necessary (I probably will because I rewrite commit history on my branches very frequently). If we run into problems with the development branch I don't think we will have much issue fixing them between the two of us. |
Thanks for your comments. I've gone ahead and made a I've started the branch with what was described around here regarding using a wrapper function so that we express our regular expressions using literal strings (which can be concatenated if necessary) instead of regex literals. Other commits include:
|
May be it makes sense to mention the dev branch in #33 and possibly in the README [1]. Things that might be worth mentioning might include:
Also, I suspect we don't want to accumulate too much in the dev branch (for too long) without having the parts we want to keep make it to our primary branch. [1] Thinking to rework the README in light of the changes on dev. |
I personally don't think the permanent issue is worth mentioning the dev branch in, at least not right now. I think we just use it to say we are about to merge changes into the master branch from the dev branch when the time is ready. At that time we can say, try the changes out on the dev branch if you are interested. Does that make sense? No need to ping everyone when there isn't really anything for them act on. |
Thanks for the feedback.
Ok, let's do it that way. Over time the existence and purpose of the dev branch should percolate and the content of the updated README mentioned in #47 might help a bit along those lines too. |
Since we now have a dev branch, perhaps we can view this issue as resolved :) |
These ideas turned out to be unexpectedly useful today -- I ended up making a pre-0.0.12 branch because some the dev branch had bits on it that I didn't want to preserve (because of the Babashka tasks stuff being split off) and it was easier to create a new branch and add to existing issues than to change what was on the dev branch and go back and edit past comments. I think it might possibly be more useful to have a |
Regarding:
Another way to view the current situation is that some of our "artifacts" ( So the repository is serving more than one purpose -- as a home for the "source" (i.e. Perhaps in Clojure terms, this might be expressed as a form of complecting. On a more serious note though, this makes a grammar repository with generated files in it more of a juicy target security-wise. This point was elaborated on here in 2022-02 by theHamsta (of the nvim-treesitter project):
I had missed his point about npm dependencies when I had read the above text before, but now I'm glad we're moving in the direction of removing our dependency on it. Our I wasn't too concerned about having I think the Rust and Node.js bindings are also potential targets so it might make sense to consider phasing those out (I've added instructions in the upcoming README regarding how to generate them so people who need to do so can, in theory). Having stated these things, I imagine there are plenty of other juicier looking tree-sitter grammar repositories compared to tree-sitter-clojure. In this case, perhaps there is more safety in smaller numbers... On a side note, I started thinking it might be useful for there to be a C program that could generate I believe the current situation is not ideal (at least not for Emacs 29+), because to use the I wonder if this sort of thing couldn't be applied usefully in this situation. |
I'm going to close this for the moment, we can reopen this issue later if necessary. |
From our research, it appears that our primary branch of development is being depended on by a number of users. It's become a sort of de-facto release branch.
I don't think we ever had a stated policy regarding our branches so perhaps that's not too surprising -- though even if we did, it's not clear how much difference that would have made :)
Possibly related to this is the inclusion of generated source in this branch. I've written up some notes on that here.
I noticed this issue at the tree-sitter-swift repository regarding inclusion of generated source and passed that along to an issue at tree-sitter-sharp. There was an interesting idea mentioned in a response by damieng:
May be this sort of thing is worth considering. I would likely opt for manually doing things though -- at least initially (I think tree-sitter-swift may have automation).
The text was updated successfully, but these errors were encountered: