-
-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Increase the number of projects we can identify for good first issues #78
Comments
I've increased the time frame to two years now |
Another way to identify projects that welcome external developers is by looking for projects with a contribution guide. There are over 400 projects on OpenSustain.tech with a contribution guide. However, such a filter would be rather crude. Many projects have a contribution guide more for their own community and thus express less interest in welcoming external developers into the project. |
Here some repository topics to increase the numbers of projects that are welcoming developers:
Some projects use beginner* issue labels instead of Good First Issue: https://github.com/Growstuff/growstuff/issues |
We don't currently use repository topics for filtering repositories, just if they have at least one open issue for one of the following labels: @Ly0n here's a list of all the labels used across all the OST projects issues, both open and closed (lowercased to remove duplicates, count > 1): https://gist.github.com/andrew/89cabfe6adcbe136ae8cddc54f928e67 Here's a list of the labels from currently open issues: https://gist.github.com/andrew/d0369c3e1264bca51fc0c03b897a2ce0 If you want to give me a list of strings to add I can deploy that quickly. Or if you want to start adding in any open issue from repositories with certain topics we can investigate that after xmas. |
And here's a list of all project keywords (again lowercased and count > 1): https://gist.github.com/andrew/14f700b3a6e42086b778cacd60690b77 Project keywords is made up of repository topics and package manager keywords for each project. |
I found another relevant namespace that is using no standard Good First Issues: Here a better regex that should be able to match all the issues: |
The query is done as an exact match in sql rather than a regex (over 200k issues in the database!) for performance, so I'll pull a list of all labels that contain good and use that:
plus a similar list for
|
Added in a0fb5c5, the result is 18 more repos showing up in climatetriage (from 246 to 264) |
Beautiful! Thanks @andrew |
Here some thoughts to increaste the number of projects / issues we discover:
Include other projects from the same namespace. Since many name ranges are used for many general projects outside the area of sustainability, this is more complex. We would first have to filter the namespaces that have a clear reference to sustainability.
Increase the time frame to last_updated in the last two years. I think that's an easy and valid way to go. Open Source Software is often an slow staty process and older issuer are often still relevant over years.
Add popular dependencies of projects. We could create a list of the X most popular / highly used first level dependencies to the list of projects we investigate. We could still use the sustainability category of the main projects so that a dependency get's various use case labels.
Increase the total number of projects with automatic discovery based on NLP.
@andrew @Codeshark-NET
The text was updated successfully, but these errors were encountered: