Skip to content

Latest commit

 

History

History
248 lines (191 loc) · 11.8 KB

CONTRIBUTING.md

File metadata and controls

248 lines (191 loc) · 11.8 KB

Contributing

Welcome to the gReLU contributor's guide.

This document focuses on getting any potential contributor familiarized with the development processes, but other kinds of contributions are also appreciated.

If you are new to using git or have never collaborated in a project previously, please have a look at contribution-guide.org. Other resources are also listed in the excellent guide created by FreeCodeCamp.

Please notice, all users and contributors are expected to be open, considerate, reasonable, and respectful. When in doubt, Python Software Foundation's Code of Conduct is a good reference in terms of behavior guidelines.

Issue Reports

If you experience bugs or general issues with gReLU, please have a look at the issue tracker. If you don't see anything useful there, please file a new issue report.

(Don't forget to include the closed issues in your search. Sometimes a solution was already reported, and the problem is considered solved.)

You can file a new issue by clicking the "New issue" button at the top right of the issue tracker. New Issue

Your new issue report should include the following information:

  1. Information about your programming environment (e.g., operating system, Python version)
  2. Steps to reproduce the problem. Please try to simplify the reproduction steps to a very minimal example that still illustrates the problem you are facing. By removing other factors, you help us to identify the root cause of the issue.
  3. The full error message that you encountered, if any.
  4. Any steps that you took to diagnose or fix the issue, and their outcomes.
  5. Any suggestions for resolving the issue.

Making code contributions

Coding resources

gReLU uses pytorch and pytorch-lightning. The below tutorials are good starting points to become familiar with these frameworks:

PyTorch tutorials

Lightning tutorials

Understanding project structure

We welcome external contributions to gReLU. Before planning changes to the code, we suggest carefully examining the current structure and organization of the package.

The API reference lists all the modules and submodules available in gReLU. Clicking on individual modules on this list will reveal a description of the module and what kinds of functions it is meant to contain. The descriptions also contain more detailed explanations of the expected structure of each module and how to contribute to it. This will help you find the appropriate location to make changes.

For instance, the table below lists some different types of functionality that contributors may want to add or change, and the corresponding module / submodule of gReLU. Click on the name of a module for more details on its structure.

Functionality Module
Functions to read / write genomic data grelu.io
Functions to preprocess genomic data after it is loaded grelu.data.preprocess
New augmentation functions for training models grelu.data.augment
Functions to introduce various types of in silico mutations into DNA sequences grelu.sequence.mutate
Other functions to manipulate DNA sequences grelu.sequence.utils
Functions to score DNA sequences based on their content grelu.transforms.seq_transforms
Functions to transform model predictions grelu.transforms.prediction_transforms
New types of model layers grelu.model.layers
New model architectures grelu.model.models
New loss functions grelu.lightning.losses
New metrics to calculate model performance grelu.lightning.metrics
New plots and visualizations grelu.visualize

For complex changes that may not fit clearly within the established package structure, it is important to first raise an issue (see instructions below).

Step-by-step instructions to contribute new code

Submit an issue

Before you work on any non-trivial code contribution it's best to first create an issue in the issue tracker to start a discussion on the subject. This often provides additional considerations and avoids unnecessary work.

Create an environment

Before you start coding, we recommend creating an isolated virtual environment to avoid any problems with your installed Python packages. This can easily be done via either virtualenv:

    virtualenv <PATH TO VENV>
    source <PATH TO VENV>/bin/activate

or Miniconda:

    conda create -n grelu python=3 six virtualenv pytest pytest-cov
    conda activate grelu

Clone the repository

  1. Create an user account on GitHub, if you do not already have one.

  2. Fork the project repository: click on the Fork button near the top of the page. This creates a copy of the code under your account on GitHub.

    Fork

  3. Clone this copy to your local disk::

    git clone git@github.com:YourLogin/grelu.git
    cd grelu
  1. You should run::
    pip install -U pip setuptools -e .

to be able to import the package under development in the Python REPL.

  1. Install pre-commit:
    pip install pre-commit
    pre-commit install

grelu comes with a lot of hooks configured to automatically help the developer to check the code being written.

Implement your changes

  1. Create a branch to hold your changes::
    git checkout -b my-feature

and start making changes. Never work on the main branch!

  1. Implement your code changes on this branch.

  2. If you change or add any functions, modules and classes, don't forget to update or add docstrings to describe these changes.

  3. If your contribution adds an additional feature and is not just a bugfix, we suggest also adding unit tests.

  4. Add yourself to the list of contributors in AUTHORS.rst.

  5. When you’re done editing, do::

    git add <MODIFIED FILES>
    git commit

to record your changes in git. Moreover, writing a descriptive commit message is highly recommended.

Please make sure to see the validation messages from pre-commit and fix any issues. This should automatically use flake8/black to check/fix the code style in a way that is compatible with the project.

Test your changes

Please check that your changes don't break any unit tests with::

    tox

(after having installed tox with pip install tox or pipx). You can also use tox to run several other pre-configured tasks in the repository. Try tox -av to see a list of the available checks.

Submit your contribution

  1. If everything works fine, push your local branch to GitHub with:
    git push -u origin my-feature
  1. Go to the web page of your fork and click "Create pull request" to send your changes for review.

    Find more detailed information in creating a PR. You might also want to open the PR as a draft first and mark it as ready for review after the feedback from the continuous integration (CI) system or any required fixes.

Troubleshooting

The following tips can be used when facing problems to build or test the package:

  1. Make sure to fetch all the tags from the upstream repository_. The command git describe --abbrev=0 --tags should return the version you are expecting. If you are trying to run CI scripts in a fork repository, make sure to push all the tags. You can also try to remove all the egg files or the complete egg folder, i.e., .eggs, as well as the *.egg-info folders in the src folder or potentially in the root of your project.

  2. Sometimes tox misses out when new dependencies are added, especially to setup.cfg and docs/requirements.txt. If you find any problems with missing dependencies when running a command with tox, try to recreate the tox environment using the -r flag. For example, instead of::

    tox -e docs

Try running::

    tox -r -e docs
  1. Make sure to have a reliable tox installation that uses the correct Python version (e.g., 3.7+). When in doubt you can run::
    tox --version
    # OR
    which tox

If you have trouble and are seeing weird errors upon running tox, you can also try to create a dedicated virtual environment with a tox binary freshly installed. For example::

virtualenv .venv
source .venv/bin/activate
.venv/bin/pip install tox
.venv/bin/tox -e all
  1. Pytest can drop you in an interactive session in the case an error occurs. In order to do that you need to pass a --pdb option (for example by running tox -- -k <NAME OF THE FALLING TEST> --pdb). You can also setup breakpoints manually instead of using the --pdb option.

Maintainer tasks

Releases

If you are part of the group of maintainers and have correct user permissions on PyPI, the following steps can be used to release a new version for grelu:

  1. Make sure all unit tests are successful.
  2. Tag the current commit on the main branch with a release tag, e.g., v1.2.3.
  3. Push the new tag to the upstream repository_, e.g., git push upstream v1.2.3
  4. Clean up the dist and build folders with tox -e clean (or rm -rf dist build) to avoid confusion with old builds and Sphinx docs.
  5. Run tox -e build and check that the files in dist have the correct version (no .dirty or git hash) according to the git tag. Also check the sizes of the distributions, if they are too big (e.g., > 500KB), unwanted clutter may have been accidentally included.
  6. Run tox -e publish -- --repository pypi and check that everything was uploaded to PyPI_ correctly.