Contributing to LLAMATOR

Thank you for your interest in contributing to LLAMATOR!

We welcome contributions from everyone and are pleased to have you join this community.

This document provides guidelines and instructions for contributing to this project.

Getting Started

Prerequisites

Before you begin, ensure you have the following installed:

Python 3.10
Git

Setting Up Your Development Environment

Fork the Repository: Start by forking the repository on GitHub.

Clone your fork:

git clone https://github.com/RomiconEZ/llamator.git

Set Up a Virtual Environment

. ./setup_dev_env.sh
source venv/bin/activate  # On Unix or macOS

Install Dependencies

Install the project dependencies in editable mode (with the '-e' argument). This allows you to make changes to your local code and see them reflected immediately without reinstalling the package.

pip install -r requirements-dev.txt

Install pre-commit

To ensure code quality we use pre-commit hook with several checks.

pre-commit install

Run Tests

Go to tests/test_local_llamator.py.
Create .env from .env.example and fill in the necessary fields.
Run the function to perform testing depending on your LLM client.

Making Changes

Always create a new side-branch for your work from the main branch.
```
git checkout main
git checkout -b your-branch-name
```
Make your changes to the code and add or modify unit tests as necessary.
Run tests again.
Commit Your Changes.

Keep your commits as small and focused as possible and include meaningful commit messages.
```
git add .
git commit -m "Add a brief description of your change"
```
Push the changes you did to GitHub.
```
git push origin your-branch-name
```

Get Started with Your First Contribution: Adding a New Test

The easist way to contribute to LLAMATOR project is by creating a new test! This can be easily acheived by:

1. Create a test file:

Navigate to the attacks directory.
Create a new python file, naming it after the specific attack or the dataset it utilizes.

2. Set up your file.

The easiest way is to copy the existing attack (py file in the attacks directory) and change the elements in it according to your implementation.

For multi-stage attack implementation see "What Drives the Multi-stage?" notes in docs.

3. Creating datasets with texts for attacks.

All files containing attack texts or prompts must be in the .parquet format.

These files are stored in the attack_data folder.

3. Add your attack file name to the `attack_loader.py` file:

from ..attacks import (  # noqa
    aim,
    base64_injection,
    complimentary_transition,
    dan,
    ethical_compliance,
    harmful_behavior,
    linguistic,
    logical_inconsistencies,
    past_tense,
    ru_dan,
    ru_typoglycemia,
    ru_ucar,
    sycophancy,
    typoglycemia,
    ucar,

    #TODO: YOUR TEST HERE
)

4. Add your attack name to the docstring of `start_testing()` in `main.py` and `initial_validation.py` file:

AvailableTests = [
    "aim_jailbreak",
    "base64_injection",
    "bon",
    "complimentary_transition",
    "do_anything_now_jailbreak",
    "ethical_compliance",
    "harmful_behavior",
    "linguistic_evasion",
    "logical_inconsistencies",
    "past_tense",
    "RU_do_anything_now_jailbreak",
    "RU_typoglycemia_attack",
    "RU_ucar",
    "sycophancy",
    "typoglycemia_attack",
    "ucar",

    #TODO: YOUR TEST HERE (in alphabetical order!)
]

5. Add your attack to the `attack_descriptions.json` and `attack_descriptions.md` files.

Please pay attention to the attack_descriptions.md structure. Description should be the same as docstring of the attack class. If your attack has an original paper or repository, it would be nice if you mentioned it in docstring and attack_descriptions.md.

6. Open a PR! Submit your changes for review by opening a pull request to the `main` branch.

Submitting a Pull Request.

Update your branch.

Fetch any new changes from the base branch and rebase your branch.
```
git fetch origin
git rebase origin/main
```
Submit a Pull Request.

Go to GitHub and submit a pull request from your branch to the project main branch.
Request reviews.

Request reviews from other contributors listed as maintainers. If you receive a feedback - make any necessary changes and push them.
Merge.

Once your pull request is approved, it will be merged into the main branch.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CONTRIBUTING.md

CONTRIBUTING.md

Contributing to LLAMATOR

Getting Started

Prerequisites

Setting Up Your Development Environment

Set Up a Virtual Environment

Install Dependencies

Install pre-commit

Run Tests

Making Changes

Get Started with Your First Contribution: Adding a New Test

1. Create a test file:

2. Set up your file.

3. Creating datasets with texts for attacks.

3. Add your attack file name to the `attack_loader.py` file:

4. Add your attack name to the docstring of `start_testing()` in `main.py` and `initial_validation.py` file:

5. Add your attack to the `attack_descriptions.json` and `attack_descriptions.md` files.

6. Open a PR! Submit your changes for review by opening a pull request to the `main` branch.

Submitting a Pull Request.

Files

CONTRIBUTING.md

Latest commit

History

CONTRIBUTING.md

File metadata and controls

Contributing to LLAMATOR

Getting Started

Prerequisites

Setting Up Your Development Environment

Set Up a Virtual Environment

Install Dependencies

Install pre-commit

Run Tests

Making Changes

Get Started with Your First Contribution: Adding a New Test

1. Create a test file:

2. Set up your file.

3. Creating datasets with texts for attacks.

3. Add your attack file name to the attack_loader.py file:

4. Add your attack name to the docstring of start_testing() in main.py and initial_validation.py file:

5. Add your attack to the attack_descriptions.json and attack_descriptions.md files.

6. Open a PR! Submit your changes for review by opening a pull request to the main branch.

Submitting a Pull Request.

3. Add your attack file name to the `attack_loader.py` file:

4. Add your attack name to the docstring of `start_testing()` in `main.py` and `initial_validation.py` file:

5. Add your attack to the `attack_descriptions.json` and `attack_descriptions.md` files.

6. Open a PR! Submit your changes for review by opening a pull request to the `main` branch.