FEAT: Gradio HiTL Scorer #722

mart123p · 2025-02-18T15:42:56Z

Description

This PR introduces a new UI for scoring using Gradio, which improves the user experience by running in a separate process.

UI Implementation

The new UI is implemented using Gradio and runs in a different process. This allows the UI to wait on scoring tasks and respond accordingly, preventing the UI from flashing and ensuring it remains on screen throughout the scoring process.

Communication Mechanism

The communication between the UI and the scoring process is done using rpyc, which implements an RPC mechanism on localhost.

Gradio Features

Gradio allows us to display more than just messages. We can display entire conversations or even multimedia content. By default, the Gradio HiTL will be launched in a web view, but the web browser can be opened by setting the argument open_browser to true.

Platform Support

Currently, this implementation supports Windows and is limited to string scoring. Future updates will include support for other platforms and content types.

Dependency Changes

The dependency aiofiles was downgraded from 24.1.0 to 23.2.1 to resolve a compatibility issue with Gradio. No issues were identified with this downgrade in PyRIT.

Tests and Documentation

I am looking for feedback on the types of tests (unit, integration, etc.) and documentation (user guide, API reference, etc.) that should be added for this PR. This scorer is meant as a drop-in replacement for the current HiTL scorer.

romanlutz

Aside from the comments, I have two more questions:

Is there any way to test this in an automated fashion?
Do we want to document this already or keep it "soft launched"/experimental until some other features are added or we get feedback? This is pretty cool and warrants a blog at some point for sure!

romanlutz · 2025-02-25T00:06:05Z

pyproject.toml

@@ -37,7 +37,7 @@ classifiers = [
 requires-python = ">=3.10, <3.13"
 dependencies = [
    "aioconsole>=0.7.1",
-    "aiofiles>=24.1.0",
+    "aiofiles>=23.2.1",


Is this downgrade necessary?

Yes there's an issue with Gradio

Then we need to document that here otherwise we'll accidentally break it soon.

Also this just allows more versions and doesn't actually force downgrading

romanlutz · 2025-02-25T00:07:11Z

pyrit/score/human_in_the_loop_gradio.py

+from typing import Optional
+
+class HumanInTheLoopScorerGradio(Scorer):
+


The class or constructor needs a docstring

pyrit/ui/app.py

romanlutz · 2025-02-25T00:09:46Z

pyrit/ui/app.py

+        if len(sys.argv) > 1:
+            open_browser = sys.argv[1] == "True"
+
+        from scorer import GradioApp


I'm guessing this is here so that we don't import it if the gradio extra isn't installed?

If so, we import gradio in the scorer file. So that won't help, right?

pyrit/ui/rpc_client.py

pyrit/ui/rpc.py

romanlutz · 2025-02-25T00:20:42Z

pyrit/ui/rpc.py

+        """
+        RPC service is the service that RPyC is using
+        """
+        def __init__(self, score_received_sem: Semaphore, client_ready_sem: Semaphore):


Two things:

I'd recommend spelling it out. Characters are free 🙂

In most cases, I tell people to prefer kw-only args, so that would be __init__(self, *, ...)

romanlutz · 2025-02-25T00:21:19Z

pyrit/ui/rpc.py

+
+        # Start the RPC server.
+        self.__rpc_service = self.RpcService(self.__score_received_sem, self.__client_ready_sem)
+        self.__server = self.rpyc.ThreadedServer(self.__rpc_service, port=DEFAULT_PORT, protocol_config={"allow_all_attrs": True})


Should probably be configurable in case there's a conflict. A default value is reasonable, of course.

mart123p · 2025-03-03T21:46:33Z

Aside from the comments, I have two more questions:

Is there any way to test this in an automated fashion?

Do we want to document this already or keep it "soft launched"/experimental until some other features are added or we get feedback? This is pretty cool and warrants a blog at some point for sure!

Automated testing could be done at the RPC level. Testing E2E would require a UI testing framework. I would rather go the low hanging fruit to start testing first.
I agree it should definitely be labeled as experimental, and not be ready for mass adoption yet. This will give us some time to review the architecture, maybe add supports for other platforms.

Martin Pouliot added 4 commits February 17, 2025 17:33

Added gradio scorer implementation

f62e992

Added missing export

c5a24d5

Fixed a few issues with PyRIT integration

b1df059

Fixed a typo

829532d

mart123p changed the title ~~FEAT: Gradio HiTL Scorer~~ [DRAFT] FEAT: Gradio HiTL Scorer Feb 18, 2025

Martin Pouliot added 5 commits February 18, 2025 10:52

Fixed a typing issue

1499dc2

Fixed import errors

2557cfd

Changed global import to scoped import

3ba7ca6

Fixed an import issue

53b9b9c

Added HumanInTheLoopScorerGradio to doc

4e279b1

mart123p changed the title ~~[DRAFT] FEAT: Gradio HiTL Scorer~~ FEAT: Gradio HiTL Scorer Feb 19, 2025

romanlutz reviewed Feb 25, 2025

View reviewed changes

mart123p added 8 commits March 3, 2025 09:47

Added missing copyright

6616eee

Added docstring to constructor

0970314

Changed RPC capitalization

9a45e55

Added RPC code description

bf0931e

Added a comment about Gradio aiofiles dependency

e9959be

Changed coding style for private members

ff7b5fb

Extracted button click logic

4941211

Changed functions to use kw-only args

b4eb898

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FEAT: Gradio HiTL Scorer #722

FEAT: Gradio HiTL Scorer #722

mart123p commented Feb 18, 2025

romanlutz left a comment

romanlutz Feb 25, 2025

mart123p Mar 3, 2025

romanlutz Mar 3, 2025

romanlutz Mar 3, 2025

romanlutz Feb 25, 2025

romanlutz Feb 25, 2025

romanlutz Feb 25, 2025

romanlutz Feb 25, 2025

mart123p commented Mar 3, 2025

		from typing import Optional

		class HumanInTheLoopScorerGradio(Scorer):

FEAT: Gradio HiTL Scorer #722

Are you sure you want to change the base?

FEAT: Gradio HiTL Scorer #722

Conversation

mart123p commented Feb 18, 2025

Description

UI Implementation

Communication Mechanism

Gradio Features

Platform Support

Dependency Changes

Tests and Documentation

romanlutz left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mart123p commented Mar 3, 2025