Skip to content

Medium to large transfers fail in relay only mode #3092

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
rklaehn opened this issue Jan 3, 2025 · 3 comments
Closed

Medium to large transfers fail in relay only mode #3092

rklaehn opened this issue Jan 3, 2025 · 3 comments
Labels
bug Something isn't working c-iroh
Milestone

Comments

@rklaehn
Copy link
Contributor

rklaehn commented Jan 3, 2025

Doing a medium to large data transfer using the transfer example fails on main, with a timeout.

This fails even when running a local relay.

Steps to reproduce: you need 3 terminals, 1 sender, 1 receiver, 1 relay. Sender and receiver need to be set to relay only mode, which currently works by setting DEV_RELAY_ONLY=1 at compile time.

Relay: RUST_LOG=error cargo run --release -p iroh-relay --features server -- --dev
Sender: DEV_RELAY_ONLY=1 cargo run --release --example transfer --all-features -- provide --size=1g --relay-url http://127.0.0.1:3340
Receiver: DEV_RELAY_ONLY=1 cargo run --release --example transfer --all-features -- fetch --relay-url http://127.0.0.1:3340 <ticket_from_sender>

The transfer will slow down and then fail after a few megabytes.

For a detailed discussion, see https://discord.com/channels/949724860232392765/1324353518017974354

The issue seems to be fixed on #3077, so hopefully it will still be fixed when this code makes it to main.

@rklaehn rklaehn added c-iroh bug Something isn't working labels Jan 3, 2025
@dignifiedquire dignifiedquire added this to the v0.31.0 milestone Jan 3, 2025
@flub
Copy link
Contributor

flub commented Jan 3, 2025

I strongly suspect this is a dup of #2951 which #3062 is aiming to address.

@Arqu
Copy link
Collaborator

Arqu commented Jan 7, 2025

This seems to be fixed for me. (running with #3099)

@flub
Copy link
Contributor

flub commented Jan 7, 2025

Let's close this for now then. I handwavingly explain the issues as effective deadlocks or crawling to an extremely slow crawl of flows through the magic socket actors on the way to the relay server. Hence connection timeouts.

@flub flub closed this as completed Jan 7, 2025
@github-project-automation github-project-automation bot moved this to ✅ Done in iroh Jan 7, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working c-iroh
Projects
Archived in project
Development

No branches or pull requests

4 participants