Tailscale step runs successfully but subsequent steps to connect to DB fail #130

khernandezrt · 2024-06-04T17:35:02Z

We created the correct tags and set the scope to device.
The step for Tailscale runs(i dont see any confirmations that we are connected) but the step to run my tests fail with
ERROR tests/mycode/code/test_my_code.py - sqlalchemy.exc.OperationalError: (pymysql.err.OperationalError) (2003, "Can't connect to MySQL server on 'mysqlserver.us-east-1.rds.amazonaws.com' (timed out)")

We also see the node being created on the Tailscale UI but i keep getting a timeout when I run pytest.

name: Python application

on:
  push:
    branches: [ "feature/github-actions" ]
  pull_request:
    branches: [ "feature/github-actions" ]

env:
  AWS_CONFIG_FILE: .github/workflows/aws_config
  DB_NAME: "mydbname"
  DB_READ_SERVER: "mysqlserver.us-east-1.rds.amazonaws.com"
  DB_USERNAME: "root"
  DB_PASSWORD: ${{secrets.DB_PASSWORD}}

  AWS_PROFILE: "dev"
  API_VERSION: "v1"
  FRONT_END_KEY: ${{secrets.FRONT_END_KEY}}

  LOG_LEVEL: "INFO"
  DB_USER_ID: 32
  SENTRY_SAMPLE_RATE: 1
  NUMEXPR_MAX_THREADS: "8"

  LOG_LEVEL_CONSOLE: True
  LOG_LEVEL_ALGORITHM: "INFO"
  LOG_LEVEL_DB: "WARNING"

permissions:
  contents: read

jobs:
  build:
    runs-on: ubuntu-latest

    steps:
      - name: Tailscale
        uses: tailscale/github-action@v2
        with:
          oauth-client-id: ${{ secrets.TS_OAUTH_CLIENT_ID }}
          oauth-secret: ${{ secrets.TS_OAUTH_SECRET }}
          tags: tag:cicd
      - uses: actions/checkout@v4
      - name: Set up Python 3.12
        uses: actions/setup-python@v3
        with:
          python-version: "3.12"
      - name: Install dependencies
        run: |
          pip install -r requirements-dev.txt
      - name: Test with pytest
        env: 
          PYTHONPATH: ${{github.workspace}}/src
        run: |
          pytest

The text was updated successfully, but these errors were encountered:

khernandezrt · 2024-06-04T17:49:12Z

Switching the URL to a direct IP did the trick. Looks like a DNS issue.
I will leave this issue open as id prefer not to use a direct IP.

henworth · 2024-06-10T14:49:06Z

I'm encountering a similar timeout error, although doesn't seem to be DNS in my case as the IP is resolved properly:

Error: Error connecting to PostgreSQL server database.us-east-1.rds.amazonaws.com (scheme: awspostgres): dial tcp correct.ip.address:5432: connect: connection timed out

khernandezrt · 2024-06-10T14:52:09Z

@henworth Have you setup your security policies correctly for your Tailscale instance?

henworth · 2024-06-10T14:54:36Z

@henworth Have you setup your security policies correctly for your Tailscale instance?

Yep, I've done all this. It was working fine and now I'm not sure what's wrong.

Connectivity to this db works fine from other non-GitHub nodes using hostname or ip.

talha5389-teraception · 2024-06-12T17:26:24Z

I also started having issues 2 weeks ago. I have also verified that things works fine outside of github actions using same configuration

ebarriosjr · 2024-06-27T15:34:06Z

I am having the same issue. It has been working perfectly so far but today I get random i/o timeouts.

ericpollmann · 2024-07-03T23:31:17Z

Same here! I had random failures especially on the first connection to our RDS instance (running in AWS) from a github action worker (running in Azure). Subsequent connections after the first failure would succeed. I did some debugging and found that the connection is going through DERP despite having inbound wireguard port for IPv4/v6 on the AWS side.

I changed our use to first run a single ping to the subnet router DNS hostname after bringing up tailscale and that seemed to dramatically improve reliability though still had 1 fail in 10 (that time it was the ping itself failing)

Set up Split DNS and haven't had a failure since then, though only have had 10 or so runs since then.

henworth · 2024-07-04T15:13:37Z

My issue turned out to be related to the stateful filtering added in v1.66.0. Once I disabled that on my subnet routers the problem disappeared.

aaomidi · 2024-11-25T17:29:44Z

I wonder if there's a propagation delay here? E.g. a new node comes up but doesn't propagate fast enough. I wonder if adding a wait of 5 seconds or so would help here. Maybe thats why pinging may have helped?

The stateful filtering is interesting, but it's disabled by default it seems.

aaomidi · 2024-12-05T22:25:18Z

@henworth can you describe what flags you changed? I think I'm seeing something similar to this but in the helm world this time.

Update:

--stateful-filtering Enable stateful filtering for [subnet routers](https://tailscale.com/kb/1019/subnets) and [exit nodes](https://tailscale.com/kb/1103/exit-nodes). When enabled, inbound packets with another node's destination IP are dropped, unless they are a part of a tracked outbound connection from that node. Defaults to disabled.

Seems like default is false?

henworth · 2024-12-05T22:31:30Z

At the time I wrote that comment the default was true, it has since been changed to false in a subsequent release.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tailscale step runs successfully but subsequent steps to connect to DB fail #130

Tailscale step runs successfully but subsequent steps to connect to DB fail #130

khernandezrt commented Jun 4, 2024

khernandezrt commented Jun 4, 2024

henworth commented Jun 10, 2024

khernandezrt commented Jun 10, 2024

henworth commented Jun 10, 2024 •

edited

Loading

talha5389-teraception commented Jun 12, 2024

ebarriosjr commented Jun 27, 2024

ericpollmann commented Jul 3, 2024 •

edited

Loading

henworth commented Jul 4, 2024

aaomidi commented Nov 25, 2024 •

edited

Loading

aaomidi commented Dec 5, 2024 •

edited

Loading

henworth commented Dec 5, 2024

Tailscale step runs successfully but subsequent steps to connect to DB fail #130

Tailscale step runs successfully but subsequent steps to connect to DB fail #130

Comments

khernandezrt commented Jun 4, 2024

khernandezrt commented Jun 4, 2024

henworth commented Jun 10, 2024

khernandezrt commented Jun 10, 2024

henworth commented Jun 10, 2024 • edited Loading

talha5389-teraception commented Jun 12, 2024

ebarriosjr commented Jun 27, 2024

ericpollmann commented Jul 3, 2024 • edited Loading

henworth commented Jul 4, 2024

aaomidi commented Nov 25, 2024 • edited Loading

aaomidi commented Dec 5, 2024 • edited Loading

henworth commented Dec 5, 2024

henworth commented Jun 10, 2024 •

edited

Loading

ericpollmann commented Jul 3, 2024 •

edited

Loading

aaomidi commented Nov 25, 2024 •

edited

Loading

aaomidi commented Dec 5, 2024 •

edited

Loading