Skip to content

Latest commit

 

History

History
187 lines (118 loc) · 9.21 KB

README.md

File metadata and controls

187 lines (118 loc) · 9.21 KB

pie.gd Mastodon Instance

pie.gd is my private Mastodon instance hosted on Fly.io.

This repository contains the various config files, scripts, and documentation that I use to run pie.gd. I'm sharing it because other people running Mastodon instances might find it helpful, but this isn't intended to be a general purpose plug-and-play Mastodon setup; it's very specific to my instance.

I've enjoyed learning more about Rails, Sidekiq, and Postgres, which I hadn't previously used much.

Acknowledgments

When I first set out to run a Mastodon instance on Fly.io I did some searching to see if there was prior art and I found Aman Gupta Karmani's tmm1/flyapp-mastodon repo, which was a tremendous help in getting started.

In the weeks since I first began running this server I've seen a few other people post about their own Mastodon instances on Fly.io, and I was fascinated to see some of the similarities and differences in how we solved the same problems.

André Arko's fantastic pull request against tmm1/flyapp-mastodon — which I discovered while writing this readme! — both validates some of the solutions I wasn't quite sure about (like using an in-VM reverse proxy and Hivemind as a process manager) and offers new solutions to problems I haven't solved yet (like how to scale Sidekiq and the Rails app).

I'm grateful to everyone who has shared their work, and I've tried to give credit in this readme and in commit messages where appropriate (and will continue to do so). In that spirit, I hope you'll also feel free to use anything you learn from this repo.

Overview

I use Fly.io to run a small private Mastodon instance using the official Mastodon Docker image. Currently, this involves running a number of Fly.io machines divided across four apps:

  • pie-gd-mastodon-v2: This app uses the same Mastodon Docker image to run different processes in two separate machines:

    • mastodon (shared-cpu-1x, 512MB RAM): Nginx reverse proxy, Mastodon Rails app, and Mastodon Node.js streaming server.

    • sidekiq (shared-cpu-1x, 1GB RAM): Sidekiq job processor.

  • pie-gd-postgres15 (shared-cpu-1x, 512GB RAM x 3): Postgres cluster created using fly pg create and later scaled up.

  • pie-gd-redis (shared-cpu-1x, 256MB RAM x 2): Redis. This is a primary in the sea region and a read-only replica in the sjc region, each with a persistent disk volume.

  • pie-gd-elasticsearch (shared-cpu-1x, 768MB RAM): Single-node Elasticsearch service. Optional.

Media is stored in Backblaze B2, which has an S3-compatible API that Mastodon can use (and lower storage fees than S3).

I use Cloudflare as a CDN in front of the Backblaze B2 bucket. Since Backblaze is part of Cloudflare's Bandwidth Alliance, egress charges from B2 are waived, which means serving media files costs me nothing.

Why use Nginx in the mastodon VM when Fly.io already provides a reverse proxy? Three reasons:

  1. We need to be able to forward requests to both the Mastodon Rails app and the Node.js streaming server.

  2. Mastodon's rate limiting and abuse prevention features rely on being able to trust the x-forwarded-for header to determine the client's IP address, and Fly.io happily passes through any x-forwarded-for value the client sends, which makes it spoofable.

    Nginx is configured to replace the x-forwarded-for header with the value of the fly-client-ip header, which can't be spoofed by the client.

  3. Many Mastodon responses are cacheable. When used as a caching proxy, Nginx easily handles the load spikes that can occur when lots of fediverse instances suddenly fetch post and user data after a post goes viral. These spikes would otherwise have to be handled by Rails and could easily overwhelm a small server (ask me how I know).

Another option for caching would be to use a CDN, but the CDNs that are good for this kind of thing typically cost money and I'm doing this on the cheap.

Cloudflare, while free, is a poor choice here because it not only requires custom configuration to get it to cache these responses, it also always ignores the Vary header, which means it will actually cache these responses incorrectly and very bad things will happen.

Initial Setup

This was adapted from tmm1/flyapp-mastodon, which I found extremely helpful!

I only did this once, and these steps are included here mainly for reference. Fly.io may introduce changes over time, so these steps may or may not continue to work the way they did when I initially set up pie.gd.

App

Note

These commands are what I used to create the original Apps v1 (aka Nomad) app for pie.gd. Apps created on Fly.io now default to the machine-based Apps v2 architecture. Some of these commands (such as setting the region and allocating ips) may no longer be needed, and you may need to use fly machine update to scale memory up after deploying.

fly apps create --org pie-gd --name pie-gd-mastodon
fly regions add sea
fly ips allocate-v4 --region sea
fly ips allocate-v6

After pointing DNS at the app, create a TLS certificate:

fly certs create pie.gd

Secrets

SECRET_KEY_BASE=$(docker run --rm -it tootsuite/mastodon:latest bin/rake secret)
OTP_SECRET=$(docker run --rm -it tootsuite/mastodon:latest bin/rake secret)

fly secrets set \
  OTP_SECRET=$OTP_SECRET \
  SECRET_KEY_BASE=$SECRET_KEY_BASE

docker run \
  --rm \
  -it \
  -e OTP_SECRET=$OTP_SECRET \
  -e SECRET_KEY_BASE=$SECRET_KEY_BASE \
  tootsuite/mastodon:latest \
  bin/rake mastodon:webpush:generate_vapid_key \
| sed 's/\r//' \
| fly secrets import

Redis

fly apps create --org pie-gd --name pie-gd-redis
fly regions add sea --config apps/redis/fly.toml
fly volumes create mastodon_redis --config apps/redis/fly.toml --region sea --size 1
fly deploy apps/redis

To add a read-only replica in sjc:

fly machines list --config apps/redis/fly.toml
fly machines clone <machine id from list> --config apps/redis/fly.toml --region sjc

Postgres

Create a Postgres flex cluster. We'll create two machines in the sea region, scale them up from the default 256MB of RAM to 512MB, and then add a machine in the sjc region to get cross-region replication.

fly pg create --org pie-gd --name pie-gd-postgres15 --initial-cluster-size 3 --region sea --flex

List the machines to get their IDs:

fly machines list --app pie-gd-postgres15

Scale up the machines to increase their RAM (do this for all machines in the list):

fly machine update <machine id from list> --memory 512 --app pie-gd-postgres15

Attach the cluster to the pie-gd-mastodon app:

fly pg attach --app pie-gd-mastodon pie-gd-postgres15

Elasticsearch

Optional. If you choose not to deploy Elasticsearch, be sure to comment out or remove the ES_ENABLED="true" env var in apps/mastodon/Dockerfile.

fly apps create --org pie-gd --name pie-gd-elasticsearch
fly regions add sea --app pie-gd-elasticsearch
fly volumes create mastodon_elasticsearch --app pie-gd-elasticsearch --region sea --size 1
fly deploy apps/elasticsearch

Upload Storage

Uploads are stored in Backblaze B2 in the pie-gd-uploads bucket.

The access key id and secret are set as Fly secrets in the AWS_ACCESS_KEY_ID and AWS_SECRET_ACCESS_KEY environment variables.

Cloudflare is configured to proxy the files.pie.gd subdomain to the pie-gd-uploads bucket, and there's a URL rewrite transform rule that adds a /file/pie-gd-uploads prefix to the path of each request so that it maps to Backblaze's path structure.

Email

Mastodon is configured to use Postmark to send emails via SMTP. Setting up a Postmark account is out of scope for this readme, but it's not hard.

This is probably not actually necessary for a private single-user Mastodon server, since the only thing you really need email for is the confirmation email when you create your account. You could just skip this and use tootctl to manually create the account. But it'll be necessary if you ever invite other users to your server, or if you're running a public instance.

Initial Deployment

To create the DB schema and run the initial migrations, I temporarily added the following release command to fly.toml:

[deploy]
  release_command = "bundle exec rails db:setup"

Then I deployed the app:

fly deploy

After the initial deployment, I removed the [deploy] section from fly.toml, since the DB setup only needs to happen once.

Once the server is running, you can use fly ssh console to SSH into the running app and run tootctl commands to grant the Owner role to your account. See the Mastodon documentation for details.

Maintenance

See MAINTENANCE.md for documentation on maintenance tasks like backing up and restoring the database.