transformer-recommenders

Transformer-based Recommender Models in PyTorch for MovieLens

Overview

This repository provides modular implementations of recommender systems using transformer architectures, matrix factorization, and sequential models. It is designed for research and experimentation on MovieLens data, with scalable data access and experiment tracking.

Architecture & Components

Core package: xfmr_rec/
- data.py: Data loading and preprocessing (MovieLens, LanceDB)
- models.py, mf/, seq/, seq_embedded/: Model architectures (MF, sequential, transformer)
- losses.py: Custom loss functions (BPR, CCL, SSM, etc.)
- metrics.py: Evaluation metrics
- trainer.py: Training loop and experiment management (PyTorch Lightning)
- service.py, deploy.py: Model serving and deployment utilities
Data:
- data/: Raw and processed MovieLens datasets (Parquet format)
- lance_db/: LanceDB format for fast retrieval
Experiment Logs:
- lightning_logs/, mlruns/: Model checkpoints and experiment tracking (MLflow)

Installation

Requirements

Python 3.12+ (the project is developed and tested on 3.12)
The repository uses uv to manage virtual environments and tasks. See pyproject.toml for pinned dependencies.

Install dependencies with uv (recommended):

# set up the environment and install pinned deps
uv sync

Usage

Data preparation

This repo ships helper scripts to download and convert MovieLens datasets into Parquet and LanceDB formats.

Example: prepare MovieLens 1M (ml-1m) and write parquet files into data/:

# fetch, extract and convert to parquet
uv run data

If you already have the original files (for example ml-1m.zip), place them under data/ and uv run data will pick them up. Otherwise the script will download and extract the dataset.

Training

Training is implemented with PyTorch Lightning. The repository exposes several task entrypoints.

Common training commands:

# Train a sequential transformer model for 16 epochs
uv run seq_train fit --trainer.max_epochs 16

# Train a matrix factorization model
uv run mf_train fit --trainer.max_epochs 10

Check pyproject.toml entrypoints for available tasks and the xfmr_rec/ modules for model and trainer configuration.

Deployment and serving

The repository contains light-weight deployment utilities to run a retrieval service from a Lightning checkpoint.

# Serve a sequential model checkpoint on localhost
uv run python -m xfmr_rec.seq.deploy --ckpt_path <path/to/checkpoint.ckpt>

See xfmr_rec/service.py and xfmr_rec/deploy.py for convenience functions that load a checkpoint and expose a simple predict/retrieve API. The code uses LanceDB or parquet data for fast lookups when available.

Project conventions

Models are organized by type in subfolders (mf/, seq/, seq_embedded/) for extensibility.
Custom loss functions live in xfmr_rec/losses.py and are referenced by trainer hooks.
Experiment tracking is handled by PyTorch Lightning and MLflow; checkpoints and logs are stored in lightning_logs/ and mlruns/.
Data access is optimized using Parquet and (optionally) LanceDB for retrieval workloads.

Entrypoints

Task entrypoints are defined in pyproject.toml and wired to uv tasks. Typical entrypoints include:

data: datasets download and conversion utilities
mf_train, mf_deploy, mf_tune: matrix factorization training / deploy / tuning workflows
seq_train, seq_deploy, seq_tune: sequential / transformer training / deploy / tuning workflows
seq_embedded_train, seq_embedded_deploy: transformer (embedded) sequential workflows

Run uv run (without args) to list available tasks, or inspect pyproject.toml for concrete command mappings.

Development notes & troubleshooting

If you see dependency or Python version errors, confirm you are using Python 3.12 and run uv sync to recreate the virtual environment.
If training fails with out-of-memory errors, reduce trainer.batch_size or enable gradient accumulation via trainer.accumulate_grad_batches.
Use the Lightning logs folder (lightning_logs/) to inspect checkpoints and tensorboard summaries.

References

Google Slides: Collaborative Filtering with Implicit Feedback
[2101.08769] Item Recommendation from Implicit Feedback
TensorFlow Recommenders Retrieval
BPR: [1205.2618] Bayesian Personalized Ranking from Implicit Feedback
CCL: [2109.12613] SimpleX: A Simple and Strong Baseline for Collaborative Filtering
SSM: [2201.02327] On the Effectiveness of Sampled Softmax Loss for Item Recommendation
DirectAU: [2206.12811] Towards Representation Alignment and Uniformity in Collaborative Filtering
MAWU: [2308.06091] Toward a Better Understanding of Loss Functions for Collaborative Filtering
InfoNCE+, MINE+: [2312.08520] Revisiting Recommendation Loss Functions through Contrastive Learning
LogQ correction: Sampling-Bias-Corrected Neural Modeling for Large Corpus Item Recommendations
MNS: Mixed Negative Sampling for Learning Two-tower Neural Networks in Recommendations
Hashing Trick: [0902.2206] Feature Hashing for Large Scale Multitask Learning
Hash Embeddings: [1709.03933] Hash Embeddings for Efficient Word Representations
Bloom embeddings: Compact word vectors with Bloom embeddings

Name		Name	Last commit message	Last commit date
Latest commit History 638 Commits
.github		.github
xfmr_rec		xfmr_rec
.editorconfig		.editorconfig
.gitignore		.gitignore
.mega-linter.yml		.mega-linter.yml
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
.yamlfmt		.yamlfmt
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
biome.json		biome.json
compose.yaml		compose.yaml
pyproject.toml		pyproject.toml
renovate.json		renovate.json
trivy.yaml		trivy.yaml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

transformer-recommenders

Overview

Architecture & Components

Installation

Usage

Data preparation

Training

Deployment and serving

Project conventions

Entrypoints

Development notes & troubleshooting

References

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors 6

Uh oh!

Languages

License

yxtay/transformer-recommenders

Folders and files

Latest commit

History

Repository files navigation

transformer-recommenders

Overview

Architecture & Components

Installation

Usage

Data preparation

Training

Deployment and serving

Project conventions

Entrypoints

Development notes & troubleshooting

References

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors 6

Uh oh!

Languages

Packages