Sign Invariant Positional Encoded Graphormer

Graphormer is a graph representation learning model designed to encode various graph features, including node centrality, shortest path distances, and eigenvectors/eigenvalues. This repository contains the implementation of the Graphormer model and its components, with sign invariant laplacian postional encoding also included.

The model is based on the papers Do Transformers Really Perform Bad for Graph Representation? and Sign and Basis Invariant Networks for Spectral Graph Representation Learning.

Installation

To install the required dependencies, run:

pip install -r requirements.txt

Usage

To use the Graphormer model, you need to prepare your dataset and configuration. Below is an example of how to initialize and use the model:

from configuration import Config
from phormerModel import Graphormer

cfg = Config()
model = Graphormer(cfg)

# Example input data
node_feat = ...
in_degree = ...
out_degree = ...
path_data = ...
dist = ...
eigenvecs = ...
eigen_value = ...

output = model(node_feat, in_degree, out_degree, path_data, dist, eigenvecs, eigen_value)

Configuration

The configuration for the Graphormer model is defined in the Config class in configuration.py. You can customize various parameters such as model dimensions, number of layers, dropout rates, and more.

from configuration import Config

cfg = Config()
print(cfg)

Components

Spatial Encoder

Encodes shortest path distances using an embedding table.

Centrality Encoder

Encodes node centrality features based on in-degrees and out-degrees.

Edge Encoder

Encodes edge features along the shortest path.

Encoder

Consists of multi-head attention and feed-forward network layers.

SignNet

SignNet is a new neural architecture that is invariant to key symmetries displayed by eigenvectors: (i) sign flips, since if v is an eigenvector then so is −v. Encodes eigenvectors and eigenvalues using rho and phi neural networks f(v1, ..., vk) = ρ(φ1(v1), ..., φk(vk)).

Model Architecture

The Graphormer model consists of several key components:

Atom Encoder: Embeds node features.
Bond Encoder: Embeds edge features (if edge encoding is enabled).
Centrality Encoder: Encodes node centrality features based on in-degrees and out-degrees.
Spatial Encoder: Encodes shortest path distances.
SignNet: Encodes eigenvectors and eigenvalues.
Encoder Layers: Stacked layers of multi-head attention and feed-forward networks.
Output Layer: Produces the final graph representation.

The model takes various graph features as input and processes them through these components to generate a graph representation.

Predictive Model Used

Training

To train the Graphormer model, you need to prepare your dataset and define the training loop. Below is a simplified example:

from zincdata import ZincDataset
from configuration import Config
from phormerModel import Graphormer

cfg = Config()
dataset = ZincDataset(cfg=cfg)
model = Graphormer(cfg)

# Define your optimizer and loss function
optimizer = torch.optim.Adam(model.parameters(), lr=cfg.lr)
loss_fn = torch.nn.MSELoss()

# Training loop
for epoch in range(num_epochs):
    for batch in dataset.train_loader:
        # Prepare input data
        node_feat, in_degree, out_degree, path_data, dist, eigenvecs, eigen_value = batch

        # Forward pass
        output = model(node_feat, in_degree, out_degree, path_data, dist, eigenvecs, eigen_value)

        # Compute loss
        loss = loss_fn(output, target)

        # Backward pass and optimization
        optimizer.zero_grad()
        loss.backward()
        optimizer.step()

Plots

Training and Validation Loss Plot [MAE]

Not Including Sign Invariant Positional Encoding

After training the model, the training and validation losses are plotted and saved as losses_plot.png.

Result Output

Not Including Sign Invariant Positional Encoding

The result of the model is plotted and saved as result.png.

Test MAE

After training the model for 300 epochs, the test Mean Absolute Error (MAE) achieved is 0.222. The configuration used is defined in configuration.py.

Due to hardware constraints, the training was limited to 300 epochs. Further training on more powerful hardware could potentially improve the performance of the model.

Caution

Do not use K { number of eigenvalues or eigenvectors } more than 10 if using eigenvalues, as it results in NaN eigenvalues if the number of nodes in the graph is smaller than K. You could find the correct K as the graph size varies, or use all eigenvalues by summing f(v1, ..., vk) = ρ(SUM(φ1(v1), ..., φk(vk))).

train Using

run trainpipeline.py

Eigenvector Plot

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
__pycache__		__pycache__
README.md		README.md
centralityencoding.py		centralityencoding.py
configuration.py		configuration.py
edgeencoding.py		edgeencoding.py
encoder.py		encoder.py
losses_plot.png		losses_plot.png
model.pth		model.pth
model_arc.png		model_arc.png
phormerModel.py		phormerModel.py
phormermodel.png		phormermodel.png
result.png		result.png
signet.py		signet.py
spaceencoding.py		spaceencoding.py
trainpipeline.py		trainpipeline.py
zincdata.py		zincdata.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Sign Invariant Positional Encoded Graphormer

Table of Contents

Installation

Usage

Configuration

Components

Spatial Encoder

Centrality Encoder

Edge Encoder

Encoder

SignNet

Model Architecture

Predictive Model Used

Training

Plots

Training and Validation Loss Plot [MAE]

Not Including Sign Invariant Positional Encoding

Result Output

Not Including Sign Invariant Positional Encoding

Test MAE

Caution

train Using

Eigenvector Plot

License

About

Uh oh!

Releases

Packages

Languages

ak24watch/Graphormer

Folders and files

Latest commit

History

Repository files navigation

Sign Invariant Positional Encoded Graphormer

Table of Contents

Installation

Usage

Configuration

Components

Spatial Encoder

Centrality Encoder

Edge Encoder

Encoder

SignNet

Model Architecture

Predictive Model Used

Training

Plots

Training and Validation Loss Plot [MAE]

Not Including Sign Invariant Positional Encoding

Result Output

Not Including Sign Invariant Positional Encoding

Test MAE

Caution

train Using

Eigenvector Plot

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages