Hybrid GNN + BiLSTM for Predictive Human Activity and Time Modeling

Human Activity Recognition (HAR) in smart environments has traditionally focused on identifying the current activity of a user based on sensor data. However, many real-world applications, such as assisted living, healthcare monitoring, and smart automation, require systems that go beyond recognition and enable anticipation of future actions.

In a smart home setup, ambient sensors continuously generate event streams reflecting user interactions with the environment. While it is feasible to infer the current activity from these signals, it remains significantly more challenging to:

Predict the next activity a user is likely to perform
Estimate when that activity will begin

These two tasks - activity forecasting and time-to-event prediction, are critical for building proactive systems.

Core Challenges

Several challenges make this problem non-trivial:

Temporal Dependency

Human activities are inherently sequential. The next activity depends not only on the current state but also on historical context over time.

Sensor Interaction Complexity

Smart home environments involve multiple sensors whose interactions form complex patterns. These relationships are better modeled as a graph structure rather than independent signals.

Uncertain Time Intervals

The time gap between activities is highly variable and often skewed, making continuous time prediction unstable without proper modeling.

Class Imbalance and Noise

Certain activities occur far more frequently than others, and sensor data may include noise or missing values, affecting model robustness.

Objectives

This project aims to design a system that:

Predicts the next human activity from a sequence of sensor events
Estimates the time until the next activity begins
Effectively captures both:
1. Spatial relationships between sensors
2. Temporal dependencies across event sequences

Approach Overview

To address these challenges, the problem is formulated as a multi-task learning problem combining:

Activity classification (what happens next)
Time prediction (when it happens)

The system leverages:

Graph-based modeling to capture sensor relationships
Sequence modeling to learn temporal patterns
Discrete time binning to stabilize time prediction

Pipeline

The system is designed as a fully modular end-to-end pipeline:

Data Processing
- Load raw sensor logs (CSV / TXT)
- Clean and normalize sensor values
- Encode categorical variables (sensor IDs, activities)
- Generate temporal features:
  - time gaps (delta_t)
  - cyclic encoding (hour/day sin-cos)
  - previous activity context
Graph Construction
- Build a sensor interaction graph using transition probabilities
- Nodes = sensors
- Edges = co-occurrence / transition strength
- Graph captures spatial relationships between sensors
Sequence Generation
- Convert event stream → sliding window sequences
- Each window → graph representation
- Target:
  - next activity label
  - time until next activity
Time Modeling
- Continuous time → quantile-based bins
- Solves skewed distribution problem
- Enables stable classification-based prediction
Dataset + Dataloader
- Custom SequenceDataset
- Custom collate_fn for graph sequences
Training Pipeline
- Two-stage training:
- Activity prediction
- Time prediction (conditioned on activity)
- Class imbalance handling
- Learning rate scheduling + early stopping

Model Architecture

Figure: Hybrid GNN + BiLSTM architecture for activity and time prediction

The model combines graph learning + temporal modeling + multi-task prediction.

Graph Encoder (GNN)
- Uses Graph Attention Network (GAT)
- Captures:
  - sensor relationships
  - interaction patterns
- Enhancements:
  - Sensor embeddings
  - Previous activity embeddings
- Output: graph-level embedding per timestep
Temporal Model (BiLSTM)
- Processes sequence of graph embeddings
- Learns:
  - temporal dependencies
  - activity transitions
- Output: contextual sequence representation
Activity Prediction Head
- Fully connected layer
- Predicts next activity
Time Prediction (Key Innovation)
- Instead of a single shared head - Activity-conditioned time prediction
- Each activity has its own prediction head
- Learns different temporal patterns per activity
- Additional signals:
  - elapsed time
  - expected duration
  - dynamic progress
Multi-task Learning
- Simultaneously learns:
  - Activity classification
  - Time prediction

Results

Key Results

Activity Accuracy: ~98%
Macro F1 Score: ~0.93
Weighted F1 Score: ~0.98
Time Prediction MAE: ~10-13 minutes
Time NMAE: 0.59
Time Bin Accuracy: ~32%

The model is evaluated on:

Activity Prediction
- Accuracy
- Macro F1
- Weighted F1
Time Prediction
- Time bin accuracy
- Mean Absolute Error (MAE)
- Normalized MAE (NMAE)
Visualization
- Confusion matrix (normalized)
- Classification report

Activity Classification Performance

Figure: Confusion Matrix

Figure: Classification Report

Figure: Activity Metrics

Time Prediction Performance

Figure: Time Metrics

Installation

Follow the steps below to set up the project locally.

1. Clone the repository

git clone https://github.com/your-username/your-repo-name.git
cd your-repo-name

2. Create a virtual environment

python -m venv venv
source venv/bin/activate      # Mac/Linux
venv\Scripts\activate         # Windows

3. Install dependencies

pip install -r requirements.txt

4. Install PyTorch Geometric

PyTorch Geometric requires a separate installation depending on your system.

CPU version:

pip install torch-scatter torch-sparse torch-cluster torch-spline-conv torch-geometric \
-f https://data.pyg.org/whl/torch-2.0.0+cpu.html

GPU version (example: CUDA 11.8):

pip install torch-scatter torch-sparse torch-cluster torch-spline-conv torch-geometric \
-f https://data.pyg.org/whl/torch-2.0.0+cu118.html

For other CUDA versions, refer to the official PyTorch Geometric installation guide.

Verify installation

import torch
import torch_geometric

print(torch.__version__)

📂 Project Structure

project/
├── src/ # Core source code
│ ├── models/ # GNN, LSTM, time head
│ ├── data/ # preprocessing & loaders
│ ├── training/ # train & evaluation logic
│ ├── utils/ # helper functions
│ ├── features/ # feature engineering
│ ├── sequences/ # sequence building
│ ├── graph/ # graph construction
│
├── data/ # dataset (CSV files)
├── scripts/ # helper scripts 
├── configs/ # configuration files
├── outputs/ # results (metrics, plots)
├── assets/ # images (architecture diagrams)
├── streamlit_app/ # demo UI 
│
├── main.py # entry point
├── requirements.txt # dependencies
├── README.md

Usage

Run the training pipeline:

python main.py --path data/cairo_labeled.csv --input_type csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hybrid GNN + BiLSTM for Predictive Human Activity and Time Modeling

Core Challenges

Objectives

Approach Overview

Pipeline

Model Architecture

Results

Key Results

Activity Classification Performance

Time Prediction Performance

Installation

1. Clone the repository

2. Create a virtual environment

3. Install dependencies

4. Install PyTorch Geometric

Verify installation

📂 Project Structure

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
assets		assets
configs		configs
data		data
outputs		outputs
scripts		scripts
src		src
streamlit_app		streamlit_app
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Hybrid GNN + BiLSTM for Predictive Human Activity and Time Modeling

Core Challenges

Objectives

Approach Overview

Pipeline

Model Architecture

Results

Key Results

Activity Classification Performance

Time Prediction Performance

Installation

1. Clone the repository

2. Create a virtual environment

3. Install dependencies

4. Install PyTorch Geometric

Verify installation

📂 Project Structure

Usage

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages