Neural Probabilistic Language Model

Implementation of the 2003 paper "A Neural Probabilistic Language Model" by Yoshua Bengio. Inspired by Andrej Karpathy's "Neural Networks: Zero to Hero" lecture series.

This project is a name generator using the architecture from the paper "A Neural Probabilistic Language Model". It generates names from either no prompt or a custom prompt from the user. I made this project to learn more about language modeling, play around with the Xavier implementation, and implement Weight Decay into a Stochastic Gradient Decent optimizer with momentum.

Features

Inference App: `app.py`

Generate names given a short prompt.

Custom Implementation

Custom Optimizer: SGD + Momentum + Weight Decay
- Weight Decay on update
- Momentum tracking
- Custom step function for SGD
Custom Trainer
- Automatic device discovery
- Linear learning rate decay
- Fit function with tqdm for progress bar
- Evaluation for model tuning using the perplexity metric torch.exp(F.cross_entropy(logits, targets)), as described in the paper.
Model Implementation
- Custom serialization
- Custom model load
- Custom linear layer
- Weight initialization using custom Xavier implementations (uniform & normal)
Data Loaders
- Parse text dataset
- Random split into train, test, and dev sets.
Minimal Use of PyTorch
- Only uses PyTorch for CrossEntropyLoss and autograd for backpropagation.
- Utilizes torch.Generator() for reproducibility.

Custom Hyperparameter Tuning

Uses Random Sampling over the hyperparameter space.
Test NLL: 1.9882903099060059
Test Perplexity: 7.303037166595459

Best parameters:

{
  "lr_start": 0.1,
  "lr_end": 0.001,
  "h_size": 328,
  "context_size": 6,
  "emb_dim": 24,
  "weight_decay": 0.00022564512422341903,
  "momentum": 0.9,
  "batch_size": 512
}

Setup

Requirements

Ensure you have the correct python version installed:

Python >= 3.10

Install the required Python packages:

pip install -r requirements.txt

Usage

Configuration

Configurations are stored in the configs directory. Ensure that your configuration files are correctly set up before running the training or inference scripts.

Model Training

To train the model, use the train.py script with the appropriate configuration file.

python train.py --config configs/config.json

Name Generation App

Run the name generation application using:

python app.py --prompt "Prompt Here" --num_names 5 --temperature 0.7

To modify the model used change path in configs/app_config.json

Hyperparameter Tuning

Perform hyperparameter tuning using the hyperparameter_search.py script.

python hyperparameter_search.py

Configuration

Configuration File Structure

A typical configuration file (configs/config.json) looks like:

{
    "runName": "gpt_but_2003",
    "dataPath": "data/names.txt",
    "epochs": 1000,
    "batchSize": 512,
    "learningRateDecay":[0.1, 0.001],
    "vocab": 27,
    "hiddenSize": 350,
    "embeddingSize": 12,
    "context": 6,
    "weightInitialization": "normal",
    "weightDecay": 0.0001,
    "momentum": 0.9,
    "generatorSeed": 42
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Neural Probabilistic Language Model

Table of Contents

Features

Inference App: `app.py`

Custom Implementation

Setup

Requirements

Usage

Configuration

Model Training

Name Generation App

Hyperparameter Tuning

Configuration

Configuration File Structure

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
configs		configs
data		data
models		models
nplm		nplm
.gitignore		.gitignore
README.md		README.md
app.py		app.py
hyperparameter_search.py		hyperparameter_search.py
requirements.txt		requirements.txt
train.py		train.py

cmsolson75/NeuralProbabilisticLanguageModel

Folders and files

Latest commit

History

Repository files navigation

Neural Probabilistic Language Model

Table of Contents

Features

Inference App: app.py

Custom Implementation

Setup

Requirements

Usage

Configuration

Model Training

Name Generation App

Hyperparameter Tuning

Configuration

Configuration File Structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Inference App: `app.py`

Packages