Fraud_Transaction_Detection

This is a short machine learning project to detect fraud transaction.

Transaction Data from the Kaggle Competition "IEEE Fraud Detection"

Models used:

Light GBM
XGBoost
Random Forest

Competition overview:

In this competition you are predicting the probability that an online transaction is fraudulent, as denoted by the binary target isFraud.

The data is broken into two files identity and transaction, which are joined by TransactionID. Not all transactions have corresponding identity information.

Categorical Features - Transaction

ProductCD
card1 - card6
addr1, addr2
P_emaildomain
R_emaildomain
M1 - M9

Categorical Features - Identity

DeviceType
DeviceInfo
id_12 - id_38

The TransactionDT feature is a timedelta from a given reference datetime (not an actual timestamp).

You can read more about the data from this post by the competition host.

Data

Download from here to ieee-fraud-detection/.

train_{transaction, identity}.csv - the training set
test_{transaction, identity}.csv - the test set (you must predict the isFraud value for these observations)
sample_submission.csv - a sample submission file in the correct format

Files:

requirements.txt contains packages used this work.
Fraud_Detection_EDA contains data exploring and feature engineering ideas with plots.
Fraud_Detection_reduced_fts contains feature selection work based on correlation between features.
Fraud_Detection_Model contains data preprocessing, feature engineering, modeling and prediction.

Result

The baseline model with simple data preprocessing and modeling with RandomForest Classifier gives an AUC score 0.8783.
My work with Lightgbm improved it by 11.1% and got an AUC score 0.97552.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
Fraud_Detection_EDA.ipynb		Fraud_Detection_EDA.ipynb
Fraud_Detection_Model.ipynb		Fraud_Detection_Model.ipynb
Fraud_Detection_reduced_fts.ipynb		Fraud_Detection_reduced_fts.ipynb
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fraud_Transaction_Detection

Models used:

Competition overview:

Files:

Result

About

Releases

Packages

Contributors 2

Languages

lee-junseok/Fraud_Transaction_Detection

Folders and files

Latest commit

History

Repository files navigation

Fraud_Transaction_Detection

Models used:

Competition overview:

Files:

Result

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages