Skip to content

YASHK-arch/VISION_RUSH-Deepfake-Detection-Engine

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

13 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

๐ŸŽญ Deepfake Video Detection MVP (Self-Trained ViT)

This repository contains a self-trained deepfake video detection system built using a Vision Transformer (ViT-B/14 with DINOv2 backbone).
The system analyzes videos temporally, identifies manipulated segments, and outputs timestamp-localized deepfake regions, with an emphasis on localization quality over raw classification accuracy.


๐Ÿš€ Key Features

  • โœ… Self-trained deepfake model (not a prebuilt classifier)
  • ๐ŸŽฏ Vision Transformer (ViT-B/14, DINOv2)
  • โฑ๏ธ Timestamp localization of manipulated segments
  • ๐Ÿ“Š Median smoothing + temporal segment merging
  • ๐ŸŽž๏ธ Video-level and segment-level confidence scores
  • ๐Ÿ–ฅ๏ธ Interactive Streamlit web interface
  • โšก Efficient inference via 2 FPS frame sampling

๐Ÿง  Model Overview

Component Description
Backbone vit_base_patch14_dinov2
Framework PyTorch + TIMM
Input Resolution 518 ร— 518
Output Binary classification (Real / Fake)
Weights Self-trained (df_detector_mvp.pth)

Each frame produces a single logit, converted to a probability using a sigmoid function.

About

self-trained deepfake video detection system built using a Vision Transformer (ViT-B/14 with DINOv2 backbone).

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages