Skip to content

TrevorW-code/fraud

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

fraud

Pronunciation: /frɔːd/ (FRAWD)

Simplified Synthetic Data

fraud is a python package designed to streamline synthetic data for finetuning machine learning models.

When finetuning for a domain specific task (i.e. extracting medical using NER), data scarcity can quickly become a limiting factor. Data annotation is the ideal solution; however it is often expensive, time-consuming, and resource-intensive.

Synthetic data offers an effective middle ground, enabling models to significantly enhance their performance by supplementing smaller datasets.

Usage

Here's a basic example to get you started.

import fraud as fr

synthetic_samples = fr.from_template('Could you please meet {name} at {time}', 20)

About

Synthetic Data Generator for ML Applications

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published