Skip to content

Latest commit

 

History

History
5 lines (4 loc) · 261 Bytes

README.md

File metadata and controls

5 lines (4 loc) · 261 Bytes

String Categorical Encoders

Scripts for paper "Encoding high-cardinality string categorical variables"

column_encoders.py contains the class ColumnEncoder, a wrapper for the min-hash encoder, the gamma-Poisson factorization and other categorical encoders.