This work describes investigations and results obtained using different gradient boosting algorithms, feature engineering techniques, and other machine learning techniques applied to a collection of Deep Learning based Synthetic Data Generators for single table data. Specifically, experiments shown were performed on tabular data provided by the Kaggle's 30 Days of ML Competition. Our method of Ordinal Encoding, boosting, hyperparameter optimization, ensembling, and stacking led us to reach 6th place in the competition out of 7573 teams worldwide among about 113,500 initial participants attending the connected course of Machine Learning.
Machine learning models and techniques applied to CTGAN-generated data
Martinelli M
2021
Abstract
This work describes investigations and results obtained using different gradient boosting algorithms, feature engineering techniques, and other machine learning techniques applied to a collection of Deep Learning based Synthetic Data Generators for single table data. Specifically, experiments shown were performed on tabular data provided by the Kaggle's 30 Days of ML Competition. Our method of Ordinal Encoding, boosting, hyperparameter optimization, ensembling, and stacking led us to reach 6th place in the competition out of 7573 teams worldwide among about 113,500 initial participants attending the connected course of Machine Learning.File | Dimensione | Formato | |
---|---|---|---|
prod_459282-doc_178915.pdf
non disponibili
Descrizione: Machine learning models and techniques applied to CTGAN-generated data
Dimensione
401.27 kB
Formato
Adobe PDF
|
401.27 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.