Published inTowards Data ScienceStop using 0.5 as the threshold for your binary classifierLearn how to set the optimal threshold for your Machine Learning model.Nov 29, 2022534Nov 29, 2022534
Published inTowards Data ScienceCan I Trust My Model’s Probabilities? A Deep Dive into Probability CalibrationA practical guide on probability calibrationNov 10, 202218Nov 10, 202218
Published inTowards Data ScienceDeploying a Data Science Platform on AWS: Parallelizing Experiments (Part III)A step-by-step guide to deploy a Data Science platform on AWS with open-source softwareNov 1, 2022175Nov 1, 2022175
Published inTowards Data ScienceDeploying a Data Science Platform on AWS: Running containerized experiments (Part II)A step-by-step guide to deploy a Data Science platform on AWS with open-source softwareOct 26, 202210Oct 26, 202210
Published inTowards Data ScienceDeploying a Data Science Platform on AWS: Setting Up AWS Batch (Part I)A step-by-step guide to deploy a Data Science platform on AWS with open-source softwareOct 7, 202245Oct 7, 202245
Tips and Tricks to Use Jupyter Notebooks EffectivelyA few things to make you 10x more productive with Jupyter.Aug 8, 202227Aug 8, 202227
Published inTowards Data ScienceIntroducing Snapshot Testing for Jupyter Notebooksnbsnapshot is an open-source package that benchmarks notebook’s outputs to detect issues automatically.Jul 5, 2022331Jul 5, 2022331
Published inTowards Data ScienceFrom Jupyter to Kubernetes: Refactoring and Deploying Notebooks Using Open-Source ToolsA step-by-step guide to going from a messy notebook to a pipeline running in KubernetesJun 23, 202261Jun 23, 202261
Published inTowards Data ScienceAnalyze and plot 5.5M records in 20s with BigQuery and PloomberDevelop scalable pipelines on Google Cloud using open-source software.May 23, 202274May 23, 202274
Published inTowards Data ScienceA Gentle Introduction to Open-Source ContributionsA step-by-step guide for contributing to an open-source projectMay 10, 2022351May 10, 2022351