Back to blogs

Tutorial

AITA for making this? A public dataset of Reddit posts about moral dilemmas
Releasing an open natural language dataset based on r/AmItheAsshole.
  • Elle O'Brien
  • Feb 17, 20208 min read
Best practices of orchestrating Python and R code in ML projects
What is the best way to integrate R and Python languages in one data science project? What are the best practices?
  • Marija Ilić
  • Sep 26, 20176 min read
ML Model Ensembling with Fast Iterations
Here we'll talk about tools that help tackling common technical challenges of building pipelines for the ensemble learning.
  • George Vyshnya
  • Aug 23, 20178 min read
R code and reproducible model development with DVC
There are a lot of example on how to use Data Version Control (DVC) with a Python project. In this document I would like to see how it can be used with a project in R.
  • Marija Ilić
  • Jul 24, 20179 min read
How Data Scientists Can Improve Their Productivity
Data science and machine learning are iterative processes. It is never possible to successfully complete a data science project in a single pass.
  • Dmitry Petrov
  • May 15, 20174 min read