May '21 Heartbeat

This month you will find: πŸš€ info on our growing team πŸ“– MLOps learning opportunities, πŸ”ƒ tutorials with integrations, πŸŽ₯ conference videos from our team members, πŸ’₯ Discord server growth, and more!

  • Jeny De Figueiredo
  • May 21, 2021 β€’ 9 min read

From the Community

It's been another month full of community goodness and we are grateful! Let's get right to it!

Thank you

Curvenote with DVC tutorials

Interested in versioning your data AND your notebooks?
Steve Purves CTO and co-founder of Curvenote gave a three-part tutuorial on integrating DVC and Curvenote for creating reproducible, collaborative version control for data scientists. The videos are beginner accessbile with tips for intermediate git users. Access the videos here.

curvenote dvc integration DVC and Curvenote for the version control win!

CML with Jenkins in DAGsHub

Next up, Puneeth Pai of Thoughtworks wrote a two-part blog series with a how-to for acheiving continuous machine learning using DVC pipelines with Jenkins and DAGsHub. Quoted in the article is our own David Ortega,

Treating experiments like potential new features in a software project opens up many possibilities for improving our engineering practices.

Check out these posts at the link below or catch Puneeth at our next Meetup where he will be giving a high level overview of this content as well as answering questions.

CML with Jenkins in DAGsHub

The first of a two-part series on how to set up continuous machine learning using DVC pipelines with Jenkins and DAGsHub.
CML with Jenkins in DAGsHub

Discord Server Explosion

Our Discord server has exploded since last month, up 30% in membership 😱, thanks in large part to a Towards Data Science post from Sara Metwalli recommending 9 Discord Servers for Math, Python, and Data Science You Need to Join Today.

Sara encourages readers to connect, learn and get inspired. πŸš€ Thanks Sara! We're on board with that! Rest assured our growing team is hard at work creating content, improving tools and working on new tools πŸ˜ΆπŸ€— to continue to grow and serve our MLOps community!

In Other MLOps News …

Learning Opportunities

Andrew NG of Deep Learning AI and Coursera fame has just released a new course specializing in MLOps, called Machine Learning Engineering for Production (MLOps) Specialization. The course "combines the foundational concepts of machine learning with the functional expertise of modern software development and engineering roles." Methodologies and capabilities of MLOps are introduced while addressing the challenges and consequences of machine learning engineering in production. I'm signed up! πŸ™‹πŸ»β€β™€οΈ How 'bout you?

Machine Learning Engineering for Production (MLOps) Specialization

Andrew Ng's new course in Coursera providing the foundation to successful and efficient MLOps
Machine Learning Engineering for Production (MLOps) Specialization

Next for your learning pleasure, Simone Scardapane is in the process of fulfilling a "personal challenge" to create a PhD course for Reproducible Deep Learning that includes the use of open source tools including our own DVC! Head to the link to star the repo and cheer him on. We will be! πŸ™ŒπŸΌ

reproducedl Reproducible Deep Learning PhD Course

Simone Scardapane is in the process of fulfilling a "personal challenge" to create a PhD course for Reproducible Deep Learning that includes the use of open source tools including our own DVC! Head to the link to star the repo and cheer him on. We will be! πŸ™ŒπŸΌ

You see what I did there, right? Reproducible… Deep Learning…
Get it? Layers of wit people. I learned from the best! Just wanted to make sure you were paying attention!

DVC News

We've hit 30 team members! Our team is distributed all over the world and has grown so much that we now have two all-hands meetings! Affectionately called UTC + and UTC -, these meetings honor all our different time zones while allowing the other group to watch via recording when they are awake! You know we're all about solving complicated problems. πŸ’ͺ🏼

team map Our team

New Team Members

Svetlana Sachkovskaya is originally from Belarus, but is currently living in Poland. She has been a full stack developer for over seven years. She loves traveling, meeting new people and is excited to work on open source software. In her spare time you may find her dancing the tango! πŸ’ƒπŸ» Welcome Sveta!

Exemplifying our diverse team in one fell swoop, Casper da Costa-Luis has lived in three continents. He has been working on DVC for a couple of years and is a long-standing contributor to open source. He now joins us on the CML & Docs teams after completing his PhD in Medical Imaging. Fun facts about Casper include his becoming the U18 chess champion of Kenya when he was 14 and being a qualified SCUBA diver. 🀿

Emre Şahin joins us on the DVC team as a technical writer/ML enthusiast/AI dreamer/tutorial builder from Instanbul, Turkey. A self-described zealot for technologies, Emre has worked in many development/ML related projects and has been programming in Python since v. 1.7. We are excited for Emre to bring you excellent technical content! ✍🏼

Tapa Dipti Sitaula joins us as a Senior Product Engineer from Nepal. She previously worked as a Principal Engineer at a tech start up in India and has worked in various capacities in her career from engineering to project management and communications. Her interests include learning languages and breaking gender stereotypes. We're right there with you Tapa! πŸš€

Open Positions

And we're still hiring!

Check out our three open roles for:

Does this sound like you or someone you know? Be in touch!

DVCTeam Conference Talks

ML Repa Week took place last month and team members gave three great talks. Dmitry Petrov gave a talk on data versioning and machine learning experiments on top of Git. Elle O'Brien gave a talk on automating machine learning with Github action and GitLab CI. And finally, Mikhail Rozhkov gave a talk on setting up the workflow for machine learning batch scoring applications using DVC, MLflow and Airflow. Be sure to check out all three talks and other great talks from the week long Conference.

DVC: Data Versioning and ML Experiments on Top of Git

Dmitry Petrov's talk at ML Repa Week on using DVC as an extension of Git for data versioning and machine learning experiments
DVC: Data Versioning and ML Experiments on Top of Git

Automating Machine Learning with GitHub Actions & GitLab CI

Elle O'Brien's conference talk about how to use GitHub actions or GitLab CI to provide automation for your machine learning projects
Automating Machine Learning with GitHub Actions & GitLab CI

Workflow & MLOps for Batch Scoring Applications with DVC, MLflow and Airflow

Mikhail Rozhkov's talk on how to set up a workflow for batch scoring applications integrating DVC, MLflow and Airlow
Workflow & MLOps for Batch Scoring Applications with DVC, MLflow and Airflow

Next Meetup

Don't miss our Meetup May 27th at 3:00pm UTC, where we will hear from Puneeth Pai as mentioned above πŸ‘†πŸ½, as well as another user putting DVC and CML into action on his team, and finally from David Ortega discussing CML pull requests! Bring your questions! We're here to help!

Tweet Love ❀️

That's quite a shout out! Thanks to Jorge Orpinel and team for always raising the bar on our docs! Until next month! πŸ‘©πŸ½β€πŸ’»


Do you have any use case questions or need support? Join us in Discord!

Head to the DVC Forum to discuss your ideas and best practices.

Subscribe for updates. We won't spam you.