I get asked fairly often, usually by science PhD students, about transitioning from academia to data science. Hereâs a thread of the things I say most often.

Caveats: I finished my PhD. Also itâs from Harvard. I donât know how many doors opened more bc of the fancy school than anything else, but I assume itâs some % of them.
Also: white, cis, not obviously queer or disabled. YMMV.
I spent ~3 yrs of my PhD casually preparing.
Also: white, cis, not obviously queer or disabled. YMMV.
I spent ~3 yrs of my PhD casually preparing.
1. Use your projects to build experience (remember youâll talk about them in interviews). Try to add ONE new DS skill each time. Examples:
- git
- basics of bash scripting / command line
- function-writing best practices
- pandas and numpy
- drake (reproducible workflows in R)
- git
- basics of bash scripting / command line
- function-writing best practices
- pandas and numpy
- drake (reproducible workflows in R)
2. Python is more common for DS jobs, but plenty of R jobs exist too. If you know R, you can learn Python.
I used MIT 6.001x (free!) on edX to teach myself Python one summer (comes with bonus computer science basics); there are many other options. https://www.edx.org/course/introduction-to-computer-science-and-programming-7
I used MIT 6.001x (free!) on edX to teach myself Python one summer (comes with bonus computer science basics); there are many other options. https://www.edx.org/course/introduction-to-computer-science-and-programming-7
3. Choosing a course on something valuable (getting/cleaning data, machine learning, git, SQL, linear algebra) and completing it is INFINITELY better than bookmarking endless unread tutorials and trying to figure out the perfect resource to learn from. (This is a self own.)
4. If you can, do an internship. There are internships for grad students, you can probably convince your advisors to let you, and 100% that internship should be paid. A+ way to get a foot in the door before graduating.
Happy to talk about my experience more if helpful.
Happy to talk about my experience more if helpful.
5. Twitter is great! The DS community here is lovely, especially #rstats folks. I donât usually do follow-Friday style tweets but I do have a set of people I recommend to folks new to twitter â is this something people already on here would be interested in?
6a. Speaking of Twitter, itâs useful for something I donât see people talk about often: learning the vibe of the data science community. How people talk, vocabulary, interesting topics, perennial debates, in-jokes. Donât underestimate the value of âsoundingâ like an insider.
6b. Itâs like learning the unwritten rules of academia, which Twitter is ALSO useful for. The contextual knowledge that people donât know they know, so they usually wonât think to share it. It can help you go from outsider with nose pressed up against the glass, to inside.
7. The thing you knew I was going to say: portfolios. Pick a thing youâre interested in, get some data, clean it, analyze it, visualize, explain the results, put it online.
More detail: this reddit thread and this podcast.
https://amp.reddit.com/r/datascience/comments/gf9hrs/what_makes_a_good_personal_project_from_the/ https://twitter.com/robinson_es/status/1319366753898692609
More detail: this reddit thread and this podcast.
https://amp.reddit.com/r/datascience/comments/gf9hrs/what_makes_a_good_personal_project_from_the/ https://twitter.com/robinson_es/status/1319366753898692609
8. Insight fellowships: Iâve heard mixed reviews. Can be helpful if youâre not getting bites.
They teach you how to sound like a data scientist & polish your academic experience to make it relevant for data science positions. Also, you get a portfolio project out of it.
They teach you how to sound like a data scientist & polish your academic experience to make it relevant for data science positions. Also, you get a portfolio project out of it.
9. Avoid Data Camp for learning
Did you know they recently reinstated this CEO with basically no changes or consequences? (cw: sexual assault)
https://www.buzzfeednews.com/article/daveyalba/datacamp-sexual-harassment-metoo-tech-startup
Instead, I recommend Coursera or edX!

https://www.buzzfeednews.com/article/daveyalba/datacamp-sexual-harassment-metoo-tech-startup
Instead, I recommend Coursera or edX!
10. That concludes my most-repeated pieces of advice when people ask me! I hope itâs useful to more people than just the ones who have sent me emails or DMs. Iâm happy to expand on any of this, or answer other questions.