Recently I had a random question that is there a dataset containing a variety of data science interview questions & answers? For now, I didn't find any so I decided to create on my own! 🥳
Hence I spent several days to gather over 300 data science interview questions & answers and finally built a dataset large enough to explore. To be honest, at the beginning, I thought it would be really difficult to cover the majority of types of data science questions. So I decided to gather only the non-coding ones.
Surprisingly, I found that, after reading hundreds of data science questions from websites such as
- Simplilearn
- Springboard
- Towards Data Science
- Edureka
- Analytics Vidhya
- Other Github repos
A conclusion is that there are not as many different interview questions as I expected. After a careful data selection & categorization, I completed a NLP project on the dataset. Hopefully, you will enjoy reading it!
EDA - Questions | EDA - Answers |
---|---|
![]() |
![]() |
Method - Questions | Method - Answers |
---|---|
![]() |
![]() |
Model - Questions | Model - Answers |
---|---|
![]() |
![]() |
Statistics - Questions | Statistics - Answers |
---|---|
![]() |
![]() |
MultinomialNB | Random Forest | Decision Trees |
---|---|---|
![]() |
![]() |
![]() |
- https://www.simplilearn.com/tutorials/data-science-tutorial/data-science-interview-questions
- https://www.edureka.co/blog/interview-questions/data-science-interview-questions/
- https://www.springboard.com/blog/data-science/data-science-interview-questions/
- https://towardsdatascience.com/over-100-data-scientist-interview-questions-and-answers-c5a66186769a
- https://towardsdatascience.com/120-data-scientist-interview-questions-and-answers-you-should-know-in-2021-b2faf7de8f3e
- https://github.com/alexeygrigorev/data-science-interviews
- https://github.com/kojino/120-Data-Science-Interview-Questions
- https://github.com/khanhnamle1994/cracking-the-data-science-interview