This article contains a list of project ideas, which can be used for projects and getting hands-on experience in Natural Language Processing. While "Hello World" problems helps in quick onboarding, the following 10 "Real World" problems should make you feel more comfortable solving NLP problems in the future. Each idea includes a link to a freely available public dataset, as well as suggested algorithm to solve the problem.
1. Tagging of Stack Overflow Questions
Problem: Train a machine learning model to predict tags for StackOverflow questions. This is a classic multi-label text classification problem, i.e. each question can have multiple tags associated with it.
Suggested Algorithm: Labeled LDA
Dataset: You can use any one or both of the following datasets