Lecture/Seminar: 4 hours per week
Lecture, seminars, demonstrations, and hands-on exercises/projects in the lab
Course Content:
1) Introduction to Natural Language Processing
2) NLP data representation
- Vector Space Model (One-hot encoding, Bag of Words, N-Grams, TF-IDF)
- POS Tagging
- Word Embedding
3) Text Categorization
- Naive Bayes Classifier, Logistic Regression, Support Vector Machine,
- Deep Learning Approaches such as CNN, LSTM, Pre-trained Models
4) Information Extraction
- Keyphrase Extraction, Name Entity Recognition,
- Relation Extraction
5) NLP Applications
- Chatbot
- Text Summarization
- Recommender System
- Machine Translation
- Question-answering System
- Review Analysis
- Sentiment Analysis
At the end of this course, successful students will be able to:
1) Demonstrate different NLP concepts like corpora, tokens, N-grams, grammar, etc.
2) Model different forms of NLP data using appropriate representation methods.
3) Apply suitable methods to solve different NLP problems including Part-of-speech (POS) tagging, chunking, Named-Entity recognition (NER), text categorization, etc.
4) Create a program for solving a particular NLP task.
5) Evaluate different NLP systems with appropriate metrics.
6) Apply deep learning methods to train NLP models.
7) Create NLP-related applications such as chatbot, sentiment analysis, recommender systems, etc.
Evaluation will be carried out in accordance with the Douglas College Evaluation Policy.
Labs/Assignments |
0-10% |
Project(s) |
15-25% |
Quizzes |
0 -10% |
Midterm Examination* |
30-40% |
Final Examination* |
35-40% |
Total |
100% |
Some of these assessments may involve group work.
* Practical hands-on computer exam
In order to pass the course, students must, in addition to receiving an overall course grade of 50%, also achieve a grade of at least 50% on the combined weighted examination components (including quizzes, tests, exams).
Students may conduct research as part of their coursework in this class. Instructors for the course are responsible for ensuring that student research projects comply with College policies on ethical conduct for research involving humans, which can require obtaining Informed Consent from participants and getting the approval of the Douglas College Research Ethics Board prior to conducting the research.
Practical Natural Language Processing by Sowmya Vajjala, Bodhisattwa Majumder, Anuj Gupta, Harshit Surana, latest edition, O'Reilly Media, Inc.
Natural Language Processing with PyTorch by Delip Rao, Brian McMahan, latest edition, O'Reilly Media, Inc.
Natural Language Processing with Python – Analyzing Text with the Natural Language Toolkit by Steven Bird, Ewan Klein, and Edward Loper (https://www.nltk.org/book/)
or other textbooks as approved by the department
CSIS 1175 (minimum grade C)