Fundamentals of Machine Learning in Data Science
Overview
- Programming language review for data analytics
- Basic syntax, variables, control flow, loops, install and import libraries for data processing such as SciPy, NumPy, Pandas, Sci-Kit Learn, TensorFlow or other similar libraries and packages
- Data And Features: using libraries such as NumPy and Pandas
- Represent data using lists, arrays for structured data
- Work with data frames using packages such as Pandas to represent diverse data
- Use Control Flow for filtering data and performing filtered computations
- Manipulate Data using functions and packages to process the data and perform computations
- Understand, determine and represent Features
- Perform Data Wrangling
- Exploring Data: - using libraries such as Matplotlib
- Visualize Data by creating plots using tools such as Matplotlib
- Perform high dimensionality visualizations
- Transforming Data: using libraries such as Sci-Kit learn
- Create Data Transformers and apply dimensionality reducing techniques as PCA
- Data Modeling: using libraries such as Keras, TensorFlow and scikit-learn
- Use machine learning techniques such as clustering, supervised learning, K-nearest neighbours, Regression to model the data
- Evaluating Data: Evaluate modeled data using evaluation techniques
- Create and apply confusion matrices
- Perform cross-validation using scoring metrics
- Implement and apply power tuning and pipelining to evaluate the data
Lecture, seminars, demonstrations, and hands-on exercises/projects
The course evaluation is consistent with the Douglas College Evaluation Policy.
Labs | 0 - 10% |
Assignments | 0 - 10% |
Projects | 15% - 30% |
Midterm Exam* | 30% - 40% |
Final Exam* | 30% - 40% |
Total | 100% |
* Practical hands-on computer exam
In order to pass the course, students must, in addition to receiving an overall course grade of 50%, also achieve a grade of at least 50% on the combined weighted examination components (including quizzes, tests, exams).
Students may conduct research as part of their coursework in this class. Instructors for the course are responsible for ensuring that student research projects comply with College policies on ethical conduct for research involving humans, which can require obtaining Informed Consent from participants and getting the approval of the Douglas College Research Ethics Board prior to conducting the research.
At the end of this course, the successful student will be able to:
- Use appropriate tools and libraries needed for data science.
- Explain the concepts of data processing and feature engineering.
- Conduct data wrangling for further machine learning processing.
- Apply statistisc and visulization techniques for data exploration.
- Employ suitable structures to transform data for modeling.
- Utilize various machine learning models and algorithms for different applications.
- Compare different models using appropriate evaluation metrics and strategies.
Custom courseware, class notes provided by the instructor, and online resources or other textbooks as approved by the department
Requisites
Course Guidelines
Course Guidelines for previous years are viewable by selecting the version desired. If you took this course and do not see a listing for the starting semester / year of the course, consider the previous version as the applicable version.
Course Transfers
These are for current course guidelines only. For a full list of archived courses please see https://www.bctransferguide.ca
Institution | Transfer Details for CSIS 3290 |
---|---|
Athabasca University (AU) | AU COMP 3XX (3) |
College of New Caledonia (CNC) | CNC CSC 2XX (3) |
Kwantlen Polytechnic University (KPU) | No credit |
Simon Fraser University (SFU) | No credit |
Thompson Rivers University (TRU) | TRU COMP 3XXX (3) |
University Canada West (UCW) | UCW CMPT 3XX (3) |
University of Northern BC (UNBC) | UNBC CPSC 3XX (3) |
University of the Fraser Valley (UFV) | UFV COMP 381 (3) |
Course Offerings
Winter 2025
CRN | Days | Instructor | Status | More details |
---|---|---|---|---|
CRN
17149
|
Mon | Instructor Last Name
Ma
Instructor First Name
Howard
|
Course Status
Open
|
CSIS 3290 050 - This section is restricted to students in PDD Data & Analytics, PBD Computer & Information Systems - Data Analytics stream, Computing Studies & Information Systems, and PDD Information & Communication Technology students. Students will NOT receive credit for both CSIS 3190 and CSIS 3290.
CRN | Days | Instructor | Status | More details |
---|---|---|---|---|
CRN
17370
|
Thu | Instructor Last Name
Ma
Instructor First Name
Howard
|
Course Status
Open
|
CSIS 3290 050 - This section is restricted to students in PDD Data & Analytics, PBD Computer & Information Systems - Data Analytics stream, Computing Studies & Information Systems, and PDD Information & Communication Technology students. Students will NOT receive credit for both CSIS 3190 and CSIS 3290.