Program Modules
TERM 1: Foundations of Data Science
Description: Introduces data science concepts, methodologies, industry applications, and the data-driven decision-making process.
Modules:
- Introduction to Data Science - What is Data Science?, Lifecycle, Data-driven organizations
- Data Types and Sources - Structured, Semi-structured, Unstructured data
- Data Science Ecosystem - Data Analyst, Data Scientist, Data Engineer, ML Engineer
- Data Science Applications - Finance, Healthcare, Retail, Government, Cybersecurity
- Emerging Trends - Big Data, AI-driven Analytics, Generative AI in Analytics
Labs: Data exploration exercises, Industry case studies
Modules:
- Python Fundamentals - Variables, Data Types, Operators
- Program Flow - Conditions, Loops, Functions
- Data Structures - Lists, Dictionaries, Sets, Tuples
- Object-Oriented Programming - Classes, Objects, Inheritance
- Data Science Libraries - NumPy, Pandas, Matplotlib
- Working with Files - CSV, JSON, Excel
Labs: Python coding exercises, Data manipulation projects
Modules:
- Descriptive Statistics - Mean, Median, Mode, Variance, Standard Deviation
- Probability - Probability distributions, Conditional probability
- Inferential Statistics - Sampling, Confidence intervals, Hypothesis testing
- Linear Algebra - Vectors, Matrices, Matrix operations
- Calculus Fundamentals - Derivatives, Optimization concepts
Labs: Statistical analysis exercises, Mathematical modeling
TERM 2: Data Management and Analytics
Modules:
- Database Fundamentals - Relational databases, Database design
- SQL Basics - SELECT, WHERE, ORDER BY, GROUP BY
- Advanced SQL - Joins, Subqueries, Views, Stored Procedures
- Data Warehousing - ETL processes, Data marts
- NoSQL Introduction - Document databases, Key-value stores
Labs: MySQL/PostgreSQL projects, Query optimization
Modules:
- Data Acquisition - APIs, Web data sources, Open datasets
- Data Cleaning - Missing values, Outlier detection, Duplicate records
- Data Transformation - Scaling, Encoding, Aggregation
- Feature Engineering - Feature selection, Feature extraction
- Data Quality Management
Labs: Real-world data cleaning projects
Modules:
- Visualization Principles - Chart selection, Visual perception
- Python Visualization - Matplotlib, Seaborn, Plotly
- Dashboard Development - Interactive dashboards, KPI monitoring
- Business Storytelling - Insight communication, Executive reporting
- Data Presentation Techniques
Labs: Dashboard projects, Executive reports
TERM 3: Analytics and Machine Learning
Modules:
- Data Profiling
- Correlation Analysis
- Pattern Discovery
- Trend Analysis
- Business Insights Generation
Labs: EDA projects using real datasets
Modules:
- Machine Learning Fundamentals - Supervised learning, Unsupervised learning
- Regression Models - Linear Regression, Multiple Regression
- Classification Models - Logistic Regression, Decision Trees, Random Forest
- Clustering - K-Means, Hierarchical Clustering
- Model Evaluation - Accuracy, Precision, Recall, ROC-AUC
- Model Optimization - Cross-validation, Hyperparameter tuning
Labs: Scikit-learn projects, Predictive analytics
Modules:
- Business Intelligence Concepts
- KPI Design
- Data Warehousing
- OLAP Analysis
- Executive Dashboards
- Decision Support Systems
Labs: BI reporting projects, Business case studies
TERM 4: Advanced Data Science Applications
Modules:
- Big Data Concepts
- Hadoop Ecosystem
- Distributed Processing
- Spark Fundamentals
- Data Lakes
- Cloud Data Platforms
Labs: Spark exercises, Big data processing
Modules:
- Data Ethics
- Privacy and Data Protection
- Data Governance Frameworks
- Data Quality Management
- Risk Management
- Regulatory Compliance
Labs: Governance assessments, Privacy impact analysis
Modules:
- Data Science in Finance
- Data Science in Healthcare
- Retail Analytics
- Marketing Analytics
- Cybersecurity Analytics
- Smart Cities Analytics
Labs: Industry-focused case studies, Applied analytics projects
Capstone Graduation Project (100 Hours)
Students must complete a full end-to-end data science project.
Project Phases:
- Phase 1: Problem Definition - Business requirements gathering, Project planning
- Phase 2: Data Collection - Dataset acquisition, Data validation
- Phase 3: Data Preparation - Cleaning, Transformation, Feature engineering
- Phase 4: Analysis and Modeling - Exploratory analysis, Machine learning
- Phase 5: Visualization and Reporting - Dashboard creation, Business recommendations
- Phase 6: Presentation and Defense