About Courses

About Courses

Organizations today operate in a world surrounded by data and data that is understood and analyzed smartly can play a pivotal role in determining the success of many businesses. Data Science, through its various inherent tools and techniques, is successfully adding value to all the business models by using statistics and deep learning to make better, relevant and timely decisions. Understanding the dynamics of data and knowing how to deal with data has therefore become a critical skill in an organizational context. No wonder that Harvard Business Review has termed Data Scientist as the Sexiest Job of the 21st Century!

The objective of this course is to introduce participants to the intricacies of data science and techniques of machine learning. This course will expose participants to hands on experience of popular and in-demand tools in the BDA and ML area and has been designed with an intention to impart practical problem solving skills to participants which in turn will enhance prospects of career growth in this sunrise domain.

Course-1

PG Certificate Program in Machine Learning and Big Data Analytics from CSTCP-IIIT Allahabad, Prayagraj

MODULE 1 – FUNDAMENTALS OF PYTHON

Data scientists must know how to code – start by learning the fundamentals of one of the most popular programming languages – Python.

  1. Basics of Python
  2. Conditional and Loops
  3. String and List Objects
  4. Functions & OOPs Concepts
  5. Exception Handling
  6. Database Programming
MODULE 2 – DATA WRANGLING

Once you have the core skill of programming covered– dip your feet in the nitty – gritties of working with data by learning how to wrangle and visualize them.

  1. Reading CSV, JSON, XML and HTML files using Python
  2. NumPy & Pandas
  3. Relational Databases and Data Manipulation with SQL
  4. Scipy Libraries
  5. Loading, Cleaning, Transforming, Merging, and Reshaping Data
MODULE 3 – STATISTICS AND PROBABILITY

It is impossible to use data without knowledge of statistics. Collect, organize, analyze, interpret, and present data using these concepts of statistics.

  1. Descriptive Statistics & Data Distributions
  2. Probability Concepts and Set Theory
  3. Probability Mass Functions
  4. Probability Distribution Functions
  5. Cumulative Distribution Functions
  6. Modeling Distributions
  7. Inferential Statistics
  8. Estimation
  9. Hypothesis Testing
  10. Implementation of Statistical Concepts in Python
MODULE 4 – MACHINE LEARNING MODELS IN PYTHON

Machines have increased the ability to interpret large volumes of complex data. Combine aspects of computer science with statistics to formulate algorithms that help machines draw insights from structured and unstructured data.

  1. Building Models Using Below Algorithms
  2. Linear and Logistics Regression
  3. Decision Trees
  4. Support Vector Machines (SVMs)
  5. Random Forests
  6. XGBoost
  7. K Nearest Neighbour & Hierarchical Clustering
  8. Principal Component Analysis
  9. Text Analytics and Time Series Forecasting
MODULE 5 – DATA VISUALIZATION USING MATPLOTLIB

Complex data sets call for simple representations that are easy to follow. Visualize and communicate key insights derived from data effectively by using tools like Matplotlib.

  1. Interactive Visualizations with Matplotlib
MODULE 6 – DEEP LEARNING USING TENSORFLOW

Go beyond superficial analysis of data by learning how to interpret them deeply. Use deep-learning nets to uncover hidden structures in even unlabeled and unstructured data using TensorFlow.

  1. Basics of Neural Network
  2. Linear Algebra
  3. Implementation of Neural Network in Vanilla
  4. Basics of TensorFlow
  5. Convolutional Neural Networks (CNNs)
  6. Recurrent Neural Networks (RNNs)
  7. Generative Models
  8. Semi-supervised Learning using GAN
  9. Seq-to-seq Model
  10. Encoder and Decode

End of Course-1