-
Aadhaar Data and Relational Databases
-
Advice for Applying PCA
-
Advice for Data Scientist and Recap
-
Advice, Recap & Conclusion
-
Algorithm
-
Analyzing Data
-
Anomaly Detection vs Supervised Learning
-
APIs
-
Auditing the data
-
Autonomous Driving (Examples)
-
Backpropagation Algorithm
-
Backpropagation Intuition
-
Bayesian Inference
-
Boosting
-
Boosting, Post-decessor
-
Boosting, Pre-decesor
-
Boxplot, histogram, visualization in R sysntax
-
Ceiling Analysis: What Part of the Pipeline to Work on Next
-
Choosing the Number of Principal Components
-
Choosing what features to use
-
Classification vs Regression
-
Cleaning the data
-
Coeffecient Determination
-
Conclusion and Project for Data Wrangling
-
Conclusion, other advice, assignment and Recap
-
Content-based Recommendation
-
Cost Function
-
Counting Words
-
CSV
-
CSV Wrangling
-
Data for Machine Learning
-
Data Formats
-
Data Types and Scales
-
Data Wrangling, Analyze Messy Data and Nick's Experiences
-
Database Schema
-
Decision Trees
-
Definition
-
Developing and evaluating an anomaly detection system
-
Error Analysis
-
Error metrics for Skewed Classes
-
Features And Polynomial Regression
-
Final Project
-
Fundamentals
-
Gaussian Distribution
-
Getting Lots of Data and Artificial Data Synthesis
-
Gradient Checking
-
Gradient Descent
-
How to handle missing data
-
ID3
-
Implementation detail: Mean Normalization
-
Implementation note: Unrolling parameters
-
Infinite Hypothesis Spaces
-
Instance Based Learning and Others
-
Intro
-
Intro Map Reduce
-
Intro to Data Visualization
-
Intro to EDA
-
Introducing Don and Rishiraj, advice on Communicate Findings
-
Introduction and Statistic(Rigor)
-
Introduction of Data Wrangling
-
Introduction to pandas and Numpy
-
Joint Distribution
-
Joshua intro and advice, addition, Recap and Conclusion
-
JSON Wrangling
-
K-means algorithm
-
Kernels I
-
Kernels II
-
kNN
-
Large Margin Intuition
-
Learning Theory
-
Learning With Large Datasets
-
Linear Regression, Gradient Descent, Cost Function
-
Machine Learning
-
Map-reduce and data-parallelism
-
Mini Batch Gradient Descent
-
Model Representation
-
More Map Reduce
-
Motivation I : Data Compression
-
Motivation II : Data Visualization
-
Multiple Variables
-
Neat Tricks
-
Neural Networks & Perceptron
-
Non-parametric test
-
Normal Equation
-
Online Learning
-
Openstreetmap Data
-
Optimization Objective
-
Pac Learning
-
Pandas and Dataframes
-
Polynomial Regression
-
Principal Component Analysis Algorithm
-
Principal Component Analysis problem formulation
-
Prioritizing What to Work On
-
Problem Description and Pipeline
-
Problem Formulation
-
Problem Motivation
-
Problem Set 2
-
Project Intro for Titanic
-
Putting it together
-
Queries
-
R Basics
-
Random Initialization
-
Reconstruction from compressed representation
-
Regression
-
Sanity Check for Missing Values
-
Scraping from Web
-
Server Computers (AD Ex.)
-
Sliding Windows
-
Stochastic Gradient Descent
-
Stochastic Gradient Descent Convergence
-
Summary
-
Support Vector Machines
-
t-test
-
Tools for Neural Networks & Others
-
Trading of Precision & Recall
-
Unsupervised Learning: Introduction
-
Using SVM
-
Vectorization: Low Rank Matrix Factorization
-
Visual Encodings
-
Visualizing Time Series Data
-
XML Parsing