Skip to content

Instantly share code, notes, and snippets.

@riyadparvez
Last active November 16, 2018 20:34
Show Gist options
  • Save riyadparvez/acba0921794297debd062dfbb4e30a8c to your computer and use it in GitHub Desktop.
Save riyadparvez/acba0921794297debd062dfbb4e30a8c to your computer and use it in GitHub Desktop.

Requirements


Topics Covered


  • Data Exploration & Validation
    • Statistical Properties
      • Mean, variance etc.
      • Correlation
      • Entropy
      • Mutual information, Point-wise mutual information
    • Missing values
    • Outlier Detection
    • Feature imputation
    • Fairness in Machine Learning
  • Feature Engineering
    • Feature scaling
    • Feature normalization
    • Feature standardization
    • Feature encoding
      • One hot encoding (why do we need one hot encoding)
    • Feature selection
      • Feature Correlation: Whether two features are correlated
      • Mutual information
      • chi2 test
      • Dimensionality reduction - Principal Component Analysis (PCA)
    • Feature Importance
  • Algorithms
    • Linear Classification
    • Linear Regression
  • Validation
    • k-fold cross validation

Dataset

Other Useful Resources

Data Exploration & Visualization Tools

Interpretable Machine Learning Tools

AutoML

Datasets

Few off-the-shelf good enough tools

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment