CS7641-SL

Classification vs regression

Discrete labels vs continuous label

Regression

Mapping continuous input to output.

People's height regresses to the mean.

What prevent us from all being the same height?

The noise from DNA combinations.

The idea to find the regression to function approximation.

Linear regression

Find the best fit functions for data point

Use calculus. Get assumption first, then find the optimal solution from calculus.

Historical facts
model selection and under/over fitting
Cross validation
Linear, polynomial regression
Best constant in terms of squared error: mean
Representation for regression.

Classification Learning

Instances: (Input vectors of values)
Concept: Function to map input to output. Idea to describe things.
Target concept: actual answer.
Hypothesis class: all possible functions.
Sample: training set. Input data with correct labels.
Candidate: concept might be target concept.
Testing set: Should not be the same as Training set.

Decision Trees

20 questions

Ask 20 questions to guess what's in another person's mind?

Each question further narrow down the scope.

Steps

Pick best attribute to split the data (in binary)
Asked question
Follow the answer path
Go to Step 1 until get an answer.

Expressiveness

Boolean functions:

AND

XOR: full truth table.

N-OR (Any)

Linear N nodes.

N-XOR (Parity)

2**N - 1 nodes.

ID3

Loop:

A < - Best attribute
Assign A as decision attribute for node
For each value of A, create a descendent of node
Sort training examples to leaves
If examples per

How to find the best attribute?

Information gain

If it's even, there is no way to guess what's the possibility of the result. Information is the maximum. If it's 100% sure, then information is none.

Entropy

A measure of randomness.

Gain = Entrosy(S) - \Sigma \dfrac{|Sv|}{|S|} Entropy(S_v)

The goal is maximize entropy gain.

Bias

Inductive Bias

Restriction Bias

Preference Bias

Decision Trees other considerations

Continuous attributes? Age, weight, distance

Classification vs regression​

Regression​

Linear regression​

Find the best fit functions for data point​

Classification Learning​

Decision Trees​

20 questions​

Steps​

Expressiveness​

N-OR (Any)​

N-XOR (Parity)​

ID3​

How to find the best attribute?​

Information gain​

Entropy​

Bias​

Decision Trees other considerations​