Of The Day: ML-First,Mid-Semester-Regular,2017-2018

Birla Institute of Technology & Science, Pilani

Work-Integrated Learning Programmes Division

First Semester 2017-2018

Mid-Semester Test

(EC-2 Regular/ Make-up)

Course No. : IS ZC464

Course Title : MACHINE LEARNING

Nature of Exam : Closed Book

Weightage : 30%

Duration : 2 Hours

Date of Exam : 24/09/2017 (FN)

Note:

1. Please follow all the Instructions to Candidates given on the cover page of the answer book.

2. All parts of a question should be answered consecutively. Each answer should start from a fresh page.

3. Assumptions made if any, should be stated clearly at the beginning of your answer.

Q.1. Let data D consists of just n coin flip in which α₁ are number of heads and α₀ are number of tails [n = α₁ + α₀]. Assume that flips are independent and identically distributed (i.i.d.).

Let X be a binary random variable which represents a coin.

X= 1; if coin flips to heads

X= 0; if coin flips to tails

Let Ө refers to the true probability of head (P(X=1) = Ө)

a) Estimate Ө by Maximum Likelihood Estimation (MLE). [3]

b) If Beta distribution is used as prior, show that Maximum a Posteriori Probability Estimation (MAP) of Ө is

[3]

Beta distribution:

is just a normalizing constant.

Q.2. Consider a training data set {x_n, t_n} where n = 1, …, N. A polynomial function of the form

is used to fit the data.

(a) Write the sum of squared error function without using vector notations. [1]

(b) What happens if N = 10 and M = 15? Discuss. [1]

(d) Explain the following terms in not more than three lines .

(i) Good generalization. [0.5]

(ii) Hypothesis set. [0.5]

(iii) Supervised learning. [0.5]

(iv) Regularization. [0.5]

IS ZC464 (EC-2 Regular) First Semester 2017-2018 Page 2

Q.3. Consider the data set given in the following table:

Outlook	Temperature	Humidity	PlayTennis
Overcast	Cool	Normal	Yes
Overcast	Hot	High	Yes
Overcast	Hot	High	Yes
Sunny	Cool	Normal	Yes
Overcast	Cool	Normal	No
Sunny	Hot	High	No
Sunny	Hot	High	No

(a) Estimate all the parameters of Naïve Bayes classifier from the data set given in the above table. [3]

(b) Using the above estimated parameters classify the following instance

< Outlook = Sunny, Temperature =Hot, Humidity =Normal> [1]

Q.4. Let there are three hypothesis h₁, h₂, h₃ in the hypothesis space. Suppose that the posterior probabilities of three hypothesis given the data set, D are as follows:

P(h₁|D) = 0.4, P(h₂|D) = 0.3, P(h₃|D) = 0.3

Suppose new instance, x is encountered, which is classified positive by h₁, but negative by h₂ and h_3.

(a) Which hypothesis is MAP hypothesis? Explain. [1]

(b) Classify new instance x using Bayes optimal classifier. [3]

Q.5. Consider a collection S, containing positive and negative examples of some target function. Assume it has two attributes Humidity = {High, Normal} and Wind= {Weak, Strong}. What is the information Gain for both attributes for the given data?

You can use the notation for Information Gain of an attribute A as Gain(S, A).

More precisely you need to calculate Gain(S, Humidity) =? and Gain(S, Wind) =?

[3 + 3 = 6]

Which attribute is best classifier? [1]

Where + represents the positive examples and – represents the negative examples.

************

↑

Of The Day

ML-First,Mid-Semester-Regular,2017-2018

No comments:

Post a Comment

Popular Posts

Blog Archive