UPENN | DS-AI Sample | Decision Tree

rajveer43 asked in Artificial Intelligence Jan 16

222 views

0 votes

When choosing one feature from $X_1, \ldots, X_n$ while building a Decision Tree, which of the following criteria is the most appropriate to maximize? (Here, $H()$ means entropy, and $P()$ means probability)

(a) $P(Y | X_j)$

(b) $P(Y) - P(Y | X_j)$

(c) $H(Y) - H(Y | X_j)$

(d) $H(Y | X_j)$

(e) $H(Y) - P(Y)$

artificial-intelligence
machine-learning
statistics
probability

rajveer43 asked in Artificial Intelligence Jan 16

by rajveer43

222 views

0 Comments

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

0 votes

Best answer

The most appropriate criterion to maximize when choosing a feature in a decision tree is $(c) \ H(Y) - H(Y | X_j)$.

Explanation:

$H(Y)$ represents the entropy of the target variable $Y$, measuring its uncertainty or randomness. $H(Y | X_j)$ represents the conditional entropy of $Y$ given a specific feature $X_j$, indicating how much uncertainty remains about $Y$ after knowing the value of $X_j$.

Information Gain:

The difference between these two entropies, $H(Y) - H(Y | X_j)$, is called the information gain associated with feature $X_j$. It quantifies the reduction in uncertainty about $Y$ achieved by knowing the value of $X_j$.

Goal of Decision Trees:

Decision trees aim to create splits that reduce uncertainty about the target variable as much as possible. Therefore, maximizing the information gain, which means maximizing $H(Y) - H(Y | X_j)$, is the most appropriate criterion for feature selection.

rajveer43 answered Jan 16 selected Jan 16 by rajveer43

by rajveer43

0 Comments

Please log in or register to add a comment.

← Previous Next →

← Previous in category Next in category →

Related questions

0 votes

1 answer

rajveer43 asked in Artificial Intelligence Jan 16

379 views

AI Sample Question for DS-AI
Imagine you are guiding a robot through a grid-based maze using the A* algorithm. The robot is currently at node A (start) and wants to reach node B (goal). The heuristic function $h(n)$ is the Euclidean distance between a node and the goal. The ... algorithm explore next based on the A* calculation? A) Node C B) Node D C) Node E D) Not enough information to decide

rajveer43 asked in Artificial Intelligence Jan 16

by rajveer43

379 views

artificial-intelligence
machine-learning
probability
statistics

0 votes

1 answer

rajveer43 asked in Artificial Intelligence Jan 16

221 views

UPENN | ML | DECISION TREE
Given the following table of observations, calculate the information gain $IG(Y |X)$ that would result from learning the value of $X$. X Y Red True Green False Brown False Brown False (a) 1/2 (b) 1 (c) 3/2 (d) 2 (e) none of the above

rajveer43 asked in Artificial Intelligence Jan 16

by rajveer43

221 views

artificial-intelligence
statistics
machine-learning
binary-tree

0 votes

1 answer

rajveer43 asked in Artificial Intelligence Jan 14

394 views

DA Practice | UPENN | ML | Naive Bais
Suppose you have a three-class problem where class label $ y \in \{0, 1, 2\} $, and each training example $ \mathbf{X} $ has 3 binary attributes $ X_1, X_2, X_3 \in \{0, 1\} $. How many parameters do you need to know to classify an example using the Naive Bayes classifier? (a) 5 b) 9 (c) 11 (d) 13 (e) 23

rajveer43 asked in Artificial Intelligence Jan 14

by rajveer43

394 views

machine-learning
artificial-intelligence
statistics
probability

0 votes

1 answer

rajveer43 asked in Artificial Intelligence Jan 15

262 views

UPENN | ML Questions for GATE DA
In fitting some data using radial basis functions with kernel width $σ$, we compute training error of $345$ and a testing error of $390$. (a) increasing $σ$ will most likely reduce test set error (b) decreasing $σ$ will most likely reduce test set error (C) not enough information is provided to determine how $σ$ should be changed

rajveer43 asked in Artificial Intelligence Jan 15

by rajveer43

262 views

machine-learning
statistics
artificial-intelligence

Subscribe to GATE CSE 2024 Test Series

Subscribe to GO Classes for GATE CSE 2024

Quick search syntax

tags	tag:apple
author	user:martin
title	title:apple
content	content:apple
exclude	-tag:apple
force match	+apple
views	views:100
score	score:10
answers	answers:2
is accepted	isaccepted:true
is closed	isclosed:true

Subjects

All categories
General Aptitude (3.5k)
Engineering Mathematics (10.4k)
Digital Logic (3.6k)
Programming and DS (6.2k)
Algorithms (4.8k)
Theory of Computation (6.9k)
Compiler Design (2.5k)
Operating System (5.2k)
Databases (4.8k)
CO and Architecture (4.0k)
Computer Networks (4.9k)
Artificial Intelligence (79)
Machine Learning (48)
Data Mining and Warehousing (25)
Non GATE (1.4k)
Others (2.7k)
Admissions (684)
Exam Queries (1.6k)
Tier 1 Placement Questions (17)
Job Queries (80)
Projects (11)
Unknown Category (870)

64.3k questions

77.9k answers

244k comments

80.0k users

Recent Blog Comments

category ?
Hi @Arjun sir, I have obtained a score of 591 in ...
download here
Can you please tell about IIT-H mtech CSE self...
Please add your admission queries here:...

Send feedback
Rank Predictor
College Prediction
Useful Links
FAQ
Corrections
Discuss
Copyright
Request
Testimonials
Chat Logs
Chat
Badges
Search tips
Exam Category
Blog Category
Blog Tags
Privacy
Test Series
GATER
Refund Policy
Terms and Conditions
Contact Us

Developed by Chun

UPENN | DS-AI Sample | Decision Tree

0 Comments

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

0 Comments

Please log in or register to add a comment.

Related questions

Subscribe to GATE CSE 2024 Test Series

Subscribe to GO Classes for GATE CSE 2024

Recent Posts

Subjects

Recent Blog Comments