Discriminative model

Discriminative models, also referred to as conditional models, are a class of models frequently used for classification. They are typically used to assign labels, such as pass/fail, win/lose, alive/dead or healthy/sick, to existing datapoints.

Types of discriminative models include logistic regression (LR), conditional random fields (CRFs), decision trees among many others. Typical generative model approaches include naive Bayes classifiers, Gaussian mixture models, variational autoencoders, generative adversarial networks and others.

Definition

Unlike generative modelling, which studies the joint probability $P(x,y)$ , discriminative modeling studies the $P(y|x)$ or maps the given unobserved variable (target) $x$ to a class label $y$ dependent on the observed variables (training samples). For example, in object recognition, $x$ is likely to be a vector of raw pixels (or features extracted from the raw pixels of the image). Within a probabilistic framework, this is done by modeling the conditional probability distribution $P(y|x)$ , which can be used for predicting $y$ from $x$ . Note that there is still distinction between the conditional model and the discriminative model, though more often they are simply categorised as discriminative model.

Pure discriminative model vs. conditional model

A conditional model models the conditional probability distribution, while the traditional discriminative model aims to optimize on mapping the input around the most similar trained samples.^[1]

Typical discriminative modelling approaches

The following approach is based on the assumption that it is given the training data-set $D=\{(x_{i};y_{i})|i\leq N\in \mathbb {Z} \}$ , where $y_{i}$ is the corresponding output for the input $x_{i}$ .^[2]

Linear classifier

We intend to use the function $f(x)$ to simulate the behavior of what we observed from the training data-set by the linear classifier method. Using the joint feature vector $\phi (x,y)$ , the decision function is defined as:

f(x;w)=\arg \max _{y}w^{T}\phi (x,y)

According to Memisevic's interpretation,^[2] $w^{T}\phi (x,y)$ , which is also $c(x,y;w)$ , computes a score which measures the compatibility of the input $x$ with the potential output $y$ . Then the $\arg \max$ determines the class with the highest score.

Logistic regression (LR)

Since the 0-1 loss function is a commonly used one in the decision theory, the conditional probability distribution $P(y|x;w)$ , where $w$ is a parameter vector for optimizing the training data, could be reconsidered as following for the logistics regression model:

P(y|x;w)={\frac {1}{Z(x;w)}}\exp(w^{T}\phi (x,y))

, with

Z(x;w)=\textstyle \sum _{y}\displaystyle \exp(w^{T}\phi (x,y))

The equation above represents logistic regression. Notice that a major distinction between models is their way of introducing posterior probability. Posterior probability is inferred from the parametric model. We then can maximize the parameter by following equation:

L(w)=\textstyle \sum _{i}\displaystyle \log p(y^{i}|x^{i};w)

It could also be replaced by the log-loss equation below:

l^{\log }(x^{i},y^{i},c(x^{i};w))=-\log p(y^{i}|x^{i};w)=\log Z(x^{i};w)-w^{T}\phi (x^{i},y^{i})

Since the log-loss is differentiable, a gradient-based method can be used to optimize the model. A global optimum is guaranteed because the objective function is convex. The gradient of log likelihood is represented by:

{\frac {\partial L(w)}{\partial w}}=\textstyle \sum _{i}\displaystyle \phi (x^{i},y^{i})-E_{p(y|x^{i};w)}\phi (x^{i},y)

where $E_{p(y|x^{i};w)}$ is the expectation of $p(y|x^{i};w)$

Navigácia: Veda >

Analytika
Antropológia
Aplikované vedy
Bibliometria
Dejiny vedy
Encyklopédie
Filozofia vedy
Forenzné vedy
Humanitné vedy
Knižničná veda
Kryogenika
Kryptológia
Kulturológia
Literárna veda
Medzidisciplinárne oblasti
Metódy kvantitatívnej analýzy
Metavedy
Metodika

Metodológia vedy
Náboženstvo a veda
Náučná literatúra
Podvody vo vede
Popularizácia vedy
Potravinárstvo
Prírodné vedy
Pseudoveda
Scientometria
Spoločenské vedy
Teórie
Teatrológia
Technické vedy
Technika
Terminológia
Umenie
Výskum

Veda
Veda a technika podľa štátu
Veda a technika podľa kontinentu
Veda a technika podľa roka
Veda v kozme
Vedci
Vedecká literatúra
Vedecké databázy
Vedecké experimenty
Vedecké konferencie
Vedecké metódy
Vedecké ocenenia
Vedecké organizácie
Vedecké parky
Vedeckí spisovatelia
Vzdelávanie
Záhady

Príbuzné výrazy:

Text je dostupný za podmienok Creative Commons Attribution/Share-Alike License 3.0 Unported; prípadne za ďalších podmienok.
Podrobnejšie informácie nájdete na stránke Podmienky použitia.

[1]

[2]