CrossEntropyLoss¶
- 
class torch.nn.CrossEntropyLoss(weight=None, size_average=None, ignore_index=- 100, reduce=None, reduction='mean')[source]¶
- This criterion combines - LogSoftmaxand- NLLLossin one single class.- It is useful when training a classification problem with C classes. If provided, the optional argument - weightshould be a 1D Tensor assigning weight to each of the classes. This is particularly useful when you have an unbalanced training set.- The input is expected to contain raw, unnormalized scores for each class. - input has to be a Tensor of size either or with for the K-dimensional case (described later). - This criterion expects a class index in the range as the target for each value of a 1D tensor of size minibatch; if ignore_index is specified, this criterion also accepts this class index (this index may not necessarily be in the class range). - The loss can be described as: - or in the case of the - weightargument being specified:- The losses are averaged across observations for each minibatch. If the - weightargument is specified then this is a weighted average:- Can also be used for higher dimension inputs, such as 2D images, by providing an input of size with , where is the number of dimensions, and a target of appropriate shape (see below). - Parameters
- weight (Tensor, optional) – a manual rescaling weight given to each class. If given, has to be a Tensor of size C 
- size_average (bool, optional) – Deprecated (see - reduction). By default, the losses are averaged over each loss element in the batch. Note that for some losses, there are multiple elements per sample. If the field- size_averageis set to- False, the losses are instead summed for each minibatch. Ignored when- reduceis- False. Default:- True
- ignore_index (int, optional) – Specifies a target value that is ignored and does not contribute to the input gradient. When - size_averageis- True, the loss is averaged over non-ignored targets.
- reduce (bool, optional) – Deprecated (see - reduction). By default, the losses are averaged or summed over observations for each minibatch depending on- size_average. When- reduceis- False, returns a loss per batch element instead and ignores- size_average. Default:- True
- reduction (string, optional) – Specifies the reduction to apply to the output: - 'none'|- 'mean'|- 'sum'.- 'none': no reduction will be applied,- 'mean': the weighted mean of the output is taken,- 'sum': the output will be summed. Note:- size_averageand- reduceare in the process of being deprecated, and in the meantime, specifying either of those two args will override- reduction. Default:- 'mean'
 
 - Shape:
- Input: where C = number of classes, or with in the case of K-dimensional loss. 
- Target: where each value is , or with in the case of K-dimensional loss. 
- Output: scalar. If - reductionis- 'none', then the same size as the target: , or with in the case of K-dimensional loss.
 
 - Examples: - >>> loss = nn.CrossEntropyLoss() >>> input = torch.randn(3, 5, requires_grad=True) >>> target = torch.empty(3, dtype=torch.long).random_(5) >>> output = loss(input, target) >>> output.backward()