SparseAveragedPerceptron (LBJava 1.2.22 API)

java.lang.Object
- edu.illinois.cs.cogcomp.lbjava.classify.Classifier
- - edu.illinois.cs.cogcomp.lbjava.learn.Learner
  - - edu.illinois.cs.cogcomp.lbjava.learn.LinearThresholdUnit
    - - edu.illinois.cs.cogcomp.lbjava.learn.SparsePerceptron
      - edu.illinois.cs.cogcomp.lbjava.learn.SparseAveragedPerceptron

All Implemented Interfaces:

Serializable, Cloneable
```
public class SparseAveragedPerceptron
extends SparsePerceptron
```
An approximation to voted Perceptron, in which a weighted average of the weight vectors arrived at during training becomes the weight vector used to make predictions after training.
During training, after each example e_i is processed, the weight vector w_i becomes the active weight vector used to make predictions on future training examples. If a mistake was made on e_i, w_i will be different than w_{i - 1}. Otherwise, it will remain unchanged.
After training, each distinct weight vector arrived at during training is associated with an integer weight equal to the number of examples whose training made that weight vector active. A new weight vector w^* is computed by taking the average of all these weight vectors weighted as described. w^* is used to make all predictions returned to the user through methods such as Classifier.classify(Object) or Classifier.discreteValue(Object).
The above description is a useful way to think about the operation of this Learner. However, the user should note that this implementation never explicitly stores w^*. Instead, it is computed efficiently on demand. Thus, interspersed online training and evaluation is efficient and operates as expected.
It is assumed that Learner.labeler is a single discrete classifier that produces the same feature for every example object and that the values that feature may take are available through the Classifier.allowableValues() method. The second value returned from Classifier.allowableValues() is treated as "positive", and it is assumed there are exactly 2 allowable values. Assertions will produce error messages if these assumptions do not hold.

Author:

Nick Rizzolo

See Also:

Serialized Form

Nested Class Summary

Nested Classes
Modifier and Type	Class and Description
`static class`	`SparseAveragedPerceptron.AveragedWeightVector` This implementation of a sparse weight vector associates two `double`s with each `Feature`.
`static class`	`SparseAveragedPerceptron.Parameters` Simply a container for all of `SparseAveragedPerceptron`'s configurable parameters.

Field Summary

Fields
Modifier and Type	Field and Description
`protected double`	`averagedBias` Keeps the extra information necessary to compute the averaged bias.
`protected SparseAveragedPerceptron.AveragedWeightVector`	`awv` Holds the same reference as `LinearThresholdUnit.weightVector` casted to `SparseAveragedPerceptron.AveragedWeightVector`.
`static SparseAveragedPerceptron.AveragedWeightVector`	`defaultWeightVector` Default for `LinearThresholdUnit.weightVector`.

Fields inherited from class edu.illinois.cs.cogcomp.lbjava.learn.LinearThresholdUnit
allowableValues, bias, defaultInitialWeight, defaultLearningRate, defaultThickness, defaultThreshold, initialWeight, learningRate, negativeThickness, positiveThickness, threshold, weightVector

Fields inherited from class edu.illinois.cs.cogcomp.lbjava.learn.Learner
candidates, encoding, extractor, labeler, labelLexicon, lcFilePath, lexFilePath, lexicon, lossFlag, predictions, readLexiconOnDemand

Fields inherited from class edu.illinois.cs.cogcomp.lbjava.classify.Classifier
containingPackage, name

Constructor Summary

Constructors
Constructor and Description
`SparseAveragedPerceptron()` The learning rate and threshold take default values, while the name of the classifier gets the empty string.
`SparseAveragedPerceptron(double r)` Sets the learning rate to the specified value, and the threshold takes the default, while the name of the classifier gets the empty string.
`SparseAveragedPerceptron(double r, double t)` Sets the learning rate and threshold to the specified values, while the name of the classifier gets the empty string.
`SparseAveragedPerceptron(double r, double t, double pt)` Use this constructor to fit a thick separator, where both the positive and negative sides of the hyperplane will be given the specified thickness, while the name of the classifier gets the empty string.
`SparseAveragedPerceptron(double r, double t, double pt, double nt)` Use this constructor to fit a thick separator, where the positive and negative sides of the hyperplane will be given the specified separate thicknesses, while the name of the classifier gets the empty string.
`SparseAveragedPerceptron(SparseAveragedPerceptron.Parameters p)` Initializing constructor.
`SparseAveragedPerceptron(String n)` The learning rate and threshold take default values.
`SparseAveragedPerceptron(String n, double r)` Sets the learning rate to the specified value, and the threshold takes the default.
`SparseAveragedPerceptron(String n, double r, double t)` Sets the learning rate and threshold to the specified values.
`SparseAveragedPerceptron(String n, double r, double t, double pt)` Use this constructor to fit a thick separator, where both the positive and negative sides of the hyperplane will be given the specified thickness.
`SparseAveragedPerceptron(String n, double r, double t, double pt, double nt)` Use this constructor to fit a thick separator, where the positive and negative sides of the hyperplane will be given the specified separate thicknesses.
`SparseAveragedPerceptron(String n, SparseAveragedPerceptron.Parameters p)` Initializing constructor.

Method Summary

All Methods Instance Methods Concrete Methods
Modifier and Type	Method and Description
`void`	`demote(int[] exampleFeatures, double[] exampleValues, double rate)` Scales the feature vector produced by the extractor by the learning rate and subtracts it from the weight vector.
`void`	`forget()` Resets the weight vector to all zeros.
`Learner.Parameters`	`getParameters()` Retrieves the parameters that are set in this learner.
`void`	`initialize(int numExamples, int numFeatures)` Initializes the weight vector array to the size of the supplied number of features, with each cell taking the default value of `LinearThresholdUnit.initialWeight`.
`void`	`learn(int[] exampleFeatures, double[] exampleValues, int[] exampleLabels, double[] labelValues)` This method works just like `LinearThresholdUnit.learn(int[],double[],int[],double[])`, except it notifies its weight vector when it got an example correct in addition to updating it when it makes a mistake.
`void`	`promote(int[] exampleFeatures, double[] exampleValues, double rate)` Scales the feature vector produced by the extractor by the learning rate and adds it to the weight vector.
`void`	`read(edu.illinois.cs.cogcomp.core.datastructures.vectors.ExceptionlessInputStream in)` Reads the binary representation of a learner with this object's run-time type, overwriting any and all learned or manually specified parameters as well as the label lexicon but without modifying the feature lexicon.
`double`	`score(int[] exampleFeatures, double[] exampleValues)` The score of the specified object is equal to `w * x + bias` where `*` is dot product, `w` is the weight vector, and `x` is the feature vector produced by the extractor.
`void`	`setParameters(SparseAveragedPerceptron.Parameters p)` Sets the values of parameters that control the behavior of this learning algorithm.
`void`	`write(edu.illinois.cs.cogcomp.core.datastructures.vectors.ExceptionlessOutputStream out)` Writes the learned function's internal representation in binary form.
`void`	`write(PrintStream out)` Writes the algorithm's internal representation as text.

Methods inherited from class edu.illinois.cs.cogcomp.lbjava.learn.SparsePerceptron
getLearningRate, setLearningRate

Methods inherited from class edu.illinois.cs.cogcomp.lbjava.learn.LinearThresholdUnit
allowableValues, classify, clone, computeLearningRate, discreteValue, featureValue, getAllowableValues, getBias, getInitialWeight, getNegativeThickness, getPositiveThickness, getThreshold, getWeightVector, score, scores, setInitialWeight, setLabeler, setNegativeThickness, setParameters, setPositiveThickness, setThickness, setThreshold, shouldDemote, shouldPromote

Methods inherited from class edu.illinois.cs.cogcomp.lbjava.learn.Learner
classify, classify, classify, classify, countFeatures, createPrediction, createPrediction, demandLexicon, discreteValue, discreteValue, doneLearning, doneWithRound, emptyClone, featureValue, featureValue, getCurrentLexicon, getExampleArray, getExampleArray, getExtractor, getLabeler, getLabelLexicon, getLexicon, getLexiconDiscardCounts, getLexiconLocation, getModelLocation, getPrunedLexiconSize, learn, learn, learn, learn, read, readLabelLexicon, readLearner, readLearner, readLearner, readLearner, readLearner, readLearner, readLexicon, readLexicon, readLexiconOnDemand, readLexiconOnDemand, readModel, readModel, readParameters, realValue, realValue, realValue, save, saveLexicon, saveModel, scores, scores, scoresAugmented, setCandidates, setEncoding, setExtractor, setLabelLexicon, setLexicon, setLexiconLocation, setLexiconLocation, setLossFlag, setModelLocation, setModelLocation, setParameters, setReadLexiconOnDemand, unclone, unsetLossFlag, write, writeLexicon, writeModel, writeParameters

Methods inherited from class edu.illinois.cs.cogcomp.lbjava.classify.Classifier
classify, discreteValueArray, getCompositeChildren, getInputType, getOutputType, realValueArray, test, toString, valueIndexOf

Methods inherited from class java.lang.Object
equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait

- Field Detail
  - defaultWeightVector
```
public static final SparseAveragedPerceptron.AveragedWeightVector defaultWeightVector
```
    Default for LinearThresholdUnit.weightVector.
  - awv
```
protected SparseAveragedPerceptron.AveragedWeightVector awv
```
    Holds the same reference as LinearThresholdUnit.weightVector casted to SparseAveragedPerceptron.AveragedWeightVector.
  - averagedBias
```
protected double averagedBias
```
    Keeps the extra information necessary to compute the averaged bias.
- Constructor Detail
  - SparseAveragedPerceptron
```
public SparseAveragedPerceptron()
```
    The learning rate and threshold take default values, while the name of the classifier gets the empty string.
  - SparseAveragedPerceptron
```
public SparseAveragedPerceptron(double r)
```
    Sets the learning rate to the specified value, and the threshold takes the default, while the name of the classifier gets the empty string.
    
    Parameters:
    
    r - The desired learning rate value.
  - SparseAveragedPerceptron
```
public SparseAveragedPerceptron(double r,
                                double t)
```
    Sets the learning rate and threshold to the specified values, while the name of the classifier gets the empty string.
    
    Parameters:
    
    r - The desired learning rate value.
    
    t - The desired threshold value.
  - SparseAveragedPerceptron
```
public SparseAveragedPerceptron(double r,
                                double t,
                                double pt)
```
    Use this constructor to fit a thick separator, where both the positive and negative sides of the hyperplane will be given the specified thickness, while the name of the classifier gets the empty string.
    
    Parameters:
    
    r - The desired learning rate value.
    
    t - The desired threshold value.
    
    pt - The desired thickness.
  - SparseAveragedPerceptron
```
public SparseAveragedPerceptron(double r,
                                double t,
                                double pt,
                                double nt)
```
    Use this constructor to fit a thick separator, where the positive and negative sides of the hyperplane will be given the specified separate thicknesses, while the name of the classifier gets the empty string.
    
    Parameters:
    
    r - The desired learning rate value.
    
    t - The desired threshold value.
    
    pt - The desired positive thickness.
    
    nt - The desired negative thickness.
  - SparseAveragedPerceptron
```
public SparseAveragedPerceptron(SparseAveragedPerceptron.Parameters p)
```
    Initializing constructor. Sets all member variables to their associated settings in the SparseAveragedPerceptron.Parameters object.
    
    Parameters:
    
    p - The settings of all parameters.
  - SparseAveragedPerceptron
```
public SparseAveragedPerceptron(String n)
```
    The learning rate and threshold take default values.
    
    Parameters:
    
    n - The name of the classifier.
  - SparseAveragedPerceptron
```
public SparseAveragedPerceptron(String n,
                                double r)
```
    Sets the learning rate to the specified value, and the threshold takes the default.
    
    Parameters:
    
    n - The name of the classifier.
    
    r - The desired learning rate value.
  - SparseAveragedPerceptron
```
public SparseAveragedPerceptron(String n,
                                double r,
                                double t)
```
    Sets the learning rate and threshold to the specified values.
    
    Parameters:
    
    n - The name of the classifier.
    
    r - The desired learning rate value.
    
    t - The desired threshold value.
  - SparseAveragedPerceptron
```
public SparseAveragedPerceptron(String n,
                                double r,
                                double t,
                                double pt)
```
    Use this constructor to fit a thick separator, where both the positive and negative sides of the hyperplane will be given the specified thickness.
    
    Parameters:
    
    n - The name of the classifier.
    
    r - The desired learning rate value.
    
    t - The desired threshold value.
    
    pt - The desired thickness.
  - SparseAveragedPerceptron
```
public SparseAveragedPerceptron(String n,
                                double r,
                                double t,
                                double pt,
                                double nt)
```
    Use this constructor to fit a thick separator, where the positive and negative sides of the hyperplane will be given the specified separate thicknesses.
    
    Parameters:
    
    n - The name of the classifier.
    
    r - The desired learning rate value.
    
    t - The desired threshold value.
    
    pt - The desired positive thickness.
    
    nt - The desired negative thickness.
  - SparseAveragedPerceptron
```
public SparseAveragedPerceptron(String n,
                                SparseAveragedPerceptron.Parameters p)
```
    Initializing constructor. Sets all member variables to their associated settings in the SparseAveragedPerceptron.Parameters object.
    
    Parameters:
    
    n - The name of the classifier.
    
    p - The settings of all parameters.
- Method Detail
  - getParameters
```
public Learner.Parameters getParameters()
```
    Retrieves the parameters that are set in this learner.
    
    Overrides:
    
    getParameters in class SparsePerceptron
    
    Returns:
    
    An object containing all the values of the parameters that control the behavior of this learning algorithm.
  - setParameters
```
public void setParameters(SparseAveragedPerceptron.Parameters p)
```
    Sets the values of parameters that control the behavior of this learning algorithm.
    
    Parameters:
    
    p - The parameters.
  - score
```
public double score(int[] exampleFeatures,
                    double[] exampleValues)
```
    The score of the specified object is equal to w * x + bias where * is dot product, w is the weight vector, and x is the feature vector produced by the extractor.
    
    Overrides:
    
    score in class LinearThresholdUnit
    
    Parameters:
    
    exampleFeatures - The example's array of feature indices.
    
    exampleValues - The example's array of feature values.
    
    Returns:
    
    The result of the dot product plus the bias.
  - promote
```
public void promote(int[] exampleFeatures,
                    double[] exampleValues,
                    double rate)
```
    Scales the feature vector produced by the extractor by the learning rate and adds it to the weight vector.
    
    Overrides:
    
    promote in class SparsePerceptron
    
    Parameters:
    
    exampleFeatures - The example's array of feature indices.
    
    exampleValues - The example's array of feature values.
    
    rate - The learning rate at which the weights are updated.
  - demote
```
public void demote(int[] exampleFeatures,
                   double[] exampleValues,
                   double rate)
```
    Scales the feature vector produced by the extractor by the learning rate and subtracts it from the weight vector.
    
    Overrides:
    
    demote in class SparsePerceptron
    
    Parameters:
    
    exampleFeatures - The example's array of feature indices.
    
    exampleValues - The example's array of feature values.
    
    rate - The learning rate at which the weights are updated.
  - learn
```
public void learn(int[] exampleFeatures,
                  double[] exampleValues,
                  int[] exampleLabels,
                  double[] labelValues)
```
    This method works just like LinearThresholdUnit.learn(int[],double[],int[],double[]), except it notifies its weight vector when it got an example correct in addition to updating it when it makes a mistake.
    
    Overrides:
    
    learn in class LinearThresholdUnit
    
    Parameters:
    
    exampleFeatures - The example's array of feature indices
    
    exampleValues - The example's array of feature values
    
    exampleLabels - The example's label(s)
    
    labelValues - The labels' values
  - initialize
```
public void initialize(int numExamples,
                       int numFeatures)
```
    Initializes the weight vector array to the size of the supplied number of features, with each cell taking the default value of LinearThresholdUnit.initialWeight.
    
    Overrides:
    
    initialize in class LinearThresholdUnit
    
    Parameters:
    
    numExamples - The number of examples
    
    numFeatures - The number of features
  - forget
```
public void forget()
```
    Resets the weight vector to all zeros.
    
    Overrides:
    
    forget in class LinearThresholdUnit
  - write
```
public void write(PrintStream out)
```
    Writes the algorithm's internal representation as text. In the first line of output, the name of the classifier is printed, followed by LinearThresholdUnit.learningRate, LinearThresholdUnit.initialWeight, LinearThresholdUnit.threshold, LinearThresholdUnit.positiveThickness, LinearThresholdUnit.negativeThickness, LinearThresholdUnit.bias, and finally averagedBias.
    
    Overrides:
    
    write in class SparsePerceptron
    
    Parameters:
    
    out - The output stream.
  - write
```
public void write(edu.illinois.cs.cogcomp.core.datastructures.vectors.ExceptionlessOutputStream out)
```
    Writes the learned function's internal representation in binary form.
    
    Overrides:
    
    write in class LinearThresholdUnit
    
    Parameters:
    
    out - The output stream.
  - read
```
public void read(edu.illinois.cs.cogcomp.core.datastructures.vectors.ExceptionlessInputStream in)
```
    Reads the binary representation of a learner with this object's run-time type, overwriting any and all learned or manually specified parameters as well as the label lexicon but without modifying the feature lexicon.
    
    Overrides:
    
    read in class LinearThresholdUnit
    
    Parameters:
    
    in - The input stream.

Class SparseAveragedPerceptron

Nested Class Summary

Field Summary

Fields inherited from class edu.illinois.cs.cogcomp.lbjava.learn.LinearThresholdUnit

Fields inherited from class edu.illinois.cs.cogcomp.lbjava.learn.Learner

Fields inherited from class edu.illinois.cs.cogcomp.lbjava.classify.Classifier

Constructor Summary

Method Summary

Methods inherited from class edu.illinois.cs.cogcomp.lbjava.learn.SparsePerceptron

Methods inherited from class edu.illinois.cs.cogcomp.lbjava.learn.LinearThresholdUnit

Methods inherited from class edu.illinois.cs.cogcomp.lbjava.learn.Learner

Methods inherited from class edu.illinois.cs.cogcomp.lbjava.classify.Classifier

Methods inherited from class java.lang.Object

Field Detail

defaultWeightVector

awv

averagedBias

Constructor Detail

SparseAveragedPerceptron

SparseAveragedPerceptron

SparseAveragedPerceptron

SparseAveragedPerceptron

SparseAveragedPerceptron

SparseAveragedPerceptron

SparseAveragedPerceptron

SparseAveragedPerceptron

SparseAveragedPerceptron

SparseAveragedPerceptron

SparseAveragedPerceptron

SparseAveragedPerceptron

Method Detail

getParameters

setParameters

score

promote

demote

learn

initialize

forget

write

write

read