function package - autograd

The function package re-exports every operation from the variable package and adds higher-level neural-network primitives. You can use function as the single import for most forward-pass code.

Import path: github.com/itsubaki/autograd/function

Arithmetic

These functions are aliases for the identically named functions in the variable package.

Add / AddC

var Add  = variable.Add   // func(x ...*variable.Variable) *variable.Variable
var AddC = variable.AddC  // func(c float64, x ...*variable.Variable) *variable.Variable

Element-wise addition. AddC adds a scalar constant.

import F "github.com/itsubaki/autograd/function"

y := F.Add(x1, x2)
y := F.AddC(1.0, x)

Sub / SubC

var Sub  = variable.Sub
var SubC = variable.SubC

Element-wise subtraction. SubC(c, x) computes c - x.

Mul / MulC

var Mul  = variable.Mul
var MulC = variable.MulC

Element-wise multiplication. MulC(c, x) multiplies by a scalar.

Div / DivC

var Div  = variable.Div
var DivC = variable.DivC

Element-wise division. DivC(c, x) computes c / x.

Neg

var Neg = variable.Neg

Negates each element.

Math

Sin / Cos / Tanh

var Sin  = variable.Sin
var Cos  = variable.Cos
var Tanh = variable.Tanh

Element-wise trigonometric and hyperbolic tangent functions, all differentiable.

y := F.Sin(x)
y := F.Cos(x)
y := F.Tanh(x)

Exp / Log

var Exp = variable.Exp
var Log = variable.Log

Element-wise eˣ and natural logarithm.

Pow / Square

var Pow    = variable.Pow    // func(p float64) func(x ...*variable.Variable) *variable.Variable
var Square = variable.Square // func(x ...*variable.Variable) *variable.Variable

Pow(p) returns a function that raises each element to the power p. Square is a convenience for Pow(2.0).

y := F.Pow(3.0)(x)   // x^3
y := F.Square(x)     // x^2

Tensor operations

Sum / SumTo / Mean / Variance

var Sum      = variable.Sum      // func(axes ...int) func(x ...*variable.Variable) *variable.Variable
var SumTo    = variable.SumTo    // func(shape ...int) func(x ...*variable.Variable) *variable.Variable
var Mean     = variable.Mean     // func(axes ...int) func(x ...*variable.Variable) *variable.Variable
var Variance = variable.Variance // func(axes ...int) func(x ...*variable.Variable) *variable.Variable

Reduction operations. Pass axis indices to reduce along specific dimensions; omit for global reduction.

total := F.Sum()(x)       // scalar sum
rowm  := F.Mean(1)(x)     // mean along axis 1
v     := F.Variance()(x)  // global variance

BroadcastTo / Reshape / Transpose / TransposeMatMul / MatMul

var BroadcastTo     = variable.BroadcastTo
var Reshape         = variable.Reshape
var Transpose       = variable.Transpose
var TransposeMatMul = variable.TransposeMatMul
var MatMul          = variable.MatMul

Shape and layout operations. See variable package for full descriptions.

y  := F.BroadcastTo(4, 3)(x)
y  := F.Reshape(2, -1)(x)
xt := F.Transpose()(x)
y  := F.MatMul(x, w)

Max / Min / Clip

var Max  = variable.Max   // func(axes ...int) func(x ...*variable.Variable) *variable.Variable
var Min  = variable.Min   // func(axes ...int) func(x ...*variable.Variable) *variable.Variable
var Clip = variable.Clip  // func(min, max float64) func(x ...*variable.Variable) *variable.Variable

m := F.Max(1)(x)
c := F.Clip(0.0, 1.0)(x)

GetItem / Concat / Split

var GetItem = variable.GetItem // func(indices []int, axis int) func(x ...*variable.Variable) *variable.Variable
var Concat  = variable.Concat  // func(axis int) func(x ...*variable.Variable) *variable.Variable
var Split   = variable.Split   // func(size []int, axis int) func(x ...*variable.Variable) []*variable.Variable

Indexing and combining operations.

rows  := F.GetItem([]int{0, 2}, 0)(x)
cat   := F.Concat(1)(x1, x2)
parts := F.Split([]int{3, 3}, 0)(x)

Neural network functions

Sigmoid

func Sigmoid(x ...*variable.Variable) *variable.Variable

Numerically stable sigmoid using the identity σ(x) = 0.5 + 0.5·tanh(0.5x). Backward: gy * y * (1 - y).

y := F.Sigmoid(x)

Use Sigmoid instead of SigmoidSimple in production — it avoids overflow for large inputs.

ReLU

func ReLU(x ...*variable.Variable) *variable.Variable

Rectified Linear Unit: y = max(0, x). Gradient is 1 where input is positive, 0 elsewhere.

y := F.ReLU(x)

Softmax

func Softmax(axis int) func(x ...*variable.Variable) *variable.Variable

Numerically stable softmax along the given axis: subtracts the per-row maximum before exponentiation.

axis

int

required

Axis along which to normalise probabilities. Typically 1 for (N, C) logit tensors.

probs := F.Softmax(1)(logits)

SoftmaxCrossEntropy

func SoftmaxCrossEntropy(x ...*variable.Variable) *variable.Variable

Combined softmax + negative log-likelihood loss. Expects:

x[0]: logits with shape (N, C)
x[1]: integer class labels with shape (N,) (stored as float64)

Returns a scalar loss -(1/N) * Σ log p(y_i).

loss := F.SoftmaxCrossEntropy(logits, labels)
loss.Backward()

Passing raw logits (before softmax) is both numerically stable and more efficient than applying Softmax first.

Linear

func Linear(x ...*variable.Variable) *variable.Variable

Affine transformation: y = x @ w [+ b].

x[0]: input (N, in_features)
x[1]: weight (in_features, out_features)
x[2] (optional): bias (1, out_features)

y := F.Linear(x, w, b)   // with bias
y := F.Linear(x, w)      // no bias

For a higher-level layer that manages its own parameters, use layer.Linear. See layer package.

MeanSquaredError

func MeanSquaredError(x ...*variable.Variable) *variable.Variable

Mean squared error: (1/N) * Σ (x0 - x1)².

loss := F.MeanSquaredError(pred, target)

DropoutSimple

func DropoutSimple(ratio float64, s ...randv2.Source) func(x ...*variable.Variable) *variable.Variable

Inverted dropout. At training time, randomly zeroes elements with probability ratio and scales surviving elements by 1/(1-ratio). At test time (when variable.Config.Train == false) returns the input unchanged.

ratio

float64

required

Probability of zeroing each element, e.g. 0.5.

...randv2.Source

Optional random source for reproducibility.

dropout := F.DropoutSimple(0.5)
y := dropout(x)

// In test mode, dropout is bypassed
span := variable.TestMode()
defer span.End()
y := dropout(x) // returns x unchanged

Accuracy

func Accuracy(y, t *variable.Variable) *variable.Variable

Computes classification accuracy as the fraction of samples where argmax(y) == t. The return value is not differentiable.

*variable.Variable

required

Predicted logits or probabilities with shape (N, C).

*variable.Variable

required

Integer class labels with shape (N,).

acc := F.Accuracy(y, t)
fmt.Printf("accuracy: %.2f\n", acc.At())

Simple / reference implementations

These functions implement the same operations using basic differentiable primitives. They are useful for understanding the math or for testing, but prefer the non-Simple variants in practice.

Function	Equivalent to
`SigmoidSimple(x)`	`1 / (1 + exp(-x))`
`SoftmaxSimple(x, axis)`	`exp(x) / sum(exp(x))`
`LinearSimple(x, w, b...)`	`x @ w + b`
`MeanSquaredErrorSimple(x0, x1)`	`(1/N) * sum((x0-x1)²)`

// reference sigmoid
y := F.SigmoidSimple(x)

// reference linear (saves memory by clearing intermediate)
y := F.LinearSimple(x, w, b)

Packages

Documentation Index

​Arithmetic

​Math

​Tensor operations

​Neural network functions

​Sigmoid

​ReLU

​Softmax

​SoftmaxCrossEntropy

​Linear

​MeanSquaredError

​DropoutSimple

​Accuracy

​Simple / reference implementations

Arithmetic

Math

Tensor operations

Neural network functions

Sigmoid

ReLU

Softmax

SoftmaxCrossEntropy

Linear

MeanSquaredError

DropoutSimple

Accuracy

Simple / reference implementations