Skip to content

CovMatrix

Sample covariance matrix.

Parameters

  • ddof – defaults to 1

    Delta Degrees of Freedom.

Examples

>>> import numpy as np
>>> import pandas as pd
>>> from river import misc

>>> np.random.seed(42)
>>> X = pd.DataFrame(np.random.random((8, 3)), columns=["red", "green", "blue"])
>>> X
        red     green      blue
0  0.374540  0.950714  0.731994
1  0.598658  0.156019  0.155995
2  0.058084  0.866176  0.601115
3  0.708073  0.020584  0.969910
4  0.832443  0.212339  0.181825
5  0.183405  0.304242  0.524756
6  0.431945  0.291229  0.611853
7  0.139494  0.292145  0.366362

>>> cov = misc.CovMatrix()
>>> for x in X.to_dict(orient="records"):
...     cov = cov.update(x)
>>> cov
        blue     green    red
 blue    0.076    0.020   -0.010
green    0.020    0.113   -0.053
  red   -0.010   -0.053    0.079

There is also an update_many method to process mini-batches. The results are identical.

>>> cov = misc.CovMatrix()
>>> cov = cov.update_many(X)
>>> cov
          blue     green    red
 blue    0.076    0.020   -0.010
green    0.020    0.113   -0.053
  red   -0.010   -0.053    0.079

The covariances are stored in a dictionary, meaning any one of them can be accessed as such:

>>> cov["blue", "green"]
Cov: 0.020292

Diagonal entries are variances:

>>> cov["blue", "blue"]
Var: 0.076119

Methods

clear

D.clear() -> None. Remove all items from D.

copy
fromkeys
get

D.get(k[,d]) -> D[k] if k in D, else d. d defaults to None.

Parameters

  • key
  • default – defaults to None
items

D.items() -> a set-like object providing a view on D's items

keys

D.keys() -> a set-like object providing a view on D's keys

pop

D.pop(k[,d]) -> v, remove specified key and return the corresponding value. If key is not found, d is returned if given, otherwise KeyError is raised.

Parameters

  • key
  • default – defaults to <object object at 0x7f36fb3d5150>
popitem

D.popitem() -> (k, v), remove and return some (key, value) pair as a 2-tuple; but raise KeyError if D is empty.

setdefault

D.setdefault(k[,d]) -> D.get(k,d), also set D[k]=d if k not in D

Parameters

  • key
  • default – defaults to None
update

Update with a single sample.

Parameters

  • x (dict)
update_many

Update with many samples.

Parameters

  • X (pandas.core.frame.DataFrame)
values

D.values() -> an object providing a view on D's values