EmpiricalCovariance¶
Empirical covariance matrix.
Parameters¶
-
ddof – defaults to
1
Delta Degrees of Freedom.
Attributes¶
- matrix
Examples¶
>>> import numpy as np
>>> import pandas as pd
>>> from river import covariance
>>> np.random.seed(42)
>>> X = pd.DataFrame(np.random.random((8, 3)), columns=["red", "green", "blue"])
>>> X
red green blue
0 0.374540 0.950714 0.731994
1 0.598658 0.156019 0.155995
2 0.058084 0.866176 0.601115
3 0.708073 0.020584 0.969910
4 0.832443 0.212339 0.181825
5 0.183405 0.304242 0.524756
6 0.431945 0.291229 0.611853
7 0.139494 0.292145 0.366362
>>> cov = covariance.EmpiricalCovariance()
>>> for x in X.to_dict(orient="records"):
... cov = cov.update(x)
>>> cov
blue green red
blue 0.076 0.020 -0.010
green 0.020 0.113 -0.053
red -0.010 -0.053 0.079
There is also an update_many
method to process mini-batches. The results are identical.
>>> cov = covariance.EmpiricalCovariance()
>>> cov = cov.update_many(X)
>>> cov
blue green red
blue 0.076 0.020 -0.010
green 0.020 0.113 -0.053
red -0.010 -0.053 0.079
The covariances are stored in a dictionary, meaning any one of them can be accessed as such:
>>> cov["blue", "green"]
Cov: 0.020292
Diagonal entries are variances:
>>> cov["blue", "blue"]
Var: 0.076119
Methods¶
revert
Downdate with a single sample.
Parameters
- x (dict)
update
Update with a single sample.
Parameters
- x (dict)
update_many
Update with a dataframe of samples.
Parameters
- X (pandas.core.frame.DataFrame)