iter_libsvm¶
Iterates over a dataset in LIBSVM format.
The LIBSVM format is a popular way in the machine learning community to store sparse datasets. Only numerical feature values are supported. The feature names will be considered as strings.
Parameters¶
-
filepath_or_buffer
Type → str
Either a string indicating the location of a file, or a buffer object that has a
read
method. -
target_type
Default →
<class 'float'>
The type of the target value.
-
compression
Default →
infer
For on-the-fly decompression of on-disk data. If this is set to 'infer' and
filepath_or_buffer
is a path, then the decompression method is inferred for the following extensions: '.gz', '.zip'.
Examples¶
import io
from river import stream
data = io.StringIO('''+1 x:-134.26 y:0.2563
1 x:-12 z:0.3
-1 y:.25
''')
for x, y in stream.iter_libsvm(data, target_type=int):
print(y, x)
1 {'x': -134.26, 'y': 0.2563}
1 {'x': -12.0, 'z': 0.3}
-1 {'y': 0.25}