Skip to content

iter_libsvm

Iterates over a dataset in LIBSVM format.

The LIBSVM format is a popular way in the machine learning community to store sparse datasets. Only numerical feature values are supported. The feature names will be considered as strings.

Parameters

  • filepath_or_buffer (str)

    Either a string indicating the location of a CSV file, or a buffer object that has a read method.

  • target_type – defaults to <class 'float'>

    The type of the target value.

  • compression – defaults to infer

    For on-the-fly decompression of on-disk data. If this is set to 'infer' and filepath_or_buffer is a path, then the decompression method is inferred for the following extensions: '.gz', '.zip'.

Examples

>>> import io
>>> from river import stream

>>> data = io.StringIO('''+1 x:-134.26 y:0.2563
... 1 x:-12 z:0.3
... -1 y:.25
... ''')

>>> for x, y in stream.iter_libsvm(data, target_type=int):
...     print(y, x)
1 {'x': -134.26, 'y': 0.2563}
1 {'x': -12.0, 'z': 0.3}
-1 {'y': 0.25}

References