Clusterfun can be installed with pip:
pip install clusterfun
Clusterfun requires Python 3.8 or higher.
Plots accept data in the form of a pandas DataFrame, which will be installed automatically if not already present.
import pandas as pd
import clusterfun as clt
df = pd.read_csv("https://raw.githubusercontent.com/gietema/clusterfun-data/main/wiki-art.csv")
clt.scatter(df, x="x", y="y", media="img_path", color="painter")
As you can see, a clusterfun plot takes as input a pandas dataframe and column names indicating which columns to use for the visualisation. In this way, it is similar to the seaborn
or the plotly
library. But in clusterfun, you can:
This makes clusterfun ideal for quickly visualising image data, which can be useful in the context of building datasets, exploring edge cases and debugging model performance.