Return the path of the scikit-learn data directory.
This folder is used by some large dataset loaders to avoid downloading the data several times.
By default the data directory is set to a folder named ‘scikit_learn_data’ in the user home folder.
Alternatively, it can be set by the ‘SCIKIT_LEARN_DATA’ environment variable or programmatically by giving an explicit folder path. The ‘~’ symbol is expanded to the user home folder.
If the folder does not already exist, it is automatically created.
The path to scikit-learn data directory. If None, the default path is ~/scikit_learn_data.
The path to scikit-learn data directory.
>>> import os >>> from sklearn.datasets import get_data_home >>> data_home_path = get_data_home() >>> os.path.exists(data_home_path) True
© 2007–2025 The scikit-learn developers
Licensed under the 3-clause BSD License.
https://scikit-learn.org/1.6/modules/generated/sklearn.datasets.get_data_home.html