Skip to main content

Hugging Face - Datasets

Imports datasets from Hugging Face (https://huggingface.co/datasets)

Only datasets with Parquet exports can be imported with this connector.

Configuration

InputTypeDescriptionDefault Value
dataset_namestringDataset Name.
dataset_subsetsarrayDataset Subsets. Dataset Subsets to import. Will import all of them if nothing is provided (see https://huggingface.co/docs/dataset-viewer/en/configs_and_splits for more details)
dataset_splitsarrayDataset Splits. Splits to import. Will import all of them if nothing is provided (see https://huggingface.co/docs/dataset-viewer/en/configs_and_splits for more details)

Streams

Stream NamePrimary KeyPaginationSupports Full SyncSupports Incremental
rowsDefaultPaginator
splitsNo pagination

Changelog

Expand to review
VersionDatePull RequestSubject
0.0.52024-12-2150079Update dependencies
0.0.42024-12-1449609Update dependencies
0.0.32024-12-1249233Update dependencies
0.0.22024-12-1148911Starting with this version, the Docker image is now rootless. Please note that this and future versions will not be compatible with Airbyte versions earlier than 0.64
0.0.12024-11-28Initial release by @michel-tricot via Connector Builder
Was this page helpful?