Datasets

Machine-readable datasets for AI agents, published on HuggingFace. JSON and Parquet, updated automatically from source. Free, CC-BY-4.0, no email gate.

Need a dataset wired into an agent pipeline?

We use both datasets to power agent tooling at AutomateLab. If you want one integrated into your own retrieval pipeline or fine-tuning run, we can help scope and build it.

Get in touch