from datasets import load_dataset


dataset = load_dataset("openwebtext")