This page is where I am collecting references to published workload traces from LLM inferencing workloads.
- Azure/AzurePublicDataset includes a bunch of conversation traces from Azure. For example, they have inputs and timings for a multimodal image generation service.
- facebook/natural_reasoning are science-based question/answer pairs that may be useful for fine-tuning reasoning models.