Prowys Research

Publications.

The full catalogue of Prowys research. Quarterly briefings, standalone papers, sector notes, and work co-authored with partner universities and research labs.

Research

We publish the research we wish existed before we started.

Quarterly market briefings, standalone papers, sector notes, and co-authored work with partner universities and research labs. Some papers are open; others are released on request under NDA.

Quarterly briefing · Q3 2026Latest

Global allocation outlook: H200 and the GB200 rollout window.

Re-routing of Tier-1 OEM allocations through specialist integrators is shortening lead times for H200 SKUs by an average of nine weeks, reshaping cluster economics for inference-first buyers across major markets.

24 pages · PDF · September 2026Authored by R. Al-Mansouri, K. Demir

Read the briefing →

Fig. 1 · H200 lead timeweeks · 2024–2026

Source: Prowys research, OEM datan=27 deployments

Library

BriefAug 202614pp · PDF

Cost-per-token at production scale: what to model, what to ignore.

A working framework for token cost, covering serving stack, batch dynamics, utilization, and the four line items most buyers undercount.

AuthorsR. Al-Mansouri, S. Nair

On requestRequest →

Co-authoredJul 202618pp · PDF

Inference networking: 400G vs 800G at production scale.

A side-by-side of three reference fabrics deployed in the past 18 months. Networking choice now drives effective tokens-per-watt more than GPU SKU.

AuthorsK. Demir (Prowys), A. Belhaj (Partner lab)In partnership with National research institute · Anonymised

OpenRead →

Sector noteJul 20268pp · PDF

Sovereign compute and the residency question.

How three national regulators are converging, and diverging, on model residency, audit, and cross-border inference. Practical implications for buyer timelines.

AuthorsS. Nair, F. Hosseini

OpenRead →

Co-authoredJun 202622pp · PDF

Liquid cooling readiness in modern data centres.

A field study across nine facilities. Power envelope, water budget, and the operational gap between rated and deliverable rack density.

AuthorsF. Hosseini (Prowys), Dr. M. Saif (Partner univ.)In partnership with Partner university · Centre for Energy Systems

On requestRequest →

Quarterly briefingJun 202628pp · PDF

GPU market mid-year: H100 floor, B200 ramp, MI300X repricing.

Mid-cycle reading of the GPU market. What is moving, what is mispriced, what is now obsolete for new deployments.

AuthorsR. Al-Mansouri, K. Demir

On requestRequest →

Co-authoredMay 202616pp · PDF

Multilingual model evaluation: an enterprise harness.

Eval criteria, datasets, and reproducible scoring for enterprise multilingual LLM selection. Released as an open benchmark.

AuthorsK. Demir, A. Ouazzani; with a partner NLP research groupIn partnership with Partner research institute · NLP

OpenRead →

BriefMay 202612pp · PDF

Utilization, not contracting: the second-year token cost.

Why year-two cost-per-token is set by FinOps and scheduler choice, not the original GPU contract. Five reference clusters benchmarked.

AuthorsS. Nair, R. Al-Mansouri

On requestRequest →

Sector noteApr 202610pp · PDF

Generative AI in enterprise banking: where the cost case is real.

Five use-cases ranked by realised ROI across six Tier-1 banks. What is shipping, what is shelved, and why.

AuthorsF. Hosseini

OpenRead →

Co-authored papers are released jointly under Prowys' and the partner institution's marks. Bespoke buyer-side research is commissioned on a per-engagement basis.

Commission research