Research
We publish the research we wish existed before we started.
Quarterly market briefings, standalone papers, sector notes, and co-authored work with partner universities and research labs. Some papers are open; others are released on request under NDA.
BriefAug 202614pp · PDF
Cost-per-token at production scale: what to model, what to ignore.
A working framework for token cost, covering serving stack, batch dynamics, utilization, and the four line items most buyers undercount.
AuthorsR. Al-Mansouri, S. Nair
Co-authoredJul 202618pp · PDF
Inference networking: 400G vs 800G at production scale.
A side-by-side of three reference fabrics deployed in the past 18 months. Networking choice now drives effective tokens-per-watt more than GPU SKU.
AuthorsK. Demir (Prowys), A. Belhaj (Partner lab)In partnership with National research institute · Anonymised
Sector noteJul 20268pp · PDF
Sovereign compute and the residency question.
How three national regulators are converging, and diverging, on model residency, audit, and cross-border inference. Practical implications for buyer timelines.
AuthorsS. Nair, F. Hosseini
Co-authoredJun 202622pp · PDF
Liquid cooling readiness in modern data centres.
A field study across nine facilities. Power envelope, water budget, and the operational gap between rated and deliverable rack density.
AuthorsF. Hosseini (Prowys), Dr. M. Saif (Partner univ.)In partnership with Partner university · Centre for Energy Systems
Quarterly briefingJun 202628pp · PDF
GPU market mid-year: H100 floor, B200 ramp, MI300X repricing.
Mid-cycle reading of the GPU market. What is moving, what is mispriced, what is now obsolete for new deployments.
AuthorsR. Al-Mansouri, K. Demir
Co-authoredMay 202616pp · PDF
Multilingual model evaluation: an enterprise harness.
Eval criteria, datasets, and reproducible scoring for enterprise multilingual LLM selection. Released as an open benchmark.
AuthorsK. Demir, A. Ouazzani; with a partner NLP research groupIn partnership with Partner research institute · NLP
BriefMay 202612pp · PDF
Utilization, not contracting: the second-year token cost.
Why year-two cost-per-token is set by FinOps and scheduler choice, not the original GPU contract. Five reference clusters benchmarked.
AuthorsS. Nair, R. Al-Mansouri
Sector noteApr 202610pp · PDF
Generative AI in enterprise banking: where the cost case is real.
Five use-cases ranked by realised ROI across six Tier-1 banks. What is shipping, what is shelved, and why.
AuthorsF. Hosseini