Ori Global Cloud Blog | Deepak Manoor

Tutorial

How to run Magistral Small on a cloud GPU

Learn how to deploy Mistral’s open‑source Magistral Small model on a cloud GPU using Ollama with OpenWebUI and our analysis of the Magistral model.

Deepak Manoor Jun 12, 2025

Tutorial

How to run Qwen 3 235B on a cloud GPU

Discover how to deploy Qwen 3 235B model with Ollama and OpenWebUI on a cloud GPU and check out our model analysis.

Deepak Manoor May 6, 2025

Tutorial

How to run Llama 4 on a cloud GPU with Transformers and vLLM

Learn how to run Meta’s multimodal Llama 4 models with Hugging Face Transformers and vLLM on an Ori cloud GPU, and check our comparison of Llama 4 vs...

Deepak Manoor Apr 12, 2025

What is Reinforcement Learning (RL)?

Explore reinforcement learning (RL), how it works, and essential RL techniques such as Q-learning, policy gradient, and actor-critic methods.

Deepak Manoor Apr 7, 2025

Product updates

NVIDIA H200 GPUs Now Generally Available on Ori Global Cloud

Accelerate your AI with NVIDIA H200 GPUs on Ori to train models and run inference more efficiently than ever before.

Deepak Manoor Mar 11, 2025

Tutorial

How to run Mistral Small 3 on a cloud GPU with vLLM

Discover how to easily deploy Mistral Small 3 on a cloud GPU with vLLM and our model analysis with verbal, math and coding prompts.

Deepak Manoor Feb 5, 2025

Tutorial

How to run DeepSeek R1 on a cloud GPU with Ollama

Learn how to easily deploy DeepSeek R1 Distill 70B on an H100 GPU with Ollama and OpenWebUI, plus our thoughts about the model and its innovative...

Deepak Manoor Jan 28, 2025

LLM

Deploy and scale Qwen 2.5 with just one click on Ori Inference Endpoints

Learn how to deploy and scale Qwen 2.5 1.5B effortlessly with Ori Inference Endpoints.

Deepak Manoor Jan 6, 2025

Tutorial

How to run Llama 3.3 70B on a cloud GPU

Learn how to deploy Meta’s new text-generation model Llama 3.3 70B with Ollama and Open WebUI on an Ori cloud GPU.

Deepak Manoor Dec 10, 2024

GPU

An overview of the NVIDIA H200 GPU

Inside the NVIDIA H200: Specifications, use cases, performance benchmarks, and a comparison of H200 vs H100 GPUs.

Deepak Manoor Nov 28, 2024

Tutorial

How to run Genmo Mochi 1 video generation on a cloud GPU

Discover how to deploy Genmo Mochi 1 with ComfyUI on an Ori GPU instance, and read our analysis of this new open source video generation model.

Deepak Manoor Nov 12, 2024

GPU

Everything you need to know about the NVIDIA L40S GPU

Learn more about the NVIDIA L40S, a versatile GPU that is designed to power a wide variety of applications, and check out NVIDIA L40S vs NVIDIA H100...

Deepak Manoor Oct 24, 2024

1 2

Educational content, tutorials and insights on the future of AI infrastructure.

How to run Magistral Small on a cloud GPU

How to run Qwen 3 235B on a cloud GPU

How to run Llama 4 on a cloud GPU with Transformers and vLLM

What is Reinforcement Learning (RL)?

NVIDIA H200 GPUs Now Generally Available on Ori Global Cloud

How to run Mistral Small 3 on a cloud GPU with vLLM

How to run DeepSeek R1 on a cloud GPU with Ollama

Deploy and scale Qwen 2.5 with just one click on Ori Inference Endpoints

How to run Llama 3.3 70B on a cloud GPU

An overview of the NVIDIA H200 GPU

How to run Genmo Mochi 1 video generation on a cloud GPU

Everything you need to know about the NVIDIA L40S GPU

Subscribe for more news and insights