Thursday, May 7, 2026
World News Prime
No Result
View All Result
  • Home
  • Breaking News
  • Business
  • Politics
  • Health
  • Sports
  • Entertainment
  • Technology
  • Gaming
  • Travel
  • Lifestyle
World News Prime
  • Home
  • Breaking News
  • Business
  • Politics
  • Health
  • Sports
  • Entertainment
  • Technology
  • Gaming
  • Travel
  • Lifestyle
No Result
View All Result
World News Prime
No Result
View All Result
Home Business

Unlock GPU Clouds with Cisco AI PODs and Rafay

October 14, 2025
in Business
Reading Time: 5 mins read
0 0
0
Unlock GPU Clouds with Cisco AI PODs and Rafay
Share on FacebookShare on Twitter


Enterprises are making daring strikes into AI, and Cisco AI PODs present a strong, pre-validated basis for deploying AI infrastructure at scale. They carry collectively compute, storage, and networking in a modular design that simplifies procurement and deployment. Nevertheless, deploying {hardware} is just the start. The following essential step is making this highly effective infrastructure consumable as a service.

That is the place Rafay enhances Cisco AI PODs. Rafay’s GPU Platform as a Service (PaaS) provides the essential consumption layer, turning the {hardware} right into a ruled, self-service GPU cloud. Collectively, Cisco and Rafay allow organizations to operationalize AI sooner by providing safe, multi-tenant entry, standardized workload SKUs, and policy-driven governance.

This put up explores how this joint resolution transforms uncooked GPU energy right into a production-ready AI platform, enabling developer self-service whereas sustaining enterprise-grade management.

From Infrastructure to Consumption: The Platform Problem

Organizations have accelerated investments in AI infrastructure, deploying platforms like Cisco AI PODs with the most recent NVIDIA {hardware} to allow generative AI, Retrieval-Augmented Era (RAG), and large-scale inference. As adoption grows, a brand new problem emerges: allow a number of groups to securely and effectively eat this shared infrastructure.

Platform groups should steadiness entry throughout completely different teams, every with distinctive wants and safety necessities. With no standardized consumption layer, this results in a number of issues:

Underutilized GPUs: Business benchmarks report common GPU utilization charges typically fall beneath 30%. That is partly as a result of AI workloads are “bursty” and most environments lack the mechanisms to slice and share GPU assets effectively. When costly GPUs sit idle, it represents a big alternative value.
Handbook Provisioning: Platform groups typically depend on handbook configurations, ad-hoc scripts, and repair tickets to handle entry. These workflows decelerate supply, introduce inconsistencies, and make it tough to implement governance.
Siloed Sources: With no unified platform, GPU infrastructure typically turns into siloed by workforce, limiting sharing and stopping a holistic view of utilization and prices. Builders and researchers should navigate complicated inside processes simply to run a job.

To unravel this, enterprises must function their GPU infrastructure as a service—one which helps shared assets, multitenant isolation, and automatic coverage enforcement.

The Joint Resolution: Cisco AI PODs + Rafay GPU PaaS

Cisco and Rafay have collaborated to ship a modular, absolutely validated GPU cloud structure. This resolution combines Cisco’s best-in-class AI POD infrastructure with Rafay’s GPU Platform as a Service, remodeling GPU {hardware} right into a safe, self-service, multitenant cloud.

Cisco AI PODs present the compute, cloth, storage, and pre-validated design. Primarily based on Cisco Validated Designs (CVDs), they combine next-generation Cisco UCS platforms (just like the C885A M8 Server) and the most recent NVIDIA GPUs to energy all the AI lifecycle.
Rafay GPU PaaS delivers the orchestration, coverage enforcement, and developer abstraction layer. It transforms the foundational {hardware} right into a production-grade GPU cloud that’s easy to eat.

This mixed structure allows organizations to quickly launch and function GPU clouds with full-stack orchestration, declarative SKU provisioning, and built-in value attribution.

Developer Self-Service By a Curated Catalog

On the core of Rafay’s platform is the SKU Studio, a purpose-built catalog system that empowers platform groups to ship AI-ready infrastructure and functions as reusable SKUs.

Every SKU is a modular abstraction that bundles:

Compute Configuration: GPU/MIG profiles, CPU, reminiscence, and storage.
Software Stack: Pre-integrated instruments like vLLM, Triton, or Jupyter Notebooks.
Coverage Controls: Time-to-Dwell (TTLs), RBAC, multitenancy, and quotas.
Billing Metadata: Utilization items and value attribution.

Builders can entry GPU environments immediately by a self-service portal (GUI, API, or CLI) while not having to file help tickets. For instance, an information scientist can choose an “H100-Inference-vLLM” SKU, which routinely provisions a selected GPU slice, deploys a safe container, and applies a 48-hour TTL. This streamlines workflows and ensures safety greatest practices are utilized constantly.

Safe Multi-Tenancy and Governance

Sharing costly GPU assets requires strict isolation and governance. Rafay gives native, safe multi-tenancy that enables groups to securely share infrastructure with out interference.

Key safety controls are routinely enforced:

Hierarchical RBAC: Defines permissions and entry scope for tenants, tasks, and workspaces.
Namespace Isolation: Ensures workloads are separated on the cluster and community degree.
Useful resource Quotas: Prevents any single workforce or job from monopolizing assets.
Centralized Audit Logs: Supplies an entire audit path of person actions for compliance.

These built-in protections permit platform groups to take care of full oversight and management whereas empowering builders with the liberty they should innovate.

Complete GPU Administration and Visibility

To maximise ROI, it is advisable to understand how your GPUs are getting used. Rafay gives end-to-end visibility, metering, and value attribution tailor-made for multitenant environments.

Platform groups can use declarative blueprints to standardize GPU operator configurations and slicing methods (like MIG) throughout all clusters. Multi-tenant dashboards provide detailed insights into:

GPU stock and allocation
SKU utilization patterns
Occasion-level exercise and person attribution
Well being standing and uptime tendencies

A billing metrics API aggregates utilization knowledge, calculates billable compute, and generates auditable experiences, enabling chargebacks and monetary accountability.

Who Advantages from a Unified GPU Cloud?

This collectively validated resolution is designed for a various vary of shoppers who must operationalize GPU infrastructure with safety, velocity, and scale.

Enterprise IT Groups: Acquire federated self-service, quota enforcement, and centralized visibility. This reduces infrastructure duplication and embeds governance into day by day operations.
Sovereign & Public Sector Organizations: Meet compliance wants in air-gapped environments with safe multitenancy, coverage enforcement, and centralized audit logging.
Cloud & Managed Service Suppliers: Monetize GPU infrastructure with a white-labeled, multitenant platform that features automated tenant onboarding and built-in chargeback metering.
Current Cisco Prospects: Prolong the ROI of present UCS deployments by including GPU orchestration as a seamless overlay with no re-architecture required.
Greenfield AI Builders: Begin recent with a pre-validated, absolutely built-in resolution that reduces the time from procurement to operational AI companies from months to weeks.

Operationalize Your AI Infrastructure Right now

Pairing Cisco’s validated AI infrastructure with Rafay’s GPU PaaS management airplane permits organizations to remodel GPU techniques into absolutely ruled inside platforms. The result’s a consumption-driven structure the place builders achieve self-service entry, operators implement quotas and monitor consumption, and the enterprise maximizes the worth of its AI investments.

This structure presents a transparent path ahead: ship GPU infrastructure as a service, allow safe and compliant multitenancy, and make consumption predictable and cost-aligned from day one.

To see this highly effective resolution in motion, be a part of our upcoming webinar. Specialists from Cisco and Rafay will show rework your GPU infrastructure right into a production-ready AI service.

Dwell Webinar: From AI PODs to GPU CloudOctober 21, 2025 at 8:00 a.m. PST / 3:00 p.m. GMT

 

 

We’d love to listen to what you assume. Ask a Query, Remark Under, and Keep Related with #CiscoPartners on social!

Cisco Companions Fb  |  @CiscoPartners X/Twitter  |  Cisco Companions LinkedIn



Source link

Tags: AI infrastructureCiscoCisco Partnerscloud governanceCloudsGPUGPU PaaSMuticloud GovernanceNVIDIAPODsRafayUnlock
Previous Post

Pokemon developer Game Freak’s plans up to 2030 have been leaked

Next Post

Managing stock in a business sale

Related Posts

Blue chips falter amid wait for Middle East deal
Business

Blue chips falter amid wait for Middle East deal

May 7, 2026
Revealed: The five high street banks with the most customer complaints
Business

Revealed: The five high street banks with the most customer complaints

May 7, 2026
Month of Developer Productivity: Build and Forget
Business

Month of Developer Productivity: Build and Forget

May 7, 2026
Zoom Grants Solopreneurs With 0,000 Cash Grants
Business

Zoom Grants Solopreneurs With $150,000 Cash Grants

May 6, 2026
Lucy Lukic On Canada’s Shift Toward Living Benefits
Business

Lucy Lukic On Canada’s Shift Toward Living Benefits

May 7, 2026
Ford Has a Secret Team Working on a ,000 Electric Truck to Beat China. It’s a ‘Model T Moment,’ Says CEO.
Business

Ford Has a Secret Team Working on a $30,000 Electric Truck to Beat China. It’s a ‘Model T Moment,’ Says CEO.

May 6, 2026
Next Post
Managing stock in a business sale

Managing stock in a business sale

Father Joe: Al Pacino & Kiefer Sutherland to star in Luc Besson action thriller

Father Joe: Al Pacino & Kiefer Sutherland to star in Luc Besson action thriller

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

  • Trending
  • Comments
  • Latest
Kyrgyzstan Under the Khanstitution: 5 Years On

Kyrgyzstan Under the Khanstitution: 5 Years On

January 12, 2026
Injection π23 Tabula Rasa Brings Classic Survival Horror to Xbox Series X|S – Xbox Wire

Injection π23 Tabula Rasa Brings Classic Survival Horror to Xbox Series X|S – Xbox Wire

December 29, 2025
BBQ Traditions Around the World | Enchanting Travels

BBQ Traditions Around the World | Enchanting Travels

January 17, 2026
Leon Kennedy voice actor shares his dream Resident Evil game

Leon Kennedy voice actor shares his dream Resident Evil game

March 4, 2026
The Top 10 Websites of All Time According to AI

The Top 10 Websites of All Time According to AI

August 27, 2025
Retired? You may be eligible for the UAE 5-Year Retirement Visa: Eligibility, requirements, and benefits explained | World News – The Times of India

Retired? You may be eligible for the UAE 5-Year Retirement Visa: Eligibility, requirements, and benefits explained | World News – The Times of India

September 15, 2025
Trump warns tariffs will jump if EU does not fulfil trade deal by new deadline

Trump warns tariffs will jump if EU does not fulfil trade deal by new deadline

May 7, 2026
NFL schedule release 411: Potential release dates, most interesting games and more

NFL schedule release 411: Potential release dates, most interesting games and more

May 7, 2026
Blue chips falter amid wait for Middle East deal

Blue chips falter amid wait for Middle East deal

May 7, 2026
B1/B2 visa update: US Consulate opens priority slots for parents aged 50 years and above – The Times of India

B1/B2 visa update: US Consulate opens priority slots for parents aged 50 years and above – The Times of India

May 7, 2026
Who Is Hannah Harper? All About the ‘American Idol’ Season 24 Finalist

Who Is Hannah Harper? All About the ‘American Idol’ Season 24 Finalist

May 7, 2026
US jobless claim applications rise to 200K but remain historically low

US jobless claim applications rise to 200K but remain historically low

May 7, 2026
World News Prime

Discover the latest world news, insightful analysis, and comprehensive coverage at World News Prime. Stay updated on global events, business, technology, sports, and culture with trusted reporting you can rely on.

CATEGORIES

  • Breaking News
  • Business
  • Entertainment
  • Gaming
  • Health
  • Lifestyle
  • Politics
  • Sports
  • Technology
  • Travel

LATEST UPDATES

  • Trump warns tariffs will jump if EU does not fulfil trade deal by new deadline
  • NFL schedule release 411: Potential release dates, most interesting games and more
  • Blue chips falter amid wait for Middle East deal
  • About Us
  • Advertise With Us
  • Disclaimer
  • Privacy Policy
  • DMCA
  • Cookie Policy
  • Terms and Conditions
  • Contact Us

© 2025 World News Prime.
World News Prime is not responsible for the content of external sites.

No Result
View All Result
  • Home
  • Breaking News
  • Business
  • Politics
  • Health
  • Sports
  • Entertainment
  • Technology
  • Gaming
  • Travel
  • Lifestyle

© 2025 World News Prime.
World News Prime is not responsible for the content of external sites.

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In