gpu-scheduling

Here are 8 public repositories matching this topic...

NexusGPU / tensor-fusion

Tensor Fusion is a state-of-the-art GPU virtualization and pooling solution designed to optimize GPU cluster utilization to its fullest potential.

kubernetes ai gpu inference pytorch nvidia gpu-acceleration autoscaling amd-gpu vgpu gpu-usage gpu-virtualization gpu-scheduling karpenter dynamic-resource-allocation remote-gpu llm-serving rcuda gpu-pooling

Updated Dec 25, 2025
Go

yalue / cuda_scheduling_examiner_mirror

Star

A tool for examining GPU scheduling behavior.

benchmark gpu cuda mandelbrot cuda-kernels gpu-scheduling

Updated Aug 17, 2024
Cuda

tungngreen / PipelineScheduler

Star

PipelineScheduler optimizes workload distribution between servers and edge devices, setting optimal batch sizes to maximize throughput and minimize latency amid content dynamics and network instability. It also addresses resource contention with spatiotemporal inference scheduling to reduce co-location interference.

model-serving batch-inference gpu-scheduling dnn-serving

Updated Dec 4, 2025
C++

raj200501 / GPUOptimizerML

Star

The GPU Optimizer for ML Models enhances GPU performance for machine learning. It offers advanced scheduling, real-time monitoring, and efficient resource management through a user-friendly web interface and robust API, integrating big data technologies for seamless data processing and model optimization. @NVIDIA

model-management gpu-optimization real-time-monitoring secure-api big-data-integration gpu-scheduling

Updated Jun 29, 2024
Python

Gitdigital-products / fraud-detection-service

Star

# fraud-detection-service The **fraud-detection-service** detects fraudulent orders and user activity. ## Endpoints - `GET /health` — service status - `POST /fraud/check` — check an order for fraud (sample) - `GET /fraud/:orderId` — get fraud status for an order (sample) ## Tracing This service reports telemetry

Updated Dec 9, 2025
JavaScript

leoho0722 / llm-gpu-scheduler

Star

Design of a GPU Dynamic LLM Inference Task Scheduling Architecture Based on KubeAI

kubernetes monitoring prometheus gpu-scheduling llm-inference kubeai langtrace

Updated Aug 27, 2025
Python

chicogong / dtask-scheduler

Star

A distributed CPU/GPU task scheduler for large-scale batch jobs across thousands of machines. Zero dependencies, sub-millisecond latency.

golang distributed-systems queue high-performance scheduler job-scheduler distributed-computing zero-dependency task-scheduler task-queue control-plane batch-processing worker-pool load-balancing gpu-scheduling distributed-scheduler worker-agent

Updated Dec 23, 2025
Go

janakan2466 / kaleidoscope-infrastructure

Star

HPC research toolkit infrastructure for interfacing & analyzing LLMs (Kit is composed of: API gateway service, GPU scheduler, model servicer, and web interface)

slurm high-performance-computing natural-language-inference full-stack-application natural-language-procressing gpu-scheduling

Updated Dec 5, 2024
Python

Improve this page

Add a description, image, and links to the gpu-scheduling topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the gpu-scheduling topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gpu-scheduling

Here are 8 public repositories matching this topic...

NexusGPU / tensor-fusion

yalue / cuda_scheduling_examiner_mirror

tungngreen / PipelineScheduler

raj200501 / GPUOptimizerML

Gitdigital-products / fraud-detection-service

leoho0722 / llm-gpu-scheduler

chicogong / dtask-scheduler

janakan2466 / kaleidoscope-infrastructure

Improve this page

Add this topic to your repo